Skip to content

Popular repositories Loading

  1. rmbg-1.4 rmbg-1.4 Public template

    State-of-the-art background removal model, designed to effectively separate foreground from background. <metadata> gpu: T4 | collections: ["HF Transformers"] </metadata>

    Python 20 11

  2. triton-co-pilot triton-co-pilot Public

    Generate Glue Code in seconds to simplify your Nvidia Triton Inference Server Deployments

    Python 19 3

  3. Smaug-72B Smaug-72B Public

    Smaug-72B - which topped the Hugging Face LLM leaderboard and it’s the first model with an average score of 80, making it the world’s best open-source foundation model.

    Python 17 5

  4. whisper-large-v3 whisper-large-v3 Public

    State‑of‑the‑art speech recognition model for English, delivering transcription accuracy across diverse audio scenarios. <metadata> gpu: T4 | collections: ["CTranslate2"] </metadata>

    Python 16 13

  5. qwq-32b-preview qwq-32b-preview Public template

    A 32B experimental reasoning model for advanced text generation and robust instruction following. <metadata> gpu: A100 | collections: ["vLLM"] </metadata>

    Python 16 6

  6. deepseek-r1-distill-qwen-32b deepseek-r1-distill-qwen-32b Public template

    A distilled DeepSeek-R1 variant built on Qwen2.5-32B, fine-tuned with curated data for enhanced performance and efficiency. <metadata> gpu: A100 | collections: ["vLLM"] </metadata>

    Python 16 20

Repositories

Showing 10 of 159 repositories
  • Document-RAG-Upload Public

    This is a semantic search application build using Inferless and Pinecone.

    inferless/Document-RAG-Upload’s past year of commit activity
    Python 0 4 0 0 Updated Mar 28, 2025
  • inferless/Customer-Service-Voicebot’s past year of commit activity
    Python 2 2 0 0 Updated Mar 28, 2025
  • inferless/Voice-Conversational-Chatbot’s past year of commit activity
    Python 2 2 0 0 Updated Mar 28, 2025
  • inferless/Logo-Generator’s past year of commit activity
    Python 2 6 0 0 Updated Mar 28, 2025
  • inferless/spatiallm-qwen-0.5b’s past year of commit activity
    Python 0 2 0 0 Updated Mar 28, 2025
  • inferless/spatiallm-llama-1b’s past year of commit activity
    Python 0 2 0 0 Updated Mar 28, 2025
  • mistral-small-3.1-24b-instruct Public template

    Advanced multimodal language model developed by Mistral AI with enhanced text performance, robust vision capabilities, and an expanded context window of up to 128,000 tokens. <metadata> gpu: A10 | collections: ["HF Transformers"] </metadata>

    inferless/mistral-small-3.1-24b-instruct’s past year of commit activity
    Python 0 7 0 0 Updated Mar 28, 2025
  • gemma-3-27b-it Public template

    Gemma-3-27B-it is a multimodal model that handles both text and image inputs, supports over 140 languages, and features a context window of up to 128,000 tokens. <metadata> gpu: A100 | collections: ["HF Transformers"] </metadata>

    inferless/gemma-3-27b-it’s past year of commit activity
    Python 0 6 0 0 Updated Mar 28, 2025
  • gemma-2b-it Public template

    2B instruct-tuned model for delivering coherent and instruction-following responses across a wide range of tasks. <metadata> gpu: A100 | collections: ["vLLM"] </metadata>

    inferless/gemma-2b-it’s past year of commit activity
    Python 1 2 0 0 Updated Mar 25, 2025
  • stable-diffusion-v1-5 Public template

    A text-to-image model by Stability AI, renowned for generating high-quality, diverse images from text prompts. <metadata> gpu: T4 | collections: ["Diffusers"] </metadata>

    inferless/stable-diffusion-v1-5’s past year of commit activity
    Python 0 1 0 0 Updated Mar 25, 2025

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…