Support

AI-powered help

Welcome!

Please introduce yourself before we start.

    LLM Gateway
    • Docs
    • Pricing
    • Pricing
    • Docs
    • Models
    1.2k
    Log InGet Started

    Models

    Comprehensive list of all supported models and their providers

    Compare

    Use Case

    Capabilities

    Provider

    Input Price ($/M tokens)

    Output Price ($/M tokens)

    Context Size (tokens)

    240/240
    Models
    29/30
    Providers
    115
    Vision Models (filtered)
    157
    Tool-enabled (filtered)
    3
    Free Models (filtered)
    Features
    Together AI
    kimi-k2.5
    $0.50$2.80—
    Alibaba Cloud(cn-beijing)
    kimi-k2.5
    $0.57$3.01—
    Nebius AI
    qwen3-30b-a3b
    $0.10$0.30—
    NovitaAI
    qwen3-vl-8b-instruct
    $0.08$0.50—
    Nebius AI
    llama-3.3-70b-instruct
    $0.13$0.40—
    Cerebras
    llama-3.3-70b-instruct
    $0.85$1.20—
    NovitaAI
    llama-3.3-70b-instruct
    $0.14$0.40—
    MiniMax
    minimax-m2.1
    $0.27$1.10—
    NovitaAI
    minimax-m2.1
    $0.30$1.20$0.03
    xAI
    grok-4-20-beta-0309-reasoning
    $2.00$6.00$0.20
    NovitaAI
    llama-3-70b-instruct
    $0.51$0.74—
    MiniMax
    minimax-m2
    $0.20$1.00$0.03
    MiniMax
    minimax-m2.5-highspeed
    $0.60$2.40$0.03
    NovitaAI
    qwen3-next-80b-a3b-thinking
    $0.15$1.50—
    Alibaba Cloud
    qwen3-next-80b-a3b-thinking
    $0.50$0.40
    -20% off
    $6.00$4.80
    -20% off
    —
    Nebius AI
    qwen3-next-80b-a3b-thinking
    $0.15$1.20—
    Alibaba Cloud(cn-beijing)
    qwen-plus-latest
    $0.12$0.09
    -20% off
    $0.29$0.23
    -20% off
    $0.02$0.02
    -20% off
    Alibaba Cloud(singapore)
    qwen-plus-latest
    $0.40$0.32
    -20% off
    $1.20$0.96
    -20% off
    $0.08$0.06
    -20% off
    Alibaba Cloud
    qwen-plus-latest
    $0.40$0.32
    -20% off
    $1.20$0.96
    -20% off
    $0.08$0.06
    -20% off
    Nebius AI
    qwen3-30b-a3b-thinking-2507
    $0.10$0.30—
    Nebius AI
    qwen25-72b-instruct
    $0.13$0.40—
    Nebius AI
    qwen-qwq-32b
    $0.15$0.45—
    Alibaba Cloud
    qwen3-coder-plus
    $6.00$4.80
    -20% off
    $60.00$48.00
    -20% off
    —
    Nebius AI
    qwen3-32b
    $0.10$0.30—
    Cerebras
    qwen3-32b
    $0.40$0.80—
    Nebius AI
    deepseek-v3
    $0.50$1.50—
    Nebius AI
    qwen3-14b
    $0.08$0.24—
    Nebius AI
    qwen3-30b-a3b-instruct-2507
    $0.10$0.30—
    xAI
    grok-4-1-fast-non-reasoning
    $0.20$0.50$0.05
    Azure AI Foundry
    grok-4-1-fast-non-reasoning
    $0.20$0.16
    -20% off
    $0.50$0.40
    -20% off
    —
    Xiaomi
    mimo-v2.5
    $0.40$2.00$0.08
    Xiaomi
    mimo-v2.5-pro
    $1.00$3.00$0.20
    xAI
    grok-imagine-image-pro
    $0.070/req——
    xAI
    grok-4-fast-reasoning
    $0.20$0.50$0.05
    Xiaomi
    mimo-v2-omni
    $0.40$2.00$0.08
    NovitaAI
    qwen3-235b-a22b-thinking-2507
    $0.30$3.00—
    Nebius AI
    qwen3-235b-a22b-thinking-2507
    $0.20$0.60—
    Inference.net
    llama-3.2-11b-instruct
    $0.07$0.33—
    Nebius AI
    qwen3-235b-a22b-instruct-2507
    $0.20$0.60—
    NovitaAI
    qwen3-235b-a22b-instruct-2507
    $0.09$0.58—
    Cerebras
    qwen3-235b-a22b-instruct-2507
    $0.60$1.20—
    Groq
    llama-guard-4-12b
    $0.20$0.20—
    Azure AI Foundry
    grok-4-1-fast-reasoning
    $0.20$0.16
    -20% off
    $0.50$0.40
    -20% off
    —
    xAI
    grok-4-1-fast-reasoning
    $0.20$0.50$0.05
    DeepSeek
    deepseek-v3.2
    $0.28$0.24
    -15% off
    $0.42$0.36
    -15% off
    $0.03$0.02
    -15% off
    Alibaba Cloud
    deepseek-v3.2
    $0.57$0.46
    -20% off
    $1.71$1.37
    -20% off
    $0.11$0.09
    -20% off
    NovitaAI
    deepseek-v3.2
    $0.27$0.40$0.13
    ByteDance
    deepseek-v3.2
    $0.28$0.42$0.06
    Nebius AI
    deepseek-v3.2
    $0.30$0.45—
    Alibaba Cloud(singapore)
    deepseek-v3.2
    $0.57$0.46
    -20% off
    $1.71$1.37
    -20% off
    $0.11$0.09
    -20% off
    Page 5 of 9

    Newsletter

    Stay ahead of the curve

    Join developers who get weekly insights on LLM routing, new model launches, and cost optimization — straight to their inbox.

    • New models & providers as they drop
    • Tips to cut latency & costs
    • Early access to beta features

    No spam. Unsubscribe anytime.

    LLM Gateway

    Product

    • Features
    • Models
    • Providers
    • Chat Playground
    • Changelog
    • DevPass
    • Compare Models
    • Enterprise

    Resources

    • Apps
    • Templates
    • Agents
    • MCP Server
    • Blog
    • Documentation
    • Integrations
    • Guides
    • Brand Assets
    • Token Cost Calculator
    • Referral Program
    • GitHub
    • Contact Us

    Community

    • Twitter
    • Discord

    Compare

    • OpenRouter
    • LiteLLM

    Models

    • Text Generation
    • Text to Image
    • Image to Image
    • Vision
    • Reasoning
    • Tool Calling
    • Web Search
    • Discounted

    Providers

    • OpenAI
    • Anthropic
    • Google AI Studio
    • Glacier
    • Google Vertex AI
    • Quartz
    • Avalanche
    • Groq
    • Cerebras
    • xAI
    • DeepSeek
    • Alibaba Cloud
    • NovitaAI
    • AWS Bedrock
    • Azure
    • Azure AI Foundry
    • Z AI
    • Moonshot AI
    • Perplexity
    • Nebius AI
    • Mistral AI
    • Inference.net
    • Together AI
    • Custom
    • NanoGPT
    • ByteDance
    • MiniMax
    • EmberCloud
    • Xiaomi

    © 2026 LLM Gateway. All rights reserved.

    All systems operationalPrivacy PolicyTerms of Use