Model Catalog

One API for all models. Search our library, deploy and run inference on NVIDIA GPUs in seconds

Unlock $1 free API credit on first deposit of $5 - generate up to ~4M tokens

Qubrid AI Model Catalog
Model Category Context Provider Parameters Price(/1M tokens) Action
Vision Up to 1M Tokens Anthropic
Input: $6.50
Output: $32.50
Chat 300K Tokens NVIDIA Undisclosed
Input: $0.06
Output: $0.24
Vision 256K Tokens Moonshot AI 1T (32B active)
Input: $0.89
Output: $3.71
Chat 256K Tokens (up to 1M) NVIDIA 120B (12B active)
Input: $0.10
Output: $0.50
Chat 393,216 Tokens DeepSeek V4 family
Input: $0.14
Output: $0.28
Chat 200K Tokens Z.ai (Zhipu AI) 744B (40B active)
Input: $0.80
Output: $3.13
Chat 200K Tokens MiniMax 230B (10B active)
Input: $0.30
Output: $1.20
Vision 256K Tokens Moonshot AI 1T (32B active)
Input: $0.60
Output: $3.00
Code Up to 1M Tokens Alibaba (Cloud)
Input: $0.10
Output: $5.00
Vision Up to 1M Tokens Anthropic
Input: $6.50
Output: $32.50
Vision 1,050,000 Tokens OpenAI
Input: $5.00
Output: $22.00
Vision Up to 1M Tokens Google
Input: $2.00
Output: $12.00
Vision Up to 256K tokens / 10 images Alibaba (Cloud) Undisclosed (frontier-scale)
Input: $0.26
Output: $1.80
Chat 128K Tokens Alibaba (Cloud) Undisclosed
Input: $1.30
Output: $7.80
Chat 256K Tokens Moonshot AI 1T (32B active)
Input: $0.60
Output: $2.50
Vision 1M Tokens (API) / 256K Tokens (self-hosted base) Alibaba (Cloud) 35B (3B active) — hosted
Input: $0.10
Output: $0.40
Vision 256K Tokens (up to 1M) Alibaba (Cloud) 27B (dense)
Input: $0.30
Output: $2.40
Vision 256K Tokens (up to 1M) Alibaba (Cloud) 35B (3B active)
Input: $0.25
Output: $2.00
Vision 256K Tokens (up to 1M) Alibaba (Cloud) 122B (10B active)
Input: $0.40
Output: $3.20
Vision 256K Tokens (up to 1M) Alibaba (Cloud) 27B
Input: $0.60
Output: $3.60
Vision 256K Tokens (up to 1M) Alibaba (Cloud) 35B (A3B active)
Input: $0.25
Output: $1.49
Chat 128K Tokens DeepSeek 685B
Input: $0.56
Output: $1.68
Chat 128K Tokens DeepSeek 671B (37B active)
Input: $0.90
Output: $3.20
Chat 128K Tokens Alibaba (Cloud) 235B (22B active)
Input: $1.20
Output: $6.00
Vision 256K Tokens (up to 1M) Alibaba (Cloud) 235B (22B active)
Input: $0.40
Output: $4.00
Code 256K Tokens Alibaba (Cloud) 480B (35B active)
Input: $1.50
Output: $7.50
Chat 256K Tokens Alibaba (Cloud) 80B (3.9B active)
Input: $0.20
Output: $1.80
Chat 262k Tokens NVIDIA 31.6B Total / 3.2B Active
Input: $0.04
Output: $0.22
Chat 128K Tokens Meta 70B
Input: $0.27
Output: $0.85
OCR 16K Tokens Tencent Hunyuan 1.0B
Input: $0.21
Output: $0.35
Chat 64k Tokens DeepSeek 70B
Input: $1.20
Output: $1.80
Chat 8192 Tokens Microsoft 7B
Input: $0.21
Output: $0.25
Code N/A Alibaba (Cloud) 1.1B
Input: $0.79
Output: $0.79
Chat 256k Tokens OpenAI 121.7B
Input: $0.15
Output: $0.61
Code 262K Tokens Alibaba (Cloud) 79.7B (3B active)
Input: $0.30
Output: $1.50
Chat 32K Tokens MistralAI 7.3B
Input: $0.21
Output: $0.25
Vision 32K Tokens Alibaba (Cloud) 9B
Input: $0.25
Output: $0.44
Code Up to 1M Tokens Alibaba (Cloud)
Input: $0.30
Output: $5.00
Chat Up to 1M Tokens Alibaba (Cloud)
Input: $0.40
Output: $1.20
Vision Up to 128K Tokens Alibaba (Cloud) 235B
Input: $0.40
Output: $1.60
Vision Up to 256K Tokens Alibaba (Cloud)
Input: $0.05
Output: $0.40
Vision Up to 256K Tokens Alibaba (Cloud)
Input: $0.20
Output: $1.60
Vision 128K Tokens Alibaba (Cloud) 30B
Input: $1.15
Output: $1.17
Chat 128K Tokens DeepSeek 671B (37B active)
Input: $0.30
Output: $1.30
Chat 393,216 Tokens DeepSeek V4 family
Input: $1.74
Output: $3.48
Vision 1M Tokens (API) / 262K Tokens (self-hosted base) Alibaba (Cloud) 397B (17B active) — hosted
Input: $0.40
Output: $2.40
Chat 205K Tokens Z.ai (Zhipu AI) 355B (32B active)
Input: $0.60
Output: $2.20
Chat 256K Tokens Moonshot AI 1T (32B active)
Input: $0.50
Output: $2.40
Vision Up to 1M Tokens Google
Input: $0.50
Output: $3.00
Vision Up to 1M Tokens Google
Input: $1.25
Output: $10.00
Vision Up to 1M Tokens Google
Input: $0.15
Output: $0.25
Vision Up to 200K Tokens Anthropic
Input: $6.50
Output: $32.50
Vision Up to 1M Tokens Anthropic
Input: $3.90
Output: $19.50
Vision Up to 200K Tokens (1M via beta header `context-1m-2025-08-07`) Anthropic
Input: $3.90
Output: $19.50
Vision Up to 200K Tokens Anthropic
Input: $1.30
Output: $6.50
Vision 128K Tokens OpenAI
Input: $2.50
Output: $10.00
Vision 128K Tokens OpenAI
Input: $0.40
Output: $1.60
Vision 1,047,576 Tokens OpenAI
Input: $2.00
Output: $8.00
Vision 400K Tokens OpenAI
Input: $1.50
Output: $6.00
Vision 400K Tokens OpenAI
Input: $0.20
Output: $1.25
Showing 0 of 0 models

Sign up to get $1.00 free API credit on first deposit of $5. Test out the latest models now.

Access enterprise-grade open-source AI models including Llama 3, DeepSeek, Qwen, and more via our high-performance serverless API. Experience low-latency inference on the latest NVIDIA GPUs optimized for production workloads.

"Qubrid enabled us to deploy production AI agents with reliable tool-calling and step tracing. We now ship agents faster with full visibility into every decision and API call."

AI Agents Team

Agent Systems & Orchestration