Model Catalog

One API for all models. Search our library, deploy and run inference on NVIDIA GPUs in seconds

Unlock $1 free API credit on first deposit of $5 - generate up to ~4M tokens

Qubrid AI Model Catalog
Model Category Context Provider Parameters Price(/1M tokens) Action
Vision Up to 1M Tokens Anthropic
Input: $6.50
Output: $32.50
Chat 300K Tokens NVIDIA Undisclosed
Input: $0.06
Output: $0.24
Vision 256K Tokens Moonshot AI 1T (32B active)
Input: $0.89
Output: $3.71
Chat 256K Tokens (up to 1M) NVIDIA 120B (12B active)
Input: $0.10
Output: $0.50
Chat 393,216 Tokens DeepSeek V4 family
Input: $0.14
Output: $0.28
Chat 200K Tokens Z.ai (Zhipu AI) 744B (40B active)
Input: $0.80
Output: $3.13
Chat 200K Tokens MiniMax 230B (10B active)
Input: $0.30
Output: $1.20
Vision 256K Tokens Moonshot AI 1T (32B active)
Input: $0.60
Output: $3.00
Code Up to 1M Tokens Alibaba (Cloud)
Input: $0.10
Output: $5.00
Vision Up to 1M Tokens Anthropic
Input: $6.50
Output: $32.50
Vision 1,050,000 Tokens OpenAI
Input: $5.00
Output: $22.00
Vision Up to 1M Tokens Google
Input: $2.00
Output: $12.00
Vision Up to 256K tokens / 10 images Alibaba (Cloud) Undisclosed (frontier-scale)
Input: $0.26
Output: $1.80
Chat 128K Tokens Alibaba (Cloud) Undisclosed
Input: $1.30
Output: $7.80
Chat 256K Tokens Moonshot AI 1T (32B active)
Input: $0.60
Output: $2.50
Vision 1M Tokens (API) / 256K Tokens (self-hosted base) Alibaba (Cloud) 35B (3B active) — hosted
Input: $0.10
Output: $0.40
Vision 256K Tokens (up to 1M) Alibaba (Cloud) 27B (dense)
Input: $0.30
Output: $2.40
Vision 256K Tokens (up to 1M) Alibaba (Cloud) 35B (3B active)
Input: $0.25
Output: $2.00
Vision 256K Tokens (up to 1M) Alibaba (Cloud) 122B (10B active)
Input: $0.40
Output: $3.20
Vision 256K Tokens (up to 1M) Alibaba (Cloud) 27B
Input: $0.60
Output: $3.60
Vision 256K Tokens (up to 1M) Alibaba (Cloud) 35B (A3B active)
Input: $0.25
Output: $1.49
Chat 128K Tokens DeepSeek 685B
Input: $0.56
Output: $1.68
Chat 128K Tokens DeepSeek 671B (37B active)
Input: $0.90
Output: $3.20
Chat 128K Tokens Alibaba (Cloud) 235B (22B active)
Input: $1.20
Output: $6.00
Vision 256K Tokens (up to 1M) Alibaba (Cloud) 235B (22B active)
Input: $0.40
Output: $4.00
Code 256K Tokens Alibaba (Cloud) 480B (35B active)
Input: $1.50
Output: $7.50
Chat 256K Tokens Alibaba (Cloud) 80B (3.9B active)
Input: $0.20
Output: $1.80
Chat 262k Tokens NVIDIA 31.6B Total / 3.2B Active
Input: $0.04
Output: $0.22
Chat 128K Tokens Meta 70B
Input: $0.27
Output: $0.85
OCR 16K Tokens Tencent Hunyuan 1.0B
Input: $0.21
Output: $0.35
Chat 64k Tokens DeepSeek 70B
Input: $1.20
Output: $1.80
Chat 8192 Tokens Microsoft 7B
Input: $0.21
Output: $0.25
Code N/A Alibaba (Cloud) 1.1B
Input: $0.79
Output: $0.79
Chat 256k Tokens OpenAI 121.7B
Input: $0.15
Output: $0.61
Code 262K Tokens Alibaba (Cloud) 79.7B (3B active)
Input: $0.30
Output: $1.50
Chat 32K Tokens MistralAI 7.3B
Input: $0.21
Output: $0.25
Vision 32K Tokens Alibaba (Cloud) 9B
Input: $0.25
Output: $0.44
Code Up to 1M Tokens Alibaba (Cloud)
Input: $0.30
Output: $5.00
Chat Up to 1M Tokens Alibaba (Cloud)
Input: $0.40
Output: $1.20
Vision Up to 128K Tokens Alibaba (Cloud) 235B
Input: $0.40
Output: $1.60
Vision Up to 256K Tokens Alibaba (Cloud)
Input: $0.05
Output: $0.40
Vision Up to 256K Tokens Alibaba (Cloud)
Input: $0.20
Output: $1.60
Vision 128K Tokens Alibaba (Cloud) 30B
Input: $1.15
Output: $1.17
Chat 128K Tokens DeepSeek 671B (37B active)
Input: $0.30
Output: $1.30
Chat 393,216 Tokens DeepSeek V4 family
Input: $1.74
Output: $3.48
Vision 1M Tokens (API) / 262K Tokens (self-hosted base) Alibaba (Cloud) 397B (17B active) — hosted
Input: $0.40
Output: $2.40
Chat 205K Tokens Z.ai (Zhipu AI) 355B (32B active)
Input: $0.60
Output: $2.20
Chat 256K Tokens Moonshot AI 1T (32B active)
Input: $0.50
Output: $2.40
Vision Up to 1M Tokens Google
Input: $0.50
Output: $3.00
Vision Up to 1M Tokens Google
Input: $1.25
Output: $10.00
Vision Up to 1M Tokens Google
Input: $0.15
Output: $0.25
Vision Up to 200K Tokens Anthropic
Input: $6.50
Output: $32.50
Vision Up to 1M Tokens Anthropic
Input: $3.90
Output: $19.50
Vision Up to 200K Tokens (1M via beta header `context-1m-2025-08-07`) Anthropic
Input: $3.90
Output: $19.50
Vision Up to 200K Tokens Anthropic
Input: $1.30
Output: $6.50
Vision 128K Tokens OpenAI
Input: $2.50
Output: $10.00
Vision 128K Tokens OpenAI
Input: $0.40
Output: $1.60
Vision 1,047,576 Tokens OpenAI
Input: $2.00
Output: $8.00
Vision 400K Tokens OpenAI
Input: $1.50
Output: $6.00
Vision 400K Tokens OpenAI
Input: $0.20
Output: $1.25
Showing 0 of 0 models

Sign up to get $1.00 free API credit on first deposit of $5. Test out the latest models now.

Access enterprise-grade open-source AI models including Llama 3, DeepSeek, Qwen, and more via our high-performance serverless API. Experience low-latency inference on the latest NVIDIA GPUs optimized for production workloads.

"Qubrid's medical OCR and research parsing cut our document extraction time in half. We now have traceable pipelines and reproducible outputs that meet our compliance requirements."

Clinical AI Team

Research & Clinical Intelligence