Model Catalog
One API for all models. Search our library, deploy and run inference on NVIDIA GPUs in seconds
Unlock $1 free API credit on first deposit of $5 - generate up to ~4M tokens
| Model | Category | Context | Provider | Parameters | Price(/1M tokens) | Action |
|---|---|---|---|---|---|---|
| Vision | Up to 1M Tokens | Anthropic | Input: $6.50 Output: $32.50 | |||
| Chat | 300K Tokens | NVIDIA | Undisclosed | Input: $0.06 Output: $0.24 | ||
| Vision | 256K Tokens | Moonshot AI | 1T (32B active) | Input: $0.89 Output: $3.71 | ||
| Chat | 256K Tokens (up to 1M) | NVIDIA | 120B (12B active) | Input: $0.10 Output: $0.50 | ||
| Chat | 393,216 Tokens | DeepSeek | V4 family | Input: $0.14 Output: $0.28 | ||
| Chat | 200K Tokens | Z.ai (Zhipu AI) | 744B (40B active) | Input: $0.80 Output: $3.13 | ||
| Chat | 200K Tokens | MiniMax | 230B (10B active) | Input: $0.30 Output: $1.20 | ||
| Vision | 256K Tokens | Moonshot AI | 1T (32B active) | Input: $0.60 Output: $3.00 | ||
| Code | Up to 1M Tokens | Alibaba (Cloud) | Input: $0.10 Output: $5.00 | |||
| Vision | Up to 1M Tokens | Anthropic | Input: $6.50 Output: $32.50 | |||
| Vision | 1,050,000 Tokens | OpenAI | Input: $5.00 Output: $22.00 | |||
| Vision | Up to 1M Tokens | Input: $2.00 Output: $12.00 | ||||
| Vision | Up to 256K tokens / 10 images | Alibaba (Cloud) | Undisclosed (frontier-scale) | Input: $0.26 Output: $1.80 | ||
| Chat | 128K Tokens | Alibaba (Cloud) | Undisclosed | Input: $1.30 Output: $7.80 | ||
| Chat | 256K Tokens | Moonshot AI | 1T (32B active) | Input: $0.60 Output: $2.50 | ||
| Vision | 1M Tokens (API) / 256K Tokens (self-hosted base) | Alibaba (Cloud) | 35B (3B active) — hosted | Input: $0.10 Output: $0.40 | ||
| Vision | 256K Tokens (up to 1M) | Alibaba (Cloud) | 27B (dense) | Input: $0.30 Output: $2.40 | ||
| Vision | 256K Tokens (up to 1M) | Alibaba (Cloud) | 35B (3B active) | Input: $0.25 Output: $2.00 | ||
| Vision | 256K Tokens (up to 1M) | Alibaba (Cloud) | 122B (10B active) | Input: $0.40 Output: $3.20 | ||
| Vision | 256K Tokens (up to 1M) | Alibaba (Cloud) | 27B | Input: $0.60 Output: $3.60 | ||
| Vision | 256K Tokens (up to 1M) | Alibaba (Cloud) | 35B (A3B active) | Input: $0.25 Output: $1.49 | ||
| Chat | 128K Tokens | DeepSeek | 685B | Input: $0.56 Output: $1.68 | ||
| Chat | 128K Tokens | DeepSeek | 671B (37B active) | Input: $0.90 Output: $3.20 | ||
| Chat | 128K Tokens | Alibaba (Cloud) | 235B (22B active) | Input: $1.20 Output: $6.00 | ||
| Vision | 256K Tokens (up to 1M) | Alibaba (Cloud) | 235B (22B active) | Input: $0.40 Output: $4.00 | ||
| Code | 256K Tokens | Alibaba (Cloud) | 480B (35B active) | Input: $1.50 Output: $7.50 | ||
| Chat | 256K Tokens | Alibaba (Cloud) | 80B (3.9B active) | Input: $0.20 Output: $1.80 | ||
| Chat | 262k Tokens | NVIDIA | 31.6B Total / 3.2B Active | Input: $0.04 Output: $0.22 | ||
| Chat | 128K Tokens | Meta | 70B | Input: $0.27 Output: $0.85 | ||
| OCR | 16K Tokens | Tencent Hunyuan | 1.0B | Input: $0.21 Output: $0.35 | ||
| Chat | 64k Tokens | DeepSeek | 70B | Input: $1.20 Output: $1.80 | ||
| Chat | 8192 Tokens | Microsoft | 7B | Input: $0.21 Output: $0.25 | ||
| Code | N/A | Alibaba (Cloud) | 1.1B | Input: $0.79 Output: $0.79 | ||
| Chat | 256k Tokens | OpenAI | 121.7B | Input: $0.15 Output: $0.61 | ||
| Code | 262K Tokens | Alibaba (Cloud) | 79.7B (3B active) | Input: $0.30 Output: $1.50 | ||
| Chat | 32K Tokens | MistralAI | 7.3B | Input: $0.21 Output: $0.25 | ||
| Vision | 32K Tokens | Alibaba (Cloud) | 9B | Input: $0.25 Output: $0.44 | ||
| Code | Up to 1M Tokens | Alibaba (Cloud) | Input: $0.30 Output: $5.00 | |||
| Chat | Up to 1M Tokens | Alibaba (Cloud) | Input: $0.40 Output: $1.20 | |||
| Vision | Up to 128K Tokens | Alibaba (Cloud) | 235B | Input: $0.40 Output: $1.60 | ||
| Vision | Up to 256K Tokens | Alibaba (Cloud) | Input: $0.05 Output: $0.40 | |||
| Vision | Up to 256K Tokens | Alibaba (Cloud) | Input: $0.20 Output: $1.60 | |||
| Vision | 128K Tokens | Alibaba (Cloud) | 30B | Input: $1.15 Output: $1.17 | ||
| Chat | 128K Tokens | DeepSeek | 671B (37B active) | Input: $0.30 Output: $1.30 | ||
| Chat | 393,216 Tokens | DeepSeek | V4 family | Input: $1.74 Output: $3.48 | ||
| Vision | 1M Tokens (API) / 262K Tokens (self-hosted base) | Alibaba (Cloud) | 397B (17B active) — hosted | Input: $0.40 Output: $2.40 | ||
| Chat | 205K Tokens | Z.ai (Zhipu AI) | 355B (32B active) | Input: $0.60 Output: $2.20 | ||
| Chat | 256K Tokens | Moonshot AI | 1T (32B active) | Input: $0.50 Output: $2.40 | ||
| Vision | Up to 1M Tokens | Input: $0.50 Output: $3.00 | ||||
| Vision | Up to 1M Tokens | Input: $1.25 Output: $10.00 | ||||
| Vision | Up to 1M Tokens | Input: $0.15 Output: $0.25 | ||||
| Vision | Up to 200K Tokens | Anthropic | Input: $6.50 Output: $32.50 | |||
| Vision | Up to 1M Tokens | Anthropic | Input: $3.90 Output: $19.50 | |||
| Vision | Up to 200K Tokens (1M via beta header `context-1m-2025-08-07`) | Anthropic | Input: $3.90 Output: $19.50 | |||
| Vision | Up to 200K Tokens | Anthropic | Input: $1.30 Output: $6.50 | |||
| Vision | 128K Tokens | OpenAI | Input: $2.50 Output: $10.00 | |||
| Vision | 128K Tokens | OpenAI | Input: $0.40 Output: $1.60 | |||
| Vision | 1,047,576 Tokens | OpenAI | Input: $2.00 Output: $8.00 | |||
| Vision | 400K Tokens | OpenAI | Input: $1.50 Output: $6.00 | |||
| Vision | 400K Tokens | OpenAI | Input: $0.20 Output: $1.25 |
Showing 0 of 0 models
No models match your search. Try a different keyword or category.
Sign up to get $1.00 free API credit on first deposit of $5. Test out the latest models now.
Access enterprise-grade open-source AI models including Llama 3, DeepSeek, Qwen, and more via our high-performance serverless API. Experience low-latency inference on the latest NVIDIA GPUs optimized for production workloads.
"Qubrid enabled us to deploy production AI agents with reliable tool-calling and step tracing. We now ship agents faster with full visibility into every decision and API call."
AI Agents Team
Agent Systems & Orchestration
