Model Catalog
One API for all models. Search our library, deploy and run inference on NVIDIA GPUs in seconds
Unlock $1 free API credit on first deposit of $5 - generate up to ~4M tokens
| Model | Category | Context | Provider | Parameters | Price(/1M tokens) | Action |
|---|---|---|---|---|---|---|
| Vision | Up to 1M Tokens | Anthropic | Input: $6.50 Output: $32.50 | |||
| Chat | 300K Tokens | NVIDIA | Undisclosed | Input: $0.06 Output: $0.24 | ||
| Vision | 256K Tokens | Moonshot AI | 1T (32B active) | Input: $0.89 Output: $3.71 | ||
| Chat | 256K Tokens (up to 1M) | NVIDIA | 120B (12B active) | Input: $0.10 Output: $0.50 | ||
| Chat | 393,216 Tokens | DeepSeek | V4 family | Input: $0.14 Output: $0.28 | ||
| Chat | 200K Tokens | Z.ai (Zhipu AI) | 744B (40B active) | Input: $0.80 Output: $3.13 | ||
| Chat | 200K Tokens | MiniMax | 230B (10B active) | Input: $0.30 Output: $1.20 | ||
| Vision | 256K Tokens | Moonshot AI | 1T (32B active) | Input: $0.60 Output: $3.00 | ||
| Code | Up to 1M Tokens | Alibaba (Cloud) | Input: $0.10 Output: $5.00 | |||
| Vision | Up to 1M Tokens | Anthropic | Input: $6.50 Output: $32.50 | |||
| Vision | 1,050,000 Tokens | OpenAI | Input: $5.00 Output: $22.00 | |||
| Vision | Up to 1M Tokens | Input: $2.00 Output: $12.00 | ||||
| Vision | Up to 256K tokens / 10 images | Alibaba (Cloud) | Undisclosed (frontier-scale) | Input: $0.26 Output: $1.80 | ||
| Chat | 128K Tokens | Alibaba (Cloud) | Undisclosed | Input: $1.30 Output: $7.80 | ||
| Chat | 256K Tokens | Moonshot AI | 1T (32B active) | Input: $0.60 Output: $2.50 | ||
| Vision | 1M Tokens (API) / 256K Tokens (self-hosted base) | Alibaba (Cloud) | 35B (3B active) — hosted | Input: $0.10 Output: $0.40 | ||
| Vision | 256K Tokens (up to 1M) | Alibaba (Cloud) | 27B (dense) | Input: $0.30 Output: $2.40 | ||
| Vision | 256K Tokens (up to 1M) | Alibaba (Cloud) | 35B (3B active) | Input: $0.25 Output: $2.00 | ||
| Vision | 256K Tokens (up to 1M) | Alibaba (Cloud) | 122B (10B active) | Input: $0.40 Output: $3.20 | ||
| Vision | 256K Tokens (up to 1M) | Alibaba (Cloud) | 27B | Input: $0.60 Output: $3.60 | ||
| Vision | 256K Tokens (up to 1M) | Alibaba (Cloud) | 35B (A3B active) | Input: $0.25 Output: $1.49 | ||
| Chat | 128K Tokens | DeepSeek | 685B | Input: $0.56 Output: $1.68 | ||
| Chat | 128K Tokens | DeepSeek | 671B (37B active) | Input: $0.90 Output: $3.20 | ||
| Chat | 128K Tokens | Alibaba (Cloud) | 235B (22B active) | Input: $1.20 Output: $6.00 | ||
| Vision | 256K Tokens (up to 1M) | Alibaba (Cloud) | 235B (22B active) | Input: $0.40 Output: $4.00 | ||
| Code | 256K Tokens | Alibaba (Cloud) | 480B (35B active) | Input: $1.50 Output: $7.50 | ||
| Chat | 256K Tokens | Alibaba (Cloud) | 80B (3.9B active) | Input: $0.20 Output: $1.80 | ||
| Chat | 262k Tokens | NVIDIA | 31.6B Total / 3.2B Active | Input: $0.04 Output: $0.22 | ||
| Chat | 128K Tokens | Meta | 70B | Input: $0.27 Output: $0.85 | ||
| OCR | 16K Tokens | Tencent Hunyuan | 1.0B | Input: $0.21 Output: $0.35 | ||
| Chat | 64k Tokens | DeepSeek | 70B | Input: $1.20 Output: $1.80 | ||
| Chat | 8192 Tokens | Microsoft | 7B | Input: $0.21 Output: $0.25 | ||
| Code | N/A | Alibaba (Cloud) | 1.1B | Input: $0.79 Output: $0.79 | ||
| Chat | 256k Tokens | OpenAI | 121.7B | Input: $0.15 Output: $0.61 | ||
| Code | 262K Tokens | Alibaba (Cloud) | 79.7B (3B active) | Input: $0.30 Output: $1.50 | ||
| Chat | 32K Tokens | MistralAI | 7.3B | Input: $0.21 Output: $0.25 | ||
| Vision | 32K Tokens | Alibaba (Cloud) | 9B | Input: $0.25 Output: $0.44 | ||
| Code | Up to 1M Tokens | Alibaba (Cloud) | Input: $0.30 Output: $5.00 | |||
| Chat | Up to 1M Tokens | Alibaba (Cloud) | Input: $0.40 Output: $1.20 | |||
| Vision | Up to 128K Tokens | Alibaba (Cloud) | 235B | Input: $0.40 Output: $1.60 | ||
| Vision | Up to 256K Tokens | Alibaba (Cloud) | Input: $0.05 Output: $0.40 | |||
| Vision | Up to 256K Tokens | Alibaba (Cloud) | Input: $0.20 Output: $1.60 | |||
| Vision | 128K Tokens | Alibaba (Cloud) | 30B | Input: $1.15 Output: $1.17 | ||
| Chat | 128K Tokens | DeepSeek | 671B (37B active) | Input: $0.30 Output: $1.30 | ||
| Chat | 393,216 Tokens | DeepSeek | V4 family | Input: $1.74 Output: $3.48 | ||
| Vision | 1M Tokens (API) / 262K Tokens (self-hosted base) | Alibaba (Cloud) | 397B (17B active) — hosted | Input: $0.40 Output: $2.40 | ||
| Chat | 205K Tokens | Z.ai (Zhipu AI) | 355B (32B active) | Input: $0.60 Output: $2.20 | ||
| Chat | 256K Tokens | Moonshot AI | 1T (32B active) | Input: $0.50 Output: $2.40 | ||
| Vision | Up to 1M Tokens | Input: $0.50 Output: $3.00 | ||||
| Vision | Up to 1M Tokens | Input: $1.25 Output: $10.00 | ||||
| Vision | Up to 1M Tokens | Input: $0.15 Output: $0.25 | ||||
| Vision | Up to 200K Tokens | Anthropic | Input: $6.50 Output: $32.50 | |||
| Vision | Up to 1M Tokens | Anthropic | Input: $3.90 Output: $19.50 | |||
| Vision | Up to 200K Tokens (1M via beta header `context-1m-2025-08-07`) | Anthropic | Input: $3.90 Output: $19.50 | |||
| Vision | Up to 200K Tokens | Anthropic | Input: $1.30 Output: $6.50 | |||
| Vision | 128K Tokens | OpenAI | Input: $2.50 Output: $10.00 | |||
| Vision | 128K Tokens | OpenAI | Input: $0.40 Output: $1.60 | |||
| Vision | 1,047,576 Tokens | OpenAI | Input: $2.00 Output: $8.00 | |||
| Vision | 400K Tokens | OpenAI | Input: $1.50 Output: $6.00 | |||
| Vision | 400K Tokens | OpenAI | Input: $0.20 Output: $1.25 |
Showing 0 of 0 models
No models match your search. Try a different keyword or category.
Sign up to get $1.00 free API credit on first deposit of $5. Test out the latest models now.
Access enterprise-grade open-source AI models including Llama 3, DeepSeek, Qwen, and more via our high-performance serverless API. Experience low-latency inference on the latest NVIDIA GPUs optimized for production workloads.
"Qubrid's medical OCR and research parsing cut our document extraction time in half. We now have traceable pipelines and reproducible outputs that meet our compliance requirements."
Clinical AI Team
Research & Clinical Intelligence
