Chat
nvidia/NVIDIA-Nemotron-3-Super-120B-A12B
Input/1M $0.10
Output/1M $0.50
Parameters 120B (12B active)
One API for all models. Search our library, deploy and run inference on NVIDIA GPUs in seconds
Unlock $1 free API credit on first recharge - generate up to ~4M tokens
Access enterprise-grade open-source AI models including Llama 3, DeepSeek, Qwen, and more via our high-performance serverless API. Experience low-latency inference on the latest NVIDIA GPUs optimized for production workloads.
"Qubrid enabled us to deploy production AI agents with reliable tool-calling and step tracing. We now ship agents faster with full visibility into every decision and API call."
AI Agents Team
Agent Systems & Orchestration