DeepSeek R1 0528

DeepSeek R1 0528 API

Released May 2025128K Tokens context671B params (37B active) parameters

Documentation

DeepSeek R1 0528 API enables Advanced mathematical reasoning, Code generation & debugging, Complex multi-step problem solving, Research and analysis, JSON-structured output generation, and Function calling and tool use. DeepSeek-R1-0528 is the May 2025 update to the original DeepSeek-R1. Built on the DeepSeek-V3 backbone with 671B total parameters and 37B active per inference pass (MoE), it achieves performance on par with OpenAI o1. Key improvements include 87.5% on AIME 2025 (up from 70%), reduced hallucinations, enhanced front-end capabilities, and newly added JSON output and function calling support. Standout strengths include 671B total / 37B active MoE β€” frontier reasoning and 87.5% on AIME 2025 (up +17.5% from previous version). It is optimized for production agent and assistant workloads where response quality, latency, and predictable operating cost all matter.

from openai import OpenAI # Initialize the OpenAI client with Qubrid base URL client = OpenAI( base_url="https://platform.qubrid.com/v1", api_key="QUBRID_API_KEY", ) stream = client.chat.completions.create( model="deepseek-ai/DeepSeek-R1-0528", messages=[ { "role": "user", "content": "Explain quantum computing in simple terms" } ], max_tokens=16384, temperature=0.6, top_p=0.95, stream=True ) for chunk in stream: if chunk.choices and chunk.choices[0].delta.content: print(chunk.choices[0].delta.content, end="", flush=True) print("\n")

Serverless

API access

INPUT$0.90 /1M
CACHED INPUT$0.28 /1M
OUTPUT$3.20 /1M
Deploy using API

Dedicated

Cloud GPU VM

Price starts at$1.25 / GPU/ hr
Deploy with GPU VM

Interactive

Playground

INPUT$0.90 /1M
CACHED INPUT$0.28 /1M
OUTPUT$3.20 /1M
Chat in Playground

Enterprise
Platform Integration

Docker

Docker Support

Official Docker images for containerized deployments

Kubernetes

Kubernetes Ready

Production-grade KBS manifests and Helm charts

SDK

SDK Libraries

Official SDKs for Python, Javascript, Go, and Java

Don't let your AI control you. Control your AI the Qubrid way!

Have questions? Want to Partner with us? Looking for larger deployments or custom fine-tuning? Let's collaborate on the right setup for your workloads.

"Qubrid enabled us to deploy production AI agents with reliable tool-calling and step tracing. We now ship agents faster with full visibility into every decision and API call."

AI Agents Team

Agent Systems & Orchestration