Qwen3 Coder Flash API

Model IDqwen3-coder-flash

Input <= 32k

Save 20%

$0.43$0.35/1M

input tokens

$2.16$1.72/1M

output tokens

Cached $0.04$0.03/1M cached tokens

32k < Input <= 128k

Save 20%

$0.72$0.58/1M

input tokens

$3.59$2.88/1M

output tokens

Cached $0.07$0.06/1M cached tokens

128k < Input <= 256k

Save 20%

$1.15$0.92/1M

input tokens

$5.75$4.60/1M

output tokens

Cached $0.11$0.09/1M cached tokens

256k < Input <= 1m

Save 20%

$2.30$1.84/1M

input tokens

$13.80$11.04/1M

output tokens

Cached $0.23$0.18/1M cached tokens

Get API Key Open in Playground

from openai import OpenAI  # Initialize the OpenAI client with Qubrid base URL client = OpenAI(  base_url="https://platform.qubrid.com/v1",  api_key="QUBRID_API_KEY", )  stream = client.chat.completions.create(  model="Qwen/Qwen3-Coder-Flash",  messages=[  {  "role": "user",  "content": "Write a Python function to calculate fibonacci sequence"  }  ],  max_tokens=8962,  temperature=0.1,  top_p=1,  stream=True )  for chunk in stream:  if chunk.choices and chunk.choices[0].delta.content:  print(chunk.choices[0].delta.content, end="", flush=True)  print("\n")

Qwen3 Coder Flash

EnterprisePlatform Integration

Docker Support

Kubernetes Ready

SDK Libraries

Don't let your AI control you. Control your AI the Qubrid way!

Enterprise
Platform Integration