← Back to model catalog

Qwen3 32B

Provider
Brightnode-hosted
Context
131,072 tokens
Pricing
$0.10 / 1M input · $1.20 / 1M output
APAC regions
Singapore
Residency
in-region
Task
Chat
APAC performance

Latency profile

SingaporeTokyoSydney
TTFT p50314539
TTFT p95497263
E2E latency p50688477
E2E latency p95104129117
Pricing

Input: $0.10 per 1M tokens

Output: $1.20 per 1M tokens

Billing: Per-token, charged against wallet balance

Dedicated endpoint option
  • A100 80GB: $4.01/hr (Singapore)
  • H100 80GB: $14.29/hr (Singapore)
View full pricing and calculator →
Capabilities

Reasoning-focused large model tuned for structured analysis, tool use, and long prompts in multilingual production settings.

Best for: Reasoning tasks, Agent tool use, Code generation, Long-context workflows

Quickstart code snippets
Python
from openai import OpenAI

client = OpenAI(
    base_url="https://api.brightnode.cloud/v1",
    api_key="YOUR_BRIGHTNODE_API_KEY",
)

response = client.chat.completions.create(
    model="Qwen/Qwen3-32B",
    messages=[{"role": "user", "content": "Hello from APAC"}],
)
Node
import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://api.brightnode.cloud/v1",
  apiKey: process.env.BRIGHTNODE_API_KEY,
});

const response = await client.chat.completions.create({
  model: "Qwen/Qwen3-32B",
  messages: [{ role: "user", content: "Hello from APAC" }],
});
Other models on Brightnode