Pricing

Inference-first pricing with clear paths for dedicated capacity and GPU Workspaces.

Transparent pricing for every deployment path

Inference first. Dedicated endpoints for enterprise traffic. GPU Workspaces when you need to build before serving.

Monthly inference estimator

Uses model pricing from this repository's catalog. No synthetic cross-provider discount assumptions.

Selected model usage
$5,025.00

Monthly estimate

Dedicated endpoints
Coming soon

Reserved capacity pricing will be published at launch.

For proprietary-routed models, token rates are pass-through from provider list pricing; Brightnode adds routing, latency, and observability layers.

Managed model pricing (per 1M tokens)

Popular live models pulled from the repository model catalog.

FamilyModelRouteContextInputOutputRegionStatus
ClaudeClaude Sonnet 4Proprietary models200,000$3.00$15.00Singapore, Sydney, Tokyo, Thailand, Malaysia, Jakarta, New Zealand, Seoul, Taiwan, MumbaiLive
ClaudeClaude Haiku 4.5Proprietary models200,000$1.00$5.00Singapore, Jakarta, Malaysia, Thailand, Tokyo, Seoul, Taiwan, Mumbai, Sydney, New ZealandLive
LlamaLlama 3.3 70B InstructBrightnode-hosted131,072$0.22$0.50SingaporeLive
QwenQwen3 32BBrightnode-hosted131,072$0.10$1.20SingaporeLive
DeepseekDeepSeek V3Proprietary models163,840$0.60$1.74Jakarta, Singapore, Malaysia, Thailand, Tokyo, Seoul, Taiwan, Mumbai, Sydney, New ZealandLive
MistralMistral NemoBrightnode-hosted131,072$0.15$0.15SingaporeLive

Bnodes (GPU Workspaces)

Secondary product lane for fine-tuning and eval before deploy-to-inference.

GPUvRAMPrice fromRegionBest for
T416GB$0.50/hrSingaporeComfyUI, prototyping, lightweight model work
L424GB$0.87/hrSingaporeEmbedding pipelines and medium-size inference tests
A10080GB$4.01/hrSingaporeFine-tuning, evaluation suites, 70B+ experimentation
H10080GB$14.29/hrSingaporeHeavy training and high-throughput pre-production validation
B200180GBOn requestSingaporeFrontier-scale workloads with reserved capacity

Dedicated endpoints (coming soon)

Enterprise-grade reserved capacity with regional deployment control.

A100 Endpoint

A100 80GB · Singapore

Dedicated throughput for production chat and agents

Coming soon

H100 Endpoint

H100 80GB · Singapore

High-throughput enterprise inference and heavy traffic

Coming soon

Pay per second. No hourly minimums. No commitments.

Network egress: free within APAC regions

Persistent storage: $0.044/GB/month

Try It Risk-Free

Deploy Your First Workload in 60 Seconds

No credit card required
$100 trial credit on signup
Deploy in 60 seconds
Pre-configured workloads ready to run
Delete anytime
One click to stop, pay only for what you use
Deploy Your Workload