Model pricing

NVIDIA: Llama 3.3 Nemotron Super 49B V1.5 API Pricing

NVIDIA · 128K context

toolsstructured-outputreasoningbudget
Provider prices

Pricing across providers

Prices are shown per 1M tokens with cache pricing, source links, observation dates, and confidence labels.

ProviderInput / OutputCachedSource typeSourceObservedConfidence
DeepInfradeepinfra · endpoint$0.40 / $0.40OpenRouter endpointSource2026-06-08high
OpenRouteropenrouter · aggregator$0.40 / $0.40OpenRouter aggregateSource2026-06-08high
Same organization

Related models

Other tracked models from NVIDIA.