NVIDIA coding models

9 free NIM models, curated for Claude Code — map any to Sonnet, Opus, or Haiku.

Model Slot Mapping

How Claude Code maps to NVIDIA models

Claude Code relies on internal sub-slots (Sonnet, Opus, and Haiku) for optimal tooling workflows. This chart outlines exactly how to pipe those Anthropic models into free NVIDIA endpoints for top-tier performance.

Claude Slot	CLI Command	Target NVIDIA Model	Optimization
Sonnet (default)	/model claude-sonnet-4-6	mistralai/mistral-medium-3.5-128b	Daily coding — fast and reliable
Opus (powerful)	/model claude-opus-4-6	deepseek-ai/deepseek-v4-pro	Complex reasoning & multi-file work
Haiku (quick)	/model claude-haiku-4-5	deepseek-ai/deepseek-v4-flash	Background tasks — fast
Specialty — Mistral	/model claude-mistral	mistralai/mistral-medium-3.5-128b	Fast general coding alternative
Specialty — DeepSeek	/model claude-deepseek	deepseek-ai/deepseek-v4-pro	Deep reasoning (explicit access)
Specialty — GLM	/model claude-glm	z-ai/glm-5.2	Long agentic sessions
Specialty — MiniMax	/model claude-minimax	minimaxai/minimax-m3	General purpose coding + vision
Specialty — Nemotron	/model claude-nemotron	nvidia/nemotron-3-ultra-550b-a55b	NVIDIA flagship — complex instruction following

Available Models

Free NVIDIA NIM computing

Browse the full registry of open-weights models available on NVIDIA's platform compatible through your LiteLLM proxy.

Mistral

Mistral Medium 3.5

Recommended Default

NIM API Target

mistralai/mistral-medium-3.5-128b

Performance

⚡ Very Fast (~0.6s)

Coding Rating

⭐⭐⭐⭐

Ideal use case: Daily driver — fast, clean English

DeepSeek

DeepSeek V4 Pro

Top Tier

NIM API Target

deepseek-ai/deepseek-v4-pro

Performance

⚡ Fast (~6s)

Coding Rating

⭐⭐⭐⭐⭐

Ideal use case: Deep reasoning, hard bugs, multi-file work

DeepSeek

DeepSeek V4 Flash

Fast & Efficient

NIM API Target

deepseek-ai/deepseek-v4-flash

Performance

⚡ Fast

Coding Rating

⭐⭐⭐⭐

Ideal use case: Fast coding & background tasks

Z.AI

GLM 5.2

Agentic

NIM API Target

z-ai/glm-5.2

Performance

🔵 Medium

Coding Rating

⭐⭐⭐⭐

Ideal use case: Long agentic sessions, tool-heavy workflows

MiniMax

MiniMax M3

Fast & Efficient

NIM API Target

minimaxai/minimax-m3

Performance

⚡ Fast

Coding Rating

⭐⭐⭐⭐

Ideal use case: General coding + vision, strong reasoning

NVIDIA

Nemotron 3 Ultra 550B

Top Tier

NIM API Target

nvidia/nemotron-3-ultra-550b-a55b

Performance

🔵 Medium

Coding Rating

⭐⭐⭐⭐⭐

Ideal use case: NVIDIA's flagship — complex reasoning & instruction following

Coding Capability Rating

⭐⭐⭐⭐⭐

Exceptional for agentic coding & complex refactors

⭐⭐⭐⭐

Reliable fallback and fast general generation

⭐⭐⭐

Basic coding support and documentation

Stay Updated

New models added daily

NVIDIA NIM constantly adds new models. Visit the official registry to explore the full catalog and discover the latest models for your coding needs.

Explore All Models