Skip to main content

NVIDIA coding models

9 free NIM models, curated for Claude Code — map any to Sonnet, Opus, or Haiku.

Model Slot Mapping

How Claude Code maps to NVIDIA models

Claude Code relies on internal sub-slots (Sonnet, Opus, and Haiku) for optimal tooling workflows. This chart outlines exactly how to pipe those Anthropic models into free NVIDIA endpoints for top-tier performance.

Claude SlotCLI CommandTarget NVIDIA ModelOptimization
Sonnet (default)/model claude-sonnet-4-6mistralai/mistral-medium-3.5-128bDaily coding — fast and reliable
Opus (powerful)/model claude-opus-4-6deepseek-ai/deepseek-v4-proComplex reasoning & multi-file work
Haiku (quick)/model claude-haiku-4-5deepseek-ai/deepseek-v4-flashBackground tasks — fast
Specialty — Mistral/model claude-mistralmistralai/mistral-medium-3.5-128bFast general coding alternative
Specialty — DeepSeek/model claude-deepseekdeepseek-ai/deepseek-v4-proDeep reasoning (explicit access)
Specialty — DeepSeek Flash/model claude-deepseek-flashdeepseek-ai/deepseek-v4-flashFast reasoning, 1M context
Specialty — GLM/model claude-glmz-ai/glm-5.1Long agentic sessions
Specialty — MiniMax/model claude-minimaxminimaxai/minimax-m3General purpose coding + vision
Specialty — Gemma/model claude-gemmagoogle/gemma-4-31b-itVision + code (screenshots, UI work)
Specialty — Step/model claude-stepstepfun-ai/step-3.7-flashHeavy reasoning specialist (slow)
Specialty — Kimi/model claude-kimimoonshotai/kimi-k2.6Vision specialist — use sparingly
Specialty — Nemotron/model claude-nemotronnvidia/nemotron-3-ultra-550b-a55bNVIDIA flagship — complex instruction following

Available Models

Free NVIDIA NIM computing

Browse the full registry of open-weights models available on NVIDIA's platform compatible through your LiteLLM proxy.

Mistral logo

Mistral

Mistral Medium 3.5

Recommended Default

NIM API Target

mistralai/mistral-medium-3.5-128b

Performance

⚡ Very Fast (~0.6s)

Coding Rating

⭐⭐⭐⭐

Ideal use case: Daily driver — fast, clean English

DeepSeek logo

DeepSeek

DeepSeek V4 Pro

Top Tier

NIM API Target

deepseek-ai/deepseek-v4-pro

Performance

⚡ Fast (~6s)

Coding Rating

⭐⭐⭐⭐⭐

Ideal use case: Deep reasoning, hard bugs, multi-file work

DeepSeek logo

DeepSeek

DeepSeek V4 Flash

Fast & Efficient

NIM API Target

deepseek-ai/deepseek-v4-flash

Performance

⚡ Fast

Coding Rating

⭐⭐⭐⭐

Ideal use case: Fast coding & background tasks

Google logo

Google

Gemma 4 31B

Fast & Efficient

NIM API Target

google/gemma-4-31b-it

Performance

⚡ Very Fast

Coding Rating

⭐⭐⭐⭐

Ideal use case: Vision + fast coding, screenshots/UI

Z.AI

GLM 5.1

Agentic

NIM API Target

z-ai/glm-5.1

Performance

🔵 Medium

Coding Rating

⭐⭐⭐⭐

Ideal use case: Long agentic sessions, tool-heavy workflows

MiniMax logo

MiniMax

MiniMax M3

Fast & Efficient

NIM API Target

minimaxai/minimax-m3

Performance

⚡ Fast

Coding Rating

⭐⭐⭐⭐

Ideal use case: General coding + vision, strong reasoning

StepFun

Step 3.7 Flash

Reasoning & Logic

NIM API Target

stepfun-ai/step-3.7-flash

Performance

🔵 Medium (~2.6s, reasoning-heavy)

Coding Rating

⭐⭐⭐⭐

Ideal use case: Reasoning specialist — switch to explicitly

Moonshot AI logo

Moonshot AI

Kimi K2.6

Vision & Creative

NIM API Target

moonshotai/kimi-k2.6

Performance

🐢 Slow (can exceed 2min on free tier)

Coding Rating

⭐⭐⭐⭐⭐

Ideal use case: Vision specialist — heavy, use sparingly

NVIDIA logo

NVIDIA

Nemotron 3 Ultra 550B

Top Tier

NIM API Target

nvidia/nemotron-3-ultra-550b-a55b

Performance

🔵 Medium

Coding Rating

⭐⭐⭐⭐⭐

Ideal use case: NVIDIA's flagship — complex reasoning & instruction following

Coding Capability Rating

⭐⭐⭐⭐⭐

Exceptional for agentic coding & complex refactors

⭐⭐⭐⭐

Reliable fallback and fast general generation

⭐⭐⭐

Basic coding support and documentation

Stay Updated

New models added daily

NVIDIA NIM constantly adds new models. Visit the official registry to explore the full catalog and discover the latest models for your coding needs.

Explore All Models