Comparisons

Opus vs Sonnet vs Haiku: Which Claude Model? (2026)

Comparison4 min readUpdated June 13, 2026

At a glance

Anthropic ships Claude in three sizes, and picking the right one is the simplest, biggest lever on both quality and your bill. Opus is the most capable model for the hardest reasoning, Sonnet is the balanced workhorse that handles most coding and agent tasks well, and Haiku is the fast, cheap model for high-volume and latency-sensitive work. They share the same core abilities and, in 2026, a 1M-token context at standard pricing; what changes is depth of reasoning, speed and cost per token. This page compares them honestly so you can match the model to the task instead of paying for Opus on work Haiku would nail, or sending a genuinely hard problem to Haiku and getting frustrated. It pairs with our Choosing an AI Model article. Prices are list rates as of 2026 and move over time.

The options

Claude Opus

The most capable Claude, for the hardest problems.

Best for: Complex reasoning, tricky multi-file refactors, architecture and the agentic tasks where quality matters most.

Strengths

  • Highest reasoning quality and the strongest results on hard, multi-step coding tasks.
  • 1M-token context at standard pricing in 2026 for large-codebase work.
  • The model to reach for when getting it right the first time saves the most time.

Trade-offs

  • Most expensive per token (about USD 5 input / USD 25 output per million in 2026).
  • Slower than Sonnet and Haiku, so it is overkill for simple, high-volume work.

Claude Sonnet

The balanced workhorse for everyday building.

Best for: Most day-to-day coding, agent loops and content work where you want strong quality at a sensible price.

Strengths

  • Excellent quality-to-cost ratio; handles the large majority of real coding tasks well.
  • Faster and far cheaper than Opus (about USD 3 input / USD 15 output per million in 2026).
  • The sensible default for most Claude Code sessions and agent workflows.

Trade-offs

  • Can fall short of Opus on the very hardest reasoning or gnarly refactors.
  • Pricier than Haiku for simple, bulk tasks that do not need its depth.

Claude Haiku

The fast, cheap model for volume and latency.

Best for: High-volume classification, extraction, simple edits, subagent side work and anything latency-sensitive.

Strengths

  • Cheapest per token (about USD 1 input / USD 5 output per million in 2026) and the fastest.
  • Ideal for high-throughput pipelines, simple tasks and read-only subagents.
  • Great cost lever: route narrow work to Haiku while a stronger model leads.

Trade-offs

  • Less capable on deep reasoning and complex, multi-step coding.
  • Can need more supervision or retries on ambiguous or hard tasks.

Side by side

DimensionClaude OpusClaude SonnetClaude Haiku
TierMost capableBalancedFastest and cheapest
Best forHardest reasoning, architecture, complex refactorsMost coding and agent workHigh volume, simple tasks, latency-sensitive work
Relative capabilityHighestHighGood for its size
Relative speedSlowestFastFastest
Input price per million (as of 2026)About USD 5About USD 3About USD 1
Output price per million (as of 2026)About USD 25About USD 15About USD 5
Context window (2026)1M tokens at standard pricing1M tokens at standard pricingLarge context

The verdict

Make Sonnet your default: it handles the large majority of coding and agent work at a strong quality-to-cost ratio, which is why Claude Code defaults to it. Reach for Opus on the genuinely hard problems, complex reasoning, tricky refactors and architecture, where its extra depth pays for itself by getting things right the first time. Drop to Haiku for high-volume, simple or latency-sensitive work like classification, extraction and read-only subagents, where speed and cost matter more than peak reasoning. A great pattern is to mix them: a stronger model leads while Haiku does the cheap, narrow side work. To cut spend further, use prompt caching for repeated context and batch processing for non-urgent jobs. When in doubt, start on Sonnet and only move up or down when a task clearly demands it.

Frequently asked questions

Next step

Ready to put AI to work as a real workflow?

Start with the foundations course, keep your progress locally and sync everything to your free account whenever you like.