Claude Fable 5 and Claude Opus 4.8 are both high-end Anthropic models, but they sit in different tiers. Fable 5 is the first generally available Mythos-class model — a capability level above Opus — while Opus 4.8 remains Anthropic’s flagship of the previous generation and is still excellent for complex reasoning and agentic coding.

Quick answer: Claude Fable 5 outperforms Claude Opus 4.8 on every benchmark Anthropic published, with the biggest gap on long-horizon agentic coding: 80.3% vs 69.2% on SWE-bench Pro and 29.3% vs 13.4% on FrontierCode Diamond. The trade-off is cost and compliance: Fable 5 is priced at $10/$50 per million input/output tokens (double Opus 4.8’s $5/$25) and carries a 30-day data-retention requirement, while Opus 4.8 supports zero data retention. Switch to Fable 5 when cheaper models fail or need too much human rescue — not by default.

This is the deep-dive companion to our full Claude Fable 5 guide, which covers release details, API access, safeguards, and use cases.

Fable 5 vs Opus 4.8 at a Glance

Try it in practice Make this section actionable Practice the workflow instead of only comparing tools.
Comparison Claude Fable 5 Claude Opus 4.8
Model class Mythos-class (tier above Opus) Opus-class flagship
Release date June 9, 2026 Previous generation
API model ID claude-fable-5 claude-opus-4-8
Input price $10 / MTok $5 / MTok
Output price $50 / MTok $25 / MTok
Context window 1M tokens Smaller
Max output 128k tokens Lower
Thinking mode Adaptive thinking, always on (effort parameter) Configurable thinking
Safeguard fallback Flagged cyber/bio/chem/distillation requests route to Opus 4.8 No routing layer
Data retention Covered model, 30-day retention Zero data retention available
Best fit Hardest long-horizon, agentic, long-context work Complex reasoning where cost and ZDR matter

Benchmarks: Fable 5 vs Opus 4.8

Try it in practice Make this section actionable Practice the workflow instead of only comparing tools.

Anthropic published a launch-day benchmark table comparing the new Mythos-class models against Opus 4.8 and competing frontier models. The deltas over Opus 4.8 are unusually large for a single generation.

Benchmark Claude Fable 5 Claude Opus 4.8 Delta
SWE-bench Pro (agentic coding) 80.3% 69.2% +11.1 pts
FrontierCode Diamond (Cognition) 29.3% 13.4% +15.9 pts (2.2×)
Terminal-Bench 2.1 88.0% 82.7% +5.3 pts
GDPval-AA (knowledge-work Elo) 1932 1890 +42 Elo
SWE-bench Verified 95.0% approaching ceiling

A few notes on reading these numbers correctly:

  • SWE-bench Pro is the headline. SWE-bench Verified is close to saturation across frontier models, so the +11-point gap on the harder Pro variant — end-to-end resolution of real GitHub issues — is the more meaningful coding signal. For context, the Fable-over-Opus gap is larger than the gap between Opus 4.8 and Gemini 3.1 Pro (54.2%).
  • FrontierCode Diamond measures maintainable code, not just passing tests. Fable 5 more than doubles Opus 4.8 here, and Anthropic reports it leads frontier models even at medium reasoning effort — relevant if you plan to run the effort parameter below maximum to control cost.
  • Token efficiency matters as much as raw scores. Anthropic and GitHub both report that Fable 5 completes equivalent autonomous coding work with fewer tool calls and lower token consumption than Opus-tier models. On agentic workloads, fewer turns can partially offset the 2× per-token price.
  • The safeguard asterisk. On benchmarks touching cybersecurity or biology, published scores reflect the restricted Claude Mythos 5; the deployable Fable 5 performs closer to Opus 4.8 in those domains because flagged requests fall back to Opus. Outside safeguarded domains, Fable 5 and Mythos 5 share the same underlying model and the same scores.

Early production anecdotes back the benchmarks: Stripe reported a migration in a 50-million-line Ruby codebase completed in one day that was estimated at two-plus months of team effort. Treat all vendor-published results as directional and validate on your own tasks.

Pricing: What the 2× Premium Actually Costs

Try it in practice Make this section actionable Practice the workflow instead of only comparing tools.

Fable 5 doubles Opus 4.8 on every line of the rate card:

Rate Claude Fable 5 Claude Opus 4.8
Base input $10 / MTok $5 / MTok
Output $50 / MTok $25 / MTok
Batch API input $5 / MTok $2.50 / MTok
Batch API output $25 / MTok $12.50 / MTok
Cache hits $1 / MTok $0.50 / MTok

The naive read is “Fable costs twice as much.” The realistic read depends on workload shape:

  • Single-shot, short tasks: the 2× premium is pure overhead. Opus 4.8 (or Sonnet 4.6) wins on cost-performance.
  • Multi-turn agentic tasks: if Fable 5 finishes in fewer turns with fewer tool calls and less repeated exploration, total tokens per completed task can land closer to parity — and the success rate is part of the math. A failed Opus run that needs a human rescue costs more than its token bill.
  • Long-context analysis: a near-1M-token prompt at $10/MTok is a $10 input bill per call before output. Use prompt caching aggressively for repeated context, and reserve full-context calls for work where missing a detail is expensive.

The rational routing rule: start with the cheapest model that reliably completes the job, and promote tasks to Fable 5 only when Opus 4.8 demonstrably fails, loses the plan mid-task, or burns more total tokens through retries.

Data Retention: The Hidden Switching Cost

Try it in practice Make this section actionable Practice the workflow instead of only comparing tools.

This is the difference most comparison tables miss. Claude Opus 4.8 is available under zero data retention (ZDR). Claude Fable 5 is a covered model with a 30-day data-retention requirement, because Anthropic needs retained prompts and outputs to operate the safety classifiers. Retained data is not used to train Anthropic’s models, but it is retained.

For individual developers this is usually a non-issue. For enterprises it is a procurement gate: if your contracts, regulators, or clients require ZDR for source code, personal data, or confidential material, Fable 5 may be blocked regardless of how good the benchmarks are. The same constraint follows the model into GitHub Copilot — Fable 5 is the only Claude model there that requires retention, as we cover in Claude Fable 5 in GitHub Copilot.

Behavior Differences That Affect Production Code

Try it in practice Make this section actionable Practice the workflow instead of only comparing tools.

Swapping claude-opus-4-8 for claude-fable-5 is not a drop-in change. Three behaviors differ:

  1. Adaptive thinking is always on. You cannot disable thinking on Fable 5; you control depth with the effort parameter. Budget for thinking tokens in cost projections.
  2. Refusals arrive as HTTP 200. A safeguard-flagged request can return a successful response with stop_reason: "refusal", or be answered by Opus 4.8 via fallback. Production code must check stop reasons instead of treating every 200 as a completed Fable answer. Anthropic reports fallback affects under 5% of sessions for typical workloads, and you are not billed Fable prices for Opus-fallback responses.
  3. Raw chain-of-thought is not returned. If any Opus-era tooling parsed reasoning text, migrate it to summarized thinking before switching.

When to Use Which

Try it in practice Make this section actionable Practice the workflow instead of only comparing tools.
Scenario Pick Why
Routine code edits, helpers, explanations Opus 4.8 or Sonnet 4.6 Fable 5 is overkill at 2× the price
Large codebase migration or multi-repo refactor Fable 5 Largest measured capability gap
Long-running autonomous agent tasks Fable 5 Fewer turns, better plan retention
Budget-capped, high-volume pipelines Opus 4.8 Per-token economics dominate
ZDR-mandated data Opus 4.8 Fable 5 requires 30-day retention
Work near cyber/bio boundaries Opus 4.8 (or vetted Mythos access) Fable 5 routes those requests to Opus anyway
1M-token document or research analysis Fable 5 Context window plus reasoning gains

Migration Checklist: Opus 4.8 → Fable 5

Try it in practice Make this section actionable Practice the workflow instead of only comparing tools.
  1. Benchmark 10–20 of your real tasks on both models with identical prompts; score completion rate, edits required, turns, tool calls, and total cost per completed task — not vibes.
  2. Confirm 30-day retention is acceptable for every data category in the workload.
  3. Add refusal and fallback handling (stop_reason: "refusal", fallback metadata) before flipping traffic.
  4. Re-tune the effort parameter; medium effort may already beat Opus on your tasks at lower cost.
  5. Route by task class: keep Sonnet/Opus for routine work, send only the failure-prone tail to Fable 5.

FAQ

Try it in practice Make this section actionable Practice the workflow instead of only comparing tools.
Is Claude Fable 5 better than Opus 4.8?
On Anthropic’s published benchmarks, yes — Claude Fable 5 leads Claude Opus 4.8 on every reported test, most notably 80.3% vs 69.2% on SWE-bench Pro and 29.3% vs 13.4% on FrontierCode Diamond. Opus 4.8 remains the better choice when cost, zero data retention, or routine workloads dominate.
How much more does Claude Fable 5 cost than Opus 4.8?
Exactly double on the rate card: $10 vs $5 per million input tokens and $50 vs $25 per million output tokens, with the same 2× ratio on batch and caching rates. On agentic tasks, Fable 5’s better token efficiency and higher success rate can narrow the effective gap.
Does Claude Opus 4.8 support zero data retention while Fable 5 does not?
Yes. Opus 4.8 can operate under zero data retention. Fable 5 is a covered model with a 30-day retention requirement needed to run Anthropic’s safety classifiers; retained data is not used for training.
Why did my Claude Fable 5 request get answered by Opus 4.8?
Fable 5’s safeguards route flagged cybersecurity, biology, chemistry, and distillation-related requests to Claude Opus 4.8 instead. Anthropic says this affects a small minority of sessions, and Opus-fallback responses are not billed at Fable prices.
Should I migrate my Opus 4.8 workloads to Fable 5?
Only the failure-prone tail. Benchmark both models on your real tasks, keep Opus 4.8 (or Sonnet 4.6) for work it already completes reliably, and move long-horizon, long-context, or repeatedly failing tasks to Fable 5 where the higher success rate justifies the price.

For the full picture — release details, API specs, safeguards, and use cases — go back to the main Claude Fable 5 guide. For coding-workflow context, see Claude Code vs Codex and Claude vs ChatGPT for coding.