Claude Fable 5 and Claude Opus 4.8 are both high-end Anthropic models, but they sit in different tiers. Fable 5 is the first generally available Mythos-class model — a capability level above Opus — while Opus 4.8 remains Anthropic’s flagship of the previous generation and is still excellent for complex reasoning and agentic coding.
Quick answer: Claude Fable 5 outperforms Claude Opus 4.8 on every benchmark Anthropic published, with the biggest gap on long-horizon agentic coding: 80.3% vs 69.2% on SWE-bench Pro and 29.3% vs 13.4% on FrontierCode Diamond. The trade-off is cost and compliance: Fable 5 is priced at $10/$50 per million input/output tokens (double Opus 4.8’s $5/$25) and carries a 30-day data-retention requirement, while Opus 4.8 supports zero data retention. Switch to Fable 5 when cheaper models fail or need too much human rescue — not by default.
This is the deep-dive companion to our full Claude Fable 5 guide, which covers release details, API access, safeguards, and use cases.
Fable 5 vs Opus 4.8 at a Glance
| Comparison | Claude Fable 5 | Claude Opus 4.8 |
|---|---|---|
| Model class | Mythos-class (tier above Opus) | Opus-class flagship |
| Release date | June 9, 2026 | Previous generation |
| API model ID | claude-fable-5 |
claude-opus-4-8 |
| Input price | $10 / MTok | $5 / MTok |
| Output price | $50 / MTok | $25 / MTok |
| Context window | 1M tokens | Smaller |
| Max output | 128k tokens | Lower |
| Thinking mode | Adaptive thinking, always on (effort parameter) |
Configurable thinking |
| Safeguard fallback | Flagged cyber/bio/chem/distillation requests route to Opus 4.8 | No routing layer |
| Data retention | Covered model, 30-day retention | Zero data retention available |
| Best fit | Hardest long-horizon, agentic, long-context work | Complex reasoning where cost and ZDR matter |
Benchmarks: Fable 5 vs Opus 4.8
Anthropic published a launch-day benchmark table comparing the new Mythos-class models against Opus 4.8 and competing frontier models. The deltas over Opus 4.8 are unusually large for a single generation.
| Benchmark | Claude Fable 5 | Claude Opus 4.8 | Delta |
|---|---|---|---|
| SWE-bench Pro (agentic coding) | 80.3% | 69.2% | +11.1 pts |
| FrontierCode Diamond (Cognition) | 29.3% | 13.4% | +15.9 pts (2.2×) |
| Terminal-Bench 2.1 | 88.0% | 82.7% | +5.3 pts |
| GDPval-AA (knowledge-work Elo) | 1932 | 1890 | +42 Elo |
| SWE-bench Verified | 95.0% | approaching ceiling | — |
A few notes on reading these numbers correctly:
- SWE-bench Pro is the headline. SWE-bench Verified is close to saturation across frontier models, so the +11-point gap on the harder Pro variant — end-to-end resolution of real GitHub issues — is the more meaningful coding signal. For context, the Fable-over-Opus gap is larger than the gap between Opus 4.8 and Gemini 3.1 Pro (54.2%).
- FrontierCode Diamond measures maintainable code, not just passing tests. Fable 5 more than doubles Opus 4.8 here, and Anthropic reports it leads frontier models even at medium reasoning effort — relevant if you plan to run the
effortparameter below maximum to control cost. - Token efficiency matters as much as raw scores. Anthropic and GitHub both report that Fable 5 completes equivalent autonomous coding work with fewer tool calls and lower token consumption than Opus-tier models. On agentic workloads, fewer turns can partially offset the 2× per-token price.
- The safeguard asterisk. On benchmarks touching cybersecurity or biology, published scores reflect the restricted Claude Mythos 5; the deployable Fable 5 performs closer to Opus 4.8 in those domains because flagged requests fall back to Opus. Outside safeguarded domains, Fable 5 and Mythos 5 share the same underlying model and the same scores.
Early production anecdotes back the benchmarks: Stripe reported a migration in a 50-million-line Ruby codebase completed in one day that was estimated at two-plus months of team effort. Treat all vendor-published results as directional and validate on your own tasks.
Pricing: What the 2× Premium Actually Costs
Fable 5 doubles Opus 4.8 on every line of the rate card:
| Rate | Claude Fable 5 | Claude Opus 4.8 |
|---|---|---|
| Base input | $10 / MTok | $5 / MTok |
| Output | $50 / MTok | $25 / MTok |
| Batch API input | $5 / MTok | $2.50 / MTok |
| Batch API output | $25 / MTok | $12.50 / MTok |
| Cache hits | $1 / MTok | $0.50 / MTok |
The naive read is “Fable costs twice as much.” The realistic read depends on workload shape:
- Single-shot, short tasks: the 2× premium is pure overhead. Opus 4.8 (or Sonnet 4.6) wins on cost-performance.
- Multi-turn agentic tasks: if Fable 5 finishes in fewer turns with fewer tool calls and less repeated exploration, total tokens per completed task can land closer to parity — and the success rate is part of the math. A failed Opus run that needs a human rescue costs more than its token bill.
- Long-context analysis: a near-1M-token prompt at $10/MTok is a $10 input bill per call before output. Use prompt caching aggressively for repeated context, and reserve full-context calls for work where missing a detail is expensive.
The rational routing rule: start with the cheapest model that reliably completes the job, and promote tasks to Fable 5 only when Opus 4.8 demonstrably fails, loses the plan mid-task, or burns more total tokens through retries.
Data Retention: The Hidden Switching Cost
This is the difference most comparison tables miss. Claude Opus 4.8 is available under zero data retention (ZDR). Claude Fable 5 is a covered model with a 30-day data-retention requirement, because Anthropic needs retained prompts and outputs to operate the safety classifiers. Retained data is not used to train Anthropic’s models, but it is retained.
For individual developers this is usually a non-issue. For enterprises it is a procurement gate: if your contracts, regulators, or clients require ZDR for source code, personal data, or confidential material, Fable 5 may be blocked regardless of how good the benchmarks are. The same constraint follows the model into GitHub Copilot — Fable 5 is the only Claude model there that requires retention, as we cover in Claude Fable 5 in GitHub Copilot.
Behavior Differences That Affect Production Code
Swapping claude-opus-4-8 for claude-fable-5 is not a drop-in change. Three behaviors differ:
- Adaptive thinking is always on. You cannot disable thinking on Fable 5; you control depth with the
effortparameter. Budget for thinking tokens in cost projections. - Refusals arrive as HTTP 200. A safeguard-flagged request can return a successful response with
stop_reason: "refusal", or be answered by Opus 4.8 via fallback. Production code must check stop reasons instead of treating every 200 as a completed Fable answer. Anthropic reports fallback affects under 5% of sessions for typical workloads, and you are not billed Fable prices for Opus-fallback responses. - Raw chain-of-thought is not returned. If any Opus-era tooling parsed reasoning text, migrate it to summarized thinking before switching.
When to Use Which
| Scenario | Pick | Why |
|---|---|---|
| Routine code edits, helpers, explanations | Opus 4.8 or Sonnet 4.6 | Fable 5 is overkill at 2× the price |
| Large codebase migration or multi-repo refactor | Fable 5 | Largest measured capability gap |
| Long-running autonomous agent tasks | Fable 5 | Fewer turns, better plan retention |
| Budget-capped, high-volume pipelines | Opus 4.8 | Per-token economics dominate |
| ZDR-mandated data | Opus 4.8 | Fable 5 requires 30-day retention |
| Work near cyber/bio boundaries | Opus 4.8 (or vetted Mythos access) | Fable 5 routes those requests to Opus anyway |
| 1M-token document or research analysis | Fable 5 | Context window plus reasoning gains |
Migration Checklist: Opus 4.8 → Fable 5
- Benchmark 10–20 of your real tasks on both models with identical prompts; score completion rate, edits required, turns, tool calls, and total cost per completed task — not vibes.
- Confirm 30-day retention is acceptable for every data category in the workload.
- Add refusal and fallback handling (
stop_reason: "refusal", fallback metadata) before flipping traffic. - Re-tune the
effortparameter; medium effort may already beat Opus on your tasks at lower cost. - Route by task class: keep Sonnet/Opus for routine work, send only the failure-prone tail to Fable 5.
FAQ
Is Claude Fable 5 better than Opus 4.8?
How much more does Claude Fable 5 cost than Opus 4.8?
Does Claude Opus 4.8 support zero data retention while Fable 5 does not?
Why did my Claude Fable 5 request get answered by Opus 4.8?
Should I migrate my Opus 4.8 workloads to Fable 5?
For the full picture — release details, API specs, safeguards, and use cases — go back to the main Claude Fable 5 guide. For coding-workflow context, see Claude Code vs Codex and Claude vs ChatGPT for coding.