Claude Opus 4.1 advances the state-of-the-art coding performance to 74.5% on SWE-bench Verified, and improves Claude's in-depth research and data analysis skills, especially around detail tracking and agentic search. GitHub notes that Claude Opus 4.1 improves across most capabilities relative to Opus 4, with particularly notable performance gains in multi-file code refactoring. Rakuten Group finds that Opus 4.1 excels at pinpointing exact corrections within large codebases without making unnecessary adjustments or introducing bugs.
Windsurf reports that Opus 4.1 delivers a one standard deviation improvement over Opus 4 on their junior developer benchmark, showing roughly the same performance leap as the jump from Sonnet 3.7 to Sonnet 4. To get started with Claude Opus 4.1, developers can simply use claud-opus-4-1-20250805 via the API. The system card, model page, pricing page, and docs are also available to learn more about this upgrade. Feedback from users is encouraged to help improve the model further.