Skip to content

Claude Opus 4.7: Everything New You Need to Know

A clear, beginner-friendly breakdown of Claude Opus 4.7's new features, breaking changes, and behavior updates, including high-res image support, task budgets, adaptive thinking, and migration tips.

Claude Opus 4.7.

If you've been using Claude for anything serious, such as building agents, processing documents, or handling complex reasoning tasks, you know how much the model version matters. A new release can mean faster results, smarter outputs, and fewer workarounds in your code.

Claude Opus 4.7 is Anthropic's most capable generally available model yet. It's built for long, autonomous tasks, better vision, and smarter memory. But it also comes with real breaking changes that will affect your existing API code if you don't update it.

This post cuts through the official docs and gives you only what matters: what's new, what broke, what changed in behavior, and how to migrate cleanly.


What Is Claude Opus 4.7?

Claude Opus 4.7 (claude-opus-4-7) is the flagship model for complex reasoning and agentic coding. It supports:

  • A 1 million token context window
  • 128k max output tokens
  • Adaptive thinking
  • All tools and platform features from Opus 4.6

It's designed for long-horizon tasks where the model needs to plan, act, and self-correct over many steps.


New Features

High-Resolution Image Support

Opus 4.7 is the first Claude model to support high-resolution images. The maximum resolution jumped from 1568px / 1.15MP to 2576px / 3.75MP.

This matters most for:

  • Computer use (screenshots, UI interaction)
  • Document and artifact understanding
  • Image localization and bounding-box detection

One practical improvement: coordinates are now 1:1 with actual pixels. You no longer need to apply a scale factor when mapping pixel positions.

Heads up: High-res images consume more tokens. If you don't need the extra detail, downsample your images before sending them to avoid unnecessary cost.


New xhigh Effort Level

The effort parameter lets you trade model intelligence for speed and cost. Opus 4.7 adds a new xhigh level at the top.

Effort LevelBest For
xhighCoding, agentic tasks
highMost intelligence-sensitive tasks
mediumBalanced speed and quality
lowFast, low-cost tasks

This is available in the Messages API only. If you use Claude Managed Agents, effort is handled automatically.


Task Budgets (Beta)

Task budgets let you tell Claude roughly how many tokens to use across an entire agentic loop, including thinking, tool calls, and output. Claude sees a running countdown and adjusts its work accordingly.

To use it, set the beta header and add task_budget to your output_config:

python
response = client.beta.messages.create(
    model="claude-opus-4-7",
    max_tokens=128000,
    output_config={
        "effort": "high",
        "task_budget": {"type": "tokens", "total": 128000},
    },
    messages=[
        {"role": "user", "content": "Review the codebase and propose a refactor plan."}
    ],
    betas=["task-budgets-2026-03-13"],
)

Key distinctions:

ParameterTypeWhat It Does
max_tokensHard limitCaps tokens per request, model is not aware of it
task_budgetAdvisoryGuides the model to self-limit across the full loop

Use task_budget when you want the model to manage its own scope. Use max_tokens as a safety ceiling. Minimum task budget is 20k tokens.


Breaking Changes (Messages API Only)

These will cause errors in your existing code if not addressed. If you use Claude Managed Agents, none of these apply.

Extended Thinking Budgets Removed

Setting budget_tokens in the thinking config now returns a 400 error.

python
# Before (Opus 4.6)
thinking = {"type": "enabled", "budget_tokens": 32000}

# After (Opus 4.7)
thinking = {"type": "adaptive"}
output_config = {"effort": "high"}

Adaptive thinking is the only supported thinking mode now, and it's off by default. You must enable it explicitly.


Sampling Parameters Removed

Setting temperature, top_p, or top_k to any non-default value now returns a 400 error.

The fix: remove these parameters from your requests entirely. If you used temperature = 0 for determinism, note that it never actually guaranteed identical outputs anyway.


Thinking Content Omitted by Default

Thinking blocks are still generated but the thinking field in the response is now empty by default. No error is raised, it just silently omits the content.

If you stream reasoning to users, this will appear as a long pause before output begins. To restore visible progress:

python
thinking = {
    "type": "adaptive",
    "display": "summarized",  # or "omitted" (default)
}

Updated Token Counts

Opus 4.7 uses a new tokenizer. Expect roughly 1x to 1.35x more tokens compared to Opus 4.6 for the same content, depending on the workload.

What to do:

  • Update your max_tokens values to give more headroom
  • Revisit compaction triggers if you use them
  • Use task_budget and effort to control cost without sacrificing too much quality

Capability Improvements

Knowledge Work

Opus 4.7 is better at tasks that require visual verification of its own outputs:

  • DOCX redlining and PPTX editing: Improved at tracking changes and checking slide layouts.
  • Chart and figure analysis: Better at using image-processing libraries like PIL to extract data from charts, including pixel-level transcription.

If you added prompts like "double-check the slide layout before returning," try removing those and re-testing. The model may handle it natively now.


Memory

The model is better at writing to and reading from file-system-based memory. If your agent uses a scratchpad or structured notes file, expect improved consistency across turns.

You can also use the built-in memory tool to give Claude a managed scratchpad without building your own.


Behavior Changes (No API Errors, But May Need Prompt Updates)

BehaviorWhat Changed
Instruction followingMore literal, will not infer unstated requests
Response lengthCalibrates to task complexity, not a fixed verbosity
Tool usageFewer tool calls by default, uses reasoning more
ToneMore direct and opinionated, fewer emoji
Progress updatesMore frequent status updates during long agentic runs
SubagentsFewer spawned by default, steerable via prompting
CybersecurityMay refuse high-risk topics, apply to Cyber Verification Program for legitimate work

Q&A

1. Is Claude Opus 4.7 a drop-in replacement for Opus 4.6?

Mostly, but not entirely. If you use the Messages API and set temperature, top_p, top_k, or budget_tokens, your requests will return 400 errors. Review the breaking changes section before switching.

2. What is adaptive thinking and how do I enable it?

Adaptive thinking is Claude's only supported reasoning mode in Opus 4.7. It's off by default. Enable it by setting thinking: {"type": "adaptive"} in your request.

3. What happens if I don't update my sampling parameters?

Any non-default value for temperature, top_p, or top_k will return a 400 error. The safest fix is to remove those parameters entirely.

4. What is the difference between task_budget and max_tokens?

max_tokens is a hard per-request cap that the model is not aware of. task_budget is an advisory budget across the full agentic loop that the model actively uses to pace itself.

5. Why do my images now cost more tokens?

High-resolution image support uses more tokens per image. If you don't need the extra detail, resize or compress images before sending them to Claude.

6. My streaming UI now shows a long pause before output. What's wrong?

Thinking content is now omitted by default. Set "display": "summarized" in your thinking config to restore visible progress during reasoning.

7. Will existing Opus 4.6 prompts work as-is?

Possibly, but expect behavior differences. The model is more literal, more direct, uses fewer tools by default, and calibrates response length to task complexity. Test your key prompts before migrating in production.

8. Can I still use extended thinking with a custom token budget?

No. Extended thinking budgets are removed entirely. Use adaptive thinking with an effort level instead. Internally, adaptive thinking outperforms the old extended thinking mode.

9. What is the xhigh effort level for?

It's the highest effort level, recommended for coding and agentic tasks where quality matters most. Use at least high for any intelligence-sensitive work.

10. How do I migrate my codebase automatically?

If you use Claude Code or the Agent SDK, the Claude API skill can apply migration changes to your codebase automatically. Otherwise, follow the step-by-step migration guide in the official docs.

References

Last updated:

Made with ❤️ by Mun Bock Ho

Copyright ©️ 2026