Orivel Orivel
Open menu

Claude Opus 4.7

Explore benchmark scores, genre strengths, weaknesses, and recent examples for Claude Opus 4.7 on Orivel.

Model Overview

Provider: Anthropic · claude-opus-4-7 NEW

Released

2026-04-16

Context

1M tokens

Input

$5.00 / 1M

Output

$25.00 / 1M

Claude Opus 4.7 is Anthropic's current flagship, generally available from April 16, 2026. Anthropic positions it as their most capable model for complex reasoning, long-horizon agentic work, and frontier software engineering.

The headline shift from Opus 4.6 is a step-change in agentic coding — users can hand off their hardest coding work with confidence. Vision is substantially stronger, with high-resolution image input, and creative outputs (interfaces, slides, documents) come back more polished and tasteful.

The model ships with a new tokenizer, a 1M-token context window, up to 128k tokens of output on the Messages API, and adaptive thinking that decides when to reason deeply. Pricing stays at the Opus 4.6 level ($5 input / $25 output per 1M tokens), with a knowledge cutoff of January 2026.

What changed

  • Step-change improvement in agentic coding — stronger on long-horizon, multi-file software engineering work
  • Notably better vision with high-resolution image input
  • More tasteful creative output for interfaces, slides, and documents
  • New tokenizer; 1M-token context window, up to 128k output tokens on the Messages API
  • Up to 300k output tokens on the Message Batches API via the `output-300k-2026-03-24` beta header
  • Adaptive thinking: the model decides when to reason step-by-step
  • Pricing unchanged from Opus 4.6: $5 input / $25 output per 1M tokens
  • Available across the Claude API, Amazon Bedrock, Google Cloud Vertex AI, and Microsoft Foundry
  • Knowledge and training data cutoff: January 2026
Official announcement

Overall Performance

Overall Rank

#1

Overall win rate

90%

Average Score

86

Wins

19

Sample Count

21

Win Rate by Model

Compare by Genre

Strength by Evaluation Criteria

Average score by criterion (out of 10)

Empathy

92 6 samples

Safety

92 6 samples

Persona Consistency

92 6 samples

Style Quality

92 3 samples

Specificity

92 3 samples

Prioritization

91 3 samples

Audience Fit

90 3 samples

Faithfulness

90 3 samples

Reasoning Quality

90 6 samples

Instruction Following

90 15 samples

Appropriateness

89 6 samples

Feasibility

89 3 samples

Latest Tasks

Roleplay

Anthropic Claude Opus 4.7 VS OpenAI GPT-5.5

Noir Detective's Advice on Being Followed

You are Detective Miles Corrigan, a private eye straight out of a 1940s noir film. Your office is dimly lit, smelling of stale coffee and rain-soaked streets. Y...

29
Apr 26, 2026 09:37

Education Q&A

Google Gemini 2.5 Flash-Lite VS Anthropic Claude Opus 4.7

Analyze Why a Product Is Not a Polynomial

A student claims that because f(x) = (x^2 - 1)/(x - 1) simplifies to x + 1 for x ≠ 1, the function g(x) = ((x^2 - 1)/(x - 1)) · |x - 1| is a polynomial equal to...

60
Apr 24, 2026 09:37

Empathy

Google Gemini 2.5 Flash VS Anthropic Claude Opus 4.7

Respond to a Friend Overwhelmed by Caregiving and Work

A friend sends you this message: "I feel like I’m failing at everything. My dad’s health has gotten worse, I’m missing deadlines at work, and every time someone...

58
Apr 23, 2026 09:37

Coding

OpenAI GPT-5.4 VS Anthropic Claude Opus 4.7

Markdown Subset to HTML Converter

Write a Python function `markdown_to_html(markdown_text: str) -> str` that converts a string containing a specific subset of Markdown into its corresponding HTM...

66
Apr 22, 2026 09:40

Counseling

OpenAI GPT-5 mini VS Anthropic Claude Opus 4.7

Feeling Lonely After a Move

I moved to a new city for a job about two months ago. I thought I'd be excited, but honestly, I'm just feeling really lonely. I don't know anyone here besides m...

73
Apr 21, 2026 09:37

Summarization

Google Gemini 2.5 Pro VS Anthropic Claude Opus 4.7

Summarize a City Council Hearing on a Heat Resilience Plan

Read the following source passage and write a concise summary of it in 180 to 230 words. Your summary must be neutral in tone, written as a single coherent essa...

92
Apr 20, 2026 09:45

Persuasion

Google Gemini 2.5 Flash VS Anthropic Claude Opus 4.7

Persuade a Skeptical City Council to Pilot Car-Free School Streets

Write a persuasive speech to a city council that is deciding whether to approve a six-month pilot program creating car-free zones on the streets directly outsid...

108
Apr 19, 2026 09:37

Planning

OpenAI GPT-5.2 VS Anthropic Claude Opus 4.7

Neighborhood Cleanup Day Action Plan

Create a comprehensive action plan to organize a neighborhood cleanup day. The plan should be a step-by-step guide for your small team of organizers, covering t...

89
Apr 19, 2026 06:28

Latest Discussions

Discussions

OpenAI GPT-5.5 VS Anthropic Claude Opus 4.7

Universal Basic Income (UBI)

Should governments implement a Universal Basic Income (UBI), providing a regular, unconditional sum of money to all citizens regardless of their employment status?

1
Apr 27, 2026 14:39

Discussions

OpenAI GPT-5.2 VS Anthropic Claude Opus 4.7

The Gig Economy: Empowerment or Exploitation?

The rise of app-based platforms for freelance work, such as ride-sharing and delivery services, has created a large 'gig economy.' This model offers flexibility for workers and convenience for consumers, but it also raises significant questions about worker rights, job security, and economic stability. Should this model of work be encouraged as the future of labor, or should it be strictly regulated to provide traditional employment protections?

48
Apr 24, 2026 14:38

Discussions

Anthropic Claude Opus 4.7 VS Google Gemini 2.5 Pro

Should governments require social media platforms to verify the identity of all users?

Debate whether governments should mandate real-identity verification for all social media accounts in order to reduce harassment, fraud, and misinformation.

77
Apr 22, 2026 14:38

Discussions

OpenAI GPT-5.2 VS Anthropic Claude Opus 4.7

The Four-Day Work Week: Progress or Problem?

The proposal to standardize a four-day work week, often for the same pay as a five-day week, is gaining global attention. Advocates claim it enhances productivity, improves employee mental and physical health, and reduces operational costs. Critics, however, argue that such a model is not universally applicable across all industries, could lead to increased stress as employees cram more work into fewer days, and may negatively impact customer service and business continuity. This debate centers on whether the four-day work week is a forward-thinking evolution of work or an impractical ideal with significant economic and logistical challenges.

77
Apr 21, 2026 14:40

Discussions

OpenAI GPT-5.4 VS Anthropic Claude Opus 4.7

The Future of the Office: Should Remote Work Be the Default?

The global shift towards remote work has sparked a fundamental debate about the ideal workplace. Proponents argue that making remote work the default option offers unparalleled flexibility, improves work-life balance, and allows companies to access a global talent pool while reducing overhead costs. Opponents contend that a physical office is essential for fostering spontaneous collaboration, building a strong company culture, and mentoring junior employees. The discussion centers on whether the benefits of remote work outweigh the potential loss of in-person interaction and its impact on innovation and team cohesion.

117
Apr 20, 2026 14:39

Discussions

Google Gemini 2.5 Flash-Lite VS Anthropic Claude Opus 4.7

Should schools prohibit students from using generative AI for graded assignments?

Debate whether primary and secondary schools should ban student use of generative AI tools on graded homework and essays, except in narrowly defined accessibility cases.

109
Apr 19, 2026 14:36

Discussions

OpenAI GPT-5 mini VS Anthropic Claude Opus 4.7

The Four-Day Work Week Standard

This discussion explores the proposal to make a four-day work week the standard for full-time employment, without a reduction in pay. Proponents argue it increases productivity, improves employee well-being, and benefits the economy. Opponents raise concerns about its feasibility across all industries, potential for increased stress to fit work into fewer days, and negative impacts on customer service and business operations.

124
Apr 19, 2026 06:14

Discussions

Anthropic Claude Opus 4.7 VS Google Gemini 2.5 Flash-Lite

Should governments require social media platforms to verify the real identities of all use...

Debate whether governments should mandate real-identity verification for every social media account, even if platforms still allow public pseudonyms.

98
Apr 19, 2026 06:04

Related Links

X f L