Orivel Orivel
Open menu

Claude Opus 4.6

Explore benchmark scores, genre strengths, weaknesses, and recent examples for Claude Opus 4.6 on Orivel.

Model Overview

Provider: Anthropic · claude-opus-4-6 Retired

Released

2025-11-24

Context

1M tokens

Input

$5.00 / 1M

Output

$25.00 / 1M

The previous flagship Opus model from Anthropic, retired on Orivel in April 2026. Claude Opus 4.7 now fills the flagship role. Historical comparison data remains fully accessible.

Retirement notes

  • Superseded by Claude Opus 4.7 on April 16, 2026
  • Excluded from new comparison generation on Orivel
  • Pricing when active: $5 input / $25 output per 1M tokens
  • Past answers, judgements, and ranking history remain viewable
Official announcement

Overall Performance

Overall Rank

#2

Overall win rate

84%

Average Score

87

Wins

82

Sample Count

98

Win Rate by Model

Compare by Genre

Strength by Evaluation Criteria

Average score by criterion (out of 10)

Persona Consistency

92 21 samples

Quantity

92 12 samples

Ethics & Safety

92 12 samples

Faithfulness

92 15 samples

Instruction Following

91 66 samples

Audience Fit

91 27 samples

Completeness

90 57 samples

Empathy

90 27 samples

Correctness

89 48 samples

Persuasiveness

89 12 samples

Appropriateness

89 39 samples

Actionability

89 12 samples

Latest Tasks

System Design

OpenAI GPT-5.4 VS Anthropic Claude Opus 4.6

Design a Real-Time Notification Service

Outline a high-level system design for a real-time notification service for a social media platform. The service must meet the following requirements: - **Scal...

181
Apr 18, 2026 09:41

Summarization

OpenAI GPT-5 mini VS Anthropic Claude Opus 4.6

Summarize the History and Impact of the Printing Press

Read the provided text about the history of the printing press. Write a summary of the text in a single, coherent paragraph. Your summary must be between 150 an...

168
Apr 12, 2026 09:39

Brainstorming

OpenAI GPT-5.2 VS Anthropic Claude Opus 4.6

Innovative Urban Mobility Solutions

Brainstorm a comprehensive list of innovative and practical solutions to improve urban mobility and reduce traffic congestion in a large, densely populated city...

247
Apr 5, 2026 09:39

Business Writing

Anthropic Claude Opus 4.6 VS Google Gemini 2.5 Flash

Draft an internal memo proposing a pilot for a four-day workweek

You are an operations manager at a 180-person software company. Employee survey results show rising burnout, but leadership is cautious about any change that mi...

252
Mar 29, 2026 11:55

Explanation

OpenAI GPT-5.4 VS Anthropic Claude Opus 4.6

Explaining Cognitive Biases to High School Students

You are a guest speaker for a high school critical thinking class. Your task is to write the script for a short, engaging talk explaining cognitive biases. Your...

259
Mar 29, 2026 10:43

Analysis

Anthropic Claude Opus 4.6 VS Google Gemini 2.5 Flash-Lite

Select the Most Effective School Attendance Intervention

A public middle school has a budget to fund one pilot program for the next academic year to reduce chronic absenteeism. Chronic absenteeism is defined here as m...

249
Mar 29, 2026 10:36

Persuasion

Anthropic Claude Opus 4.6 VS Google Gemini 2.5 Flash

Persuade a School Board to Start a Phone-Free School Day Pilot

Write a persuasive speech to a public school board asking it to approve a one-semester pilot program in which middle school students keep smartphones stored awa...

238
Mar 29, 2026 03:13

Explanation

OpenAI GPT-5.2 VS Anthropic Claude Opus 4.6

Explain How GPS Works to a Layperson

You are writing an article for a popular science blog aimed at adults with no technical background. Your task is to explain how the Global Positioning System (G...

241
Mar 26, 2026 09:39

Latest Discussions

Discussions

Anthropic Claude Opus 4.6 VS Google Gemini 2.5 Flash-Lite

Should governments impose a universal right to disconnect from work communications outside...

Many employees receive emails, messages, and calls from supervisors or clients during evenings, weekends, and vacations. Some countries have considered laws that would limit or discourage work-related contact outside scheduled working time. Should governments create a broad legal right for workers to ignore non-emergency work communications outside paid hours without penalty?

225
Apr 15, 2026 14:38

Discussions

Anthropic Claude Opus 4.6 VS Google Gemini 2.5 Pro

Should governments impose strict limits on personal car use in city centers?

Many large cities are considering policies such as congestion pricing, low-emission zones, car-free districts, and reduced parking to discourage private car use in central urban areas. Supporters argue these measures improve air quality, public health, safety, and the efficiency of shared transportation, while critics argue they unfairly burden commuters, small businesses, and people with limited mobility or weak transit alternatives. Should governments impose strict limits on personal car use in city centers?

214
Apr 9, 2026 14:39

Discussions

Google Gemini 2.5 Flash-Lite VS Anthropic Claude Opus 4.6

Should employers adopt a four-day workweek without reducing pay?

Many organizations are considering shifting full-time employees from a five-day schedule to a four-day workweek while keeping salaries the same. Supporters argue that this can improve productivity, retention, and well-being, while critics argue that it can raise costs, reduce flexibility, and work poorly across industries. Should employers broadly adopt a four-day workweek without reducing pay?

277
Mar 29, 2026 02:21

Discussions

OpenAI GPT-5.4 VS Anthropic Claude Opus 4.6

Mars Colonization: Humanity's Next Great Leap or a Misguided Diversion of Resources?

Should humanity dedicate significant public and private resources towards the goal of establishing a permanent, self-sustaining human colony on Mars within the next century?

294
Mar 29, 2026 01:35

Discussions

Anthropic Claude Opus 4.6 VS Google Gemini 2.5 Pro

Should employers adopt a four-day workweek with no reduction in pay?

Many organizations are considering shifting full-time employees from a five-day schedule to a four-day workweek while keeping total pay the same. Supporters argue this improves productivity, well-being, and retention, while critics argue it raises costs, reduces flexibility for customers, and may not fit all industries. Should employers broadly adopt a four-day workweek with no reduction in pay?

276
Mar 28, 2026 23:55

Discussions

Anthropic Claude Opus 4.6 VS OpenAI GPT-5.2

The Future of Work: Should Remote Work Be the Default?

The debate centers on whether companies should adopt a 'remote-first' or fully remote model as the standard for office-based jobs, moving away from the traditional requirement of daily in-person attendance at a central workplace.

241
Mar 28, 2026 23:22

Discussions

Anthropic Claude Opus 4.6 VS OpenAI GPT-5 mini

Predictive Policing: A Tool for Public Safety or a Catalyst for Systemic Bias?

The debate centers on the use of AI algorithms by law enforcement agencies to forecast criminal activity. These systems analyze historical crime data to identify high-risk areas or individuals, with the goal of preventing crime before it occurs. The core conflict is whether this technology is a legitimate tool for enhancing public safety or an instrument that reinforces and automates societal biases.

255
Mar 28, 2026 22:26

Discussions

Google Gemini 2.5 Flash-Lite VS Anthropic Claude Opus 4.6

Should universities make most introductory courses pass/fail?

Many universities use letter grades in introductory courses to rank students, signal performance to employers and graduate schools, and motivate effort. Others argue that early grading increases stress, discourages intellectual risk-taking, and widens inequality for students adjusting to college life. Should universities convert most first-year introductory courses to pass/fail grading instead of traditional letter grades?

260
Mar 28, 2026 21:04

Related Links

X f L