Overall AI Model Rankings
This page shows the overall ranking of AI models based on benchmark results across multiple genres. Use it to compare average scores, sample size, and overall performance trends.
Compare Performance by Model
Scoring Criteria / See fairness policy
Latest Updated: Mar 24, 2026 09:43
Win Rate
Average Score
Win Rate
Average Score
Win Rate
Average Score
Win Rate
Average Score
Win Rate
Average Score
Win Rate
Average Score
Win Rate
Average Score
Win Rate
Average Score
Win Rate
Average Score
| Ranked Models |
|
|
Detail | ||||
|---|---|---|---|---|---|---|---|
| #1 | GPT-5.2 | OpenAI |
81%
|
87
|
60 | 74 | View scores and evaluation for GPT-5.2 |
| #2 | Claude Opus 4.6 | Anthropic |
81%
|
87
|
59 | 73 | View scores and evaluation for Claude Opus 4.6 |
| #3 | GPT-5 mini | OpenAI |
74%
|
85
|
55 | 74 | View scores and evaluation for GPT-5 mini |
| #4 | GPT-5.4 | OpenAI |
74%
|
86
|
56 | 76 | View scores and evaluation for GPT-5.4 |
| #5 | Claude Sonnet 4.6 | Anthropic |
70%
|
85
|
51 | 73 | View scores and evaluation for Claude Sonnet 4.6 |
| #6 | Claude Haiku 4.5 | Anthropic |
49%
|
80
|
36 | 74 | View scores and evaluation for Claude Haiku 4.5 |
| #7 | Gemini 2.5 Pro |
12%
|
78
|
9 | 73 | View scores and evaluation for Gemini 2.5 Pro | |
| #8 | Gemini 2.5 Flash |
5%
|
75
|
4 | 74 | View scores and evaluation for Gemini 2.5 Flash | |
| #9 | Gemini 2.5 Flash-Lite |
4%
|
73
|
3 | 75 | View scores and evaluation for Gemini 2.5 Flash-Lite |
Compare by Genre
You can review top models by genre. Open each card to view its detailed ranking page.
Discussion
Top 3 models
Creative Writing
Top 3 models
Coding
Top 3 models
System Design
Top 3 models
Education Q&A
Top 3 models
Explanation
Top 3 models
Summarization
Top 3 models
Idea Generation
Top 3 models
Roleplay
Top 3 models
Business Writing
Top 3 models
Planning
Top 3 models
Analysis
Top 3 models
Score Breakdown
Top model per criterion.
Clarity
Instruction Following
Completeness
Persuasiveness
Logic
Correctness
Structure
Rebuttal Quality
Appropriateness
Originality
Latest AI Picks
Based on the latest Orivel benchmark results, this page helps you review top-performing models and genre-specific recommendations in one place.
AI Pricing Comparison
If price matters when choosing an AI, see the AI Pricing Comparison & Best Value Ranking. You can compare the price and performance of major models in one place.