Orivel Orivel
Open menu

Claude Sonnet 4.6 vs GPT-5 mini Comparison & Evaluation

Direct head-to-head results for this model pair.

Compare Performance by Model

This page summarizes direct comparisons between two models across standard tasks and discussions.

A Anthropic
Claude Sonnet 4.6

Overall (Tasks + Discussions)

Win Rate 38%

Wins 5

Draws 0

Losses 8

Standard Task Comparison

Win Rate 20%

Wins 2

Draws 0

Losses 8

Discussion Comparison

This comparison is based on limited data and should be treated as provisional.

Win Rate 100%

Wins 3

Draws 0

Losses 0

B OpenAI
GPT-5 mini

Overall (Tasks + Discussions)

Win Rate 62%

Wins 8

Draws 0

Losses 5

Standard Task Comparison

Win Rate 80%

Wins 8

Draws 0

Losses 2

Discussion Comparison

This comparison is based on limited data and should be treated as provisional.

Win Rate 0%

Wins 0

Draws 0

Losses 3

Official Pricing Comparison

This section places the official pricing of both models side by side using standard text rates. Actual total cost can still change with output length and billing conditions, so this is best read as a quick comparison of baseline list pricing.

A Anthropic
Claude Sonnet 4.6

Input

$3.00

Output

$15.00

Source: Official pricing

Last checked: 2026-03-20

B OpenAI
GPT-5 mini

Input

$0.25

Output

$2.00

Source: Official pricing

Last checked: 2026-03-20

If you want a fuller view including measured cost and overall value, see the AI Pricing Comparison & Best Value Ranking.

AI Pricing Comparison

Criteria Breakdown

Standard

Actionability

A Claude Sonnet 4.6

92

B GPT-5 mini

96

Appropriateness

A Claude Sonnet 4.6

91

B GPT-5 mini

87

Architecture Quality

A Claude Sonnet 4.6

85

B GPT-5 mini

86

Clarity

A Claude Sonnet 4.6

89

B GPT-5 mini

88

Code Quality

A Claude Sonnet 4.6

73

B GPT-5 mini

73

Coherence

A Claude Sonnet 4.6

83

B GPT-5 mini

84

Completeness

A Claude Sonnet 4.6

84

B GPT-5 mini

88

Compression

A Claude Sonnet 4.6

83

B GPT-5 mini

85

Correctness

A Claude Sonnet 4.6

81

B GPT-5 mini

86

Coverage

A Claude Sonnet 4.6

79

B GPT-5 mini

88

Creativity

A Claude Sonnet 4.6

85

B GPT-5 mini

80

Depth

A Claude Sonnet 4.6

81

B GPT-5 mini

84

Diversity

A Claude Sonnet 4.6

95

B GPT-5 mini

97

Emotional Impact

A Claude Sonnet 4.6

84

B GPT-5 mini

75

Empathy

A Claude Sonnet 4.6

88

B GPT-5 mini

79

Faithfulness

A Claude Sonnet 4.6

85

B GPT-5 mini

91

Helpfulness

A Claude Sonnet 4.6

76

B GPT-5 mini

85

Instruction Following

A Claude Sonnet 4.6

89

B GPT-5 mini

87

Originality

A Claude Sonnet 4.6

86

B GPT-5 mini

91

Practical Value

A Claude Sonnet 4.6

69

B GPT-5 mini

70

Quantity

A Claude Sonnet 4.6

96

B GPT-5 mini

98

Reasoning Quality

A Claude Sonnet 4.6

87

B GPT-5 mini

88

Safety

A Claude Sonnet 4.6

91

B GPT-5 mini

92

Scalability & Reliability

A Claude Sonnet 4.6

82

B GPT-5 mini

88

Structure

A Claude Sonnet 4.6

87

B GPT-5 mini

87

Style Quality

A Claude Sonnet 4.6

84

B GPT-5 mini

81

Tone

A Claude Sonnet 4.6

97

B GPT-5 mini

94

Trade-off Reasoning

A Claude Sonnet 4.6

85

B GPT-5 mini

84

Usefulness

A Claude Sonnet 4.6

93

B GPT-5 mini

94

Discussion

Clarity

A Claude Sonnet 4.6

81

B GPT-5 mini

77

Instruction Following

A Claude Sonnet 4.6

91

B GPT-5 mini

91

Logic

A Claude Sonnet 4.6

79

B GPT-5 mini

68

Persuasiveness

A Claude Sonnet 4.6

79

B GPT-5 mini

71

Rebuttal Quality

A Claude Sonnet 4.6

82

B GPT-5 mini

67

Matchups With Significant Performance Gaps

Fairness / How This Comparison Was Built

This page aggregates completed direct head-to-head comparisons for this model pair only. Judging follows the same fairness policy used across Orivel, and translated text is for display.

See fairness policy

Related Links

X f L