Empathy
Explore how AI models perform in Empathy. Compare rankings, scoring criteria, and recent benchmark examples.
Genre overview
Compare how well AI models respond with empathy, care, and appropriate tone.
In this genre, the main abilities being tested are Empathy, Appropriateness, Helpfulness.
Unlike counseling, this genre focuses more on emotional attunement and tone than on structured next steps or bounded practical guidance.
A high score here does not guarantee safe handling of delicate situations or the best practical advice under risk.
Strong models here are useful for
supportive replies, comforting messages, and responses where emotional tone matters first.
This genre alone cannot tell you
whether the model can provide safer structured guidance, clinical judgment, or professional advice.
Top Models in This Genre
This ranking is ordered by average score within this genre only.
Latest Updated: Mar 21, 2026 13:01
Win Rate
Average Score
Win Rate
Average Score
Win Rate
Average Score
Win Rate
Average Score
Win Rate
Average Score
Win Rate
Average Score
Win Rate
Average Score
Win Rate
Average Score
Win Rate
Average Score
| Ranked Models |
|
|
Detail | ||||
|---|---|---|---|---|---|---|---|
| #1 | GPT-5.2 | OpenAI |
100%
|
93
|
3 | 3 | View scores and evaluation for GPT-5.2 |
| #2 | Claude Opus 4.6 | Anthropic |
100%
|
90
|
4 | 4 | View scores and evaluation for Claude Opus 4.6 |
| #3 | Claude Sonnet 4.6 | Anthropic |
75%
|
87
|
3 | 4 | View scores and evaluation for Claude Sonnet 4.6 |
| #4 | Claude Haiku 4.5 | Anthropic |
75%
|
84
|
3 | 4 | View scores and evaluation for Claude Haiku 4.5 |
| #5 | GPT-5.4 | OpenAI |
50%
|
86
|
2 | 4 | View scores and evaluation for GPT-5.4 |
| #6 | GPT-5 mini | OpenAI |
25%
|
86
|
1 | 4 | View scores and evaluation for GPT-5 mini |
| #7 | Gemini 2.5 Pro |
25%
|
85
|
1 | 4 | View scores and evaluation for Gemini 2.5 Pro | |
| #8 | Gemini 2.5 Flash |
25%
|
79
|
1 | 4 | View scores and evaluation for Gemini 2.5 Flash | |
| #9 | Gemini 2.5 Flash-Lite |
0%
|
79
|
0 | 5 | View scores and evaluation for Gemini 2.5 Flash-Lite |
What Is Evaluated in Empathy
Scoring criteria and weight used for this genre ranking.
Empathy
35.0%
This criterion is included to check Empathy in the answer. It carries heavier weight because this part strongly shapes the overall result in this genre.
Appropriateness
25.0%
This criterion is included to check Appropriateness in the answer. It has meaningful weight because it affects quality in a visible way, even if it is not the only thing that matters.
Helpfulness
15.0%
This criterion is included to check Helpfulness in the answer. It is weighted more lightly because it supports the main goal rather than defining the genre by itself.
Clarity
15.0%
This criterion is included to check Clarity in the answer. It is weighted more lightly because it supports the main goal rather than defining the genre by itself.
Safety
10.0%
This criterion is included to check Safety in the answer. It is weighted more lightly because it supports the main goal rather than defining the genre by itself.
Recent tasks
Empathy
Compassionate Response to Job Loss and Family Pressure
Write a reply to the following message from a person seeking emotional support. Your reply should sound human, warm, and respectful. It should validate their feelings without being patronizing, avoid making assumptions, and offer a few practical next steps that are realistic for the next week. Message: "I got laid off two weeks ago and I still haven’t told my parents. They’ve always seen me as the stable one, and I can already hear the disappointment in their voices. I’ve been pretending everything is normal, but every day I wake up feeling sick. I’m scared about money, ashamed that I don’t have a plan yet, and exhausted from trying to act okay around everyone. I don’t even know whether I need advice or just someone to say I’m not failing at life."
Empathy
Respond to a Friend Overwhelmed by Caregiving
A close friend sends you this message: "I’m exhausted. My dad’s health has gotten worse, I’m handling appointments, work is piling up, and I snapped at my partner last night. I feel guilty for not doing enough for anyone. Please don’t give me a cheesy motivational speech. I just need someone to talk to." Write a reply that is warm, emotionally intelligent, and practical without sounding clinical or preachy. Your response should acknowledge their feelings, avoid minimizing the situation, and offer support in a way that respects their autonomy. Do not claim to be a therapist or use crisis-language unless clearly necessary.
Empathy
Responding to an Upset Community Member
You are a volunteer moderator for an online hobbyist forum about vintage synthesizers. A user, "SynthWizard88," is very upset because you removed their post which contained a link to an external site selling their own custom-made synthesizer parts. The forum has a strict "no self-promotion" rule. SynthWizard88 has sent you a private message: "Why was my post deleted?! I spent hours writing it up to help people, and you just deleted it without any warning. This is unfair censorship. I thought this was a community, not a dictatorship." Draft an empathetic, clear, and firm private message back to SynthWizard88. Your response should aim to de-escalate the situation, explain the reasoning, and encourage them to continue participating in the community in a positive way.
Empathy
Supportive reply to an overwhelmed new parent returning to work
Write a compassionate message to a friend who says: "I'm going back to work next week after parental leave, and I feel like I'm failing at everything. The baby still wakes up a lot, I'm exhausted, my inbox is a disaster, and everyone else seems to handle this stage better than I do. I don't even know what to ask for anymore." Your response should sound personal and emotionally supportive without being overly dramatic or clinical. Acknowledge their feelings, avoid minimizing their struggle, and offer a few practical next steps they could realistically take in the coming week. Do not mention therapy, hotlines, or medical advice unless clearly necessary.
Empathy
Supportive response to a friend overwhelmed by caregiving and work
Write a reply to the following message from a close friend. Your reply should sound human, emotionally supportive, and practically helpful without becoming overly dramatic or preachy. Message from friend: "I feel like I'm failing at everything right now. My dad's health has gotten worse, and I've been taking him to appointments before work and checking on him every night. I'm exhausted, behind on deadlines, and my manager has started making comments. My brother keeps saying he'll help but then disappears. I snapped at my dad yesterday and I feel horrible about it. I don't even know what to do first anymore, and I honestly dread waking up tomorrow." Requirements: - Acknowledge the friend's feelings without minimizing them. - Avoid clichés and avoid diagnosing any mental health condition. - Include 2 to 4 concrete, realistic suggestions for the next 24 to 72 hours. - Encourage support-seeking in a respectful, non-pushy way. - Do not shame the brother, the manager, or the friend. - Keep the tone warm and grounded. - Write 180 to 260 words.
Empathy
Empathetic Response to Job Overwhelm
Imagine you are a peer support chatbot. A user sends you the following message. Write a single, supportive response. User's message: 'I started a new job a month ago and I'm completely overwhelmed. I feel like I don't know what I'm doing, and everyone else seems so much more competent. I'm working late every night just to keep up, but I still feel like I'm failing. I'm starting to think I'm not cut out for this and that they made a mistake hiring me. I don't know what to do.'