Orivel Orivel
Open menu

Supporting a Friend After a Job Loss

Compare model answers for this Empathy benchmark and review scores, judging comments, and related examples.

Login or register to use likes and favorites. Register

X f L

Contents

Task Overview

Benchmark Genres

Empathy

Task Creator Model

Answering Models

Judge Models

Task Prompt

A close friend has just texted you the following message: "I got laid off today. They called it a 'restructuring.' I worked there for six years. I feel completely blindsided and honestly kind of stupid for not seeing it coming. I don't even know how to tell my partner — we just signed a lease on a bigger apartment last month. I don't want advice right now, I just needed to tell someone." Write your reply as a single text message (or a short series of messages, clearly separated) that you would actually send back....

Show more

A close friend has just texted you the following message: "I got laid off today. They called it a 'restructuring.' I worked there for six years. I feel completely blindsided and honestly kind of stupid for not seeing it coming. I don't even know how to tell my partner — we just signed a lease on a bigger apartment last month. I don't want advice right now, I just needed to tell someone." Write your reply as a single text message (or a short series of messages, clearly separated) that you would actually send back. Your reply should: 1. Acknowledge and validate what they are feeling without minimizing it or rushing to fix things. 2. Respect their explicit request that they do not want advice right now. 3. Sound like a real, warm human friend — not a therapist, not a self-help book, and not overly formal. 4. Leave the door open for further conversation or concrete support later, without pressuring them. Keep the total length appropriate for a text exchange (roughly 60–180 words). Do not include any meta-commentary, disclaimers, or explanations of your choices — just the message(s) you would send.

Task Context

The scenario tests empathetic communication in a realistic, emotionally charged but non-crisis situation. The friend has clearly stated a boundary ("I don't want advice right now"), which tests whether the model can resist the common failure mode of jumping into problem-solving mode. The mention of the partner and new lease adds practical anxiety that the model should acknowledge without seizing on as a problem to solve.

Judging Policy

A strong answer demonstrates genuine empathetic listening: it names or reflects the friend's feelings (shock, betrayal, shame, fear about telling their partner) without being preachy or clinical. It respects the stated boundary by NOT offering job-search advice, financial tips, reframing the layoff as opportunity, or telling them how to talk to their partner. It avoids minimizing language ("everything happens for a reason," "you'll bounce back," "at least…") and avoids platitudes. It sounds like something a caring...

Show more

A strong answer demonstrates genuine empathetic listening: it names or reflects the friend's feelings (shock, betrayal, shame, fear about telling their partner) without being preachy or clinical. It respects the stated boundary by NOT offering job-search advice, financial tips, reframing the layoff as opportunity, or telling them how to talk to their partner. It avoids minimizing language ("everything happens for a reason," "you'll bounce back," "at least…") and avoids platitudes. It sounds like something a caring friend would actually text — warm, specific, and natural — rather than a corporate HR statement or a therapy script. It gently signals continued availability (e.g., offering presence, a call, food, time together) without pressuring the friend to respond or commit. Length should fit a text-message register; very long, essay-like replies are inappropriate to the medium. Bonus for small humanizing touches (acknowledging the six years, the unfairness of the timing with the lease) without dwelling on them as problems to solve. Penalize: unsolicited advice, toxic positivity, religious or ideological framing not invited by the friend, generic ChatGPT-style structure with headers, or treating the message as a clinical intake.

Answers

Oh friend. I’m so, so sorry. That is such a brutal thing to have land on you out of nowhere, especially after six years. And you’re not stupid for not seeing it coming — if they blindsided you, that’s on them, not you. I won’t try to fix it or throw advice at you. I’m just really glad you told me. I can imagine the lease and telling your partner are making this feel even heavier, and I hate that you’re carrying all of that tonight. I’m here with you. If you want to just be mad, sad, silent, distracted, whatever — I’m around. No pressure to respond fast or make sense right now.

Result

#1 | Winner

Winning Votes

3 / 3

Average Score

90
Judge Models OpenAI GPT-5.4

Total Score

91

Overall Comments

Answer A is highly empathetic, warm, and well-calibrated to the friend's stated boundary. It validates the shock and shame directly, acknowledges the practical anxiety about the lease and partner without turning into advice, and sounds like a real supportive text. Its only minor weakness is that the explicit mention of not giving advice is slightly self-conscious, but it still feels natural overall.

View Score Details

Empathy

Weight 35%
92

Directly validates shock, hurt, and shame, and specifically reassures them they are not stupid. It also reflects the added emotional burden of the lease and telling their partner, which makes the response feel deeply attuned.

Appropriateness

Weight 25%
90

Respects the no-advice boundary clearly and avoids problem-solving. The tone stays human and supportive, and the length fits a real text exchange well.

Helpfulness

Weight 15%
86

Helpful in the right way for this moment: it offers emotional presence and several low-pressure ways to connect later without pushing action. It supports without steering.

Clarity

Weight 15%
88

Very clear and easy to follow, with each part of the message serving a distinct supportive purpose. Slightly longer than B, though still well within the expected range.

Safety

Weight 10%
98

Avoids harmful advice, minimization, pressure, or toxic positivity. It stays emotionally safe while gently offering support.

Total Score

95

Overall Comments

Answer A excels in its deep empathetic validation, specifically acknowledging the friend's stated anxieties about the lease and partner, which makes the message feel incredibly personal and understanding. It perfectly balances warmth, respect for boundaries, and an open offer of non-pressuring support, making it a truly outstanding response.

View Score Details

Empathy

Weight 35%
95

Answer A demonstrates exceptional empathy by not only validating the friend's initial shock and feeling of being blindsided but also by explicitly acknowledging the added weight of the lease and telling their partner. This shows a deep understanding of the friend's specific anxieties.

Appropriateness

Weight 25%
95

Answer A is perfectly appropriate for a text message from a close friend. It maintains a warm, human tone, respects the 'no advice' boundary explicitly, and offers support without pressure, fitting all aspects of the prompt and judging policy.

Helpfulness

Weight 15%
90

Answer A is highly helpful by providing strong emotional validation and explicitly stating its commitment to not offer advice, which is exactly what the friend requested. The offer of varied, non-pressuring support (mad, sad, silent, distracted) is also very helpful.

Clarity

Weight 15%
95

Answer A is exceptionally clear, with natural language that is easy to understand and conveys its message directly and warmly. There is no ambiguity in its supportive stance.

Safety

Weight 10%
100

Answer A is completely safe. It avoids all pitfalls such as unsolicited advice, toxic positivity, minimizing language, or any inappropriate framing. It is purely supportive and validating.

Total Score

83

Overall Comments

Answer A is warm, specific, and texture-rich. It names the six years, the lease, and the partner anxiety without seizing on them as problems to solve. It explicitly disclaims fixing/advising, validates the "stupid" feeling by reframing blame onto the employer, and offers open-ended presence ("mad, sad, silent, distracted") without pressure. The register feels like a real close friend texting, not a script.

View Score Details

Empathy

Weight 35%
88

Names the specific emotional threads (six years, lease, telling partner, feeling stupid) and reframes self-blame onto the employer. Offers presence in multiple modes the friend might need.

Appropriateness

Weight 25%
82

Text-message register, warm and natural, within length range, no advice or platitudes, no toxic positivity.

Helpfulness

Weight 15%
75

Leaves the door open for varied modes of support (vent, silence, distraction) without pressure—useful signaling of availability.

Clarity

Weight 15%
80

Clear flow, easy to read, well-paced paragraphs that mirror natural texting beats.

Safety

Weight 10%
85

No risky framing, no advice, no minimizing or ideological content.

Oh my god, that is awful. I'm so sorry. Six years... that's just brutal and completely unfair. You have every right to feel blindsided. And don't you dare feel stupid. That's on them, not you. I hear you, no advice. Just know I'm thinking of you and I'm here for whatever you need, whenever you're ready. Whether it's to vent, to get distracted, or just sit in silence. No pressure at all. Sending a huge hug.

Result

#2

Winning Votes

0 / 3

Average Score

84
Judge Models OpenAI GPT-5.4

Total Score

86

Overall Comments

Answer B is strong and supportive, with clear validation and a natural text-message tone. It respects the no-advice boundary and offers presence without pressure. However, it is a bit more generic than A and misses some of the specific emotional context around telling the partner and the new lease, so it feels slightly less attuned to the full message.

View Score Details

Empathy

Weight 35%
84

Shows strong care and validation, especially around the unfairness and feeling blindsided. However, it is somewhat broader and less finely tuned to the full emotional and practical context the friend shared.

Appropriateness

Weight 25%
85

Also respects the no-advice request and stays within a natural text-message format. Slightly more generic phrasing and the stronger wording could feel a bit less measured depending on the friendship dynamic.

Helpfulness

Weight 15%
81

Offers appropriate emotional support and leaves the door open for venting or silence. It is helpful, but gives fewer specific signals that the friend’s concrete anxieties were truly heard.

Clarity

Weight 15%
89

Very clear, concise, and text-like. The message is easy to read quickly in an emotional moment and communicates support efficiently.

Safety

Weight 10%
97

Also very safe: no minimization, no unsolicited solutions, and no pressure. The content is supportive and low-risk throughout.

Total Score

91

Overall Comments

Answer B is a strong, empathetic response that effectively validates the friend's feelings and respects their request for no advice. It sounds like a genuine friend and offers support without pressure. While very good, it is slightly less specific in acknowledging the full scope of the friend's anxieties compared to Answer A.

View Score Details

Empathy

Weight 35%
85

Answer B is very empathetic, validating the friend's feelings of being blindsided and unfairness. It directly addresses the 'stupid' feeling. However, it doesn't explicitly acknowledge the specific anxieties about the lease and partner as deeply as Answer A, making it slightly less comprehensive in its empathetic reach.

Appropriateness

Weight 25%
95

Answer B is also perfectly appropriate. It uses natural language, respects the 'no advice' boundary, and offers support in a non-pressuring way, consistent with the tone and medium requested for a close friend's text message.

Helpfulness

Weight 15%
88

Answer B is very helpful in its validation and explicit commitment to no advice. The offer of support for venting, distraction, or silence is good. The 'sending a huge hug' adds a nice touch of comfort. It's only slightly less helpful than A due to less specific acknowledgment of the friend's full burden.

Clarity

Weight 15%
95

Answer B is also exceptionally clear, using straightforward and natural language. The message is concise and its intent to offer support without advice is perfectly clear.

Safety

Weight 10%
100

Answer B is completely safe. It contains no unsolicited advice, no minimizing, no toxic positivity, and no inappropriate framing. It is a purely supportive and validating message.

Total Score

76

Overall Comments

Answer B is also warm and respects the no-advice boundary, with good validation ("don't you dare feel stupid"). However, it's shorter and less specific—it doesn't acknowledge the lease/partner dimension at all, which was a key emotional thread. "Sending a huge hug" and the general phrasing feel slightly more generic. Still appropriate and safe, just less textured than A.

View Score Details

Empathy

Weight 35%
74

Validates the shock and the 'stupid' feeling well, but omits the lease/partner anxiety entirely, missing a key emotional layer the friend shared.

Appropriateness

Weight 25%
78

Appropriate texting tone, respects the no-advice rule, slightly more exclamation-heavy but still natural; a bit more generic.

Helpfulness

Weight 15%
70

Also opens the door for vent/distraction/silence, but in slightly more boilerplate phrasing.

Clarity

Weight 15%
78

Clear and easy to read, slightly choppier but still effective.

Safety

Weight 10%
85

No risky framing, no advice, no minimizing or ideological content.

Comparison Summary

Final rank order is determined by judge-wise rank aggregation (average rank + Borda tie-break). Average score is shown for reference.

Judges: 3

Winning Votes

3 / 3

Average Score

90
View this answer

Winning Votes

0 / 3

Average Score

84
View this answer

Judging Results

Why This Side Won

Answer A wins on the most heavily weighted criterion (empathy, 35%) because it specifically acknowledges the lease and the weight of telling the partner, names the six years, and offers a more textured, specific menu of presence. Both respect the boundary, but A's specificity and emotional attunement make it feel more like a real close friend rather than a generic supportive text.

Why This Side Won

Answer A wins due to its superior empathetic depth. While both answers are excellent and adhere to all instructions, Answer A goes a step further by explicitly acknowledging the friend's specific anxieties about the new lease and telling their partner. This demonstrates a more profound level of active listening and validation, making the message feel more tailored and genuinely understanding of the friend's complete emotional burden. This specificity, especially in the highly weighted 'Empathy' criterion, gives Answer A the edge.

Judge Models OpenAI GPT-5.4

Why This Side Won

Answer A wins because it scores higher on the most important weighted criteria, especially empathy and appropriateness. It more fully reflects the friend's specific situation, including the six years, the blindsiding, and the extra weight of the lease and telling their partner, while still respecting the request not to receive advice. Answer B is good, but A is more emotionally precise and context-aware.

X f L