Orivel Orivel
Open menu

Humor

Explore how AI models perform in Humor. Compare rankings, scoring criteria, and recent benchmark examples.

Genre overview

Compare comedic originality and how effectively AI models produce humor.

In this genre, the main abilities being tested are Humor Effectiveness, Originality, Coherence.

Unlike creative writing, this genre cares more specifically about whether the output actually lands as humor for the intended audience.

A high score here does not guarantee safe tone for sensitive situations, factual precision, or professional communication skill.

Strong models here are useful for

jokes, playful copy, light entertainment, and prompts where comic effect matters.

This genre alone cannot tell you

whether the model is best for serious guidance, careful support, or exact business communication.

Experimental

Top Models in This Genre

This ranking is ordered by average score within this genre only.

Latest Updated: May 10, 2026 09:38

#1
GPT-5 mini OpenAI

Win Rate

100%

Average Score

82
#2
GPT-5.2 OpenAI

Win Rate

83%

Average Score

87
#3
Claude Opus 4.6 Anthropic

Win Rate

75%

Average Score

86
#4
GPT-5.4 OpenAI

Win Rate

75%

Average Score

84
#5
Claude Haiku 4.5 Anthropic

Win Rate

67%

Average Score

76
#6
Claude Sonnet 4.6 Anthropic

Win Rate

50%

Average Score

82
#7
GPT-5.5 OpenAI

Win Rate

0%

Average Score

82
#8
Gemini 2.5 Pro Google

Win Rate

0%

Average Score

71
#9
Gemini 2.5 Flash Google

Win Rate

0%

Average Score

68
#10
Gemini 2.5 Flash-Lite Google

Win Rate

0%

Average Score

65

What Is Evaluated in Humor

Scoring criteria and weight used for this genre ranking.

Humor Effectiveness

35.0%

This criterion is included to check Humor Effectiveness in the answer. It carries heavier weight because this part strongly shapes the overall result in this genre.

Originality

25.0%

This criterion is included to check Originality in the answer. It has meaningful weight because it affects quality in a visible way, even if it is not the only thing that matters.

Coherence

15.0%

This criterion is included to check Coherence in the answer. It is weighted more lightly because it supports the main goal rather than defining the genre by itself.

Clarity

15.0%

This criterion is included to check Clarity in the answer. It is weighted more lightly because it supports the main goal rather than defining the genre by itself.

Instruction Following

10.0%

This criterion is included to check Instruction Following in the answer. It is weighted more lightly because it supports the main goal rather than defining the genre by itself.

Recent tasks

Humor

OpenAI GPT-5.5 VS Anthropic Claude Sonnet 4.6

Stand-up Routine for a Tech Conference

Write a 2-minute stand-up comedy routine for a comedian performing at a major tech conference. The audience consists primarily of software engineers and project managers. The routine should focus on the funny or absurd aspects of remote work and 'agile' development methodologies. The tone should be sarcastic and observational, but ultimately good-natured and safe for a corporate environment.

63
May 10, 2026 09:38

Humor

OpenAI GPT-5 mini VS Google Gemini 2.5 Flash

Write a Stand-Up Comedy Set About the Absurdities of Grocery Shopping

Write a short stand-up comedy set (approximately 400–600 words) performed by a fictional comedian at an open-mic night. The entire set should revolve around the everyday absurdities of grocery shopping — from navigating the aisles, to self-checkout machines, to the unspoken social rules among shoppers. Requirements: 1. The set must be written in first person as if spoken on stage, including natural pauses, crowd work cues, or callbacks that a real comedian might use. 2. The humor should be observational and relatable — no shock humor, crude language, or mean-spirited jokes targeting specific groups of people. 3. Include at least three distinct comedic bits (mini-topics) within the grocery shopping theme, with smooth transitions between them. 4. End the set with a strong closing joke or callback that ties back to something mentioned earlier in the set. 5. The tone should be suitable for a general adult audience (think a clean comedy club night).

221
Mar 31, 2026 09:37

Humor

Google Gemini 2.5 Flash VS OpenAI GPT-5.2

Corporate Jargon Roast: A Satirical Office Memo

Write a satirical internal company memo (approximately 300–500 words) from a fictional middle manager named "Derek from Synergy Solutions" announcing a new, absurdly unnecessary corporate policy. The memo should: 1. Be written in exaggerated corporate jargon and buzzwords (e.g., "synergize," "circle back," "leverage," "move the needle"). 2. Announce a policy that sounds important but is completely pointless or counterproductive when you think about it. 3. Maintain a deadpan, serious tone throughout — the humor should come from the contrast between the formal delivery and the ridiculous content. 4. Include at least one made-up acronym or initiative name that sounds plausible. 5. End with a signature block that adds one final comedic touch. The memo should be funny to anyone who has worked in a corporate office environment, but it must remain workplace-appropriate (no profanity, no targeting of protected groups, no mean-spirited content about real companies or individuals).

262
Mar 29, 2026 11:47

Humor

Anthropic Claude Haiku 4.5 VS Google Gemini 2.5 Flash-Lite

Clean Stand-Up Monologue for a Nervous Science Museum Opening

Write a clean, original stand-up monologue of 220 to 320 words for a host opening a new science museum exhibition about everyday household objects. The audience is mixed: children aged 10+, parents, teachers, and local donors. The speaker is a little nervous but trying to sound confident and charming. Required constraints: - Keep it suitable for a general family audience. - Use exactly 6 jokes or comedic beats. - At least 3 jokes must be about ordinary objects being treated as if they have dramatic secret lives. - Include 1 brief callback to an earlier joke near the end. - Mention all 5 of these objects naturally: toaster, umbrella, sock, vacuum cleaner, and refrigerator. - Avoid insults, politics, religion, dating humor, bathroom humor, and references to celebrities. - The monologue should feel like one continuous performance, not a list of unrelated one-liners. Aim for humor that works both for kids and adults, with clear setup and payoff.

257
Mar 21, 2026 09:09

Humor

Google Gemini 2.5 Pro VS Anthropic Claude Opus 4.6

Write a Funny Wedding Toast for Two Librarians

Write a humorous wedding toast of 250 to 350 words for a couple who are both librarians and are getting married in a small town public library after hours. The audience includes grandparents, coworkers, a few teenagers from the library’s book club, and one very serious city council member who approved the event permit. Keep the humor warm, clever, and family-friendly rather than edgy or insulting. Include at least three specific library-related jokes or playful references, but make sure the toast still sounds sincere and celebratory. Do not use clichés about marriage being terrible, do not mock reading itself, and do not rely on puns in every sentence. End with a short, heartfelt closing line suitable for raising a glass.

253
Mar 21, 2026 05:47

Humor

Anthropic Claude Opus 4.6 VS OpenAI GPT-5.4

The Cynical Pilot's In-Flight Announcement

Write a short, humorous in-flight announcement from the perspective of a pilot who is completely fed up with their job. The announcement should be delivered over the plane's intercom. Your tone should be dry, sarcastic, and world-weary, but not genuinely alarming. Cover the usual topics like welcome, flight time, and weather, but infuse them with the pilot's cynical perspective on air travel.

300
Mar 20, 2026 11:34

Related Links

X f L