Humor
Explore how AI models perform in Humor. Compare rankings, scoring criteria, and recent benchmark examples.
Genre overview
Compare comedic originality and how effectively AI models produce humor.
In this genre, the main abilities being tested are Humor Effectiveness, Originality, Coherence.
Unlike creative writing, this genre cares more specifically about whether the output actually lands as humor for the intended audience.
A high score here does not guarantee safe tone for sensitive situations, factual precision, or professional communication skill.
Strong models here are useful for
jokes, playful copy, light entertainment, and prompts where comic effect matters.
This genre alone cannot tell you
whether the model is best for serious guidance, careful support, or exact business communication.
Top Models in This Genre
This ranking is ordered by average score within this genre only.
Latest Updated: Mar 21, 2026 09:09
Win Rate
Average Score
Win Rate
Average Score
Win Rate
Average Score
Win Rate
Average Score
Win Rate
Average Score
Win Rate
Average Score
Win Rate
Average Score
Win Rate
Average Score
Win Rate
Average Score
| Ranked Models |
|
|
Detail | ||||
|---|---|---|---|---|---|---|---|
| #1 | GPT-5 mini | OpenAI |
100%
|
84
|
3 | 3 | View scores and evaluation for GPT-5 mini |
| #2 | GPT-5.2 | OpenAI |
80%
|
87
|
4 | 5 | View scores and evaluation for GPT-5.2 |
| #3 | Claude Opus 4.6 | Anthropic |
75%
|
86
|
3 | 4 | View scores and evaluation for Claude Opus 4.6 |
| #4 | GPT-5.4 | OpenAI |
75%
|
84
|
3 | 4 | View scores and evaluation for GPT-5.4 |
| #5 | Claude Haiku 4.5 | Anthropic |
67%
|
76
|
2 | 3 | View scores and evaluation for Claude Haiku 4.5 |
| #6 | Claude Sonnet 4.6 | Anthropic |
33%
|
83
|
1 | 3 | View scores and evaluation for Claude Sonnet 4.6 |
| #7 | Gemini 2.5 Pro |
0%
|
71
|
0 | 3 | View scores and evaluation for Gemini 2.5 Pro | |
| #8 | Gemini 2.5 Flash |
0%
|
70
|
0 | 3 | View scores and evaluation for Gemini 2.5 Flash | |
| #9 | Gemini 2.5 Flash-Lite |
0%
|
65
|
0 | 4 | View scores and evaluation for Gemini 2.5 Flash-Lite |
What Is Evaluated in Humor
Scoring criteria and weight used for this genre ranking.
Humor Effectiveness
35.0%
This criterion is included to check Humor Effectiveness in the answer. It carries heavier weight because this part strongly shapes the overall result in this genre.
Originality
25.0%
This criterion is included to check Originality in the answer. It has meaningful weight because it affects quality in a visible way, even if it is not the only thing that matters.
Coherence
15.0%
This criterion is included to check Coherence in the answer. It is weighted more lightly because it supports the main goal rather than defining the genre by itself.
Clarity
15.0%
This criterion is included to check Clarity in the answer. It is weighted more lightly because it supports the main goal rather than defining the genre by itself.
Instruction Following
10.0%
This criterion is included to check Instruction Following in the answer. It is weighted more lightly because it supports the main goal rather than defining the genre by itself.
Recent tasks
Humor
Clean Stand-Up Monologue for a Nervous Science Museum Opening
Write a clean, original stand-up monologue of 220 to 320 words for a host opening a new science museum exhibition about everyday household objects. The audience is mixed: children aged 10+, parents, teachers, and local donors. The speaker is a little nervous but trying to sound confident and charming. Required constraints: - Keep it suitable for a general family audience. - Use exactly 6 jokes or comedic beats. - At least 3 jokes must be about ordinary objects being treated as if they have dramatic secret lives. - Include 1 brief callback to an earlier joke near the end. - Mention all 5 of these objects naturally: toaster, umbrella, sock, vacuum cleaner, and refrigerator. - Avoid insults, politics, religion, dating humor, bathroom humor, and references to celebrities. - The monologue should feel like one continuous performance, not a list of unrelated one-liners. Aim for humor that works both for kids and adults, with clear setup and payoff.
Humor
Write a Funny Wedding Toast for Two Librarians
Write a humorous wedding toast of 250 to 350 words for a couple who are both librarians and are getting married in a small town public library after hours. The audience includes grandparents, coworkers, a few teenagers from the library’s book club, and one very serious city council member who approved the event permit. Keep the humor warm, clever, and family-friendly rather than edgy or insulting. Include at least three specific library-related jokes or playful references, but make sure the toast still sounds sincere and celebratory. Do not use clichés about marriage being terrible, do not mock reading itself, and do not rely on puns in every sentence. End with a short, heartfelt closing line suitable for raising a glass.
Humor
The Cynical Pilot's In-Flight Announcement
Write a short, humorous in-flight announcement from the perspective of a pilot who is completely fed up with their job. The announcement should be delivered over the plane's intercom. Your tone should be dry, sarcastic, and world-weary, but not genuinely alarming. Cover the usual topics like welcome, flight time, and weather, but infuse them with the pilot's cynical perspective on air travel.
Humor
Write a Comedic Dialogue Between a Time Traveler and a Medieval Peasant Trying to Explain Modern Technology
Write a comedic dialogue between a time traveler from the year 2024 who has accidentally landed in a medieval English village in the year 1320, and a local peasant named Aldric. The time traveler is desperately trying to explain what a smartphone is so that Aldric can help them find a power source to charge it. Requirements: - The dialogue should be between 400 and 600 words. - Aldric should consistently misinterpret modern concepts through a medieval worldview (for example, interpreting "the cloud" as actual clouds, or "apps" as some kind of food). - The time traveler should grow increasingly frustrated but remain polite. - Include at least three distinct modern technology concepts that Aldric hilariously misunderstands. - The dialogue should have a satisfying comedic ending or punchline. - The humor should be clever and character-driven, not relying on crude jokes or slurs. - Format the dialogue with character names followed by colons before each line of speech, with brief stage directions in parentheses where appropriate.
Humor
Write a Comedic Dialogue Between a Time Traveler and a Medieval Peasant Trying to Explain Modern Technology
Write a comedic dialogue between a time traveler from the year 2024 who has accidentally landed in a medieval English village in the year 1320, and a local peasant named Aldric. The time traveler is desperately trying to explain what a smartphone is so that Aldric can help them find a power source to charge it. The dialogue should be at least 20 exchanges long (10 per character minimum). Constraints and tone guidelines: - The humor should arise naturally from the cultural and technological misunderstanding between the two characters, not from mean-spirited mockery of either one. - Aldric should be portrayed as genuinely intelligent but working entirely within a medieval worldview (he might interpret things through religion, alchemy, farming, or feudal politics). - The time traveler should grow increasingly frustrated but remain fundamentally polite. - Include at least one moment where Aldric's medieval logic accidentally arrives at a surprisingly insightful or almost-correct conclusion about modern technology. - The dialogue should have a satisfying comedic ending or punchline. - Keep the tone suitable for a general audience (no profanity, slurs, or crude humor).
Humor
Historical Figures as Modern Roommates
Write a short, humorous dialogue between Marie Antoinette and a Spartan warrior who are roommates in a modern apartment. The topic of their argument is that the Spartan has used all the hot water for his 4 AM ice bath and cold shower routine, and now Marie Antoinette can't have her two-hour-long bubble bath.