Orivel Orivel
Open menu

Stand-up Routine for a Tech Conference

Compare model answers for this Humor benchmark and review scores, judging comments, and related examples.

Login or register to use likes and favorites. Register

X f L

Contents

Task Overview

Benchmark Genres

Humor

Task Creator Model

Answering Models

Judge Models

Task Prompt

Write a 2-minute stand-up comedy routine for a comedian performing at a major tech conference. The audience consists primarily of software engineers and project managers. The routine should focus on the funny or absurd aspects of remote work and 'agile' development methodologies. The tone should be sarcastic and observational, but ultimately good-natured and safe for a corporate environment.

Task Context

The event is 'InnovateSphere 2024'. The comedian is the opening act on the final day, right before the main keynote. The goal is to warm up the crowd with relatable humor about their daily work lives. The jokes should be clever enough for a technical audience but avoid overly niche jargon. The routine should be structured as a monologue script.

Judging Policy

A high-quality answer will present a coherent and well-structured comedy routine with clear setups and punchlines. The humor should be original, avoiding common tech clichés. The content must be highly relevant and relatable to an audience of software engineers, demonstrating an understanding of their work culture (e.g., daily stand-ups, Slack notifications, code reviews, 'sprints'). The tone must be appropriate for a corporate setting—funny and sarcastic without being offensive or unprofessional.

Answers

Good morning, InnovateSphere 2024! Final day! Give yourselves a hand for surviving three days of keynotes, breakout sessions, and pretending you understood the blockchain panel. I know it’s early, but don’t worry — I’ll keep this tight. I’m basically the human loading spinner before the keynote. Now, this room is mostly software engineers and project managers, which means half of you are thinking, “Please be funny,” and the other half are wondering if this routine has acceptance criteria. Remote work has changed everything. It used to be, if you were late to a meeting, people assumed traffic. Now they assume y...

Show Full Answer

Good morning, InnovateSphere 2024! Final day! Give yourselves a hand for surviving three days of keynotes, breakout sessions, and pretending you understood the blockchain panel. I know it’s early, but don’t worry — I’ll keep this tight. I’m basically the human loading spinner before the keynote. Now, this room is mostly software engineers and project managers, which means half of you are thinking, “Please be funny,” and the other half are wondering if this routine has acceptance criteria. Remote work has changed everything. It used to be, if you were late to a meeting, people assumed traffic. Now they assume you’re trapped behind a Zoom update that says, “Installing... 99% complete,” which is technology’s way of saying, “I have no idea what time is.” And every remote meeting starts the same way: “Can you hear me?” That’s not a technical question anymore — that’s the modern workplace prayer. Then there’s the camera dilemma. Camera on means, “I’m engaged.” Camera off means, “I’m still engaged, but spiritually, I am laundry.” We’ve all seen that one coworker with the perfect home office: plants, lighting, acoustic panels, standing desk. Meanwhile, I’m on a folding chair in a corner of the bedroom, trying to look strategic while my neighbor’s leaf blower is contributing to the roadmap. And agile — I love agile. Agile is incredible because it took the simple idea of “let’s communicate better” and turned it into 14 ceremonies, 6 boards, 3 dashboards, and one person named Brian asking, “Can we take that offline?” Brian, everything is offline. We work remote. The daily stand-up is my favorite. It’s called a stand-up, but everyone is sitting, muted, and emotionally horizontal. You go around the room: “Yesterday I worked on the thing. Today I’ll continue working on the thing. Blockers? The thing.” Project managers are amazing, though. Truly. They can say, “We’re slightly behind schedule” with the same calm tone a pilot uses for, “We’ve lost an engine, but the snacks are still complimentary.” Engineers have their own language too. If an engineer says, “That’s interesting,” it means, “This is broken in a way that offends me personally.” If they say, “Should be a quick fix,” that means anywhere from 10 minutes to the heat death of the universe. And sprint planning — beautiful ritual. Everyone gathers to estimate work using story points, which are like currency in a country no one can locate. “Is this a 3 or a 5?” I don’t know, Karen, is it Tuesday? Are we emotionally available? Has the API team responded? But despite all the absurdity, it works somehow. Code ships. Products launch. Managers align stakeholders. Engineers heroically fix bugs they definitely did not create. So enjoy the final day, InnovateSphere. May your Wi-Fi be stable, your backlog be groomed, your meetings have agendas, and may every “quick sync” actually be quick. You’ve been great — now please welcome the keynote before someone schedules a retrospective on this set.

Result

#2

Winning Votes

1 / 3

Average Score

82
Judge Models OpenAI GPT-5.4

Total Score

77

Overall Comments

Answer A is a solid, polished corporate-friendly routine with strong relevance to remote work and agile culture. It has a clear monologue structure, consistent tone, and several effective lines such as the loading spinner opener, emotionally horizontal stand-up joke, and story points bit. Its main weakness is that much of the material feels familiar and safe rather than especially fresh, and some punchlines are more clever observations than big laughs.

View Score Details

Humor Effectiveness

Weight 35%
74

Consistently amusing with multiple solid lines, especially the loading spinner, modern workplace prayer, emotionally horizontal, and heat death of the universe jokes. However, several bits are more mildly witty than laugh-out-loud, so the set entertains more than it surprises.

Originality

Weight 25%
67

The material is relevant and polished, but many observations are familiar corporate-tech comedy territory: Zoom issues, camera on/off, Jira boards, story points, and quick fixes. There are a few fresh phrasings, yet the overall premise set feels fairly standard.

Coherence

Weight 15%
83

The routine flows smoothly from conference opener to remote work, agile, engineers versus project managers, and a clean closing callback to retrospectives. Transitions are natural and the set feels complete.

Instruction Following

Weight 10%
90

It closely follows the brief: a 2-minute monologue for InnovateSphere 2024, aimed at engineers and project managers, focused on remote work and agile, with sarcastic but safe corporate humor. It avoids overly niche jargon while still feeling technical.

Clarity

Weight 15%
86

Very clear wording and straightforward setups make the routine easy to follow, with concise lines and minimal clutter. The script reads naturally and cleanly as a spoken monologue.

Total Score

87

Overall Comments

Answer A is a very well-written and witty monologue. It contains several highly original and clever jokes, such as describing story points as "currency in a country no one can locate" and stand-ups as "emotionally horizontal." The routine is coherent and perfectly tailored to the target audience. Its main weakness is that it reads more like a humorous essay than a performable stand-up script, lacking the clear pacing and punchline structure that would elevate it as a performance piece.

View Score Details

Humor Effectiveness

Weight 35%
85

The routine is very funny with several strong, relatable jokes. Lines like "spiritually, I am laundry" and the analogy for project managers are great. However, the overall rhythm feels more like a continuous humorous speech than a stand-up routine with distinct setup-punchline structures.

Originality

Weight 25%
88

The answer demonstrates high originality with several unique phrases and analogies. The description of story points as "currency in a country no one can locate" is brilliant and fresh. It also cleverly avoids many of the most overused tech clichés.

Coherence

Weight 15%
85

The routine is well-structured and flows logically from one topic to the next, starting with remote work and moving smoothly into agile methodologies. The transitions are natural, creating a cohesive monologue.

Instruction Following

Weight 10%
100

The answer perfectly follows all instructions. It provides a 2-minute monologue for the specified audience and event, covers the required topics of remote work and agile, and maintains the requested sarcastic yet corporate-safe tone.

Clarity

Weight 15%
85

The language is clear and the jokes are easy for the target audience to understand. The script is well-written and easy to read as a piece of text.

Total Score

80

Overall Comments

Answer A delivers a tight, polished routine with a high density of original, well-crafted punchlines (e.g., 'emotionally horizontal,' 'currency in a country no one can locate,' the pilot analogy for project managers). Setups are economical and callbacks land. The voice is consistently sarcastic yet good-natured, and the closing benediction ('may your Wi-Fi be stable, your backlog be groomed') is a strong sign-off. Jargon is balanced—technical enough to feel insider but accessible. Minor weakness: a couple of beats (camera on/off, 'Can you hear me?') are familiar tropes, though freshly phrased.

View Score Details

Humor Effectiveness

Weight 35%
82

High joke density with multiple genuinely funny, well-constructed punchlines (pilot analogy, 'emotionally horizontal,' Brian/offline callback, story points as foreign currency). Rhythm is tight and lines build.

Originality

Weight 25%
78

Fresh phrasings and unexpected metaphors dominate ('human loading spinner,' 'modern workplace prayer,' 'spiritually, I am laundry,' leaf blower 'contributing to the roadmap'). Even familiar topics get novel angles.

Coherence

Weight 15%
80

Clear arc: opener → remote work → agile ceremonies → standups → PM/engineer language → sprint planning → warm close. Transitions are smooth and the ending callbacks tie it together.

Instruction Following

Weight 10%
80

Hits all requirements: ~2-minute length, monologue, sarcastic-but-safe tone, references InnovateSphere 2024, opening-act framing, closes by handing off to keynote. Audience-appropriate jargon level.

Clarity

Weight 15%
80

Clean prose, easy to read aloud, punchlines clearly delineated. No confusing references; jargon is explained through the joke itself.

Good morning, InnovateSphere 2024! Wow, look at this crowd. So many brilliant minds in one room — and yet, I guarantee at least half of you are also in a Slack channel right now called "urgent" that hasn't been urgent since 2021. [pause for laughter] It's great to be here on the final day. The final day of a tech conference is a special thing. It's like the last sprint of a two-week sprint — everyone's exhausted, half the tickets are still open, and someone in leadership just added three new "high priority" items at 4:45 on Friday. [pause] Speaking of sprints — can we talk about Agile for a second? Because I...

Show Full Answer

Good morning, InnovateSphere 2024! Wow, look at this crowd. So many brilliant minds in one room — and yet, I guarantee at least half of you are also in a Slack channel right now called "urgent" that hasn't been urgent since 2021. [pause for laughter] It's great to be here on the final day. The final day of a tech conference is a special thing. It's like the last sprint of a two-week sprint — everyone's exhausted, half the tickets are still open, and someone in leadership just added three new "high priority" items at 4:45 on Friday. [pause] Speaking of sprints — can we talk about Agile for a second? Because I love Agile. I love the *idea* of Agile. The manifesto says, "Individuals and interactions over processes and tools." Beautiful. Poetry. And then we spend forty-five minutes in a Jira board arguing about whether a bug fix is a story point two or a story point three. Schrödinger's ticket — it's both done and not done until the product owner opens the sprint review. [pause] And the daily standup! Fifteen minutes. That's the rule. Fifteen minutes. I have been in standups that outlasted relationships. I once attended a standup where someone said, "I'm blocked," and we spent the next hour unblocking them in real time. That's not a standup. That's a hostage negotiation. [pause] But honestly, nothing — *nothing* — has changed our work lives more than remote work. Remote work is incredible. I wake up, I roll out of bed, I open my laptop, and I am immediately, professionally, a disaster. My background is a bookshelf I specifically bought to look smart on camera. I have read zero of those books. One of them is upside down. Nobody has said anything. That's the real agile — adapting to your environment. [pause] The best part of remote work is the camera etiquette. We've all developed this unspoken rule: camera on means you're engaged, camera off means you're either eating, walking your dog, or quietly having an existential crisis. And we respect that. We respect the camera-off. It's a safe space. [pause] My favorite remote work moment is the "Can everyone hear me?" opener. Every. Single. Meeting. We have been doing video calls for years. We have sent rockets to space. We have trained AI to write poetry. And yet, every Monday at nine a.m., someone with a PhD is going, "Hello? Am I coming through? I think I'm on mute." You ARE on mute, Gerald. You have ALWAYS been on mute. [pause] And then there's the hybrid model — the compromise nobody asked for. You're in the office, but half the team is remote, so you're all on a video call... from the same building. You're sitting in a glass conference room, waving at your colleague on a screen, who is sitting at home in pajamas, and you think: one of us made the right choice today, and it wasn't me. [pause] But look — at the end of the day, we're all here because we love what we do. We love building things, solving problems, and occasionally deploying to production on a Friday afternoon and then immediately regretting every decision we've ever made. [pause for laughter] You are the people who keep the internet running, the apps working, and the world moving forward — one story point at a time. Give yourselves a hand. You've earned it. Now let's get to that keynote — I hear it's going to be *transformative*. And if it runs long, don't worry. We'll just move it to the next sprint. Thank you, InnovateSphere! You've been amazing!

Result

#1 | Winner

Winning Votes

2 / 3

Average Score

82
Judge Models OpenAI GPT-5.4

Total Score

83

Overall Comments

Answer B delivers a well-structured stand-up set with stronger comedic escalation and more memorable punchlines. It stays highly relevant to the audience, uses agile and remote-work references naturally, and maintains a sarcastic but good-natured tone suitable for a tech conference. It is somewhat more conventional in structure due to repeated pause tags, but the jokes land more sharply and feel more dynamic overall.

View Score Details

Humor Effectiveness

Weight 35%
84

Stronger laugh density and sharper punchlines throughout, including urgent Slack, Schrödinger's ticket, hostage negotiation, bookshelf prop, and hybrid-office joke. The routine builds momentum well and delivers more distinct comedic peaks.

Originality

Weight 25%
76

While still grounded in recognizable tech-work tropes, the answer adds more inventive framing such as Schrödinger's ticket, hostage negotiation, the decorative unread bookshelf, and the hybrid model comparison. These give the set a somewhat fresher identity.

Coherence

Weight 15%
85

Very coherent progression from conference fatigue to agile, standups, remote work, hybrid work, and a closing transition into the keynote. The pacing and sequencing support the comedy effectively, with each section connecting clearly.

Instruction Following

Weight 10%
92

It fulfills the prompt extremely well: monologue script format, correct event framing, strong focus on remote work and agile, audience-specific references, and a polished corporate-safe tone. The material is highly on-brief and conference-ready.

Clarity

Weight 15%
83

Clear and easy to follow, with good spoken delivery cues and accessible references. The repeated pause markers slightly interrupt reading flow compared with A, but overall the script remains very understandable and performance-ready.

Total Score

89

Overall Comments

Answer B delivers an exceptional stand-up routine that is both hilarious and perfectly structured for performance. The inclusion of pauses and stage directions makes the rhythm and delivery crystal clear. The jokes are highly relatable and effective, with standout bits about stand-ups becoming "hostage negotiations" and the absurdity of the hybrid work model. While it uses a few more common tech humor tropes than Answer A, its overall execution as a comedy script is superior.

View Score Details

Humor Effectiveness

Weight 35%
90

This routine is exceptionally effective. The jokes are punchy and well-structured, and the inclusion of pauses demonstrates a strong understanding of comedic timing. The 'hostage negotiation' standup and the hybrid work jokes are particularly hilarious and land perfectly for the audience.

Originality

Weight 25%
82

The routine contains strong original material, such as the 'Schrödinger's ticket' and 'hostage negotiation' concepts. However, it also relies on some more common tech humor tropes, like the 'you're on mute' joke and the curated Zoom bookshelf, which slightly reduces its overall originality score compared to A.

Coherence

Weight 15%
90

The routine is highly coherent, with excellent flow and transitions between different bits. The structure feels very deliberate and polished, guiding the audience through the topics in a way that builds momentum, making it feel like a professionally crafted set.

Instruction Following

Weight 10%
100

The answer flawlessly adheres to every aspect of the prompt. The length, audience targeting, topics, tone, and format are all exactly as requested. The inclusion of stage directions is a nice touch that fits the 'dialogue' format well.

Clarity

Weight 15%
92

The routine is exceptionally clear, not just in its language but in its intended delivery. The explicit inclusion of `[pause]` and `[pause for laughter]` makes the comedic timing and rhythm unambiguous, which is a major asset for a performance script.

Total Score

73

Overall Comments

Answer B is competent and readable, with stage directions ([pause for laughter]) that emphasize its performative nature. It has some strong lines ('Schrödinger's ticket,' 'hostage negotiation,' the upside-down bookshelf book). However, several bits lean on well-worn observations (mute jokes, Friday deploys, hybrid meetings from the same building) without fresh twists. The 'Gerald' mute bit is a familiar trope. Pacing is good but the routine is slightly longer and looser than A, and the stage directions, while helpful, clutter the read.

View Score Details

Humor Effectiveness

Weight 35%
74

Several strong jokes (Schrödinger's ticket, hostage negotiation, upside-down book) but interspersed with more predictable beats (mute jokes, Friday deploys). Funny but less consistently sharp.

Originality

Weight 25%
65

Some original moments (bookshelf bit, Schrödinger's ticket) but leans on common tech-comedy tropes—mute jokes, hybrid meetings, Friday deploys—without strongly subverting them.

Coherence

Weight 15%
75

Logical flow from agile to standups to remote to hybrid to close, but slightly more episodic. Stage directions help pacing but the segments feel more like discrete bits than a building arc.

Instruction Following

Weight 10%
80

Also hits all requirements clearly—names the conference, frames as opening before keynote, sarcastic and safe, monologue format with explicit pause cues. Slightly longer than 2 minutes when read aloud.

Clarity

Weight 15%
78

Clear and readable, with stage directions aiding delivery. A few lines run long, and the bracketed [pause] cues, while useful, slightly interrupt the read-through flow.

Comparison Summary

Final rank order is determined by judge-wise rank aggregation (average rank + Borda tie-break). Average score is shown for reference.

Judges: 3

Winning Votes

1 / 3

Average Score

82
View this answer

Winning Votes

2 / 3

Average Score

82
View this answer

Judging Results

Why This Side Won

Answer A wins on the most heavily weighted criteria: humor effectiveness and originality. Its punchlines are denser, more surprising, and more linguistically inventive (e.g., 'emotionally horizontal,' the pilot/snacks analogy, 'has the API team responded?'). A also maintains tighter coherence and a more distinctive comedic voice while still hitting all the required tech touchpoints. B is solid and well-structured but relies more on familiar tropes, which costs it on originality and humor effectiveness—the two highest-weighted criteria.

Why This Side Won

Answer B wins because it is more effective as a stand-up comedy routine. While both answers are very funny and well-written, Answer B's structure, pacing (indicated by pauses), and punchline delivery are better suited for a live performance. It excels in the most heavily weighted criterion, Humor Effectiveness, by creating a script that is not just funny to read but is clearly designed to be performed and to land jokes with an audience. Its slight edge in coherence and clarity as a script also contributes to its win.

Judge Models OpenAI GPT-5.4

Why This Side Won

Answer B wins because it scores higher on the most heavily weighted criteria, especially humor effectiveness and originality, while also remaining strong in coherence, clarity, and instruction following. Both answers fit the brief well, but B has more standout punchlines, better comedic rhythm, and slightly fresher observations, giving it the higher weighted overall result.

X f L