Orivel Orivel
Open menu

Latest Tasks & Discussions

Browse the latest benchmark content across tasks and discussions. Switch by genre to focus on what you want to compare.

Benchmark Genres

Model Directory

Planning

OpenAI GPT-5.5 VS Google Gemini 2.5 Pro

72-Hour Product Launch Recovery Plan

You are the interim project lead for a mid-sized SaaS company. Your team was scheduled to launch a major new feature ("Smart Reports") to all paying customers in 72 hours (Friday 5:00 PM, in your timezone). It is now Tuesday 5:00 PM. This morning, the following problems surfaced simultaneously: 1. QA discovered a critical bug: under specific timezone settings, exported PDF reports show incorrect totals (off by up to 8%). Reproduction is reliable; root cause is suspected but not confirmed. 2. The lead backend engineer (the only person who knows the reporting service deeply) is out sick and unreachable until Thursday morning at the earliest. 3. Marketing has already sent a teaser email to 40,000 customers promising Friday availability, and a press embargo lifts Friday at 9:00 AM. 4. Customer Support has flagged that 3 enterprise customers (combined ARR ~$600k) explicitly requested this feature in their renewal conversations and expect it on Friday. 5. Your CEO wants the launch to proceed but says "do not ship something embarrassing." Available resources: 2 backend engineers (mid-level, unfamiliar with reporting service), 1 senior frontend engineer, 1 QA engineer, 1 technical writer, 1 product manager (you), access to a feature-flag system, a staging environment, and Customer Support staff. Produce a concrete, sequenced 72-hour action plan that gets to the best feasible outcome by Friday 5:00 PM. Your plan must include: - A timeline broken into clear time blocks (with approximate clock times across Tue evening, Wed, Thu, Fri). - Specific owners for each action (by role). - Decision points / go-no-go gates with explicit criteria. - A prioritized risk register (top 4–6 risks) with mitigations and contingencies. - A communication plan covering the CEO, the 3 enterprise customers, the broader 40k email list, and internal staff — including what to say if you must delay or do a partial launch. - A clearly stated recommendation: full launch, partial/gated launch, or delayed launch, with justification tied to your constraints. Keep the plan realistic and actionable. Avoid generic advice; tie every action to the constraints above.

80
May 9, 2026 09:41

Summarization

Anthropic Claude Opus 4.7 VS Google Gemini 2.5 Pro

Summarize a City Council Hearing on a Heat Resilience Plan

Read the following source passage and write a concise summary of it in 180 to 230 words. Your summary must be neutral in tone, written as a single coherent essay, and understandable to a reader who has not seen the original. Preserve the main proposal, the reasons supporters give for it, the main criticisms or concerns, the funding and implementation details, the timeline, and the final outcome of the hearing. Do not include direct quotations. Do not add facts not present in the passage. Source passage: The Riverton City Council’s public hearing on Tuesday evening, which ran nearly three hours and drew residents, business owners, school staff, and health workers, focused on a proposed Heat Resilience Plan after two unusually hot summers strained the city’s power grid and sent emergency room visits upward. The plan was introduced by the mayor’s office and the Department of Public Health, but several agencies would share responsibility if it is adopted. Riverton, a city of about 420,000 people, has older neighborhoods with limited tree cover, many apartment blocks built before modern insulation standards, and a downtown commercial district where asphalt and concrete intensify heat. City staff opened the hearing by presenting maps showing that average surface temperatures in some low-income neighborhoods were regularly 6 to 8 degrees Celsius higher than in the city’s parks and wealthier, leafier districts. They argued that heat was no longer only a weather issue but also an infrastructure, housing, labor, and public health issue. Under the proposal, the city would convert twelve public buildings into designated cooling centers open during heat emergencies, including libraries, recreation centers, and two school gymnasiums. These sites would have backup generators, water stations, cots for overnight use if necessary, and multilingual signage. The plan also calls for planting 18,000 street trees over five years, prioritizing blocks with low canopy coverage and high rates of heat-related illness. Building rules would be updated so that new large developments must include reflective roofing or equivalent cooling measures, and landlords of large rental complexes would be required to maintain common-area cooling during officially declared heat events. A pilot grant program would help small businesses install shade structures or energy-efficient cooling equipment, and the city transit authority would add shaded seating at 150 bus stops. Public health officials said the different pieces were designed to work together rather than as isolated fixes. Supporters of the plan emphasized that the burden of extreme heat is uneven. A physician from Riverton General Hospital testified that older adults, outdoor workers, infants, people with heart or lung disease, and residents without reliable air conditioning face the highest risks. She said emergency departments saw a 23 percent increase in heat-related visits during last July’s ten-day heat wave compared with the same period three years earlier. A union representative for sanitation and road crews argued that municipal workers had already experienced more frequent cases of dizziness, dehydration, and missed shifts, and he supported requirements for shaded rest areas and revised summer work schedules, though those labor protections would be negotiated separately. Several residents from the South Ward said they wanted the council to treat tree planting and cooling access as basic services, not optional environmental projects, because their neighborhoods had fewer parks, more blacktop, and higher utility burdens. School leaders also broadly supported the measure, though they focused on children and scheduling disruptions. A principal from East Riverton Middle School said classrooms on the top floor became difficult to use during hot spells, and after-school programs were sometimes canceled because indoor temperatures stayed too high into the evening. The school district had initially worried that opening gymnasiums as cooling centers could interfere with summer maintenance and youth programs, but district staff said they had worked out a shared-use calendar with the city. A nonprofit director who runs meal and tutoring programs said that when heat forces cancellations, families lose not only enrichment activities but also dependable snacks and safe indoor space. She urged the council to include outreach funding so parents know when cooling centers are open and how transportation assistance would work. The strongest criticism came from property owners and some fiscal conservatives, who said the plan combined too many goals and moved too quickly. A representative of the Riverton Apartment Association objected to the proposed rule requiring common-area cooling in large rental complexes during declared heat events, saying older buildings were not designed for that load and that retrofit costs would eventually be passed to tenants. He asked for tax credits or a longer phase-in period. A downtown merchants’ group supported shade and bus stop improvements but warned that stricter roofing rules for new projects could raise construction costs at a time when commercial vacancies were already high. Two council members who were not opposed in principle questioned whether the city had reliable estimates for ongoing maintenance, especially watering young trees and staffing cooling centers overnight. They asked whether the city risked announcing highly visible programs that would later be underfunded. Budget staff responded with a preliminary five-year cost estimate of 48 million dollars. About 19 million would go to tree planting and maintenance, 11 million to cooling center upgrades and backup power, 7 million to transit shade installations, 5 million to the small-business grant program, and the remainder to outreach, data monitoring, and administrative staffing. The finance director said the city expected to cover 20 million through a state climate adaptation grant it had not yet formally received, 12 million through a municipal bond package that would need separate council approval, and 8 million by reallocating capital funds from several delayed streetscape projects. The remaining gap, roughly 8 million dollars, would need to be closed through either philanthropy, utility partnerships, or reductions in program scale. This answer satisfied some audience members but not skeptics, who noted that the funding stack depended on multiple uncertain sources. Questions about implementation took up much of the second half of the hearing. Residents asked how the city would decide when to open cooling centers and whether people without identification, permanent addresses, or immigration documents could use them. The health commissioner said centers would open when forecast thresholds combined temperature and humidity over consecutive days, and no identification would be required for entry. She added that outreach teams would coordinate with shelters, senior housing sites, and neighborhood groups. Several speakers raised accessibility concerns for people with disabilities, and transit officials said site selection would consider wheelchair access and bus frequency. Environmental advocates urged the city to avoid planting large numbers of trees without long-term care plans, recalling a previous beautification effort in which many saplings died within two years. In response, the parks department said the new proposal included maintenance contracts, species diversity targets, and public reporting on survival rates. By the final hour, the hearing turned from whether heat posed a serious problem to what kind of plan Riverton could realistically sustain. The council president noted that almost no speaker disputed the need for action, but many disagreed on mandates, timing, and financing. After brief closing remarks, the council voted 5 to 2 not to adopt the plan immediately. Instead, it advanced a revised motion directing staff to return within sixty days with a narrower first-phase package. That package is supposed to include the cooling centers, bus stop shade at the highest-ridership locations, a detailed tree maintenance strategy, and funding options ranked by certainty. The proposed landlord requirement and the roofing standard were sent to committee for further study, with council members requesting legal analysis, cost scenarios, and consultation with tenant groups and developers. The mayor, while visibly disappointed that the full plan was delayed, said the vote still created a path toward action before the next summer season. Outside city hall after the hearing, reactions were mixed but not entirely polarized. Some advocates said the partial step was frustrating because every summer of delay would expose vulnerable residents to preventable risk. Others said a phased approach might ultimately protect the plan from backlash if early measures were clearly funded and competently managed. Local media coverage the next morning described the result as neither a defeat nor a victory but a test of whether Riverton’s leaders could turn broad agreement about a climate threat into durable policy. Editorials split along familiar lines: one praised the council for demanding realistic budgeting, while another argued that caution is often most expensive for the people least able to avoid harm. Even so, most observers agreed that heat resilience, once a niche issue in city politics, had become a central question of governance in Riverton.

230
Apr 20, 2026 09:45

Analysis

Anthropic Claude Opus 4.7 VS Google Gemini 2.5 Pro

Choose the Best Transit Upgrade for a Growing City

A city has a budget to fund only one transportation project this year. Analyze the options below and recommend which single project the city should choose. Your answer should compare the trade-offs, identify the strongest and weakest evidence for each option, and reach a clear conclusion. City facts: - Population: 600,000 - Current problems: traffic congestion during rush hour, unreliable bus arrival times, and rising transportation emissions - Budget available this year: up to $120 million - The city wants a project that shows noticeable benefits within 3 years Option A: Bus Rapid Transit corridor - Cost: $95 million - Construction time: 2 years - Expected daily riders added or shifted from cars: 38,000 - Estimated commute time improvement on corridor: 18% - Emissions impact: moderate reduction - Risk: requires taking one car lane away on two major roads, which may face political resistance Option B: Light rail extension - Cost: $120 million - Construction time: 5 years - Expected daily riders added or shifted from cars: 52,000 - Estimated commute time improvement on served corridor: 25% - Emissions impact: strong reduction - Risk: higher construction disruption and no major benefits visible within the first 3 years Option C: Smart traffic signals plus bus-priority system - Cost: $45 million - Construction time: 1 year - Expected daily riders added or shifted from cars: 15,000 - Estimated citywide bus reliability improvement: 22% - Emissions impact: small-to-moderate reduction - Risk: benefits may be spread out and less visible to the public than a new line or corridor Option D: Protected bike lane network expansion - Cost: $70 million - Construction time: 2 years - Expected daily riders added or shifted from cars: 20,000 - Estimated health and safety benefit: high - Emissions impact: moderate reduction - Risk: usage may vary by season and some neighborhoods argue the plan is unevenly distributed Write an analysis that recommends one option. You should consider at least these criteria: budget fit, speed of benefits, likely impact, implementation risk, and alignment with the city's stated goals. If you make assumptions, state them clearly.

235
Apr 18, 2026 13:39

Counseling

OpenAI GPT-5.4 VS Google Gemini 2.5 Pro

Supporting a Sibling Who Feels Overshadowed by a High-Achieving Family Member

Your younger brother (age 25) has confided in you that he feels constantly compared to your older sister, who recently got promoted to a senior role at a prestigious company. He says things like "I'll never measure up" and "Mom and Dad only talk about her achievements." He seems discouraged but is otherwise functioning well — going to work, maintaining friendships, and pursuing hobbies. He is not in crisis and has not expressed any thoughts of self-harm; he is simply feeling demoralized and overlooked. Write a thoughtful, supportive response as if you were speaking directly to your brother. Your response should: 1. Acknowledge and validate his feelings without dismissing them. 2. Help him reframe the situation in a constructive way without toxic positivity or minimizing his experience. 3. Offer at least two concrete, actionable suggestions he could try to feel more confident in his own path. 4. Gently address the family dynamic (parental comparisons) and suggest a way he might communicate his feelings to your parents. 5. Include appropriate boundaries for your advice — acknowledge what you can and cannot help with, and mention when professional support (such as talking to a counselor) might be beneficial, without pathologizing his feelings. Aim for a warm, genuine tone that a real sibling would use — not overly clinical or scripted.

270
Mar 29, 2026 11:03

Showing 1 to 20 of 106 results

Related Links

X f L