Orivel Orivel
Open menu

Latest Tasks & Discussions

Browse the latest benchmark content across tasks and discussions. Switch by genre to focus on what you want to compare.

Benchmark Genres

Model Directory

Summarization

Anthropic Claude Opus 4.6 VS Google Gemini 2.5 Flash

Summarize a Policy Memo with Balanced Tradeoffs

Read the memo below and write a concise summary of 140 to 180 words for a city council member who has not read it. Your summary must cover the problem, the proposed pilot program, expected benefits, main risks or criticisms, and how success would be measured. Do not quote directly. Memo: Riverton's public buses have lost riders for six consecutive years, even though the city's population has grown. A transportation department review found several causes: routes are infrequent outside downtown, schedules are hard to understand, and buses are often delayed by traffic congestion. Low-income residents and older adults reported the greatest difficulty reaching jobs, clinics, and grocery stores without long waits or costly ride-hailing services. In response, staff propose a two-year "Frequent Corridors" pilot. Instead of spreading service thinly across the entire network, the city would increase weekday frequency to every 10 minutes on five major corridors from 6 a.m. to 9 p.m. Two underused neighborhood routes would be replaced by on-demand shuttles that riders could book by phone or app. The plan would also add larger bus-stop signs, simplified maps, and a real-time arrival display at the central transfer station. Supporters argue that riders value reliability and simplicity more than broad but infrequent coverage. They say concentrating resources on the busiest corridors could attract new riders, reduce missed transfers, and improve access to major employers and the community college. They also note that on-demand shuttles may serve low-density areas more efficiently than nearly empty fixed-route buses. Critics raise several concerns. Some disability advocates worry that app-based booking could disadvantage riders without smartphones, although the proposal includes phone reservations. Labor representatives warn that the shuttle service could be outsourced later, potentially affecting union jobs. Environmental groups support transit investment overall but question whether replacing fixed routes with smaller vehicles might reduce total passenger capacity. Some residents also fear that neighborhoods losing direct bus lines will feel abandoned, even if average wait times fall. The pilot is estimated to cost 8 million dollars over two years. Staff suggest funding it through a mix of state transit grants, parking revenue, and delaying a planned downtown streetscape project. They propose evaluating the pilot using ridership changes, average wait times, on-time performance, transfer success rates, customer satisfaction surveys, and access to essential destinations for low-income households. If the pilot fails to improve ridership and reliability within 18 months, staff recommend ending it early or redesigning it.

434
Mar 13, 2026 02:31

Business Writing

OpenAI GPT-5.2 VS Google Gemini 2.5 Flash

Restructuring a Poorly Written Business Email

Below is a poorly written email from a regional sales manager to the executive leadership team. The email attempts to propose a new quarterly incentive program for the sales team but suffers from numerous problems: unclear structure, informal tone, buried key information, missing call to action, and lack of supporting data presentation. Rewrite this email so that it is professional, well-structured, persuasive, and appropriate for an executive audience. Your rewritten version should preserve all the factual content from the original but present it effectively. Include a clear subject line. --- Original email: "hey team, so i was thinking about this for a while and talked to a few ppl on my team and basically we think we should do something about the incentive structure because honestly its not really working anymore. last quarter we only hit 78% of target which is like the worst in 3 years and i think part of the reason is ppl arent motivated enough. sarah and jake both told me they dont even look at the bonus structure anymore because its too complicated and the payouts are too small to matter. what i want to propose is we do a tiered system where if you hit 100% you get 5% bonus, 110% gets 8%, and 120%+ gets 12%. right now everyone just gets a flat 3% regardless which doesnt really push anyone. i ran some numbers and if we had this in place last quarter and it motivated even 30% of the team to hit 110%+ we would have actually exceeded our $4.2M target by about $380K which more than covers the extra bonus cost of roughly $95K. i also think we should do monthly recognition not just quarterly because people forget about targets when theyre 3 months away. maybe a leaderboard or something. anyway lmk what you think, would love to discuss at the next leadership meeting if possible. thanks mike"

452
Mar 9, 2026 16:19

Showing 101 to 119 of 119 results

Related Links

X f L