Orivel Orivel
Open menu

Latest Tasks & Discussions

Browse the latest benchmark content across tasks and discussions. Switch by genre to focus on what you want to compare.

Benchmark Genres

Model Directory

System Design

OpenAI GPT-5.4 VS Google Gemini 2.5 Flash

Design a URL Shortening Service

Design a URL shortening service (similar to bit.ly or tinyurl.com) that must handle the following constraints: 1. The service must support 100 million new URL shortenings per month. 2. The read-to-write ratio is 100:1 (i.e., for every URL created, it is accessed 100 times on average). 3. Shortened URLs must remain accessible for at least 5 years. 4. The system must achieve 99.9% uptime. 5. Redirect latency (from receiving a short URL request to issuing the HTTP redirect) must be under 50ms at the 95th percentile. Your design should address all of the following areas: A. **Short URL Generation Strategy**: How will you generate unique, compact short codes? Discuss the encoding scheme, expected URL length, and how you handle collisions or exhaustion of the key space. B. **Data Storage**: What database(s) will you use and why? Estimate the total storage needed over 5 years. Explain your schema design and any partitioning or sharding strategy. C. **Read Path Architecture**: How will you serve redirect requests at scale to meet the latency and throughput requirements? Discuss caching layers, CDN usage, and any replication strategies. D. **Write Path Architecture**: How will you handle the ingestion of 100M new URLs per month reliably? Discuss any queuing, rate limiting, or consistency considerations. E. **Reliability and Fault Tolerance**: How does your system handle node failures, data center outages, or cache invalidation? What is your backup and recovery strategy? F. **Key Trade-offs**: Identify at least two significant trade-offs in your design (e.g., consistency vs. availability, storage cost vs. read performance, simplicity vs. scalability) and explain why you chose the side you did. Present your answer as a structured design document with clear sections corresponding to A through F above.

290
Mar 20, 2026 17:43

System Design

Google Gemini 2.5 Flash VS Anthropic Claude Sonnet 4.6

Design a Global URL Shortening Service

Design a public URL shortening service similar to Bitly. Users can submit a long URL and receive a short alias; visiting the short link should redirect quickly to the original URL. The system must support custom aliases, optional expiration dates, basic click analytics, and abuse mitigation for malicious links. Requirements and constraints: - Functional requirements: - Create short URLs for long URLs. - Redirect short URLs to original URLs. - Support custom aliases when available. - Support optional expiration time per link. - Record click events for analytics. - Allow users to disable a link manually. - Scale assumptions: - 120 million new short URLs per month. - 1.5 billion redirects per day. - Redirect traffic is globally distributed and read-heavy. - Analytics data should be queryable within 15 minutes. - Performance targets: - Redirect p95 latency under 80 ms for most regions. - Short-link creation p95 under 300 ms. - 99.99% availability for redirects. - Data and retention: - Links may live indefinitely unless expired or disabled. - Raw click events may be retained for 90 days; aggregated analytics for 2 years. - Operational constraints: - Use commodity cloud infrastructure; do not assume one exotic managed product solves everything. - Budget matters: justify any replication, caching, and storage choices. - Short codes should be compact and reasonably hard to guess at large scale, but perfect secrecy is not required. In your answer, provide: 1. A high-level architecture with major components and data flow. 2. Storage choices for link metadata, redirect path, and analytics events, with rationale. 3. A short-code generation strategy, including how to avoid collisions and handle custom aliases. 4. A scaling plan for global traffic, including caching, partitioning/sharding, and multi-region considerations. 5. A reliability plan covering failures, hot keys, disaster recovery, and degraded-mode behavior. 6. Key APIs and core data models. 7. Abuse mitigation and security considerations. 8. The main trade-offs you made and why.

264
Mar 20, 2026 11:03

Analysis

Anthropic Claude Sonnet 4.6 VS Google Gemini 2.5 Flash

Select the Most Promising School Lunch Reform

A public school district can fund only one lunch reform for the next two years. Analyze the options below and recommend which single option the district should choose. Your answer should compare the tradeoffs, address likely objections, and reach a clear conclusion. District goals: 1. Improve student nutrition 2. Increase the number of students actually eating school lunch 3. Keep implementation realistic within two years 4. Avoid large ongoing cost overruns Current situation: - 12,000 students across 18 schools - 46% of students currently choose school lunch - Surveys suggest students often skip lunch because of taste, long lines, or lack of appealing choices - The district can afford only one of the following options now Option A: Hire trained chefs to redesign menus - Upfront training and consulting cost: medium - Ongoing food cost: slightly higher - Expected effects: meals taste better, healthier recipes become more appealing, moderate increase in participation - Risks: benefits depend on staff adoption and recipe consistency across schools Option B: Add self-serve salad and fruit bars in every school - Upfront equipment cost: high - Ongoing food waste risk: high - Expected effects: strong nutrition improvement for students who use the bars, modest participation increase overall - Risks: staffing, sanitation, and uneven use by age group Option C: Launch a mobile pre-order system for lunches - Upfront technology and training cost: medium - Ongoing cost: low to medium - Expected effects: shorter lines, better forecasting, moderate participation increase, little direct nutrition improvement unless menus stay the same - Risks: unequal access for families with limited technology use, adoption challenges at first Option D: Replace sugary desserts and fried sides with healthier defaults - Upfront cost: low - Ongoing cost: neutral - Expected effects: direct nutrition improvement for all school lunch users, possible small drop in participation if students dislike changes - Risks: student backlash, perception that lunch became less enjoyable Write an analysis that identifies the best choice given the district goals and constraints. Do not invent new budget numbers or outside facts; reason only from the information provided.

240
Mar 19, 2026 21:45

Brainstorming

Google Gemini 2.5 Flash VS OpenAI GPT-5.4

Revenue Streams for a Small-Town Public Library Facing Budget Cuts

A small-town public library (serving a population of roughly 12,000) has just learned that its annual municipal funding will be cut by 30% starting next fiscal year. The library has the following assets and constraints: Assets: - A 6,000 sq ft building with a 200-person capacity community room - A small parking lot (20 spaces) - Two full-time librarians and three part-time staff - A collection of 40,000 physical books and a modest digital catalog - A makerspace with a 3D printer, laser cutter, and sewing machines - Reliable high-speed internet and 15 public-use computers - A small fenced garden area behind the building Constraints: - The library must remain free to enter and must continue lending books at no charge - It cannot sell alcohol or host gambling - Any new revenue activity must be legal in a typical U.S. municipality - Staff cannot increase; volunteers may be recruited - The library board will not approve anything that generates significant noise complaints from adjacent residential neighbors Brainstorm as many distinct, practical revenue-generating or cost-saving ideas as you can. For each idea, provide: 1. A short name 2. A one-to-two sentence description of how it works 3. Which library asset it leverages Aim for breadth across different categories (e.g., events, partnerships, services, space rental, grants, merchandising, digital, etc.).

290
Mar 19, 2026 19:59

System Design

Google Gemini 2.5 Flash VS Anthropic Claude Haiku 4.5

Design a Global URL Shortening Service

Design a globally available URL shortening service similar to Bitly. The service must let users create short links that redirect to long URLs, support custom aliases for paid users, track click analytics, and allow links to expire at a specified time. Requirements: - Handle 120 million new short links per day. - Handle 4 billion redirects per day. - Peak traffic can reach 3 times the daily average. - Redirect latency target: p95 under 80 ms for users in North America, Europe, and Asia. - Short-link creation latency target: p95 under 300 ms. - Service availability target: 99.99% for redirects. - Analytics data can be eventually consistent within 5 minutes. - Custom aliases must be unique globally. - Expired or deleted links must stop redirecting quickly. - The system should tolerate regional failures without total service outage. Assumptions you may use: - Average long URL length is 500 bytes. - Analytics events include timestamp, link ID, country, device type, and referrer domain. - Read traffic is much higher than write traffic. - You may choose SQL, NoSQL, cache, stream, CDN, and messaging technologies as needed, but justify them. In your answer, provide: 1. A high-level architecture with main components and request flows. 2. Data model and storage choices for links, aliases, and analytics. 3. A scaling strategy for read-heavy traffic, including caching and regional routing. 4. A reliability strategy covering failover, consistency decisions, and handling regional outages. 5. Key trade-offs, bottlenecks, and at least three risks with mitigations. 6. A brief capacity estimate for storage and throughput using the numbers above.

319
Mar 19, 2026 18:51

Summarization

Anthropic Claude Opus 4.6 VS Google Gemini 2.5 Flash

Summarize a City Council Hearing on Flood Resilience

Read the source passage below and write a concise summary for a busy mayor who did not attend the hearing. Your summary must: - be 220 to 280 words long - be written in clear prose, not bullet points - accurately capture the main problem, the major proposals, the biggest disagreements, and the most important evidence or examples mentioned - include the timeline pressures and funding constraints - mention at least four distinct stakeholder perspectives - remain neutral in tone and avoid adding facts not stated in the passage - not use direct quotations Source passage: The Riverton City Council held a three-hour public hearing on Tuesday night to decide whether to move forward with the first phase of a flood-resilience program for the Harbor District, a low-lying waterfront area that has seen repeated street flooding during heavy rain and seasonal high tides. City engineers opened the meeting with maps showing that nuisance flooding days have increased from about four per year a decade ago to thirteen last year, and they warned that a storm comparable to the one that hit neighboring Bay County in 2021 would likely shut down the district’s main bus corridor, damage electrical equipment in several apartment basements, and temporarily isolate the public health clinic. They said the district’s vulnerability comes from a combination of aging storm drains, land subsidence measured at roughly three millimeters per year, and a seawall built in the 1970s that was never designed for current peak water levels. The Public Works Department presented a draft first-phase plan with three linked components. The largest item, estimated at 24 million dollars, would replace undersized stormwater pipes along Mercer Avenue and install two pump stations near the canal. A second item, costing about 11 million dollars, would raise three intersections by up to eighteen inches and rebuild sidewalks with permeable paving intended to reduce runoff. The third component, projected at 8 million dollars, would launch a home-elevation and flood-proofing grant program for small residential buildings and ground-floor businesses, with priority for properties that have filed repeated flood claims. Public Works Director Elena Torres argued that the package was designed to reduce frequent flooding quickly while keeping options open for larger long-term choices such as a new tide gate or partial seawall reconstruction. She stressed that the city had a limited window to apply for a state resilience grant due in eleven weeks, and that delaying a council vote until autumn would almost certainly push construction start dates back by a full year. Torres also emphasized that the city could not afford to do everything at once. Riverton has identified only 18 million dollars in local capital funds over the next two budget cycles for the Harbor District, meaning any first phase would depend on outside money. If the state grant were approved, it could cover up to 60 percent of eligible infrastructure costs, but not all building-level retrofits. The finance office cautioned that debt service is already rising because of a new fire station and school roof repairs, and it advised against borrowing more than 12 million dollars without cutting other planned projects. Several council members noted that residents have grown skeptical after earlier promises to fix flooding produced only minor drain cleaning and temporary barriers. Business owners from the Harbor Merchants Association backed fast action but pressed for street work to be staged block by block. Their president, Malik Chen, said even short full-road closures on Mercer Avenue could cripple restaurants and small shops that rely on weekend foot traffic, especially after two difficult years of inflation and insurance premium increases. He supported the pump stations and pipe replacement as the most visible and urgent investments, but he opposed raising intersections before the city completed a parking access study. According to Chen, delivery trucks already struggle to reach loading zones, and poorly sequenced construction could create a second economic shock in a district still trying to recover. Residents from the Bayside Homes tenants’ council offered a different emphasis. They said street flooding matters, but repeated basement flooding, mold, and power shutoffs inside older apartment buildings create the most serious day-to-day harms. Council speaker Rosa Alvarez described families carrying children through standing water to reach school buses and elderly tenants losing medications when refrigerators fail during outages. She urged the city not to treat household grants as an optional add-on that could be dropped if state aid fell short. Several tenant advocates asked for anti-displacement protections, warning that landlords might use publicly funded upgrades as a reason to raise rents or decline lease renewals. Environmental groups supported green infrastructure but criticized the draft for giving it a secondary role. The nonprofit Clean Estuary Now argued that pumps and larger pipes may move water faster in the short term but could worsen downstream pollution unless paired with wetlands restoration and stricter runoff controls uphill from the district. Its director, Naomi Reed, pointed to two nearby cities where bioswales, rain gardens, and restored marsh edges reduced flood depth while also improving water quality and urban habitat. Reed said Riverton should reserve land now for living-shoreline projects before waterfront parcels become more expensive or are redeveloped. The Harbor District Community Clinic focused on continuity of care. Clinic administrator Dev Patel testified that the building itself has avoided major flood damage so far, but staff and patients often cannot reach it when the bus corridor floods or when ankle-deep water covers the nearest crosswalks. He said missed dialysis follow-ups, delayed prenatal visits, and interruptions to mental health appointments have become more common on heavy-rain days. Patel supported intersection raising and sidewalk reconstruction because, in his view, access failures produce public-health costs that are easy to overlook when discussion centers on property damage alone. A representative of the school district added another layer to the debate. Harbor Middle School sits just outside the worst flood zone, but its buses cross Mercer Avenue and nearby low spots. Deputy superintendent Lila Morgan said transportation delays have doubled on the wettest days, and after-school programs have seen irregular attendance because parents worry that children will get stranded. She favored quick infrastructure upgrades but asked the city to coordinate construction schedules with the school calendar and to maintain safe pedestrian detours. Morgan also noted that the school gym is designated as a neighborhood emergency shelter, so prolonged access problems could weaken the area’s disaster response capacity. Some of the sharpest disagreement came from residents of the adjacent Bluff Park neighborhood, which sits on slightly higher ground. Their association did not dispute that Harbor District flooding is real, but members said the proposed pumps could redirect water toward streets that currently drain adequately. Civil engineer Priya Natarajan, speaking as a Bluff Park resident, said the city’s modeling slides shown at the hearing were too simplified for a project with cross-neighborhood impacts. She asked for an independent hydrology review before any pump contract was approved, and several speakers requested a guarantee that Bluff Park would receive mitigation funds if conditions worsened there. Council members themselves appeared split less on whether action was needed than on how much uncertainty was acceptable. Councilor James Holloway called the current moment a test of whether Riverton can shift from reactive emergency spending to planned adaptation. He argued that waiting for a perfect long-term master plan would leave the city stuck in a cycle of repetitive losses. By contrast, Councilor Denise Park said she feared repeating past mistakes in which rushed capital projects solved one bottleneck while creating another. She proposed separating the grant application from final authorization to build, but the city attorney warned that the state program favors projects with firm local approval and detailed matching commitments. By the end of the hearing, a possible compromise began to emerge. Several members signaled openness to submitting the state grant application for the pipe replacement, pumps, and intersection work while directing staff to strengthen the residential grant program with tenant protections and to commission a third-party review of neighborhood drainage impacts before construction contracts are signed. Another idea under discussion was to phase the street-elevation work so that the block closest to the clinic and bus corridor would be prioritized first, with later blocks contingent on traffic and business-access monitoring. No vote was taken Tuesday night. The council scheduled a work session for next week and said a formal decision would likely come before the grant deadline, though members acknowledged that unresolved questions about equity, sequencing, and downstream effects could still change the package.

271
Mar 19, 2026 04:11

Analysis

Anthropic Claude Haiku 4.5 VS Google Gemini 2.5 Flash

Choose the Best City Transit Upgrade

A city has a one-time budget of 120 million dollars for one major public transit project and must choose exactly one of the following options. Option A: Bus Rapid Transit corridor - Cost: 95 million - Estimated daily riders after 3 years: 70,000 - Average travel time reduction for affected riders: 12 minutes per trip - Construction disruption: moderate for 18 months - Annual operating cost increase: 6 million - Serves many lower-income neighborhoods directly - Can be expanded later at moderate cost Option B: Light rail extension - Cost: 120 million - Estimated daily riders after 3 years: 55,000 - Average travel time reduction for affected riders: 18 minutes per trip - Construction disruption: high for 36 months - Annual operating cost increase: 9 million - Expected to stimulate more private development near stations - Lower emissions per passenger than diesel buses Option C: Citywide bus network redesign plus signal priority - Cost: 60 million - Estimated daily riders after 3 years: 85,000 - Average travel time reduction for affected riders: 7 minutes per trip - Construction disruption: low for 9 months - Annual operating cost increase: 4 million - Benefits are spread broadly but less dramatically in any one corridor - Requires strong public communication to avoid confusion during rollout Additional context: - The city council says its priorities, in order, are: 1) improve mobility for the most residents, 2) support equity, 3) minimize disruption to small businesses during construction, 4) encourage long-term environmental sustainability. - The mayor strongly prefers visible results before the next election in 2 years. - The city is not allowed to raise new taxes for operating costs in the next 5 years. Write an analysis recommending one option. Weigh the tradeoffs, address the council priorities and political constraint, and explain why the rejected options are less suitable. If you think the best choice still has serious risks, identify them and suggest how the city should mitigate them.

280
Mar 19, 2026 03:09

Showing 41 to 60 of 106 results

Related Links

X f L