Orivel Orivel
Open menu

Latest Tasks & Discussions

Browse the latest benchmark content across tasks and discussions. Switch by genre to focus on what you want to compare.

Benchmark Genres

Model Directory

Summarization

Google Gemini 2.5 Pro VS Anthropic Claude Sonnet 4.6

Summarize a Public Consultation Brief on Nighttime Delivery in a Historic City Center

Read the following consultation brief and write a concise summary for a city council member who has not read the document. Your summary must: - be 220 to 300 words long - use neutral, non-promotional language - explain the problem the city is trying to solve - capture the main evidence and viewpoints from supporters and critics - include the proposed pilot program, its safeguards, and how success would be measured - mention at least three specific operational details or numbers from the brief - avoid quoting full sentences from the source - not add facts or opinions not supported by the source Source passage: The City of Larkhaven is considering a 12-month pilot program that would allow a limited number of nighttime deliveries in the Old Market district, a dense mixed-use neighborhood known for narrow streets, heritage buildings, restaurants, small grocers, apartments above shops, and heavy daytime foot traffic. At present, most commercial deliveries are concentrated between 7:00 a.m. and 2:00 p.m. As a result, box trucks often double-park on streets that were laid out long before modern freight vehicles existed. Delivery drivers unload beside bus stops, riders on bicycles weave into traffic to pass stopped trucks, and pedestrians spill off crowded sidewalks when hand carts block storefronts. According to the city’s transportation department, freight activity is not the largest source of congestion in Old Market, but it is among the most disruptive because the disruptions occur on the narrowest streets and at the busiest times. A staff report prepared for the council argues that shifting some deliveries to late evening or overnight hours could reduce daytime conflicts without increasing the total number of trips. The proposal would not create new delivery demand; instead, it would move selected restocking trips to lower-traffic periods. Staff cite examples from other cities where off-hour deliveries shortened average unloading times because drivers could park legally closer to destinations and complete routes more predictably. The report also notes potential environmental benefits from smoother driving speeds and less idling while searching for curb space. However, staff acknowledge that the same studies found uneven results when neighborhoods had many residents living directly above commercial premises, especially where building insulation was poor. The draft pilot would cover only the four-block core of Old Market and would limit participation to 18 businesses in its first phase. Eligible businesses would include food retailers, pharmacies, and hospitality venues that already receive at least four deliveries per week. Participating carriers would need to use vehicles no larger than 7.5 tons gross weight and comply with a quiet-delivery code. That code would prohibit metal roll cages, require rubberized cart wheels, ban unloading with engine idling beyond two minutes, and require drivers to complete noise-awareness training. Routine delivery windows under the pilot would run from 9:30 p.m. to 6:00 a.m., but no unloading could begin after midnight within 20 meters of a residential entrance unless the destination business had submitted a building-specific mitigation plan. To address concerns about resident sleep disturbance, the city proposes several safeguards. First, the pilot would exclude streets with documented nighttime noise complaints above the district median during the previous 18 months. Second, each participating business would have to designate an on-site receiver so drivers would not need to buzz apartments or repeatedly knock on locked service doors. Third, the city would install temporary sound monitors at 12 locations and publish monthly readings, along with a log of complaints, parking citations, and observed curb-blocking incidents. Fourth, the pilot could be suspended on any block where overnight complaints exceeded a trigger threshold for two consecutive months. The threshold in the draft is six verified complaints per 100 residents, though staff say this number is open to revision after public comment. Business groups strongly support the pilot. The Old Market Merchants Association says morning deliveries frequently arrive after shops open, forcing staff to restock shelves while also serving customers. Restaurant owners argue that receiving produce and beverages at dawn or late night would free curb space during lunch preparation and reduce the need for workers to drag pallets through crowded dining streets. A coalition of independent grocers adds that more predictable delivery times could cut spoilage for chilled goods, because drivers would spend less time stuck in queues. Several carriers also support the plan, saying a truck can sometimes spend more time circling for legal curb access than actually unloading. They argue that if routes become more reliable, fewer backup vehicles may be needed to complete the same volume of deliveries. Resident organizations are divided. Some acknowledge that daytime freight activity has become chaotic and that blocked sidewalks are especially difficult for older adults, parents with strollers, wheelchair users, and delivery workers on cargo bikes. Others say the burden is being shifted from shoppers to people trying to sleep. The Old Market Tenants Forum submitted comments noting that many apartments have single-glazed windows and bedrooms facing service alleys. The forum argues that even if average noise readings stay within acceptable ranges, repeated short bursts from tail lifts, rolling containers, reversing alarms, and late conversations can still wake residents. Preservation advocates have raised a related concern: because many buildings are protected, retrofitting loading areas or installing acoustic barriers may be expensive, restricted, or visually inappropriate. Labor representatives have offered conditional support but say the pilot should not depend on unpaid schedule flexibility from retail staff or unsafe expectations for drivers. The local drivers’ union says quieter equipment is welcome, but nighttime operations can create pressure to unload faster with fewer workers present. They want clear rules on staffing, access, lighting, and restroom availability. A union representing shop employees says receiving deliveries at 5:00 a.m. should not become an informal expectation for junior workers without revised contracts, transport allowances, or secure entry procedures. City staff responded by stating that labor conditions would be monitored through employer attestations and random compliance checks, though details remain limited in the current draft. The consultation brief includes preliminary cost estimates. The city expects to spend about $420,000 over 12 months: roughly $160,000 for monitoring equipment and data analysis, $110,000 for curbside signage and temporary loading zone adjustments, $90,000 for program administration and inspections, and $60,000 for driver training subsidies and business onboarding. Staff propose funding the pilot from the existing mobility innovation budget rather than from the general fund. They argue that if daytime curb conflicts decline, the city may avoid or defer more expensive street redesigns. Critics reply that the estimate may be incomplete because it does not clearly price enforcement during overnight hours or any mitigation measures for affected residents. The brief also explains why the city is pursuing a pilot instead of a permanent rule change. Freight patterns vary sharply by street, season, and business type, and council members previously rejected a citywide nighttime delivery ordinance as too broad. Staff now argue that a smaller trial with block-by-block reporting would generate better local evidence. The proposed evaluation framework would compare pilot streets with similar non-pilot streets using measures such as average unloading duration, illegal parking observations, daytime travel speeds for buses, complaint rates, worker injury reports, and business delivery reliability. The city would also survey residents, drivers, and participating businesses at three points: before launch, at six months, and near the end of the trial. A final recommendation would return to council only if the data showed meaningful daytime benefits without disproportionate nighttime harms. At a recent public meeting, council members signaled interest but asked for revisions. One requested a stricter cap on the number of participating vehicles per night. Another asked staff to clarify whether electric refrigeration units would be required for chilled-food suppliers, since diesel-powered units can create a persistent hum even when engines are off. A third questioned whether the complaint trigger should be based on residents, dwelling units, or building frontages, noting that each method could produce different outcomes on mixed-use blocks. Staff said they would revise the draft before the formal vote next month and might narrow the eligible street list further if consultation feedback shows concentrated concern. In short, the debate is not simply about whether goods should move at night. It is about whether carefully managed off-hour deliveries can reduce visible daytime disorder in a fragile, busy district without transferring the costs to residents, workers, or historic buildings. The consultation asks respondents to comment on the proposed hours, business eligibility rules, quiet-delivery standards, complaint thresholds, labor protections, and evaluation metrics. Written comments remain open until the 28th of this month, after which staff will publish a response summary and a revised pilot design for council consideration.

52
Mar 20, 2026 11:21

System Design

Google Gemini 2.5 Flash VS Anthropic Claude Sonnet 4.6

Design a Global URL Shortening Service

Design a public URL shortening service similar to Bitly. Users can submit a long URL and receive a short alias; visiting the short link should redirect quickly to the original URL. The system must support custom aliases, optional expiration dates, basic click analytics, and abuse mitigation for malicious links. Requirements and constraints: - Functional requirements: - Create short URLs for long URLs. - Redirect short URLs to original URLs. - Support custom aliases when available. - Support optional expiration time per link. - Record click events for analytics. - Allow users to disable a link manually. - Scale assumptions: - 120 million new short URLs per month. - 1.5 billion redirects per day. - Redirect traffic is globally distributed and read-heavy. - Analytics data should be queryable within 15 minutes. - Performance targets: - Redirect p95 latency under 80 ms for most regions. - Short-link creation p95 under 300 ms. - 99.99% availability for redirects. - Data and retention: - Links may live indefinitely unless expired or disabled. - Raw click events may be retained for 90 days; aggregated analytics for 2 years. - Operational constraints: - Use commodity cloud infrastructure; do not assume one exotic managed product solves everything. - Budget matters: justify any replication, caching, and storage choices. - Short codes should be compact and reasonably hard to guess at large scale, but perfect secrecy is not required. In your answer, provide: 1. A high-level architecture with major components and data flow. 2. Storage choices for link metadata, redirect path, and analytics events, with rationale. 3. A short-code generation strategy, including how to avoid collisions and handle custom aliases. 4. A scaling plan for global traffic, including caching, partitioning/sharding, and multi-region considerations. 5. A reliability plan covering failures, hot keys, disaster recovery, and degraded-mode behavior. 6. Key APIs and core data models. 7. Abuse mitigation and security considerations. 8. The main trade-offs you made and why.

47
Mar 20, 2026 11:03

Education Q&A

OpenAI GPT-5.2 VS Google Gemini 2.5 Flash-Lite

Explain the Paradox of the Ship of Theseus in Philosophy of Identity

The Ship of Theseus is one of the oldest thought experiments in Western philosophy. Suppose a wooden ship is maintained by gradually replacing each plank of wood as it decays. After every single original plank has been replaced, is the resulting ship still the Ship of Theseus? Now suppose someone collects all the discarded original planks and reassembles them into a ship. Which ship, if either, is the "real" Ship of Theseus? In a structured essay, address all of the following: 1. State the core paradox precisely and explain why it poses a genuine philosophical problem for theories of identity. 2. Present and critically evaluate at least three distinct philosophical positions that attempt to resolve the paradox (e.g., mereological essentialism, spatiotemporal continuity theory, four-dimensionalism/perdurantism, nominal essentialism, etc.). For each position, explain its resolution and identify at least one significant objection. 3. Explain how this paradox connects to at least two real-world domains (e.g., personal identity over time, legal identity of corporations, biological cell replacement, digital file copying, restoration of historical artifacts). For each domain, show specifically how the paradox manifests and what practical consequences follow. 4. Take and defend your own reasoned position on which resolution is most philosophically satisfying, acknowledging its limitations.

49
Mar 20, 2026 10:48

Analysis

Anthropic Claude Sonnet 4.6 VS Google Gemini 2.5 Flash

Select the Most Promising School Lunch Reform

A public school district can fund only one lunch reform for the next two years. Analyze the options below and recommend which single option the district should choose. Your answer should compare the tradeoffs, address likely objections, and reach a clear conclusion. District goals: 1. Improve student nutrition 2. Increase the number of students actually eating school lunch 3. Keep implementation realistic within two years 4. Avoid large ongoing cost overruns Current situation: - 12,000 students across 18 schools - 46% of students currently choose school lunch - Surveys suggest students often skip lunch because of taste, long lines, or lack of appealing choices - The district can afford only one of the following options now Option A: Hire trained chefs to redesign menus - Upfront training and consulting cost: medium - Ongoing food cost: slightly higher - Expected effects: meals taste better, healthier recipes become more appealing, moderate increase in participation - Risks: benefits depend on staff adoption and recipe consistency across schools Option B: Add self-serve salad and fruit bars in every school - Upfront equipment cost: high - Ongoing food waste risk: high - Expected effects: strong nutrition improvement for students who use the bars, modest participation increase overall - Risks: staffing, sanitation, and uneven use by age group Option C: Launch a mobile pre-order system for lunches - Upfront technology and training cost: medium - Ongoing cost: low to medium - Expected effects: shorter lines, better forecasting, moderate participation increase, little direct nutrition improvement unless menus stay the same - Risks: unequal access for families with limited technology use, adoption challenges at first Option D: Replace sugary desserts and fried sides with healthier defaults - Upfront cost: low - Ongoing cost: neutral - Expected effects: direct nutrition improvement for all school lunch users, possible small drop in participation if students dislike changes - Risks: student backlash, perception that lunch became less enjoyable Write an analysis that identifies the best choice given the district goals and constraints. Do not invent new budget numbers or outside facts; reason only from the information provided.

46
Mar 19, 2026 21:45

Brainstorming

Google Gemini 2.5 Flash VS OpenAI GPT-5.4

Revenue Streams for a Small-Town Public Library Facing Budget Cuts

A small-town public library (serving a population of roughly 12,000) has just learned that its annual municipal funding will be cut by 30% starting next fiscal year. The library has the following assets and constraints: Assets: - A 6,000 sq ft building with a 200-person capacity community room - A small parking lot (20 spaces) - Two full-time librarians and three part-time staff - A collection of 40,000 physical books and a modest digital catalog - A makerspace with a 3D printer, laser cutter, and sewing machines - Reliable high-speed internet and 15 public-use computers - A small fenced garden area behind the building Constraints: - The library must remain free to enter and must continue lending books at no charge - It cannot sell alcohol or host gambling - Any new revenue activity must be legal in a typical U.S. municipality - Staff cannot increase; volunteers may be recruited - The library board will not approve anything that generates significant noise complaints from adjacent residential neighbors Brainstorm as many distinct, practical revenue-generating or cost-saving ideas as you can. For each idea, provide: 1. A short name 2. A one-to-two sentence description of how it works 3. Which library asset it leverages Aim for breadth across different categories (e.g., events, partnerships, services, space rental, grants, merchandising, digital, etc.).

54
Mar 19, 2026 19:59

System Design

Google Gemini 2.5 Flash VS Anthropic Claude Haiku 4.5

Design a Global URL Shortening Service

Design a globally available URL shortening service similar to Bitly. The service must let users create short links that redirect to long URLs, support custom aliases for paid users, track click analytics, and allow links to expire at a specified time. Requirements: - Handle 120 million new short links per day. - Handle 4 billion redirects per day. - Peak traffic can reach 3 times the daily average. - Redirect latency target: p95 under 80 ms for users in North America, Europe, and Asia. - Short-link creation latency target: p95 under 300 ms. - Service availability target: 99.99% for redirects. - Analytics data can be eventually consistent within 5 minutes. - Custom aliases must be unique globally. - Expired or deleted links must stop redirecting quickly. - The system should tolerate regional failures without total service outage. Assumptions you may use: - Average long URL length is 500 bytes. - Analytics events include timestamp, link ID, country, device type, and referrer domain. - Read traffic is much higher than write traffic. - You may choose SQL, NoSQL, cache, stream, CDN, and messaging technologies as needed, but justify them. In your answer, provide: 1. A high-level architecture with main components and request flows. 2. Data model and storage choices for links, aliases, and analytics. 3. A scaling strategy for read-heavy traffic, including caching and regional routing. 4. A reliability strategy covering failover, consistency decisions, and handling regional outages. 5. Key trade-offs, bottlenecks, and at least three risks with mitigations. 6. A brief capacity estimate for storage and throughput using the numbers above.

57
Mar 19, 2026 18:51

Showing 61 to 80 of 333 results

Related Links

X f L