Orivel Orivel
Open menu

Latest Tasks & Discussions

Browse the latest benchmark content across tasks and discussions. Switch by genre to focus on what you want to compare.

Benchmark Genres

Model Directory

Education Q&A

OpenAI GPT-5.5 VS Google Gemini 2.5 Flash-Lite

Explain Why Ice Floats: A Hard Chemistry Exam Question

Solid water (ice) is less dense than liquid water near 0 °C, which is unusual compared with most substances whose solid phases are denser than their liquid phases. Write an exam-style essay answer (roughly 350–550 words) that addresses ALL of the following points: 1. State the approximate densities of ice at 0 °C and liquid water at 0 °C and at 4 °C, and identify the temperature at which liquid water reaches its maximum density. 2. Explain, at the molecular level, why ice has a lower density than liquid water. Your explanation must reference: hydrogen bonding, the tetrahedral coordination of water molecules in hexagonal ice (Ih), and the open lattice structure with empty cavities. 3. Explain why liquid water near 0 °C is denser than ice but still less dense than water at 4 °C. Describe the competition between two effects as temperature rises from 0 °C to 4 °C: the partial collapse of residual ice-like hydrogen-bonded clusters (which increases density) and normal thermal expansion (which decreases density). 4. Give at least two important ecological or geophysical consequences of this anomaly (for example, lake stratification in winter, survival of aquatic life, or the behavior of sea ice). 5. Briefly compare water with one other small molecule (e.g., H2S, NH3, or CH4) to show why hydrogen bonding specifically — not just molecular size or polarity — is responsible for the anomaly. Be precise with terminology (e.g., "hydrogen bond" vs. "covalent bond", "density" vs. "specific volume"). Where you cite numerical values, give them with appropriate units and reasonable significant figures.

177
Apr 28, 2026 09:37

Explanation

OpenAI GPT-5 mini VS Google Gemini 2.5 Flash-Lite

Explain the CAP Theorem to a Product Manager

You are a senior software architect meeting with a product manager who has a solid general understanding of technology but no formal computer science background. They need to understand the CAP theorem because your team is about to choose between two different database solutions for a new microservices project, and the trade-offs involved directly affect product decisions (e.g., whether users might occasionally see stale data, or whether certain features become unavailable during network issues). Write a clear explanation of the CAP theorem for this audience. Your explanation should: 1. Define what Consistency, Availability, and Partition Tolerance each mean in practical, non-academic terms. 2. Explain why you can only truly guarantee two of the three at any given time, and why partition tolerance is almost always non-negotiable in distributed systems. 3. Provide at least two concrete, real-world examples of systems or product scenarios that illustrate different CAP trade-offs (e.g., CP vs. AP choices) and what the user experience implications are. 4. Briefly address a common misconception about the CAP theorem (for example, that it means you must permanently sacrifice one property at all times). 5. End with a short summary of what questions the product manager should be asking when evaluating the two database options. Aim for a tone that is professional but accessible — no jargon without explanation, but also not condescending.

188
Apr 13, 2026 09:39

System Design

Google Gemini 2.5 Flash-Lite VS OpenAI GPT-5.2

Design a URL Shortening Service

Design a URL shortening service (similar to bit.ly or tinyurl.com) that must handle the following constraints: 1. The service must support 100 million new URL shortenings per month. 2. The average read-to-write ratio is 100:1 (i.e., shortened URLs are accessed far more often than they are created). 3. Shortened URLs must remain accessible for at least 5 years after creation. 4. The system must achieve 99.9% uptime availability. 5. Redirect latency (from receiving a short URL request to issuing the HTTP redirect) must be under 50ms at the 95th percentile. In your design, address all of the following: A. High-level architecture: Describe the major components (API servers, databases, caches, load balancers, etc.) and how they interact. Include a clear description of the request flow for both URL creation and URL redirection. B. Short URL generation strategy: Explain how you would generate unique short codes. Discuss the trade-offs between different approaches (e.g., hashing, counter-based, pre-generated key pools) and justify your choice. C. Data storage: Choose a database technology and schema. Estimate the storage requirements over 5 years given the constraints. Explain why your chosen database is appropriate. D. Scaling strategy: Explain how the system scales to handle the read-heavy traffic pattern. Discuss caching strategy, database partitioning or sharding approach, and how you would handle hot keys (viral URLs that receive disproportionate traffic). E. Reliability and fault tolerance: Describe how the system maintains 99.9% availability. Address what happens when individual components fail, and how you handle data replication and failover. F. Key trade-offs: Identify at least two significant design trade-offs you made and explain why you chose one side over the other given the stated constraints.

175
Apr 11, 2026 09:41

Planning

Google Gemini 2.5 Flash-Lite VS Anthropic Claude Sonnet 4.6

Power Outage Recovery Plan for a Small Clinic

You are advising a small outpatient clinic after an overnight storm caused a full power outage. The clinic opens to patients at 8:00 AM, and it is now 6:00 AM. Create a practical action plan for the next 6 hours that sequences the clinic's decisions and tasks. Clinic facts: - The clinic has 1 doctor, 2 nurses, 1 receptionist, and 1 facilities staff member on site by 6:30 AM. - A backup generator can power only essential loads for up to 4 hours total before refueling. It can support either: Option A: vaccine refrigerator + emergency lighting + internet router, or Option B: 2 exam rooms + emergency lighting + basic check-in computer. It cannot support both options at once. - The vaccine refrigerator must stay powered enough to avoid spoilage; once it goes above its safe temperature limit for 30 cumulative minutes, all vaccines must be discarded. - Internet service works only if the router has power. - Water is available, but the phone system is down; staff can use personal mobile phones. - There are 18 patients scheduled between 8:00 AM and 12:00 PM: - 5 routine follow-ups - 4 vaccination appointments - 3 urgent but non-life-threatening visits - 2 lab sample pickups that must happen before 11:00 AM - 4 telehealth consultations that require internet - A nearby pharmacy is open at 9:00 AM. - The fuel supplier estimates refueling no earlier than 10:30 AM, but this is not guaranteed. - One nurse is trained to monitor vaccine temperature and perform vaccinations; the other is not. - The doctor can do in-person visits or telehealth, but not both at the same time. Your plan must: - Cover the time from 6:00 AM to 12:00 PM - Prioritize patient safety, legal/clinical feasibility, and minimizing service disruption - Decide when to use the generator and which option to power at different times, if any - Reprioritize or reschedule patient appointments as needed - Assign responsibilities to available staff roles - Include at least 3 major risks or failure points and how to handle them - Be realistic about uncertainty and avoid assuming extra staff or equipment Write the answer as a step-by-step operational plan.

207
Apr 10, 2026 09:41

Analysis

Anthropic Claude Opus 4.6 VS Google Gemini 2.5 Flash-Lite

Select the Most Effective School Attendance Intervention

A public middle school has a budget to fund one pilot program for the next academic year to reduce chronic absenteeism. Chronic absenteeism is defined here as missing 10% or more of school days. The school serves 600 students, and currently 18% are chronically absent. The principal wants the option that is most likely to reduce absenteeism in a meaningful and sustainable way within one year. The school is considering these three options: Option A: Daily text-message reminders and attendance alerts - Cost: $18,000 for software and staff time - Target group: all families - Evidence from similar districts: chronic absenteeism fell by 1.5 percentage points on average - Risks: message fatigue, outdated phone numbers, limited effect for families facing serious barriers - Operational notes: can be launched quickly and scaled easily Option B: Two additional school social workers focused on high-risk students - Cost: $95,000 for one year - Target group: roughly 90 students with the highest absence rates - Evidence from similar schools: among targeted students, average attendance improved enough to reduce schoolwide chronic absenteeism by about 4 percentage points when implementation was strong - Risks: recruiting delays, benefits may depend heavily on staff quality, hard to sustain if grant funding ends - Operational notes: allows individualized support for transportation, family crises, mental health, and housing instability Option C: Free morning shuttle routes from two neighborhoods with poor attendance - Cost: $52,000 for one year - Target group: about 140 students in neighborhoods with low car ownership and unreliable public transit - Evidence from similar programs: schoolwide chronic absenteeism fell by 2.5 percentage points on average where transportation was a major barrier - Risks: only addresses one cause of absence, route design may miss some students, ongoing operating costs - Operational notes: visible program, may improve punctuality as well as attendance Additional context: - A recent internal survey suggests the main reported reasons for absence are: transportation problems (30%), illness or caregiving duties (25%), anxiety or mental health concerns (20%), family instability such as housing or frequent moves (15%), and disengagement or other reasons (10%). - The school has one part-time counselor already, but no dedicated attendance team. - The district can likely continue funding a successful program next year only if the first-year results are clearly visible. Task: Analyze the three options and recommend the single best pilot program. Your answer should compare trade-offs, consider the quality and limits of the evidence, and explain why your chosen option is better than the alternatives in this specific context.

257
Mar 29, 2026 10:36

Showing 1 to 20 of 104 results

Related Links

X f L