Orivel Orivel
Open menu

Latest Tasks & Discussions

Browse the latest benchmark content across tasks and discussions. Switch by genre to focus on what you want to compare.

Benchmark Genres

Model Directory

Planning

Anthropic Claude Sonnet 4.6 VS Google Gemini 2.5 Flash

Weekend Community Garden Recovery Plan

You are coordinating a volunteer effort to restore a neglected community garden over a single Saturday. Create a practical plan for the day. Situation: - The garden is open from 8:00 to 16:00. - You have 8 volunteers. - Two volunteers can use power tools safely; the others cannot. - Main tasks: 1. Clear weeds from 12 raised beds. 2. Repair 18 meters of damaged wooden edging. 3. Turn and enrich the compost area. 4. Install a simple drip-irrigation line for 6 beds. 5. Plant 60 seedlings. 6. Clean and organize the tool shed. - Equipment available: - 2 power trimmers - 4 shovels - 6 hand trowels - 2 wheelbarrows - 1 drill set - 2 hoses - Time estimates if enough suitable people and tools are assigned: - Weed clearing: 3 hours total work for 4 volunteers using hand tools, or 2 hours total work if 2 trained volunteers use the power trimmers with 2 helpers. - Wooden edging repair: 3 hours for 2 volunteers, and it requires the drill set. - Compost work: 2 hours for 2 volunteers. - Drip-irrigation install: 2 hours for 2 volunteers, and it uses both hoses during installation. - Planting seedlings: 2 hours for 4 volunteers after the relevant beds are weed-cleared and irrigation is installed in those 6 beds. - Tool shed cleanup: 1.5 hours for 2 volunteers. - Required breaks: - Everyone needs a 30-minute lunch break between 12:00 and 13:30. - Each volunteer also needs one 15-minute rest break in the morning and one in the afternoon. - Goal priorities, in order: 1. Make the 6 irrigated beds fully ready and planted by the end of the day. 2. Eliminate safety hazards and leave the site organized. 3. Maximize total visible improvement. Constraints and risks: - A light rain is forecast from 14:00 to 15:00. Planting can continue in light rain, but wooden edging repair cannot. - Power tool use is not allowed before 9:00 due to a neighborhood noise rule. - At least 1 volunteer must be free at all times for check-in, supply runs inside the site, and unexpected issues. Your answer should provide: - A time-blocked schedule for the day. - Volunteer allocation by task in each block. - A short explanation of why the order is feasible and prioritized correctly. - At least 3 risks or bottlenecks and how your plan handles them. - A brief contingency note for what to cut or simplify if the team falls behind by 90 minutes.

71
Mar 15, 2026 16:02

Analysis

Anthropic Claude Opus 4.6 VS Google Gemini 2.5 Flash

Choose the Best City Transit Upgrade

A city has a budget of $120 million to improve daily commuting over the next five years. Officials are considering three options and can fund only one. Option A: Bus Rapid Transit - Cost: $95 million - Estimated daily riders affected: 70,000 - Average travel time reduction per affected rider: 9 minutes - Construction disruption: moderate for 18 months - Annual operating cost increase: low - Equity impact: strong benefit for low-income neighborhoods - Emissions impact: moderate reduction - Risk: proven technology, low implementation risk Option B: Light Rail Extension - Cost: $120 million - Estimated daily riders affected: 45,000 - Average travel time reduction per affected rider: 15 minutes - Construction disruption: high for 36 months - Annual operating cost increase: medium - Equity impact: moderate benefit across mixed-income areas - Emissions impact: strong reduction - Risk: medium implementation risk due to land acquisition Option C: Smart Traffic Signal System and Intersection Redesign - Cost: $60 million - Estimated daily riders affected: 110,000 - Average travel time reduction per affected rider: 4 minutes - Construction disruption: low for 12 months - Annual operating cost increase: low - Equity impact: limited, benefits spread broadly but not targeted - Emissions impact: small reduction - Risk: low to medium risk because benefits depend on driver behavior and enforcement Write a recommendation memo to the mayor choosing one option. Your analysis should compare the options using at least four relevant criteria, weigh trade-offs, address one reasonable counterargument to your choice, and end with a clear conclusion. Do not invent new data.

89
Mar 15, 2026 14:40

Summarization

OpenAI GPT-5.4 VS Google Gemini 2.5 Flash

Summarize a Passage on the History and Science of Fermentation

Read the following passage carefully and then produce a concise summary of no more than 200 words. Your summary must preserve all six of the key points listed after the passage. Write the summary as a single cohesive paragraph (essay style), not as bullet points. --- BEGIN PASSAGE --- Fermentation is one of the oldest biotechnological processes known to humanity, with archaeological evidence suggesting that humans have been fermenting foods and beverages for at least 9,000 years. Clay pots discovered in the Henan province of China contained residues of a mixed fermented drink made from rice, honey, and fruit, dating back to approximately 7000 BCE. Similarly, evidence of bread-making using fermented dough has been found in ancient Egyptian tombs, and Sumerian tablets from around 3000 BCE contain detailed recipes for beer production. These early practitioners did not understand the microbiology behind fermentation, but they recognized its practical benefits: preservation of food, enhancement of flavor, and the production of intoxicating beverages that played central roles in religious and social rituals. The scientific understanding of fermentation began to take shape in the 19th century, largely through the pioneering work of Louis Pasteur. Before Pasteur, the dominant theory held that fermentation was a purely chemical process — a form of decomposition that occurred spontaneously. In a series of elegant experiments conducted between 1857 and 1876, Pasteur demonstrated that fermentation was caused by living microorganisms, specifically yeasts, and that different types of microorganisms produced different fermentation products. His famous dictum, "fermentation is life without air," captured the essence of anaerobic metabolism, though we now know that the picture is considerably more nuanced. Pasteur's work not only revolutionized our understanding of fermentation but also laid the groundwork for the germ theory of disease, modern microbiology, and the food safety practices that would follow. At its core, fermentation is a metabolic process in which microorganisms — primarily bacteria, yeasts, and molds — convert sugars and other organic substrates into acids, gases, or alcohol under anaerobic or microaerobic conditions. The most well-known form is ethanol fermentation, carried out by the yeast Saccharomyces cerevisiae, in which glucose is converted into ethanol and carbon dioxide. Lactic acid fermentation, performed by species of Lactobacillus and other lactic acid bacteria, converts sugars into lactic acid and is responsible for the production of yogurt, sauerkraut, kimchi, and many other foods. A third major type, acetic acid fermentation, involves the oxidation of ethanol to acetic acid by bacteria such as Acetobacter, and is the basis for vinegar production. Each of these pathways involves a complex series of enzymatic reactions, and the specific conditions — temperature, pH, substrate concentration, and the particular microbial strains involved — determine the final characteristics of the fermented product. The health benefits of fermented foods have attracted significant scientific attention in recent decades. Fermented foods are rich in probiotics — live microorganisms that, when consumed in adequate amounts, confer health benefits on the host. Regular consumption of fermented foods has been associated with improved gut health, enhanced immune function, better nutrient absorption, and even potential mental health benefits through the gut-brain axis. For example, the fermentation of milk into yogurt not only preserves the food but also partially breaks down lactose, making it more digestible for individuals with lactose intolerance. Fermentation can also increase the bioavailability of vitamins and minerals; for instance, the fermentation of soybeans into tempeh significantly increases the availability of iron and zinc. However, researchers caution that not all fermented foods contain live cultures at the time of consumption — products that are pasteurized or heavily processed after fermentation may lose their probiotic content. The field is still evolving, and large-scale clinical trials are needed to fully establish the health claims associated with fermented food consumption. Beyond food and beverage production, fermentation has become a cornerstone of modern industrial biotechnology. The pharmaceutical industry relies heavily on fermentation for the production of antibiotics, with penicillin — first mass-produced using the mold Penicillium chrysogenum in deep-tank fermentation during World War II — being the most famous example. Today, recombinant DNA technology allows engineered microorganisms to produce complex molecules such as insulin, human growth hormone, and monoclonal antibodies through fermentation processes. The biofuel industry uses fermentation to convert plant-derived sugars into bioethanol, which serves as a renewable alternative to fossil fuels. Industrial enzymes used in detergents, textiles, and food processing are also produced through large-scale fermentation. The global industrial fermentation market was valued at over 30 billion US dollars in 2022 and is projected to grow substantially as demand increases for sustainable, bio-based products. Looking to the future, fermentation technology is poised to play an even larger role in addressing global challenges. Precision fermentation — the use of genetically engineered microorganisms to produce specific proteins, fats, and other molecules — is being explored as a way to create animal-free dairy products, egg proteins, and even collagen without the environmental footprint of traditional animal agriculture. Companies around the world are investing billions of dollars in this technology, and some precision-fermented products have already reached consumer markets. Meanwhile, researchers are investigating how fermentation can be used to upcycle food waste, turning agricultural byproducts into valuable nutrients and materials. As the world grapples with climate change, population growth, and resource scarcity, fermentation offers a versatile and ancient toolkit that is being reimagined for the challenges of the 21st century. --- END PASSAGE --- Your summary must preserve the following six key points: 1. Fermentation has ancient origins dating back at least 9,000 years. 2. Louis Pasteur's 19th-century work established that living microorganisms cause fermentation. 3. The three major types of fermentation are ethanol, lactic acid, and acetic acid fermentation. 4. Fermented foods offer health benefits including probiotics and improved nutrient bioavailability, though more research is needed. 5. Fermentation is critical in modern industry, including pharmaceuticals, biofuels, and enzyme production. 6. Precision fermentation and food-waste upcycling represent promising future applications. Write your summary as a single cohesive paragraph of no more than 200 words.

80
Mar 15, 2026 09:17

System Design

OpenAI GPT-5 mini VS Google Gemini 2.5 Flash

Design a URL Shortening Service at Scale

You are tasked with designing a URL shortening service (similar to bit.ly or tinyurl.com) that must handle the following constraints: 1. The service must support 100 million new URL shortenings per month. 2. The read-to-write ratio is 100:1 (i.e., 10 billion redirects per month). 3. Shortened URLs must be at most 7 characters long (alphanumeric). 4. Shortened URLs should not be guessable or sequential. 5. The system must achieve 99.9% uptime. 6. Redirect latency must be under 10ms at the 95th percentile. 7. Shortened URLs should expire after a configurable TTL (default 5 years), and expired URLs should be reclaimable. 8. The service must operate across at least two geographic regions for disaster recovery. Provide a comprehensive system design that addresses the following: - High-level architecture diagram description (describe components and their interactions clearly in text) - URL shortening algorithm and key generation strategy, including how you avoid collisions and ensure non-guessability - Database schema and choice of storage technology, with justification - Caching strategy and cache invalidation approach - Read path and write path, described separately with estimated throughput calculations - Scaling strategy: how the system handles 10x traffic growth - Multi-region deployment and data consistency model, including trade-offs chosen (CAP theorem reasoning) - TTL expiration and URL reclamation mechanism - Failure modes and how the system recovers (at least 3 specific failure scenarios) - Key trade-offs you made and alternatives you considered but rejected, with reasoning Be specific with numbers, technology choices, and architectural reasoning. Avoid vague generalities.

74
Mar 14, 2026 19:35

Summarization

Anthropic Claude Opus 4.6 VS Google Gemini 2.5 Flash

Summarize a Policy Memo with Balanced Tradeoffs

Read the memo below and write a concise summary of 140 to 180 words for a city council member who has not read it. Your summary must cover the problem, the proposed pilot program, expected benefits, main risks or criticisms, and how success would be measured. Do not quote directly. Memo: Riverton's public buses have lost riders for six consecutive years, even though the city's population has grown. A transportation department review found several causes: routes are infrequent outside downtown, schedules are hard to understand, and buses are often delayed by traffic congestion. Low-income residents and older adults reported the greatest difficulty reaching jobs, clinics, and grocery stores without long waits or costly ride-hailing services. In response, staff propose a two-year "Frequent Corridors" pilot. Instead of spreading service thinly across the entire network, the city would increase weekday frequency to every 10 minutes on five major corridors from 6 a.m. to 9 p.m. Two underused neighborhood routes would be replaced by on-demand shuttles that riders could book by phone or app. The plan would also add larger bus-stop signs, simplified maps, and a real-time arrival display at the central transfer station. Supporters argue that riders value reliability and simplicity more than broad but infrequent coverage. They say concentrating resources on the busiest corridors could attract new riders, reduce missed transfers, and improve access to major employers and the community college. They also note that on-demand shuttles may serve low-density areas more efficiently than nearly empty fixed-route buses. Critics raise several concerns. Some disability advocates worry that app-based booking could disadvantage riders without smartphones, although the proposal includes phone reservations. Labor representatives warn that the shuttle service could be outsourced later, potentially affecting union jobs. Environmental groups support transit investment overall but question whether replacing fixed routes with smaller vehicles might reduce total passenger capacity. Some residents also fear that neighborhoods losing direct bus lines will feel abandoned, even if average wait times fall. The pilot is estimated to cost 8 million dollars over two years. Staff suggest funding it through a mix of state transit grants, parking revenue, and delaying a planned downtown streetscape project. They propose evaluating the pilot using ridership changes, average wait times, on-time performance, transfer success rates, customer satisfaction surveys, and access to essential destinations for low-income households. If the pilot fails to improve ridership and reliability within 18 months, staff recommend ending it early or redesigning it.

101
Mar 13, 2026 02:31

Showing 41 to 60 of 74 results

Related Links

X f L