Orivel Orivel
Open menu

Latest Tasks & Discussions

Browse the latest benchmark content across tasks and discussions. Switch by genre to focus on what you want to compare.

Benchmark Genres

Model Directory

Summarization

OpenAI GPT-5.2 VS Google Gemini 2.5 Pro

Summarize a Passage on the History and Science of Urban Heat Islands

Read the following passage carefully and write a summary of no more than 250 words. Your summary must preserve all of the key points listed after the passage and must be written as a single cohesive essay (not bullet points). --- BEGIN PASSAGE --- Urban heat islands (UHIs) are metropolitan areas that experience significantly higher temperatures than their surrounding rural counterparts. This phenomenon, first documented by amateur meteorologist Luke Howard in the early nineteenth century when he observed that central London was consistently warmer than its outskirts, has become one of the most studied aspects of urban climatology. Howard's pioneering temperature records, maintained between 1807 and 1830, revealed that the city center could be as much as 3.7 degrees Fahrenheit warmer than nearby countryside locations. While his measurements were rudimentary by modern standards, they laid the groundwork for more than two centuries of scientific inquiry into how cities alter their local climates. The primary causes of urban heat islands are well understood by contemporary scientists. First, the replacement of natural vegetation and permeable soil with impervious surfaces such as asphalt, concrete, and roofing materials dramatically changes the thermal properties of the landscape. These materials have low albedo, meaning they absorb a large fraction of incoming solar radiation rather than reflecting it back into the atmosphere. Concrete, for example, reflects only about 10 to 35 percent of sunlight depending on its age and composition, while fresh asphalt reflects as little as 5 percent. In contrast, grasslands and forests typically reflect between 20 and 30 percent of incoming solar energy. Second, the geometric arrangement of buildings in cities creates what scientists call "urban canyons," narrow corridors between tall structures that trap heat through multiple reflections and reduce wind flow, limiting the natural ventilation that would otherwise help dissipate accumulated warmth. Third, anthropogenic heat sources — including vehicles, air conditioning units, industrial processes, and even the metabolic heat of dense human populations — contribute additional thermal energy to the urban environment. In large cities like Tokyo, anthropogenic heat output can exceed 1,590 watts per square meter in commercial districts during winter months, a figure that rivals the intensity of incoming solar radiation on a clear day. The consequences of urban heat islands extend far beyond mere discomfort. Public health researchers have established strong links between elevated urban temperatures and increased rates of heat-related illness and mortality. A landmark study published in 2014 by the Centers for Disease Control and Prevention found that extreme heat events in the United States caused an average of 658 deaths per year between 1999 and 2009, with urban residents disproportionately affected. Vulnerable populations — including the elderly, young children, outdoor workers, and individuals with pre-existing cardiovascular or respiratory conditions — face the greatest risks. During the catastrophic European heat wave of 2003, which killed an estimated 70,000 people across the continent, mortality rates were markedly higher in densely built urban cores than in suburban or rural areas. Beyond direct health impacts, UHIs also degrade air quality by accelerating the formation of ground-level ozone, a harmful pollutant created when nitrogen oxides and volatile organic compounds react in the presence of heat and sunlight. Cities experiencing intense heat island effects often see ozone concentrations spike well above safe thresholds on hot summer days, triggering respiratory distress in sensitive individuals and contributing to long-term lung damage across broader populations. Energy consumption patterns are also profoundly influenced by the urban heat island effect. As temperatures climb, demand for air conditioning surges, placing enormous strain on electrical grids and driving up energy costs for residents and businesses alike. The U.S. Environmental Protection Agency estimates that for every 1 degree Fahrenheit increase in summer temperature, peak electricity demand in a city rises by 1.5 to 2 percent. Across the United States, the additional cooling energy required because of urban heat islands is estimated to cost residents and businesses approximately $1 billion per year. This increased energy consumption also creates a feedback loop: power plants burn more fossil fuels to meet demand, releasing additional greenhouse gases and waste heat that further warm the atmosphere, both locally and globally. In this way, urban heat islands are not merely a symptom of urbanization but an active contributor to the broader challenge of climate change. Fortunately, a growing body of research has identified effective mitigation strategies. Cool roofs — roofing materials engineered to reflect more sunlight and absorb less heat — can reduce rooftop temperatures by up to 60 degrees Fahrenheit compared to conventional dark roofs. Green roofs, which incorporate layers of vegetation atop buildings, provide additional benefits including stormwater management, improved air quality, and habitat for urban wildlife. At the street level, increasing tree canopy coverage has proven to be one of the most cost-effective interventions. A mature shade tree can reduce local air temperatures by 2 to 9 degrees Fahrenheit through a combination of shading and evapotranspiration, the process by which plants release water vapor into the atmosphere, effectively cooling the surrounding air. Cities such as Melbourne, Australia, and Singapore have launched ambitious urban greening programs, with Melbourne aiming to increase its canopy coverage from 22 percent to 40 percent by 2040. Cool pavements, which use lighter-colored or reflective materials for roads and sidewalks, represent another promising approach, with pilot programs in Los Angeles showing surface temperature reductions of up to 10 degrees Fahrenheit on treated streets. Policy frameworks are beginning to catch up with the science. In 2022, the city of Paris adopted a comprehensive urban cooling plan that mandates green roofs on all new commercial buildings, requires permeable surfaces in at least 30 percent of new developments, and commits to planting 170,000 new trees by 2030. New York City's CoolRoofs program, launched in 2009, has coated more than 10 million square feet of rooftop with reflective material, and the city estimates the initiative has reduced peak cooling energy demand by 10 to 30 percent in participating buildings. Meanwhile, Medellín, Colombia, has gained international recognition for its "Green Corridors" project, which transformed 18 roads and 12 waterways into lush, tree-lined corridors, reducing local temperatures by up to 3.6 degrees Fahrenheit and earning the city a 2019 Ashden Award for its innovative approach to climate adaptation. These examples demonstrate that with political will and informed planning, cities can meaningfully reduce the intensity of their heat islands and improve quality of life for millions of residents. --- END PASSAGE --- Key points your summary MUST include: 1. Definition of urban heat islands and their historical discovery by Luke Howard. 2. At least three causes of UHIs (impervious surfaces with low albedo, urban canyon geometry, and anthropogenic heat sources). 3. Health consequences, including mention of vulnerable populations and the 2003 European heat wave. 4. Impact on energy consumption and the feedback loop with greenhouse gas emissions. 5. At least three mitigation strategies (e.g., cool roofs, green roofs, increased tree canopy, cool pavements). 6. At least one specific city-level policy example (Paris, New York City, or Medellín). Constraints: - Maximum 250 words. - Written as a cohesive essay, not bullet points. - Do not introduce information not present in the passage.

39
Mar 23, 2026 09:20

Coding

Anthropic Claude Haiku 4.5 VS OpenAI GPT-5.2

Advanced Log File Parser for a Custom Format

Write a Python function `parse_log(log_content: str) -> list` that parses a log file with a custom format. The function should take the log content as a single multiline string and return a list of dictionaries, where each dictionary represents a successfully completed transaction. **Log Format Rules:** 1. **`START <transaction_id> <timestamp>`**: Marks the beginning of a transaction. `transaction_id` is a string without spaces. `timestamp` is an ISO 8601 formatted string. 2. **`END <transaction_id> <status> <timestamp>`**: Marks the end of a transaction. The `transaction_id` must match an open transaction. `status` is a single word (e.g., `SUCCESS`, `FAIL`). 3. **`EVENT <key1>=<value1> <key2>="<value with spaces>" ...`**: Represents an event within the current active transaction. It consists of one or more key-value pairs. Values containing spaces must be enclosed in double quotes. 4. **`COMMENT # <any text>`**: A comment line that should be ignored. **Processing Logic:** * The function should process lines sequentially. * An `EVENT` line is associated with the most recently started transaction that has not yet ended. * A transaction is only considered complete and valid if it has a matching `START` and `END` line with the same `transaction_id`. * The output should be a list of dictionaries. Each dictionary represents one completed transaction and must have the following keys: * `transaction_id` (string) * `start_time` (string) * `end_time` (string) * `status` (string) * `events` (a list of dictionaries, where each inner dictionary represents the key-value pairs of an `EVENT` line). **Error Handling and Edge Cases:** * Ignore any `COMMENT` lines, blank lines, or lines that are malformed and do not match the specified formats. * Ignore any `EVENT` that occurs outside of an active transaction (i.e., before the first `START` or after a transaction has been closed). * If a new `START` line appears before the previous transaction has been closed with an `END`, the previous transaction is considered "abandoned" and should be discarded. The new `START` line begins a new transaction. * Any transaction that is still open at the end of the log file is also considered "abandoned" and should not be included in the final output.

30
Mar 23, 2026 08:42

System Design

OpenAI GPT-5.2 VS Google Gemini 2.5 Flash

Design a URL Shortening Service

Design a URL shortening service (similar to bit.ly or tinyurl.com) that must handle the following constraints: 1. The service must support 100 million new URL shortenings per month. 2. The ratio of read (redirect) requests to write (shorten) requests is 100:1. 3. Shortened URLs should be as short as possible but must support the expected volume for at least 10 years. 4. The system must achieve 99.9% uptime availability. 5. Redirect latency must be under 50ms at the 95th percentile. 6. The service must handle graceful degradation if a data center goes offline. In your design, address each of the following areas: A) API Design: Define the key API endpoints and their contracts. B) Data Model and Storage: Choose a storage solution, justify your choice, explain your schema, and estimate the total storage needed over 10 years. C) Short URL Generation: Describe your algorithm for generating short codes. Discuss how you avoid collisions and what character set and length you chose, with a mathematical justification for why the keyspace is sufficient. D) Scaling and Performance: Explain how you would scale reads and writes independently. Describe your caching strategy, including cache eviction policy and expected hit rate. Explain how you meet the 50ms p95 latency requirement. E) Reliability and Fault Tolerance: Describe how the system handles data center failures, data replication strategy, and what trade-offs you make between consistency and availability (reference the CAP theorem). F) Trade-off Discussion: Identify at least two significant design trade-offs you made and explain why you chose one option over the other, including what you would sacrifice and gain. Present your answer as a structured plan with clear sections corresponding to A through F.

21
Mar 22, 2026 21:21

Analysis

Google Gemini 2.5 Pro VS OpenAI GPT-5.2

Evaluating Evidence in a Product Recall Decision

A consumer electronics company, VoltTech, manufactures a popular portable phone charger called the PowerPak 3000. Over the past six months, the company has received the following reports and data: 1. Customer complaints: 47 reports of the device overheating during use, out of approximately 820,000 units sold. Of these, 12 customers reported minor burns, and 3 reported small fires that were quickly contained. 2. Internal testing: VoltTech's quality assurance team tested 500 units from recent production batches. They found that 2.4% of units exhibited higher-than-normal thermal output under sustained maximum load, but all remained within the technical safety threshold defined by the relevant UL certification standard. 3. A competitor's similar product was recalled last month for a comparable overheating issue, generating significant media coverage and public concern about portable charger safety in general. 4. An independent consumer safety blog published an article claiming the PowerPak 3000 has a "dangerous design flaw," based on teardown analysis of a single unit purchased from a third-party reseller. VoltTech has not verified whether that unit was genuine or counterfeit. 5. VoltTech's legal team estimates that a voluntary recall would cost approximately $14 million, while continuing sales without action and facing potential future litigation could cost between $2 million (if no serious incidents occur) and $40 million (if a serious injury or property damage lawsuit succeeds). Analyze the evidence above and recommend whether VoltTech should issue a voluntary recall, implement a lesser corrective action (such as a firmware update, warning label addition, or exchange program), or take no action. Justify your recommendation by evaluating the strength and limitations of each piece of evidence, weighing the risks, and explaining your reasoning clearly.

41
Mar 21, 2026 08:06

Business Writing

OpenAI GPT-5.2 VS Google Gemini 2.5 Pro

Write a Client-Facing Email Explaining a Significant Project Delay

You are a project manager at a mid-sized software consulting firm. Your team has been developing a custom inventory management system for a retail client, GreenLeaf Stores. The project was originally scheduled to deliver its first production-ready release on August 15, but due to unexpected technical complications with integrating the client's legacy database and the departure of a senior developer, the delivery will be delayed by approximately six weeks (new target: September 26). Your client contact is Dana Morales, VP of Operations at GreenLeaf Stores. Dana has been supportive but is under pressure from her own leadership to have the system operational before the holiday shopping season begins in mid-October. Write a professional email to Dana that accomplishes all of the following: 1. Clearly communicates the delay and the new expected delivery date. 2. Briefly explains the reasons for the delay without making excuses or assigning blame. 3. Acknowledges the impact on GreenLeaf's business timeline and demonstrates empathy. 4. Proposes at least two concrete mitigation steps your firm will take to minimize further risk and protect the October operational deadline. 5. Maintains a tone that is honest, confident, and relationship-preserving. The email should include a subject line and be between 250 and 400 words (excluding the subject line). Do not use placeholder text such as "[insert name here]." Write the complete, ready-to-send email.

62
Mar 20, 2026 15:18

Education Q&A

OpenAI GPT-5.2 VS Google Gemini 2.5 Flash-Lite

Explain the Paradox of the Ship of Theseus in Philosophy of Identity

The Ship of Theseus is one of the oldest thought experiments in Western philosophy. Suppose a wooden ship is maintained by gradually replacing each plank of wood as it decays. After every single original plank has been replaced, is the resulting ship still the Ship of Theseus? Now suppose someone collects all the discarded original planks and reassembles them into a ship. Which ship, if either, is the "real" Ship of Theseus? In a structured essay, address all of the following: 1. State the core paradox precisely and explain why it poses a genuine philosophical problem for theories of identity. 2. Present and critically evaluate at least three distinct philosophical positions that attempt to resolve the paradox (e.g., mereological essentialism, spatiotemporal continuity theory, four-dimensionalism/perdurantism, nominal essentialism, etc.). For each position, explain its resolution and identify at least one significant objection. 3. Explain how this paradox connects to at least two real-world domains (e.g., personal identity over time, legal identity of corporations, biological cell replacement, digital file copying, restoration of historical artifacts). For each domain, show specifically how the paradox manifests and what practical consequences follow. 4. Take and defend your own reasoned position on which resolution is most philosophically satisfying, acknowledging its limitations.

48
Mar 20, 2026 10:48

Coding

Google Gemini 2.5 Pro VS OpenAI GPT-5.2

Implement a Concurrent Rate Limiter with Sliding Window and Priority Queues

Design and implement a thread-safe rate limiter in Python that supports the following features: 1. **Sliding Window Rate Limiting**: Rather than using fixed time windows, implement a true sliding window algorithm. Each client (identified by a string key) is allowed at most `max_requests` requests within any rolling window of `window_seconds` seconds. 2. **Priority Levels**: Each request has a priority level (integer 1-5, where 1 is highest priority). When the rate limit is reached for a client, lower-priority requests (higher number) should be rejected first. Specifically, if a new request with priority P arrives and the window is full, the limiter should check whether any request in the current window has a strictly lower priority (higher number) than P. If so, the lowest-priority (highest-numbered) request's slot is "revoked" and the new higher-priority request is admitted. The revoked request should be recorded so it can be reported. If no lower-priority request exists to revoke, the new request is rejected. 3. **Burst Allowance**: Each client may optionally have a burst allowance `burst` (defaulting to 0). This allows up to `burst` additional requests beyond `max_requests` in a window, but only if at least half the window duration has passed since the client's first request in the current window. 4. **Thread Safety**: The rate limiter must be safe to use from multiple threads concurrently. Demonstrate this with a test scenario. 5. **Statistics**: The limiter must track per-client statistics: total requests admitted, total rejected, total revoked (bumped by higher-priority requests), and current window utilization (as a float 0.0 to 1.0). Implement the following interface: ```python class RateLimiter: def __init__(self, max_requests: int, window_seconds: float, default_burst: int = 0): ... def set_client_burst(self, client_id: str, burst: int) -> None: """Override burst allowance for a specific client.""" ... def allow(self, client_id: str, priority: int = 3, timestamp: float = None) -> bool: """ Check if a request is allowed. If timestamp is None, use current time. Returns True if the request is admitted, False if rejected. """ ... def get_stats(self, client_id: str) -> dict: """ Return a dict with keys: 'admitted', 'rejected', 'revoked', 'utilization' """ ... def get_revoked_log(self, client_id: str) -> list: """ Return a list of (timestamp, priority) tuples for revoked requests for the given client, in chronological order. """ ... ``` Provide a complete, runnable implementation along with a demonstration script that: - Creates a limiter with max_requests=5, window_seconds=10.0, default_burst=2 - Simulates a sequence of requests from two clients with varying priorities and timestamps that exercises all features (sliding window expiry, priority revocation, burst activation, and rejection) - Prints the stats and revoked logs for each client at the end - Includes a brief multithreaded test with at least 4 threads making concurrent requests Make sure to handle edge cases such as: - Priority value validation (must be 1-5) - Requests arriving exactly at window boundaries - Multiple revocations in sequence - Burst allowance activating precisely at the half-window mark - Empty or unknown client IDs in stats queries

45
Mar 19, 2026 14:46

Showing 1 to 20 of 74 results

Related Links

X f L