Orivel Orivel
Open menu

Summarize a City Plan for a Library-Resilience Hub

Compare model answers for this Summarization benchmark and review scores, judging comments, and related examples.

Login or register to use likes and favorites. Register

X f L

Contents

Task Overview

Benchmark Genres

Summarization

Task Creator Model

Answering Models

Judge Models

Task Prompt

Summarize the source passage below in 220 to 280 words as a single coherent prose summary. Preserve the main facts, trade-offs, stakeholder positions, timeline, funding details, implementation conditions, and unresolved concerns. Do not add outside information, do not quote long phrases from the passage, and do not use bullet points. Source passage: For more than a decade, the red-brick freight depot on the eastern edge of Marlowe has been a landmark that people mention mostly when giving directions. The building...

Show more

Summarize the source passage below in 220 to 280 words as a single coherent prose summary. Preserve the main facts, trade-offs, stakeholder positions, timeline, funding details, implementation conditions, and unresolved concerns. Do not add outside information, do not quote long phrases from the passage, and do not use bullet points. Source passage: For more than a decade, the red-brick freight depot on the eastern edge of Marlowe has been a landmark that people mention mostly when giving directions. The building sits between the public library, a bus loop, and a low stretch of Maple Creek that floods during heavy spring storms. Its arched windows are boarded, its loading dock is cracked, and weeds grow through the rails that once connected the town to a regional market. Last Tuesday, however, the depot became the center of a serious civic debate when the city council voted 5 to 2 to advance a proposal that would convert the building into a combined library annex, emergency cooling center, and neighborhood workshop space. The vote did not authorize construction, but it allowed staff to negotiate design contracts and prepare a final budget by November. The plan grew out of two problems that, at first, seemed unrelated. The Marlowe Public Library has seen a 38 percent increase in program attendance since 2019, driven by after-school tutoring, job-search classes, and English conversation groups. At the same time, the town has opened temporary heat shelters in school gyms four times in the past three summers as temperatures climbed above 100 degrees for several days in a row. Library Director Sonia Patel argued that the depot’s location made it unusually useful: it is close enough to the existing library for shared staffing, near two bus routes, and outside the highest-risk floodplain by several feet. According to Patel, the annex would add flexible classrooms, a tool-lending counter, public restrooms available after library hours, and a climate-controlled hall that could serve as a cooling center during emergencies. The preliminary budget is 14.8 million dollars, including 2.3 million for environmental cleanup, 1.1 million for flood-resistant landscaping, and 900,000 for solar panels and battery storage. City Manager Luis Ortega said the city has already secured a 5 million dollar state resilience grant and a 2 million dollar philanthropic pledge from the Hannegan Foundation, conditional on preserving the depot’s exterior walls and opening the workshop space at least five evenings per week. The remaining money would come from a mix of municipal bonds and a proposed utility resilience fee of 1.75 dollars per household per month for twelve years. Ortega emphasized that no final borrowing decision would occur before a second public hearing and a more detailed cost estimate. Supporters describe the project as a rare opportunity to solve several public needs without constructing a new building from scratch. Teachers from East Marlowe Elementary said the annex could ease crowding in school-based tutoring programs and give older students a safe place to wait for buses. The local carpenters’ guild offered to run basic repair classes if the workshop includes locked storage and ventilation. A coalition of senior residents urged the council to prioritize backup power, noting that during last summer’s heat wave several apartment buildings lost air conditioning for more than a day. Environmental advocates also praised the idea of restoring the creekside land around the depot with native plants and rain gardens, arguing that the site could demonstrate how older industrial properties can be reused rather than demolished. Opposition came from several directions, not all of them hostile to the library. Council members Dana Rhee and Martin Cole voted no because they said the city was moving too quickly without a firm estimate of future operating costs. Rhee pointed out that staffing a seven-day cooling center, maintaining batteries, and supervising evening workshop hours could strain the same departments that are already short of employees. Cole questioned whether a monthly fee would be fair to renters and residents on fixed incomes, even if the charge appears small. A group of nearby homeowners also warned that additional evening activity could bring noise, traffic, and parking conflicts to narrow streets that were not designed for heavy use. The most emotionally charged testimony came from former rail workers and preservation volunteers. They supported saving the depot but worried that the proposed interior changes would turn it into what one speaker called “a historic shell with a modern building hidden inside.” The draft design removes most interior partitions, raises the main floor by eight inches to improve flood resilience, and inserts a mezzanine for offices. Architect Mina Okafor responded that many original materials had already been lost to water damage and vandalism, but she promised to study whether one section of track, a freight scale, and several beams marked with old shipping codes could remain visible. The council added a condition requiring the design team to meet with the historical commission before presenting revised drawings. There are practical uncertainties as well. A 2021 inspection found lead paint, asbestos pipe insulation, and petroleum contamination near the old loading area, but the city has not yet completed soil testing under the western wall. If cleanup costs exceed the estimate by more than 20 percent, the state grant requires the city to submit a revised scope of work, which could delay construction by six months or more. The bus loop may also need changes because emergency vehicles must be able to access the cooling center without blocking regular transit. Public Works Director Janice Ho said these issues are manageable, but she cautioned that the schedule is “ambitious rather than comfortable.” If everything proceeds smoothly, construction would begin next spring and the center would open in early 2028. By the end of the meeting, even some skeptics acknowledged that the proposal had forced a broader conversation about what counts as essential public infrastructure. For years, Marlowe treated libraries, climate adaptation, historic preservation, and neighborhood traffic as separate topics competing for limited money. The depot plan links them in a single project, which is precisely why it attracts both enthusiasm and anxiety. The next steps will test whether the city can turn that complexity into a workable agreement: staff must produce a refined budget, the design team must address preservation concerns, and council members must decide whether the benefits of a multi-purpose civic space justify the cost and the long-term obligations that would come with it.

Judging Policy

A good answer should accurately condense the passage into the requested length and format while remaining neutral and self-contained. It should capture the council vote and what it authorized, the reasons the depot is being considered, the proposed uses of the facility, the major funding sources and costs, the arguments from supporters, the concerns from opponents, the preservation debate, the environmental and logistical uncertainties, the tentative timeline, and the conditions for next steps. It should avoid inve...

Show more

A good answer should accurately condense the passage into the requested length and format while remaining neutral and self-contained. It should capture the council vote and what it authorized, the reasons the depot is being considered, the proposed uses of the facility, the major funding sources and costs, the arguments from supporters, the concerns from opponents, the preservation debate, the environmental and logistical uncertainties, the tentative timeline, and the conditions for next steps. It should avoid inventing facts, overstating certainty, omitting major trade-offs, or turning the summary into a list of minor details.

Answers

The Marlowe City Council voted 5-2 last Tuesday to advance a proposal converting a long-vacant red-brick freight depot near the public library, a bus loop, and flood-prone Maple Creek into a combined library annex, emergency cooling center, and neighborhood workshop. The vote authorizes design contracts and a final budget by November, not construction. The plan responds to a 38 percent rise in library program attendance since 2019 and to four temporary heat shelter openings in three summers. Library Director Sonia Patel highlighted the site's proximity for shared staffing, transit access, and position outside the...

Show Full Answer

The Marlowe City Council voted 5-2 last Tuesday to advance a proposal converting a long-vacant red-brick freight depot near the public library, a bus loop, and flood-prone Maple Creek into a combined library annex, emergency cooling center, and neighborhood workshop. The vote authorizes design contracts and a final budget by November, not construction. The plan responds to a 38 percent rise in library program attendance since 2019 and to four temporary heat shelter openings in three summers. Library Director Sonia Patel highlighted the site's proximity for shared staffing, transit access, and position outside the highest floodplain, with flexible classrooms, tool-lending, after-hours restrooms, and a climate-controlled hall. The preliminary 14.8 million dollar budget includes 2.3 million for environmental cleanup, 1.1 million for flood-resistant landscaping, and 900,000 for solar and batteries. City Manager Luis Ortega cited a secured 5 million dollar state resilience grant and a 2 million dollar Hannegan Foundation pledge conditioned on preserving exterior walls and opening the workshop five evenings weekly; remaining funds would come from municipal bonds and a proposed 1.75 dollar monthly household utility resilience fee for twelve years, pending a second hearing. Supporters include teachers, carpenters, seniors seeking backup power, and environmental advocates. Council members Rhee and Cole opposed the pace, citing unknown operating costs, staffing strain, and fairness to renters and fixed-income residents; neighbors warned of noise, traffic, and parking. Preservationists fear a historic shell hiding a modern interior, prompting a required historical commission consultation. Unresolved issues include incomplete soil testing for lead, asbestos, and petroleum, possible grant-triggered delays, and bus loop redesign. Construction could begin next spring, opening in early 2028.

Result

#1 | Winner

Winning Votes

3 / 3

Average Score

88

Total Score

88

Overall Comments

Answer A is a tightly compressed, highly faithful single-paragraph summary that stays within the required word count and covers all major facts, trade-offs, stakeholder positions, funding details, conditions, uncertainties, and the timeline. It accurately reports specific figures and conditions without invention. Its main weakness is that the dense single-paragraph format can be slightly harder to parse, but it fully meets the task requirements.

View Score Details

Faithfulness

Weight 40%
92

Answer A is highly faithful to the source. All specific figures, names, conditions, and facts are accurately reported without invention or distortion. The Hannegan Foundation conditions, the grant trigger for delays, the vote count, and the historical commission requirement are all correctly stated. No outside information is added.

Coverage

Weight 20%
90

Answer A covers all major elements: the council vote and what it authorized, the two driving problems, the proposed uses, the full budget breakdown, funding sources and conditions, supporter groups and their specific arguments, opponent concerns including fairness and operating costs, the preservation debate with Okafor's response, the environmental uncertainties including the 20 percent overrun threshold, the bus loop issue, and the timeline. Coverage is comprehensive.

Compression

Weight 15%
85

Answer A is a single dense prose paragraph of approximately 250 words, well within the 220-280 word range. It achieves strong compression without sacrificing key details, efficiently packing facts and trade-offs into a coherent flow.

Clarity

Weight 15%
80

Answer A is clear and readable, though its single-paragraph format and dense packing of facts can make it slightly harder to follow in places. The logical flow from vote to rationale to budget to supporters to opponents to uncertainties to timeline is well-maintained.

Structure

Weight 10%
80

Answer A follows a logical sequence: vote and authorization, driving problems, proposed uses, budget and funding, supporters, opponents, preservation concerns, uncertainties, and timeline. As a single paragraph it meets the 'single coherent prose summary' requirement well.

Total Score

89

Overall Comments

Answer A provides an excellent summary that adheres strictly to all prompt constraints. It is highly faithful to the source, covers all essential details comprehensively, and demonstrates outstanding compression by staying well within the word limit. The summary is clear, concise, and well-structured as a single coherent prose block.

View Score Details

Faithfulness

Weight 40%
90

Answer A accurately reflects all facts, figures, and stakeholder positions from the source passage without introducing any outside information or misrepresenting details.

Coverage

Weight 20%
85

Answer A covers all the main facts, trade-offs, stakeholder positions, timeline, funding details, implementation conditions, and unresolved concerns as requested by the prompt, doing so effectively within the word limit.

Compression

Weight 15%
95

Answer A demonstrates excellent compression, condensing the information into 247 words, which is perfectly within the 220-280 word limit. It manages to retain all critical details while being concise.

Clarity

Weight 15%
85

Answer A is very clear and easy to understand. The language is precise, and the information is presented in a straightforward manner, making it highly readable.

Structure

Weight 10%
90

Answer A is structured as a single, coherent prose summary, flowing logically from the initial council vote through the details, stakeholders, challenges, and timeline. It adheres to the prompt's implied format.

Judge Models OpenAI GPT-5.5

Total Score

88

Overall Comments

Answer A is a faithful, compact summary that stays within the required 220 to 280 words and uses a single coherent prose paragraph. It captures the council vote, limited authorization, reasons for the project, uses, funding structure, major supporters and opponents, preservation dispute, unresolved cleanup and transit issues, and tentative timeline. Its main weaknesses are minor compression-related omissions, such as less detail on the broader infrastructure debate and specific design features like the raised floor and mezzanine.

View Score Details

Faithfulness

Weight 40%
90

Answer A is highly accurate and does not add outside information. It has only minor imprecision in compressing the environmental-testing issue, but it otherwise preserves the passage’s claims and uncertainties well.

Coverage

Weight 20%
82

Answer A covers nearly all core elements: the 5-to-2 vote, what it authorized, project rationale, proposed uses, budget, funding conditions, supporters, opponents, preservation concerns, unresolved cleanup and transit issues, and timeline. It omits some secondary specifics, including the raised floor, mezzanine, and broader final framing about essential infrastructure.

Compression

Weight 15%
95

Answer A is about 266 words, comfortably within the required 220 to 280 word range, and compresses a complex passage without becoming skeletal.

Clarity

Weight 15%
85

Answer A is clear, neutral, and easy to follow, though its dense single paragraph occasionally packs many details into long sentences.

Structure

Weight 10%
90

Answer A follows a coherent progression from decision and rationale to funding, stakeholder reactions, unresolved issues, and timeline, all in the requested prose-summary form.

The Marlowe City Council has voted to advance a proposal to convert the derelict freight depot into a multi-functional community hub, including a library annex, emergency cooling center, and neighborhood workshop. This decision follows a decade of the building being largely ignored, situated near the public library, a bus loop, and a flood-prone creek. The plan addresses two key issues: the Marlowe Public Library's significant increase in program attendance and the town's recurring need for heat shelters during extreme summer temperatures. Library Director Sonia Patel highlighted the depot's strategic location fo...

Show Full Answer

The Marlowe City Council has voted to advance a proposal to convert the derelict freight depot into a multi-functional community hub, including a library annex, emergency cooling center, and neighborhood workshop. This decision follows a decade of the building being largely ignored, situated near the public library, a bus loop, and a flood-prone creek. The plan addresses two key issues: the Marlowe Public Library's significant increase in program attendance and the town's recurring need for heat shelters during extreme summer temperatures. Library Director Sonia Patel highlighted the depot's strategic location for shared staffing and accessibility. The proposed annex would offer additional classrooms, a tool-lending service, extended restroom access, and a climate-controlled space for cooling center functions. The preliminary budget is set at $14.8 million, with substantial allocations for environmental cleanup ($2.3 million) and flood-resistant landscaping ($1.1 million), alongside investments in solar panels and battery storage ($900,000). Funding sources include a $5 million state resilience grant and a $2 million philanthropic pledge from the Hannegan Foundation, which is contingent on preserving the depot's exterior and operating the workshop space evenings weekly. The remaining costs are expected to be covered by municipal bonds and a proposed utility resilience fee of $1.75 per household monthly for twelve years. A second public hearing and detailed cost estimates will precede any final borrowing decisions. Supporters, including local educators and the carpenters' guild, view the project as an efficient solution to multiple community needs, providing space for tutoring, job training, and skill-sharing. Senior residents and environmental advocates have also voiced support, emphasizing the need for backup power and the ecological benefits of restoring the creekside area. However, opposition has emerged due to concerns about the speed of the process and potential long-term operational costs. Council members Dana Rhee and Martin Cole voted against the proposal, citing insufficient estimates for staffing, maintenance, and the fairness of the proposed utility fee, particularly for renters and those on fixed incomes. Nearby homeowners also expressed worries about increased noise, traffic, and parking issues. Preservationists and former rail workers raised concerns that the extensive interior renovations, including raising the floor and adding a mezzanine, would compromise the building's historic integrity. Architect Mina Okafor acknowledged material loss but pledged to explore retaining certain original features. The council has mandated that the design team consult with the historical commission. Practical challenges remain, including potential cost overruns from environmental cleanup (lead paint, asbestos, petroleum contamination) which could delay construction. Modifications to the bus loop for emergency vehicle access are also necessary. Despite these hurdles, Public Works Director Janice Ho believes the issues are manageable, though the timeline is tight. If all proceeds as planned, construction is slated to begin next spring, with an opening in early 2028. The project has spurred a broader discussion on defining essential public infrastructure, linking library services, climate adaptation, historic preservation, and neighborhood needs.

Result

#2

Winning Votes

0 / 3

Average Score

74

Total Score

78

Overall Comments

Answer B is well-written and covers most key elements with good clarity, but it significantly exceeds the 220-280 word limit (running approximately 380-400 words), which is a major failure of the compression requirement. It also omits the specific 20 percent cost-overrun threshold and is slightly imprecise about the Hannegan Foundation's five-evenings-per-week condition. While readable and generally accurate, it does not meet the core length constraint of the task.

View Score Details

Faithfulness

Weight 40%
85

Answer B is generally faithful but contains a minor omission of precision: it says the workshop must operate 'evenings weekly' rather than 'at least five evenings per week,' which slightly softens a specific condition. It also omits the 20 percent cost-overrun threshold that triggers the grant revision requirement, which is a meaningful detail. No fabrications, but slightly less precise.

Coverage

Weight 20%
80

Answer B covers most major elements but misses the specific 20 percent cost-overrun threshold that triggers grant revision, omits the specific vote count (5-2), and is less precise about the Hannegan Foundation's condition (five evenings per week). The broader civic infrastructure framing at the end is included, which is a positive. Overall coverage is good but slightly less complete than A.

Compression

Weight 15%
65

Answer B is structured into multiple paragraphs and runs noticeably longer than the 220-280 word limit—it appears to be around 380-400 words. This exceeds the specified length constraint, which is a significant compression failure. The task explicitly required 220 to 280 words.

Clarity

Weight 15%
75

Answer B is written in clear, accessible prose and is easy to read. The multi-paragraph structure aids readability. However, the excessive length undermines the clarity of compression, and some transitions between sections feel slightly abrupt.

Structure

Weight 10%
70

Answer B uses multiple paragraphs, which aids organization but violates the spirit of 'single coherent prose summary' less severely than using bullet points. The structure is logical but the length violation and the slightly looser organization reduce the score.

Total Score

73

Overall Comments

Answer B offers a faithful and comprehensive summary of the source material. However, it significantly fails the crucial word count constraint, making it less effective as a summary. While its clarity and structure are generally good, the excessive length detracts from its overall quality and compression.

View Score Details

Faithfulness

Weight 40%
90

Answer B is highly faithful to the source passage, accurately presenting all key facts, figures, and perspectives without any discernible errors or external information.

Coverage

Weight 20%
85

Answer B provides comprehensive coverage of all the required elements, including facts, trade-offs, stakeholders, funding, timeline, and concerns. It includes all essential information from the source.

Compression

Weight 15%
20

Answer B fails significantly on compression, coming in at 409 words, which is far beyond the specified 220-280 word limit. This is a major drawback for a summarization task.

Clarity

Weight 15%
70

Answer B is generally clear and understandable, but its excessive length makes it less concise than it could be. The information is well-articulated, but the verbosity slightly detracts from its overall clarity.

Structure

Weight 10%
65

Answer B uses multiple paragraphs, which, while logical for its length, deviates from the prompt's request for 'a single coherent prose summary.' The flow within paragraphs is good, but the overall structure could be more condensed.

Judge Models OpenAI GPT-5.5

Total Score

71

Overall Comments

Answer B is generally accurate and covers many important facts, including funding, stakeholder positions, preservation concerns, environmental cleanup, bus-loop issues, and the timeline. However, it substantially exceeds the required word range and is divided into multiple paragraphs rather than functioning as a tightly compressed single prose summary. It also omits or softens some specific required details, such as the 5-to-2 vote, the November budget step, and the exact evening-use condition.

View Score Details

Faithfulness

Weight 40%
83

Answer B is mostly faithful and avoids major invention, but it slightly generalizes some details, such as describing workshop operation as merely evenings weekly and adding phrasing like job training. It also omits some limiting procedural facts that affect accuracy of emphasis.

Coverage

Weight 20%
80

Answer B covers many major facts and includes several details on preservation, cleanup, bus access, and the broader public-infrastructure discussion. However, it misses notable specifics such as the 5-to-2 vote, preparation of a final budget by November, the explicit non-authorization of construction, and the exact five-evenings-per-week condition.

Compression

Weight 15%
20

Answer B is far over the 280-word maximum, making it a poor fit for the required summary length despite being relevant and readable.

Clarity

Weight 15%
80

Answer B is clearly written and logically understandable, but its verbosity reduces concision and makes it less effective as a compact summary.

Structure

Weight 10%
65

Answer B is organized logically by topic, but the multi-paragraph presentation and excessive expansion make it less aligned with the requested single coherent prose summary.

Comparison Summary

Final rank order is determined by judge-wise rank aggregation (average rank + Borda tie-break). Average score is shown for reference.

Judges: 3

Winning Votes

3 / 3

Average Score

88
View this answer

Winning Votes

0 / 3

Average Score

74
View this answer

Judging Results

Judge Models OpenAI GPT-5.5

Why This Side Won

Answer A wins because it satisfies the central summarization constraints much better while remaining highly faithful and sufficiently comprehensive. Answer B has good coverage and clarity, but its major length violation and looser format significantly weaken it under the weighted criteria, especially compression and structure, without enough coverage advantage to offset those losses.

Why This Side Won

Answer A is the clear winner because it successfully meets all the prompt's requirements, especially the critical word count constraint. Answer B, despite being faithful and comprehensive, fails significantly on compression by exceeding the word limit by a large margin, which is a fundamental requirement for a summarization task.

Why This Side Won

Answer A wins on the highest-weighted criterion (faithfulness, 40%) with greater precision and completeness, and critically outperforms Answer B on compression (15%) by staying within the required 220-280 word range while Answer B substantially exceeds it. Answer A also achieves better coverage (20%) by including the 20 percent overrun threshold and the exact vote count. These advantages on the three most heavily weighted criteria make Answer A the clear winner despite both answers being generally well-written.

X f L