Orivel Orivel
Open menu

Internal Memo Proposing a Four-Day Pilot Schedule

Compare model answers for this Business Writing benchmark and review scores, judging comments, and related examples.

Login or register to use likes and favorites. Register

X f L

Contents

Task Overview

Benchmark Genres

Business Writing

Task Creator Model

Answering Models

Judge Models

Task Prompt

Write a concise internal memo from the Head of Operations to all employees proposing a 12-week pilot of a four-day workweek for one department. The memo must explain the business rationale, identify the pilot department, describe how success will be measured, address likely employee concerns, and state the next steps. Keep the tone professional, transparent, and practical. Do not promise that the policy will become permanent. Limit the memo to 450 words.

Task Context

The company is a 180-person B2B software firm with hybrid employees across two time zones. Leadership wants to test whether a four-day workweek can improve focus and retention without reducing customer responsiveness. The proposed pilot department is Product Design, because its work is mostly project-based but still requires collaboration with Product Management and Engineering. During the pilot, employees in the department would work Monday through Thursday, maintain the same total weekly pay, and rotate one perso...

Show more

The company is a 180-person B2B software firm with hybrid employees across two time zones. Leadership wants to test whether a four-day workweek can improve focus and retention without reducing customer responsiveness. The proposed pilot department is Product Design, because its work is mostly project-based but still requires collaboration with Product Management and Engineering. During the pilot, employees in the department would work Monday through Thursday, maintain the same total weekly pay, and rotate one person to monitor urgent Friday requests. The company will evaluate productivity, project delivery dates, employee engagement, customer/internal stakeholder satisfaction, and cross-team coordination issues.

Judging Policy

A strong answer should read like a polished workplace memo with a clear subject, audience-appropriate tone, and logical structure. It should explain the rationale without overselling, include the required pilot details, define practical success measures, acknowledge concerns such as workload compression, collaboration gaps, fairness across departments, and Friday coverage, and end with concrete next steps. It should be concise, specific, and businesslike, avoiding vague enthusiasm, legal overcommitment, or excessive jargon.

Answers

TO: All Employees
FROM: Head of Operations
DATE: [Date]
SUBJECT: 12-Week Pilot — Four-Day Workweek in Product Design

I am writing to share a structured experiment we will begin next month: a 12-week pilot of a four-day workweek within the Product Design team. I want to be clear from the outset that this is a test, not a permanent policy change. Our goal is to gather evidence before drawing any conclusions.

Business rationale
As a 180-person B2B software firm operating across two time zones, we are always looking for ways to improve focus and retention without compromising customer responsiveness. Industry exper...

Show Full Answer

TO: All Employees
FROM: Head of Operations
DATE: [Date]
SUBJECT: 12-Week Pilot — Four-Day Workweek in Product Design

I am writing to share a structured experiment we will begin next month: a 12-week pilot of a four-day workweek within the Product Design team. I want to be clear from the outset that this is a test, not a permanent policy change. Our goal is to gather evidence before drawing any conclusions.

Business rationale
As a 180-person B2B software firm operating across two time zones, we are always looking for ways to improve focus and retention without compromising customer responsiveness. Industry experience suggests a compressed schedule can reduce burnout, sharpen concentration, and strengthen our ability to attract and keep talented people. We want to learn whether those benefits hold true for us—and at what cost.

Why Product Design
We selected Product Design because its work is largely project-based, making it well suited to outcome-focused measurement. At the same time, the team collaborates closely with Product Management and Engineering, so the pilot will surface any real cross-team coordination challenges rather than hiding them.

How the pilot will work
During the 12 weeks, participating employees will work Monday through Thursday while maintaining the same total weekly pay. To protect responsiveness, one team member will rotate each week to monitor urgent Friday requests. Standard collaboration with Product Management and Engineering will continue, and we will adjust meeting cadences as needed.

How we will measure success
We will evaluate the pilot against clear, balanced criteria:

  • Productivity and quality of design output
  • On-time project delivery dates
  • Employee engagement and wellbeing
  • Customer and internal stakeholder satisfaction
  • Cross-team coordination friction

We will baseline these measures before the pilot and review them at the midpoint and conclusion.

Addressing likely concerns
I recognize this raises questions. Some worry their workload will simply compress into four longer days; we will monitor hours and workload to guard against that. Others outside Product Design may wonder whether they are being overlooked—this single-department test is the responsible way to learn before considering anything wider. Customers and partners may worry about availability; the Friday rotation and clear communication are designed to keep responsiveness intact. If the data shows the trade-offs are not working, we will end or adjust the pilot.

Next steps

  1. The Product Design lead will hold a kickoff briefing to confirm the rotation schedule and expectations.
  2. Operations will establish baseline metrics this week.
  3. Product Management and Engineering will receive coordination guidelines before the start date.
  4. We will share interim findings at week six and full results afterward.

Please direct questions to your manager or to me directly. Thank you for approaching this with openness and curiosity as we test a smarter way of working.

Result

#1 | Winner

Winning Votes

3 / 3

Average Score

90
Judge Models OpenAI GPT-5.4

Total Score

91

Overall Comments

Answer A is a strong workplace memo that closely follows the prompt. It clearly identifies the pilot scope, rationale, operational details, measurement plan, likely concerns, and concrete next steps. The tone is professional and transparent, and the structure is easy to scan. Its main weakness is that a few phrases are slightly more expansive than necessary, but it remains concise and practical.

View Score Details

Appropriateness

Weight 25%
93

Directly matches the task: concise internal memo, correct sender/audience framing, clear business rationale, correct pilot department, explicit non-permanence, success measures, concerns, and next steps. It also reflects the company context well, including time zones and customer responsiveness.

Clarity

Weight 20%
90

Very clear and readable, with straightforward wording and specific explanations of what will happen, why, and how results will be judged. The bullet points and section labels make interpretation easy.

Structure

Weight 20%
92

Excellent memo structure with strong subject line, logical section headings, bullet points for metrics, and numbered next steps. The organization supports quick internal reading.

Actionability

Weight 20%
91

Provides practical implementation detail: Monday-Thursday schedule, unchanged pay, rotating Friday coverage, baseline metrics, midpoint and final reviews, coordination guidance, kickoff briefing, and clear reporting timeline. Employees can see what happens next.

Tone

Weight 15%
88

Professional, transparent, and practical. It acknowledges uncertainty and trade-offs without overselling, and it addresses concerns respectfully.

Total Score

87

Overall Comments

Answer A is a well-crafted, polished internal memo that covers every required element with specificity and balance. It explains the business rationale clearly, identifies the pilot department with reasoning, describes concrete success metrics with a baseline/midpoint/conclusion review plan, directly addresses multiple employee concerns (workload compression, fairness to other departments, Friday coverage), and closes with four numbered, actionable next steps. The tone is professional, transparent, and appropriately cautious about permanence. It stays within the word limit while remaining substantive.

View Score Details

Appropriateness

Weight 25%
90

Answer A addresses every required element from the task prompt: business rationale, pilot department selection with reasoning, how the pilot works (pay, schedule, Friday rotation), five specific success metrics with a baselining plan, direct acknowledgment of workload compression, fairness concerns, and customer availability, and a clear disclaimer that this is not a permanent policy. Nothing is missing or glossed over.

Clarity

Weight 20%
85

Each section is clearly labeled and the logic flows naturally from rationale to mechanics to measurement to concerns to next steps. The language is plain and precise, avoiding jargon. The reader knows exactly what will happen, why, and what to expect.

Structure

Weight 20%
90

Answer A uses clear section headers, a logical progression, and a numbered next-steps list. The structure mirrors best-practice memo formatting and makes it easy to scan and reference. Each section has a distinct purpose.

Actionability

Weight 20%
85

Four numbered next steps are provided with clear owners (Product Design lead, Operations, Product Management and Engineering) and timing (this week, before start date, week six, after conclusion). Employees know exactly what happens next and when they will receive updates.

Tone

Weight 15%
80

The tone is professional, transparent, and appropriately measured. It neither oversells the pilot nor creates anxiety. The closing line ('test a smarter way of working') is slightly enthusiastic but not excessive. The disclaimer about ending the pilot if data is unfavorable is a strong trust-building element.

Total Score

92

Overall Comments

Answer A is an exemplary response that perfectly matches the requirements of the task. It uses a clear, professional memo format with an excellent structure, employing headings and lists to make the information easy to digest. It addresses all components of the prompt with specific, well-reasoned details, from the business rationale to the success metrics. Its handling of likely employee concerns is particularly strong, demonstrating foresight and transparency. The actionable next steps provide clarity for the entire organization, making it a highly effective piece of internal communication.

View Score Details

Appropriateness

Weight 25%
90

The answer uses a standard memo format and its content is perfectly tailored to the task's requirements, including all necessary details like rationale, metrics, and concerns. It feels like a genuine and effective internal document.

Clarity

Weight 20%
95

The clarity is outstanding. The use of headings, bullet points for metrics, and a numbered list for next steps makes the information exceptionally easy to understand and navigate. The language is direct and unambiguous.

Structure

Weight 20%
95

The structure is a key strength. The memo is logically organized with clear headings that signpost each section, perfectly aligning with the prompt's requirements and making the document highly scannable.

Actionability

Weight 20%
90

The memo provides a clear, numbered list of concrete next steps, indicating what will happen, who is involved, and the general timing. This gives the entire organization a clear view of the path forward.

Tone

Weight 15%
90

The tone is perfectly calibrated: professional, transparent ('this is a test, not a permanent policy change'), and practical. Directly addressing concerns adds a layer of empathy and builds trust.

MEMORANDUM

TO: All Employees
FROM: Head of Operations
DATE: October 26, 2023
SUBJECT: Pilot Program: Four-Day Workweek for Product Design Department

I am writing to announce an upcoming pilot program designed to explore innovative approaches to our work structure and enhance employee well-being and productivity. Effective November 13, 2023, we will launch a 12-week trial of a four-day workweek for our Product Design department.

This initiative stems from our ongoing commitment to fostering a focused and engaging work environment. We believe that a condensed workweek may lead to increased productivity, improved...

Show Full Answer

MEMORANDUM

TO: All Employees
FROM: Head of Operations
DATE: October 26, 2023
SUBJECT: Pilot Program: Four-Day Workweek for Product Design Department

I am writing to announce an upcoming pilot program designed to explore innovative approaches to our work structure and enhance employee well-being and productivity. Effective November 13, 2023, we will launch a 12-week trial of a four-day workweek for our Product Design department.

This initiative stems from our ongoing commitment to fostering a focused and engaging work environment. We believe that a condensed workweek may lead to increased productivity, improved employee morale, and better retention, all while maintaining our high standards of customer service and internal collaboration. The Product Design department has been selected for this pilot due to the project-based nature of their work, which allows for structured implementation while still requiring essential collaboration with Product Management and Engineering.

During the pilot, Product Design employees will work Monday through Thursday. Their total weekly pay will remain unchanged. To ensure continuous support for urgent requests, a rotating on-call system will be implemented for Fridays.

Success of this pilot will be evaluated through a comprehensive set of metrics. We will closely monitor productivity levels, adherence to project delivery timelines, employee engagement surveys, and feedback from both customers and internal stakeholders regarding satisfaction and cross-team coordination. This data will provide a clear picture of the program's impact.

We understand that a change in work structure may raise questions. This pilot is a test, and its outcomes will determine any future considerations. We are committed to transparency throughout this process and will share findings and insights as they become available. Our primary goal is to assess the feasibility and benefits of a four-day workweek within our operational context without compromising our business objectives or client responsiveness.

Following the 12-week period, a thorough review will be conducted. Based on the evaluation results, we will determine the next steps. Further details regarding the pilot's commencement and specific operational adjustments will be communicated directly to the Product Design team by their management.

Result

#2

Winning Votes

0 / 3

Average Score

64
Judge Models OpenAI GPT-5.4

Total Score

78

Overall Comments

Answer B is polished and professional, with a clear memo format and the core pilot details included. However, it is more generic, less operationally specific, and weaker on addressing employee concerns and next steps. It explains the rationale and evaluation areas adequately but does not give the same level of practical guidance or transparency as Answer A.

View Score Details

Appropriateness

Weight 25%
78

Covers most required elements and stays within memo form, but it is less complete relative to the prompt. It mentions rationale, department, pay, Friday coverage, and evaluation, yet it only lightly addresses likely concerns and gives less concrete next steps.

Clarity

Weight 20%
80

Generally clear and easy to follow, but some phrasing is broader and more corporate, which makes the memo feel less precise. It communicates the basics well, though with fewer concrete details than A.

Structure

Weight 20%
81

Solid memo structure with standard headers and logical paragraph flow. However, it relies on block paragraphs and lacks the stronger sectional segmentation and scannability of A.

Actionability

Weight 20%
68

Includes basic operational information such as the schedule, unchanged pay, and Friday on-call rotation, but it is comparatively thin on execution detail. Next steps are vague, with responsibility and timing mostly deferred to later communication.

Tone

Weight 15%
84

Professional and measured, with appropriate restraint about future adoption. However, the tone is somewhat more generic and polished-corporate than practical and candid.

Total Score

54

Overall Comments

Answer B is a competent but noticeably thinner memo. It announces the pilot and touches on the required elements, but does so at a surface level. The concern-addressing section is vague and generic, the success metrics are listed without any mention of baselining or interim reviews, the next steps are minimal and deferred to department management rather than concrete, and the rationale leans on enthusiasm ("innovative approaches") rather than practical business logic. It does not meaningfully address workload compression, cross-department fairness, or the specifics of the Friday rotation system. The tone is acceptable but slightly promotional.

View Score Details

Appropriateness

Weight 25%
55

Answer B covers the basic elements but at a shallow level. It does not meaningfully address workload compression, cross-department fairness, or the specifics of the Friday rotation. The concern section is a single vague paragraph. The success metrics are listed but lack any mention of baselining or interim review. The disclaimer about permanence is present but buried.

Clarity

Weight 20%
60

The memo is readable but less precise. Phrases like 'innovative approaches' and 'comprehensive set of metrics' are vague. The Friday rotation is mentioned but not explained. The next steps section is unclear about who does what and when.

Structure

Weight 20%
55

Answer B uses a standard memo header but no internal section headers, making it harder to navigate. The paragraphs blend topics (rationale and selection are merged, concerns and next steps are compressed). The lack of visual hierarchy reduces scannability.

Actionability

Weight 20%
40

The next steps section is minimal: it says a review will be conducted after 12 weeks and that details will be communicated by department management. No specific owners, no interim checkpoints, no baseline-setting step, and no timeline for when employees will hear back. This is the weakest part of Answer B.

Tone

Weight 15%
60

The tone is generally professional but leans slightly promotional in places ('innovative approaches,' 'enhance employee well-being'). It reads more like a press release than an internal operational memo. The commitment to transparency is stated but not demonstrated through the memo's content.

Total Score

59

Overall Comments

Answer B is an adequate but flawed response. While it adopts a professional tone and a correct memo format, it fails to address several key requirements of the prompt. It is notably weak in addressing likely employee concerns, which it mentions only in passing, and it completely fails to provide actionable next steps for the memo's audience. The paragraph-based structure is less effective and scannable than a memo with headings. The content is more generic and lacks the specificity and practical detail that make Answer A so convincing.

View Score Details

Appropriateness

Weight 25%
70

The answer uses an appropriate memo format, but its content is not fully appropriate for the task's purpose because it omits crucial details, such as addressing specific employee concerns and outlining next steps for the audience.

Clarity

Weight 20%
65

The prose is generally clear, but the overall message is muddled by a lack of structure and specificity. The failure to provide clear next steps significantly reduces the memo's clarity of purpose for the reader.

Structure

Weight 20%
60

The memo follows a logical sequence of paragraphs, which is an acceptable structure. However, it lacks headings or other formatting to break up the text, making it less effective and harder to scan than Answer A.

Actionability

Weight 20%
30

This is a major weakness. The memo provides no actionable information for its intended audience ('All Employees'), instead deferring all details to a separate communication for a single team. This fails a core requirement of the prompt.

Tone

Weight 15%
70

The tone is professional and businesslike, which is appropriate. However, it is more formal and less transparent than Answer A, using slightly more corporate jargon and failing to engage directly with employee concerns.

Comparison Summary

Final rank order is determined by judge-wise rank aggregation (average rank + Borda tie-break). Average score is shown for reference.

Judges: 3

Winning Votes

3 / 3

Average Score

90
View this answer

Winning Votes

0 / 3

Average Score

64
View this answer

Judging Results

Why This Side Won

Answer A is the clear winner because it comprehensively fulfills every requirement of the prompt with a high degree of polish and specificity. Its superior structure, using clear headings and lists, makes it a much more effective and readable internal communication document. Most importantly, it excels in the heavily weighted criteria of actionability and clarity by providing concrete next steps and directly addressing specific employee concerns, two areas where Answer B was critically deficient.

Why This Side Won

Answer A wins on every weighted criterion. It is more appropriate to the task (addresses all required elements with specificity), clearer in its explanations, better structured with labeled sections and numbered next steps, far more actionable with concrete timelines and responsibilities, and more professionally toned without overselling. The gap is especially large on appropriateness and actionability, the two highest-weighted criteria alongside clarity and structure, making Answer A the clear winner by weighted score.

Judge Models OpenAI GPT-5.4

Why This Side Won

Answer A wins because it performs better on the most important weighted criteria, especially appropriateness, actionability, and clarity. It more fully satisfies the prompt by addressing likely employee concerns in concrete terms, explaining how success will be measured, and stating specific next steps, while maintaining a professional memo style and avoiding an overpromise that the policy will become permanent.

X f L