Design a Global URL Shortening Service

Compare model answers for this System Design benchmark and review scores, judging comments, and related examples.

X f L

Contents

Task Overview

Benchmark Genres

System Design

Task Creator Model The task creator is randomly selected from top task-generation models of supported providers.

OpenAI GPT-5.4

Answering Models In this benchmark, models from the same provider as the task creator are excluded from answering.

Answer A Google Gemini 2.5 Pro

Answer B Anthropic Claude Sonnet 4.6

Judge Models Judging uses exactly 3 judge models, excluding the answering models. At least 1 judge is selected from flagship models, lightweight models are not selected as judges, and the 3 judges come from 3 distinct providers.

OpenAI GPT-5.2 Anthropic Claude Opus 4.6 Google Gemini 2.5 Flash

Task Prompt

Show more ▼

Design a public URL shortening service similar to Bitly. Users can submit a long URL and receive a short alias, then anyone can use the short link to be redirected to the original URL. Your design should support these requirements and constraints: Functional requirements: - Create short links for arbitrary valid URLs. - Redirect short links with low latency. - Support optional custom aliases when available. - Provide basic click analytics per link: total clicks, clicks in the last 24 hours, and top 5 countries by click count. - Allow link expiration dates. Scale assumptions: - 120 million new short links per day. - 8 billion redirect requests per day. - Read-heavy workload with strong traffic skew: a small fraction of links receive very high traffic. - Global users across North America, Europe, and Asia. Constraints: - 99.99% availability target for redirects. - P95 redirect latency under 80 ms for users in major regions. - Newly created links should become usable within 2 seconds globally. - Analytics can be eventually consistent, but redirects must be correct. - Budget matters: justify where you would spend on stronger consistency or multi-region replication and where you would avoid it. - Assume no third-party managed analytics product; design the core system yourself. Please provide: - A high-level architecture with major components and data flow. - Storage choices for link mappings, analytics events, and cached hot links. - ID generation or alias strategy, including collision handling and custom alias checks. - API design for create-link, redirect, and analytics retrieval. - Scaling approach for hot keys, caching, partitioning, and multi-region traffic. - Reliability strategy covering failover, data replication, backup, and degradation behavior. - Key trade-offs and at least two alternative design choices you considered and rejected.

Task Context

Assume the service only needs web and API clients, not mobile SDKs. You may assume standard cloud primitives such as load balancers, queues, object storage, CDN, and regional databases are available. You do not need to estimate exact hardware counts, but your design should be concrete enough that an engineer could begin implementation planning.

Judging Policy

Show more ▼

A strong answer should present a coherent end-to-end architecture that matches the stated scale and latency goals, clearly separate the redirect path from the analytics pipeline, and justify storage and replication choices in terms of consistency, cost, and availability. It should address global deployment, cache strategy for very hot links, alias generation and collision handling, and practical API behavior. Better answers discuss likely bottlenecks, failure modes, and graceful degradation, and explain trade-offs between options rather than listing components without reasoning. Weak answers ignore the traffic scale, treat analytics and redirect correctness the same way, omit multi-region concerns, or fail to explain why specific architectural choices fit the constraints.

Answers

Answer A: Google Gemini 2.5 Pro

1. High-Level Architecture

The system is designed as a globally distributed, multi-region service to ensure low latency and high availability. Each major region (e.g., US-East, EU-West, AP-Southeast) hosts a complete Point of Presence (PoP).

Components:

Global Load Balancer (GLB): Uses latency-based DNS routing to direct users to the nearest healthy regional PoP.
Regional PoP Stack:
- API Gateway: Terminates TLS, authenticates requests, and routes them to the appropriate backend service.
- Redirect Service: A lightweight, highly-scaled service that handles `GET /{shor...

Show Full Answer ▼

1. High-Level Architecture

Components:

Global Load Balancer (GLB): Uses latency-based DNS routing to direct users to the nearest healthy regional PoP.
Regional PoP Stack:
- API Gateway: Terminates TLS, authenticates requests, and routes them to the appropriate backend service.
- Redirect Service: A lightweight, highly-scaled service that handles GET /{short_alias} requests. It is optimized for speed.
- Write Service: Handles link creation (POST /api/v1/links), including custom alias validation and unique ID generation.
- Analytics Service: Provides access to link statistics via GET /api/v1/links/{short_alias}/stats.
Data Infrastructure:
- Global Database: A strongly consistent, distributed database for the core link mapping.
- Regional Cache: An in-memory cache in each region for hot links.
- Analytics Pipeline: A message queue for event ingestion, a stream processor for aggregation, and a NoSQL database for storing and serving analytics data.

Data Flow:

Link Creation: A user's POST request is routed to the nearest Write Service. The service generates or validates an alias and writes the short_alias -> long_URL mapping to the Global Database. The write is synchronously replicated across all regions within 2 seconds.
Redirection: A GET request is routed to the nearest Redirect Service. It first checks the Regional Cache. On a cache hit, it immediately issues a 301 redirect and asynchronously sends a click event to the analytics queue. On a miss, it queries the Global Database's local replica, populates the cache, then performs the redirect and event logging.
Analytics: Click events from all regions are published to a central message queue. A stream processor consumes these events, aggregates them into time-windowed statistics (total clicks, 24h clicks, country counts), and writes the results to the Analytics Database.

2. Storage Choices

Link Mappings (Primary Storage):
- Technology: A globally distributed SQL database like Google Cloud Spanner or CockroachDB.
- Justification: This choice is driven by the need for strong consistency on redirects and the <2 second global write propagation requirement. These databases provide synchronous replication and low-latency regional reads, justifying their higher cost for the core business function. The short_alias serves as the primary key for fast lookups.
Cached Hot Links:
- Technology: A regional in-memory distributed cache like Redis Cluster.
- Justification: With 8 billion daily reads and heavy traffic skew, a cache is essential for meeting the <80ms P95 latency target and protecting the database. Each region maintains its own cache using a cache-aside pattern.
Analytics Data:
- Ingestion Queue: Apache Kafka or AWS Kinesis. This decouples the high-throughput redirect path from analytics processing and provides a durable buffer for click events.
- Serving Database: A wide-column NoSQL store like Apache Cassandra or ScyllaDB. It is optimized for the high-write, time-series aggregation workload of analytics and is more cost-effective at scale for these query patterns than a relational database.

3. ID Generation and Alias Strategy

We will use a 7-character alias from the alphabet [a-zA-Z0-9], providing 62^7 (over 3.5 trillion) unique combinations.

ID Generation: A background service pre-generates a large pool of unique random IDs and stores them in a dedicated queue (e.g., a Redis list). The Write Service simply fetches a pre-generated ID from this pool when a new link is created. This approach is fast, scalable, and avoids on-the-fly collision checks during a user request.
Custom Alias Handling: When a user submits a custom alias, the Write Service attempts to INSERT it into the Global Database. The short_alias column is the primary key with a unique constraint. This atomically handles the check-and-set operation; if the alias is taken, the database rejects the write, and the service returns a 409 Conflict error to the user.

4. API Design

Create Link: POST /api/v1/links
- Body: { "long_url": "...", "custom_alias": "..." (optional), "expires_at": "..." (optional) }
- Response: 201 Created with { "short_url": "...", "long_url": "..." }
Redirect: GET /{short_alias}
- Response: 301 Moved Permanently with Location: {long_url} header for permanent links. 404 Not Found if the alias doesn't exist or is expired.
Get Analytics: GET /api/v1/links/{short_alias}/stats
- Response: 200 OK with { "total_clicks": ..., "clicks_last_24h": ..., "top_countries": [...] }

5. Scaling Approach

Hot Keys: A multi-layer caching strategy is employed. For extremely viral links, the 301 redirect response can be cached at the CDN layer. The primary defense is the regional Redis cache, which will absorb the majority of the skewed traffic.
Partitioning: The Global Database (Spanner/CockroachDB) automatically handles partitioning of link data by its primary key (short_alias). The Analytics Database (Cassandra) will be partitioned by short_alias to co-locate all stats for a given link, enabling efficient queries.
Multi-Region Traffic: The GLB ensures users are routed to their nearest PoP for low latency. The architecture is active-active, as writes can be handled by any region and are globally replicated by the database. This design maximizes resource utilization and availability.

6. Reliability Strategy

Failover: The GLB performs health checks and will automatically route traffic away from any unhealthy region. Within a region, load balancers will route around failed service instances.
Data Replication: The Global Database is configured to replicate data across multiple availability zones and regions, providing resilience against both instance and regional failures. The Cassandra cluster for analytics will also be deployed across multiple regions.
Backup: Automated, periodic snapshots of both the primary and analytics databases are taken and stored in durable object storage (e.g., AWS S3) for disaster recovery.
Graceful Degradation: The system is designed so that a failure in the non-critical analytics pipeline will not impact the core redirect functionality. The Redirect Service will continue to serve requests even if it cannot publish events to the message queue.

7. Trade-offs and Alternatives Rejected

Alternative 1: Hash-based ID Generation (Rejected).
- Concept: Generate a short ID by hashing the long URL. If a collision occurs, add a salt and re-hash.
- Reason for Rejection: This adds unpredictable latency and complexity to the write path. At 120 million writes per day, collision probability becomes significant, requiring multiple database lookups to find a unique ID. The chosen pre-generation strategy provides constant-time performance for ID acquisition during a user request.
Alternative 2: Single-Master Relational Database (Rejected).
- Concept: Use a traditional database like PostgreSQL in a single primary region with read replicas in other regions.
- Reason for Rejection: This design cannot meet the strict <2 second global write propagation requirement. Replication lag to distant read replicas would frequently exceed this threshold, meaning a newly created link would not be immediately available to all global users. The higher cost of a globally distributed database is justified to meet this core functional requirement.

Result

Winning Votes

0 / 3

Average Score

Judge Models OpenAI GPT-5.2

Total Score

Overall Comments

Presents a coherent multi-region architecture with a clear separation between redirect and analytics paths, good core storage choices (global strongly consistent DB for mappings, regional Redis cache, async analytics pipeline), and includes required APIs and two rejected alternatives. However, several areas are under-specified or somewhat hand-wavy for this scale: ID pre-generation pool details and safety (de-duplication, multi-region coordination, refill behavior) are not addressed; analytics design is generic and doesn’t clearly explain how to compute last-24h and top countries efficiently; expiration semantics and cache/CDN interaction aren’t deeply covered; multi-region replication and consistency/budget justification is relatively shallow beyond “use Spanner/Cockroach.” Reliability/degradation is reasonable but misses concrete RTO/RPO and operational details for the global DB and analytics backpressure/data loss policy.

View Score Details ▼

Architecture Quality

Weight 30%

Sound regional PoP model with separate redirect/write/analytics services and async eventing, but some key components are described at a generic level ("Global Database" with synchronous replication everywhere) without detailing practical global write/read behavior and edge caching/TTL interaction.

Completeness

Weight 20%

Covers all requested sections, but analytics queries/aggregation approach is not fully specified (especially last-24h/top-5 countries), expiration handling is thin, and the ID pre-generation mechanism lacks operational details (coordination, collisions, refill, multi-region).

Trade-off Reasoning

Weight 20%

Includes two alternatives rejected with reasonable rationale, but budget/consistency trade-offs are mostly asserted (pay for global strong DB) with limited discussion of what can be eventual or how to minimize cost beyond caching.

Scalability & Reliability

Weight 20%

Regional caching and async analytics support read-heavy skew; reliability includes failover and degradation, but lacks deeper handling of hot-key saturation beyond CDN/Redis and gives limited detail on multi-region failure behavior and replication modes/cost.

Clarity

Weight 10%

Organized and readable with clear headings and flows; some statements are oversimplified (synchronous replication within 2 seconds globally) which reduces precision.

Judge Models Google Gemini 2.5 Flash

Total Score

Overall Comments

Answer A provides a solid, well-structured system design that addresses all key requirements. It clearly separates the redirect and analytics paths, makes appropriate storage choices, and outlines a reasonable ID generation and scaling strategy. The trade-offs discussed are relevant and well-justified. However, it lacks some of the depth and detail found in Answer B, particularly in areas like multi-layered caching for hot keys, more robust ID generation, and a comprehensive budget justification.

View Score Details ▼

Architecture Quality

Weight 30%

The architecture is well-defined with clear separation of concerns (Redirect, Write, Analytics services) and appropriate use of regional PoPs and a global database. The data flow is logical and addresses the core requirements.

Completeness

Weight 20%

Answer A covers all the requested sections, providing a complete high-level design. However, some sections, like ID generation and API design, could benefit from more detail and additional considerations.

Trade-off Reasoning

Weight 20%

The answer identifies two relevant alternatives and provides clear justifications for their rejection, primarily focusing on latency and consistency requirements. The reasoning is sound but limited in scope.

Scalability & Reliability

Weight 20%

The answer outlines a good strategy for scaling hot keys (CDN, regional cache) and multi-region traffic. Reliability aspects like failover, data replication, and graceful degradation are mentioned, providing a solid foundation.

Clarity

Weight 10%

The answer is well-organized with clear headings and bullet points, making it easy to read and understand. The language is precise and avoids ambiguity.

Judge Models Anthropic Claude Opus 4.6

Total Score

Overall Comments

Answer A presents a coherent and well-structured system design that covers all major components. It correctly separates the redirect path from the analytics pipeline, chooses appropriate technologies (Spanner/CockroachDB, Redis, Kafka, Cassandra), and provides a clean API design. The ID generation strategy using pre-generated pools is reasonable. However, the answer lacks depth in several areas: it does not discuss the hot key problem beyond mentioning CDN and Redis caching, provides no capacity estimates, does not address the 301 vs 302 trade-off (defaulting to 301 which would break analytics and link expiration), lacks detail on cache invalidation strategies, does not mention in-process caching, provides only two rejected alternatives, and the reliability section is relatively thin without specific failure scenarios or RTO/RPO details. The budget justification is absent. The design is correct but reads more like a summary than a detailed implementation plan.

View Score Details ▼

Architecture Quality

Weight 30%

Answer A presents a clean, coherent architecture with proper separation of concerns between redirect, write, and analytics paths. The choice of Spanner/CockroachDB for the link store and Kafka for analytics ingestion is sound. However, it lacks a multi-tier caching strategy (no in-process cache), and the use of 301 redirects by default is a significant design flaw that would break analytics tracking and link expiration. The architecture is correct at a high level but misses important nuances.

Completeness

Weight 20%

Answer A covers all seven required sections but with limited depth. It lacks capacity estimates, does not discuss rate limiting, omits the 301 vs 302 consideration, provides no schema details, has no budget justification, and only presents two rejected alternatives. The API design is minimal without error codes for rate limiting or authorization. Cache invalidation strategy is not discussed. The reliability section lacks specific timing for failover or degradation details per component.

Trade-off Reasoning

Weight 20%

Answer A presents only two rejected alternatives: hash-based ID generation and single-master relational database. Both are reasonable but the reasoning is somewhat surface-level. The answer does not discuss the critical 301 vs 302 trade-off, does not compare analytics storage options, and does not address the trade-off between synchronous and asynchronous analytics. The budget justification for where to spend on consistency vs where to save is entirely absent despite being explicitly required by the constraints.

Scalability & Reliability

Weight 20%

Answer A mentions CDN caching and regional Redis for hot keys but lacks specificity. There are no capacity estimates, no discussion of auto-scaling, and the hot key mitigation is limited to two layers. The reliability section covers failover, replication, backup, and graceful degradation at a high level but without specific timing, RTO/RPO targets, or per-component failure analysis. The 99.99% availability target is not explicitly addressed in terms of how the architecture achieves it.

Clarity

Weight 10%

Answer A is well-organized with clear section headers and concise writing. The data flow descriptions are easy to follow. The use of bold text and bullet points aids readability. However, the brevity sometimes comes at the cost of important details. The writing is clean and professional.

Answer B: Anthropic Claude Sonnet 4.6

GLOBAL URL SHORTENING SERVICE — SYSTEM DESIGN

─────────────────────────────────────────

HIGH-LEVEL ARCHITECTURE AND DATA FLOW
─────────────────────────────────────────

The system is organized into three planes: the Write Path (link creation), the Read Path (redirect), and the Analytics Path (event ingestion and querying).

Major components:

Global Anycast Load Balancer / CDN Edge (e.g., Cloudflare or AWS CloudFront with Lambda@Edge)
API Gateway + Auth layer (rate limiting, abuse prevention)
Link Creation Service (stateless, horizontally scalable)
Redirect Service (stateless, extremely latency-sensiti...

Show Full Answer ▼

GLOBAL URL SHORTENING SERVICE — SYSTEM DESIGN

─────────────────────────────────────────

HIGH-LEVEL ARCHITECTURE AND DATA FLOW
─────────────────────────────────────────

The system is organized into three planes: the Write Path (link creation), the Read Path (redirect), and the Analytics Path (event ingestion and querying).

Major components:

Global Anycast Load Balancer / CDN Edge (e.g., Cloudflare or AWS CloudFront with Lambda@Edge)
API Gateway + Auth layer (rate limiting, abuse prevention)
Link Creation Service (stateless, horizontally scalable)
Redirect Service (stateless, extremely latency-sensitive)
ID Generation Service (distributed, unique short-code producer)
Primary Link Store (globally replicated relational DB, e.g., CockroachDB or AWS Aurora Global)
Hot-Link Cache (Redis Cluster, per-region)
Analytics Ingest Queue (Kafka or AWS Kinesis, per-region)
Analytics Aggregation Workers (stream processors, e.g., Flink or Kafka Streams)
Analytics Store (columnar store, e.g., ClickHouse or Apache Druid)
Analytics Query Service (read-only API)
Object Storage for cold backups (S3-compatible)

Write Path (link creation):

Client POSTs to the nearest regional API Gateway.
API Gateway authenticates and rate-limits the request, then forwards to the Link Creation Service.
Link Creation Service calls the ID Generation Service to obtain a unique short code (or validates a custom alias).
The record is written to the Primary Link Store with strong consistency in the local region; the global DB replication propagates the record to other regions within ~1–2 seconds, satisfying the "usable within 2 seconds globally" SLA.
Simultaneously, the new mapping is pushed to the regional Redis cache (write-through or pub/sub invalidation) so the first redirect in that region does not miss cache.
A 201 response with the short URL is returned to the client.

Read Path (redirect):

Client issues GET /{shortCode} to the nearest edge node.
Edge node checks its own CDN cache (for extremely hot links, TTL ~5–10 seconds to respect expiration semantics).
On CDN miss, the request hits the regional Redirect Service.
Redirect Service checks the regional Redis cache (expected hit rate > 99% for hot links).
On Redis miss, the Redirect Service queries the local read replica of the Primary Link Store.
If the link is expired or not found, return 404/410.
Otherwise, issue HTTP 301 (permanent) or 302 (temporary) redirect. Use 302 by default so browsers do not cache indefinitely, preserving analytics accuracy.
Asynchronously emit a click event (shortCode, timestamp, country derived from IP, user-agent) to the regional Analytics Ingest Queue. This is fire-and-forget; it does not block the redirect response.

Analytics Path:

Regional Kafka topics receive click events.
Flink/Kafka Streams jobs consume events and maintain rolling aggregates: total clicks (counter), clicks in last 24 hours (sliding window), and top-5 countries (approximate heavy-hitter sketch, e.g., Count-Min Sketch or SpaceSaving).
Aggregated results are written to ClickHouse (append-optimized columnar store) every 30–60 seconds.
The Analytics Query Service reads from ClickHouse and returns pre-aggregated results. Raw events are also retained in ClickHouse for ad-hoc queries.
Cross-region analytics aggregation: each region's Kafka cluster mirrors events to a central analytics region via Kafka MirrorMaker 2 or Kinesis cross-region replication. The central ClickHouse cluster holds the global view.

─────────────────────────────────────────
2. STORAGE CHOICES
─────────────────────────────────────────

Link Mappings — CockroachDB (or Aurora Global DB):
Rationale: We need strong consistency for writes (to prevent duplicate aliases) and low-latency reads globally. CockroachDB is a distributed SQL database with multi-region active-active support, serializable isolation, and automatic failover. Aurora Global DB is an alternative with a single primary writer and up to 5 read replicas in different regions with ~1 s replication lag. Either satisfies the 2-second global propagation SLA.
Schema (simplified):
links(short_code VARCHAR(12) PK, long_url TEXT NOT NULL, owner_id BIGINT, created_at TIMESTAMPTZ, expires_at TIMESTAMPTZ, is_custom BOOLEAN)
Indexes: short_code (PK, clustered), owner_id + created_at (for user dashboards).
Estimated size: 120M links/day × 365 days × ~300 bytes/row ≈ ~13 TB/year. Manageable with partitioning by created_at.

Hot-Link Cache — Redis Cluster (per region):
Each region runs a Redis Cluster with 3–6 shards and replicas. Cache entries: short_code → {long_url, expires_at}. TTL set to min(link expiration, 24 hours). Expected working set: the top 1% of links (~1.2M) account for ~80% of traffic. At ~500 bytes per entry, 1.2M entries ≈ 600 MB per region — very affordable. We use a write-through strategy on link creation and lazy population on first redirect miss.

Analytics Events — Kafka + ClickHouse:
Kafka retains raw click events for 7 days (replayability). ClickHouse stores both raw events (partitioned by date, sharded by short_code) and pre-aggregated materialized views. ClickHouse's columnar storage and vectorized execution make it ideal for the aggregation queries needed (SUM, COUNT, GROUP BY country, sliding window).

Cold Backups — S3-compatible object storage:
Daily snapshots of the link store and weekly ClickHouse exports are stored in object storage with versioning and cross-region replication for disaster recovery.

─────────────────────────────────────────
3. ID GENERATION AND ALIAS STRATEGY
─────────────────────────────────────────

Short Code Format:
Use 7 characters from a Base62 alphabet (a-z, A-Z, 0-9). Base62^7 = ~3.5 trillion combinations, far exceeding any foreseeable need (120M/day × 10 years = ~438B links).

ID Generation Strategy — Distributed Counter + Encoding:
We use a distributed counter approach rather than random generation to avoid collision storms at scale.

Step 1: Each Link Creation Service instance acquires a block of IDs from a lightweight coordination service (e.g., a Redis INCRBY or a dedicated ID service backed by ZooKeeper). Each instance claims a block of 10,000 IDs at a time, reducing coordination overhead.
Step 2: The numeric ID is Base62-encoded to produce the short code. A 7-character Base62 code can represent IDs up to ~3.5 trillion.
Step 3: To prevent enumeration attacks and add unpredictability, XOR the numeric ID with a secret salt before encoding, or use a bijective scrambling function (Feistel cipher over the ID space).

Custom Aliases:

Client submits desired alias (3–20 alphanumeric characters).
Link Creation Service attempts an INSERT with the custom alias as the short_code using a unique constraint in the DB.
If the INSERT fails due to a unique constraint violation, return HTTP 409 Conflict with a suggestion to try another alias.
Custom aliases bypass the ID generation service entirely.
Custom aliases are reserved in a separate namespace check (profanity list, reserved words like "api", "admin", "health") before the DB write.

Collision Handling for Generated Codes:
Because we use a monotonically increasing counter with bijective encoding, collisions are structurally impossible for generated codes. Collisions can only occur if a generated code happens to match a custom alias — this is handled by the unique constraint; the ID generation service simply increments to the next ID and retries (extremely rare).

─────────────────────────────────────────
4. API DESIGN
─────────────────────────────────────────

All APIs are RESTful over HTTPS. Rate limiting is enforced at the API Gateway layer (e.g., 100 creates/minute per authenticated user, 10 creates/minute for anonymous users).

POST /v1/links — Create Short Link
Request body (JSON):
long_url: string (required, must be a valid URL, max 2048 chars)
custom_alias: string (optional, 3–20 alphanumeric chars)
expires_at: ISO 8601 datetime (optional)
owner_id: string (optional, from auth token)
Response 201 Created:
short_code: string
short_url: string (e.g., "https://sho.rt/aB3xY7z")
long_url: string
expires_at: string or null
created_at: string
Response 400: invalid URL or alias format
Response 409: custom alias already taken
Response 429: rate limit exceeded

GET /{shortCode} — Redirect
No request body. Path parameter: shortCode.
Response 302 Found with Location header set to long_url.
Response 404: short code not found.
Response 410 Gone: link has expired.
This endpoint is served at the apex domain (e.g., sho.rt/{shortCode}) for minimal URL length.

GET /v1/links/{shortCode}/analytics — Retrieve Analytics
Auth required (owner or admin).
Query parameters:
none required (returns default summary)
Response 200:
short_code: string
total_clicks: integer
clicks_last_24h: integer
top_countries: array of objects, each with country_code (ISO 3166-1 alpha-2) and click_count, sorted descending, max 5 entries
last_updated: ISO 8601 datetime (indicates analytics freshness)
Response 404: link not found.
Response 403: not authorized.

GET /v1/links/{shortCode} — Get Link Metadata (optional utility endpoint)
Auth required.
Response 200: full link object including long_url, created_at, expires_at, is_custom.

DELETE /v1/links/{shortCode} — Delete / Deactivate Link
Auth required (owner or admin).
Response 204 No Content.
Soft-deletes the record (sets a deleted_at timestamp) and invalidates the cache entry via pub/sub.

─────────────────────────────────────────
5. SCALING APPROACH
─────────────────────────────────────────

Hot Key Problem:
A small fraction of links (viral URLs, marketing campaigns) will receive millions of redirects per minute. Naive Redis sharding puts all traffic for one key on one shard.
Mitigation:

Local in-process cache in each Redirect Service instance (e.g., Caffeine with a 10,000-entry LRU, TTL 5 seconds). This absorbs the hottest keys entirely within the process, eliminating Redis round-trips for the top ~0.1% of links.
CDN edge caching with a short TTL (5–10 seconds) for the redirect response itself. At 8B redirects/day, even a 90% CDN hit rate offloads 7.2B requests from origin.
Redis read replicas: promote hot-key shards to have more replicas and route reads round-robin.

Caching Strategy:
Three-tier cache: CDN edge → in-process LRU → Redis Cluster → DB read replica.
Cache population: write-through on creation, lazy on first miss.
Cache invalidation: on link deletion or expiration, publish an invalidation message to a Redis pub/sub channel; all Redirect Service instances subscribe and evict the key from their local LRU. CDN entries expire naturally via TTL.

Partitioning:

Link Store: partition by short_code hash (CockroachDB handles this automatically via range-based sharding). No hot partitions because short codes are uniformly distributed after bijective scrambling.
Kafka: partition click events by short_code hash. This ensures all events for a given link go to the same partition, enabling stateful per-link aggregation without shuffles.
ClickHouse: shard by short_code, partition by date. Queries for a single link's analytics hit one shard; time-range queries benefit from partition pruning.

Multi-Region Traffic:
Deploy in three regions: US-East, EU-West, AP-Southeast (covers North America, Europe, Asia).

DNS-based GeoDNS or Anycast routes users to the nearest region.
Each region is fully independent for reads (Redirect Service + Redis + DB read replica).
Writes (link creation) go to the nearest region's Link Creation Service, which writes to the local node of the globally distributed DB. CockroachDB replicates to other regions within ~1–2 seconds.
Analytics events are processed regionally and mirrored to a central analytics cluster for global aggregation.

Capacity Sketch:

8B redirects/day = ~92,500 req/s average, ~300,000 req/s at peak (3x factor).
With 90% CDN hit rate: ~30,000 req/s to origin across 3 regions = ~10,000 req/s per region.
Each Redirect Service instance can handle ~5,000 req/s (mostly cache hits, minimal CPU). ~2–4 instances per region at average load, auto-scaling to 20+ at peak.
120M writes/day = ~1,400 writes/s average. Link Creation Service is not the bottleneck.

─────────────────────────────────────────
6. RELIABILITY STRATEGY
─────────────────────────────────────────

Availability Target: 99.99% = ~52 minutes downtime/year.

Failover:

Each region runs multiple instances of every stateless service behind a regional load balancer with health checks (fail open: remove unhealthy instances within 10 seconds).
Redis Cluster uses automatic failover: if a primary shard fails, a replica is promoted within ~15 seconds. During this window, Redirect Service falls back to the DB read replica (slightly higher latency but correct).
If an entire region fails, GeoDNS detects the failure via health probes and reroutes traffic to the next-nearest region within ~30–60 seconds. The remaining regions must absorb the load; auto-scaling handles this.
CockroachDB / Aurora Global DB: automatic failover to another region's node if the primary region is unavailable. For Aurora, this involves a manual or automated promotion of a read replica to primary (RTO ~1 minute with automation).

Data Replication:

Link Store: synchronous replication within a region (at least 2 nodes), asynchronous cross-region replication (CockroachDB consensus across regions, or Aurora Global DB ~1 s lag). We accept eventual consistency for cross-region reads because the 2-second propagation SLA is met.
Redis: each shard has at least one replica in the same region. No cross-region Redis replication (Redis is a cache; the DB is the source of truth).
Kafka: replication factor 3 within each region. Cross-region mirroring for analytics only (not for redirect correctness).
ClickHouse: replicated within the analytics region using ClickHouse Keeper (ZooKeeper replacement). No multi-region ClickHouse replication needed since analytics are eventually consistent.

Backup:

Daily logical backups of the Link Store to object storage (S3), retained for 30 days.
Weekly full ClickHouse exports to object storage, retained for 90 days.
Kafka topic data retained for 7 days (acts as a replay buffer for analytics recovery).
Backup restoration is tested monthly via automated restore drills in a staging environment.

Degradation Behavior:

If Redis is unavailable: Redirect Service falls back to DB read replicas. Latency increases but correctness is maintained. Alert and auto-scale DB read replicas.
If the Analytics Queue is unavailable: click events are dropped (analytics are eventually consistent; we accept this loss). Redirects are never blocked by analytics failures. A circuit breaker in the Redirect Service disables event emission if the queue is unhealthy.
If the Link Store primary is unavailable: new link creation fails (return 503). Redirects continue using cached data and read replicas. This is acceptable because creation is less critical than redirect availability.
If a region is fully down: GeoDNS failover to adjacent region. Users experience higher latency but the service remains available.
Rate limiting and abuse detection at the API Gateway prevent DDoS from overwhelming the system.

─────────────────────────────────────────
7. KEY TRADE-OFFS AND REJECTED ALTERNATIVES
─────────────────────────────────────────

Trade-off 1: 302 vs. 301 Redirects
We chose 302 (temporary redirect) as the default. 301 (permanent) would allow browsers to cache the redirect indefinitely, reducing origin load. However, 301 makes it impossible to track repeat clicks from the same browser, breaks analytics, and prevents us from updating or expiring links (the browser would serve the cached redirect forever). The cost of 302 is higher origin traffic, which we mitigate with CDN and in-process caching. For links explicitly marked as permanent by the owner (and with no expiration), we could offer 301 as an opt-in, but the default is 302.

Trade-off 2: Random Short Codes vs. Counter-Based Encoding
Rejected: Generating random 7-character Base62 strings and checking for collisions in the DB.
Reason for rejection: At 120M links/day and a 3.5T code space, the collision probability starts low but grows over time. More importantly, each creation requires a DB read to check for collision, then a write — two round-trips under contention. With a counter-based approach, the ID is guaranteed unique by construction; no collision check is needed. The counter service is a minor coordination overhead but is far cheaper than DB collision checks at scale. The bijective scrambling step preserves the unpredictability benefit of random codes.

Trade-off 3: Synchronous Analytics vs. Asynchronous Queue
Rejected: Writing click events synchronously to the analytics DB during the redirect request.
Reason for rejection: This would add 10–50 ms of latency to every redirect (a ClickHouse or analytics DB write in the critical path), violating the P95 < 80 ms SLA. It also creates a hard dependency between redirect availability and analytics availability. The asynchronous queue approach decouples these concerns entirely. The cost is that analytics are eventually consistent (30–60 second lag), which is explicitly acceptable per the requirements.

Trade-off 4: Single Global DB vs. Multi-Region Active-Active DB
We chose a globally distributed DB (CockroachDB or Aurora Global) over a single-region primary with read replicas everywhere. A single-region primary would give us simpler consistency semantics but would make all writes go to one region (high latency for users in other regions creating links, and a single point of failure for writes). The globally distributed DB costs more and adds operational complexity, but it is justified because link creation latency and write availability are important for the user experience and the 99.99% availability target.

Trade-off 5: ClickHouse vs. a Time-Series DB (e.g., InfluxDB) for Analytics
Rejected: InfluxDB or TimescaleDB for analytics storage.
Reason for rejection: Time-series databases are optimized for metrics with a fixed tag set. Our analytics queries require flexible GROUP BY country, arbitrary time windows, and potentially joining with link metadata. ClickHouse's columnar storage, SQL interface, and materialized views handle these patterns more naturally and at higher scale. ClickHouse also supports approximate aggregations (quantiles, heavy-hitters) natively, which is useful for the top-5-countries query at scale.

─────────────────────────────────────────
SUMMARY OF BUDGET JUSTIFICATION
─────────────────────────────────────────

Spend more on:

Multi-region Redis clusters: redirect latency is the core SLA; cache misses are expensive.
Globally distributed Link Store (CockroachDB / Aurora Global): correctness of redirects and 2-second global propagation require this.
CDN edge capacity: the single highest-leverage investment for redirect latency and availability.
Auto-scaling for Redirect Service: traffic is highly variable and spiky.

Spend less on:

Analytics infrastructure: eventual consistency is acceptable; a single central ClickHouse cluster with replication is sufficient. No need for multi-region analytics replication.
Link Creation Service: write volume is ~1,400/s, trivially handled by a few instances.
Cold storage / backups: object storage is cheap; over-provision retention rather than under-provision.
Cross-region Redis replication: Redis is a cache backed by the DB; cross-region replication adds cost and complexity with minimal benefit since the DB already replicates.

Result

#1 | Winner

Winning Votes

3 / 3

Average Score

Judge Models OpenAI GPT-5.2

Total Score

Overall Comments

Provides an end-to-end, implementation-oriented global design with strong separation of write/read/analytics planes, explicit multi-tier caching (CDN, in-process, Redis, DB), and a detailed analytics pipeline including windowed aggregates and top-country heavy-hitter approaches. It addresses hot-key behavior concretely, partitioning choices across DB/Kafka/ClickHouse, and multi-region traffic and failover with degradation modes. ID/alias strategy is robust (block allocation + Base62 + scrambling) with clear custom alias reservation and collision handling. It also includes budget justification and multiple explicit trade-offs/alternatives with rationale. Minor issues: Aurora Global DB characterization is somewhat optimistic for active-active writes, and some capacity numbers are speculative, but overall it best matches the stated constraints and scale.

View Score Details ▼

Architecture Quality

Weight 30%

Well-structured planes (write/read/analytics) with clear data flow, multi-tier caching including edge and in-process layers, and concrete analytics storage/processing choices; overall architecture maps well to latency, availability, and skew constraints.

Completeness

Weight 20%

Addresses every requested item: architecture, storage for mappings/events/cache, IDs and custom aliases, APIs, hot-key/caching/partitioning, multi-region routing, reliability/degradation, budget, and multiple alternatives rejected with reasons.

Trade-off Reasoning

Weight 20%

Explicit, requirement-tied trade-offs (302 vs 301, random vs counter IDs, async analytics, DB topology, analytics DB choice) and a clear budget spend/avoid plan; shows good awareness of correctness vs cost.

Scalability & Reliability

Weight 20%

Strong scalability story: hot-key mitigations (CDN + in-process cache + Redis replicas), partitioning across systems, and explicit multi-region routing/failover/degradation behaviors; acknowledges redirect correctness vs analytics loss and provides practical fallback paths.

Clarity

Weight 10%

Very clear structure with step-by-step flows and concrete API/behavior definitions; dense but readable and easy to map to implementation tasks.

Judge Models Google Gemini 2.5 Flash

Total Score

Overall Comments

Answer B delivers an exceptionally detailed and comprehensive system design. It excels in providing a multi-layered approach to caching, a sophisticated ID generation strategy with collision handling and enumeration prevention, and a very thorough discussion of trade-offs, including a clear budget justification. The API design is more complete, and the scalability and reliability sections are highly granular, covering specific failover mechanisms, RTOs, and graceful degradation scenarios with impressive clarity and depth. The capacity sketch adds a valuable quantitative dimension.

View Score Details ▼

Architecture Quality

Weight 30%

The architecture is exceptionally well-structured, clearly delineating the three main paths (Write, Read, Analytics). It includes more nuanced components like CDN edge with Lambda@Edge and a dedicated ID Generation Service, demonstrating a more refined and robust overall design. The data flow descriptions are highly detailed and precise.

Completeness

Weight 20%

Answer B is remarkably complete, addressing all prompt requirements with extensive detail. It includes additional API endpoints (metadata, delete), a capacity sketch, a highly detailed ID generation strategy, and a comprehensive budget justification, going beyond the basic requirements to provide a truly production-ready design plan.

Trade-off Reasoning

Weight 20%

Answer B provides five distinct trade-offs, each thoroughly analyzed with clear justifications for the chosen approach over alternatives. The discussion on 302 vs. 301 redirects, counter-based vs. random IDs, and synchronous vs. asynchronous analytics is particularly strong. The budget justification section further enhances the trade-off reasoning by explicitly linking design choices to cost implications.

Scalability & Reliability

Weight 20%

Answer B excels in this area with a multi-layered hot key mitigation strategy (CDN, in-process cache, Redis replicas) and detailed partitioning schemes. The reliability section is exceptionally thorough, covering specific failover mechanisms, RTOs, data replication strategies (distinguishing between critical and non-critical), comprehensive backup plans, and a wide range of graceful degradation scenarios with concrete fallbacks. The capacity sketch adds a quantitative validation of scalability.

Clarity

Weight 10%

Answer B is exceptionally clear and well-structured, using distinct sections and detailed bullet points. Despite its extensive content, the information is presented logically and concisely, making complex concepts easy to follow. The precision in language and the systematic breakdown of each topic contribute to its high clarity.

Judge Models Anthropic Claude Opus 4.6

Total Score

Overall Comments

Answer B delivers an exceptionally thorough and well-reasoned system design. It covers all required components with significant depth: a three-tier caching strategy (CDN, in-process LRU, Redis), a counter-based ID generation with bijective scrambling to prevent enumeration attacks, detailed capacity sketches, specific failure mode analysis with degradation behaviors, and five well-articulated trade-offs with clear reasoning. The choice of 302 over 301 is correctly justified for analytics accuracy and link expiration. The analytics pipeline is well-designed with ClickHouse and materialized views. The answer includes practical details like profanity filtering for custom aliases, rate limiting specifics, soft-delete semantics, cache invalidation via pub/sub, and a clear budget justification section. The multi-region strategy is concrete with specific region choices and failover timing estimates. The only minor weaknesses are that the answer is quite long and could be slightly more concise in places.

View Score Details ▼

Architecture Quality

Weight 30%

Answer B provides an excellent three-tier caching architecture (CDN, in-process LRU, Redis), correctly defaults to 302 redirects with clear justification, includes cross-region analytics mirroring, and details cache invalidation via pub/sub. The architecture is more complete and production-ready, with specific attention to the hot key problem at multiple layers. The data flow descriptions are thorough and account for edge cases like cache warming on creation.

Completeness

Weight 20%

Answer B is remarkably complete. It includes capacity sketches, schema details with size estimates, rate limiting specifics, five trade-offs with detailed reasoning, a dedicated budget justification section, soft-delete semantics, profanity filtering for custom aliases, specific failover timing estimates, per-component degradation behaviors, backup testing procedures, and a utility DELETE endpoint. It also includes an optional link metadata endpoint. Nearly every aspect of the prompt is addressed with concrete detail.

Trade-off Reasoning

Weight 20%

Answer B excels in trade-off reasoning with five well-articulated alternatives: 302 vs 301 redirects, random vs counter-based codes, synchronous vs asynchronous analytics, single vs multi-region DB, and ClickHouse vs time-series DB. Each trade-off includes specific reasoning tied to the constraints (latency SLA, analytics accuracy, availability target). The budget justification section explicitly addresses where to spend more and where to spend less, directly responding to the constraint about justifying consistency and replication costs.

Scalability & Reliability

Weight 20%

Answer B provides detailed capacity sketches (92,500 req/s average, 300,000 peak, 90% CDN offload), specific auto-scaling numbers, three-tier hot key mitigation, and per-component failure analysis with timing (Redis failover in 15 seconds, GeoDNS failover in 30-60 seconds). The reliability section explicitly addresses the 99.99% target (52 minutes/year), includes circuit breakers, and describes specific degradation behaviors for each component failure. Backup restoration testing is mentioned, which is a mature operational practice.

Clarity

Weight 10%

Answer B is well-organized with clear section headers and detailed subsections. The writing is thorough and precise. The use of ASCII section dividers and consistent formatting aids navigation. However, the answer is quite long, which could make it harder to quickly extract key decisions. The capacity sketch section is a nice touch that adds concreteness. Overall very clear despite the length.

Comparison Summary

Final rank order is determined by judge-wise rank aggregation (average rank + Borda tie-break). Average score is shown for reference.

Judges: 3

Google Gemini 2.5 Pro

Winning Votes

0 / 3

Average Score

View this answer

Winner Anthropic Claude Sonnet 4.6

Winning Votes

3 / 3

Average Score

View this answer

View head-to-head record for this model pair

Judging Results

Judge Models Anthropic Claude Opus 4.6

Gemini 2.5 Pro 64

Claude Sonnet 4.6 Winner 84

Why This Side Won

Answer B wins because it provides significantly more depth and rigor across every dimension. It addresses critical details that Answer A misses: the 301 vs 302 trade-off (Answer A incorrectly defaults to 301 which would break analytics and link expiration), in-process caching for hot keys, capacity estimates, specific failover timing, cache invalidation via pub/sub, enumeration attack prevention, and a comprehensive budget justification. Answer B presents five well-reasoned trade-offs compared to Answer A's two, and each trade-off includes concrete reasoning tied to the constraints. The reliability section in Answer B is substantially more detailed with specific degradation behaviors for each failure mode. Overall, Answer B demonstrates the kind of depth that would allow an engineer to begin implementation planning, which is the stated goal of the task.

Judge Models Google Gemini 2.5 Flash

Gemini 2.5 Pro 73

Claude Sonnet 4.6 Winner 92

Why This Side Won

Answer B is superior due to its significantly greater depth, detail, and robustness across all aspects of the design. It provides more sophisticated solutions for complex problems like hot key scaling and ID generation, offers a more comprehensive API, and presents a much more thorough analysis of trade-offs and reliability strategies, including a clear budget justification. The level of detail in Answer B demonstrates a deeper understanding of the practical challenges and solutions for a global-scale service.

Judge Models OpenAI GPT-5.2

Gemini 2.5 Pro 70

Claude Sonnet 4.6 Winner 86

Why This Side Won

Answer B is more complete and operationally concrete for the given global scale: it gives a clearer low-latency redirect path with multi-layer caching and hot-key mitigation, a more detailed analytics architecture capable of last-24h and top-country queries, and stronger reliability/degradation and budget trade-off reasoning. Answer A is solid but stays higher-level and leaves important scale-critical details (ID pool coordination, analytics aggregation specifics, cache/expiration/CDN semantics) less specified.

Design a Global URL Shortening Service

Task Overview

Task Prompt

Answers

Answer A: Google Gemini 2.5 Pro

1. High-Level Architecture

1. High-Level Architecture

2. Storage Choices

3. ID Generation and Alias Strategy

4. API Design

5. Scaling Approach

6. Reliability Strategy

7. Trade-offs and Alternatives Rejected

Answer B: Anthropic Claude Sonnet 4.6

Comparison Summary

Judging Results

Related Tasks

Night-Shift Pharmacist Handling a Medication Mix-Up

Respond to a Delayed Client Delivery with a Recovery Plan

Summarize a Public Consultation Brief on Nighttime Delivery in a Historic City Center

Advice for handling a draining friend without ending the friendship

Implement a Versioned Key-Value Store with Historical Queries

Persuade a skeptical city council to pilot car-free school streets

Summarize a Policy Memo on Reusing Vacant Urban Land

Diplomatic First Contact With a Suspicious AI

Related Links