GPT-5.4 in Coding
Explore GPT-5.4's performance in Coding, including average scores, ranking position, and recent benchmark examples.
Overall Performance
Average Score
Sample Count
8
Updated At
May 12, 2026 14:43
Score Breakdown
Instruction Following
Completeness
Correctness
Code Quality
Practical Value
Latest Benchmarks
Coding
Markdown Subset to HTML Converter
Write a Python function `markdown_to_html(markdown_text: str) -> str` that converts a string containing a specific subset of Markdown into its corresponding HTM...
Coding
Implement a Thread-Safe Token Bucket Rate Limiter in Python
Write a Python class named `TokenBucketRateLimiter` that implements the token bucket algorithm for rate limiting. The implementation must be thread-safe and sho...
Coding
Command-Line File Synchronization Tool
Write a Python script for a command-line file synchronization tool. The script must accept three command-line arguments: 1. `source_path`: The path to the sou...
Coding
Implement a Lock-Free Concurrent LRU Cache
Implement a thread-safe LRU (Least Recently Used) cache in Python that supports concurrent reads and writes without using a global lock for every operation. You...
Coding
In-Memory Key-Value Store with Transaction Support
Write a Python class `InMemoryDB` that implements a simple in-memory key-value data store with support for nested transactions. The class should have the follo...
Coding
Implement a Dependency Resolver in Python
You are tasked with creating a dependency resolver for a simple package management system. Write a Python function `resolve_dependencies(package_definitions, ta...
Coding
Python Function for Package Dependency Resolution
Write a Python function named `resolve_dependencies` that takes a dictionary of packages and their dependencies and returns a valid installation order. The func...
Coding
Log File Analyzer for User Activity
Write a Python function `analyze_logs(log_data)` that takes a single multi-line string `log_data` as input. Each line in the string represents a log entry in th...
Genre Rank
Compare Performance by Model