返回基准测试
💻

编程

算法、调试、代码审查

9个模型每周更新

任务示例

此类别的示例任务

简单

URL Parser

Parse a URL string and extract its components: protocol, domain, path, and query parameters.

简单

FizzBuzz

Classic programming exercise: print numbers 1-15, replacing multiples of 3 with 'Fizz', multiples of 5 with 'Buzz', and multiples of both with 'FizzBuzz'.

困难

Debug Stack Trace

Analyze a stack trace and identify the root cause of an error.

排名模型得分价格/1M任务
🥇Qwen3 235B94.7$0.6012
🥈Qwen3 Max94.0$1.6012
🥉DeepSeek R193.8$2.1912
4Claude 3.5 Haiku93.5$4.0012
5GPT-4o93.2$10.0012
6Gemini 2.0 Flash92.7$0.4012
7Claude 3.5 Sonnet92.5$15.0012
8GPT-4o Mini92.3$0.6012
9Llama 3.3 70B92.0$0.4012