MATH Level 5 Leaderboard
Advanced Mathematics Problem Solving
Why This Matters
Competition-level math - shows advanced quantitative reasoning for technical applications
Good Scores
25%+ is capable, 40%+ is strong, 55%+ is elite
Use Cases
- •Mathematical tutoring
- •Engineering calculations
- •Financial modeling
- •Data analysis
Peak Score
62.54
Average
15.06
Models Tested
53,438
Median Score
9.97
Efficiency Leaders
Best performance per billion parameters - The smart choices
ChatWaifu_v1.4
100.0M params • Score: 10.57
Efficiency
105.74
ChatWaifu_v1.4
100.0M params • Score: 10.57
Efficiency
105.74
ChatWaifu_v1.4
100.0M params • Score: 10.57
Efficiency
105.74
ChatWaifu_v1.4
100.0M params • Score: 10.57
Efficiency
105.74
ChatWaifu_v1.4
100.0M params • Score: 10.57
Efficiency
105.74
ChatWaifu_v1.4
100.0M params • Score: 10.57
Efficiency
105.74
ChatWaifu_v1.4
100.0M params • Score: 10.57
Efficiency
105.74
ChatWaifu_v1.4
100.0M params • Score: 10.57
Efficiency
105.74
ChatWaifu_v1.4
100.0M params • Score: 10.57
Efficiency
105.74
ChatWaifu_v1.4
100.0M params • Score: 10.57
Efficiency
105.74
Performance by Model Size
How different size classes perform on this benchmark
medium
Avg Score: 14.23
large
Avg Score: 27.38
xlarge
Avg Score: 33.95
🏆 Open Source Champions
Top permissively licensed models
📈 Most Downloaded Models
Popularity meets performance
📄 License Analysis
Performance by license type
🔧 Framework Analysis
Performance by framework
About MATH Level 5
MATH Level 5 contains the hardest problems from the MATH dataset, requiring advanced mathematical reasoning including algebra, calculus, geometry, and number theory at competition level.
Test These Models Yourself
Run benchmarks on your own data with these platforms
Together.ai
Instant API access to this model
Production-ready inference API. Start free, scale to millions.
Try Free APIModal
Run this model on serverless GPU
Deploy in seconds with $30 free credits. Pay only for what you use.
Get $30 Free CreditsRunPod
Rent GPU starting at $0.34/hour
Deploy on cloud GPU or serverless. 70% cheaper than AWS.
Start from $0.34/hrDisclosure: We may earn a commission from these partners. This helps keep LLMYourWay free.
Complete Leaderboard
Top 50 models ranked by MATH Level 5 performance
Qwen2.5-32B-Instruct
62.54
Score
Qwen2.5-32B-Instruct
62.54
Score
Qwen2.5-32B-Instruct
62.54
Score
Qwen2.5-32B-Instruct
62.54
Score
Qwen2.5-32B-Instruct
62.54
Score
Qwen2.5-32B-Instruct
62.54
Score
Qwen2.5-32B-Instruct
62.54
Score
Qwen2.5-32B-Instruct
62.54
Score
Qwen2.5-32B-Instruct
62.54
Score
Qwen2.5-32B-Instruct
62.54
Score
Qwen2.5-32B-Instruct
62.54
Score
Qwen2.5-32B-Instruct
62.54
Score
Qwen2.5-32B-Instruct
62.54
Score
Qwen2.5-32B-Instruct
62.54
Score
Qwen2.5-32B-Instruct
62.54
Score
Qwen2.5-32B-Instruct
62.54
Score
Qwen2.5-32B-Instruct
62.54
Score
Qwen2.5-32B-Instruct
62.54
Score
Qwen2.5-32B-Instruct
62.54
Score
Qwen2.5-32B-Instruct
62.54
Score
Qwen2.5-32B-Instruct
62.54
Score
Qwen2.5-32B-Instruct
62.54
Score
Qwen2.5-32B-Instruct
62.54
Score
Qwen2.5-32B-Instruct
62.54
Score
Qwen2.5-Math-72B-Instruct
62.39
Score
Qwen2.5-Math-72B-Instruct
62.39
Score
Qwen2.5-Math-72B-Instruct
62.39
Score
Qwen2.5-Math-72B-Instruct
62.39
Score
Qwen2.5-Math-72B-Instruct
62.39
Score
Qwen2.5-Math-72B-Instruct
62.39
Score
Qwen2.5-Math-72B-Instruct
62.39
Score
Qwen2.5-Math-72B-Instruct
62.39
Score
Qwen2.5-Math-72B-Instruct
62.39
Score
Qwen2.5-Math-72B-Instruct
62.39
Score
Qwen2.5-Math-72B-Instruct
62.39
Score
Qwen2.5-Math-72B-Instruct
62.39
Score
Qwen2.5-Math-72B-Instruct
62.39
Score
Qwen2.5-Math-72B-Instruct
62.39
Score
Qwen2.5-Math-72B-Instruct
62.39
Score
Qwen2.5-Math-72B-Instruct
62.39
Score
Qwen2.5-Math-72B-Instruct
62.39
Score
Qwen2.5-Math-72B-Instruct
62.39
Score
Qwen2.5-Math-72B-Instruct
62.39
Score
Qwen2.5-Math-72B-Instruct
62.39
Score
Qwen2.5-Math-72B-Instruct
62.39
Score
Qwen2.5-Math-72B-Instruct
62.39
Score
Qwen2.5-Math-72B-Instruct
62.39
Score
Qwen2.5-Math-72B-Instruct
62.39
Score
Awqward2.5-32B-Instruct
62.31
Score
Awqward2.5-32B-Instruct
62.31
Score