Knowledge & Reasoning

MMLU Pro Leaderboard

Massive Multitask Language Understanding - Professional Edition

Why This Matters

Tests breadth of knowledge across 57 professional domains - critical for general-purpose AI applications

Good Scores

70%+ is strong, 80%+ is excellent, 85%+ is state-of-the-art

Use Cases

•Research assistants
•Educational tools
•Professional advisory systems
•General Q&A chatbots

About MMLU Pro

MMLU Pro is an enhanced version of the MMLU benchmark, designed to test language models across 57 professional subjects including STEM, humanities, social sciences, and more. It provides a comprehensive evaluation of a model's knowledge and reasoning capabilities.

Last updated: 3/4/2026

Test These Models Yourself

Run benchmarks on your own data with these platforms

Together.ai

Instant API access to this model

Fastest API

Production-ready inference API. Start free, scale to millions.

Modal

Run this model on serverless GPU

Most Popular

Deploy in seconds with $30 free credits. Pay only for what you use.

Get $30 Free Credits

RunPod

Rent GPU starting at $0.34/hour

Best Value

Deploy on cloud GPU or serverless. 70% cheaper than AWS.

Start from $0.34/hr

Disclosure: We may earn a commission from these partners. This helps keep LLMYourWay free.

Complete Leaderboard

Top 50 models ranked by MMLU Pro performance

Calme 3.2 Instruct 78b

70.03

Score

Calme 3.2 Instruct 78b

70.03

Score

Calme 3.2 Instruct 78b

70.03

Score

Calme 3.2 Instruct 78b

70.03

Score

Calme 3.2 Instruct 78b

70.03

Score

Calme 3.2 Instruct 78b

70.03

Score

Calme 3.2 Instruct 78b

70.03

Score

Calme 3.2 Instruct 78b

70.03

Score

Calme 3.2 Instruct 78b

70.03

Score

Calme 3.2 Instruct 78b

70.03

Score

Calme 3.2 Instruct 78b

70.03

Score

Calme 3.2 Instruct 78b

70.03

Score

Calme 3.2 Instruct 78b

70.03

Score

Calme 3.2 Instruct 78b

70.03

Score

Calme 3.2 Instruct 78b

70.03

Score

Calme 3.2 Instruct 78b

70.03

Score

Calme 3.2 Instruct 78b

70.03

Score

Calme 3.2 Instruct 78b

70.03

Score

Calme 3.2 Instruct 78b

70.03

Score

Calme 3.2 Instruct 78b

70.03

Score

Calme 3.2 Instruct 78b

70.03

Score

Calme 3.2 Instruct 78b

70.03

Score

Calme 3.2 Instruct 78b

70.03

Score

Calme 3.2 Instruct 78b

70.03

Score

Calme 3.2 Instruct 78b

70.03

Score

Calme 3.2 Instruct 78b

70.03

Score

Calme 3.2 Instruct 78b

70.03

Score

Calme 3.2 Instruct 78b

70.03

Score

Calme 3.2 Instruct 78b

70.03

Score

Calme 3.2 Instruct 78b

70.03

Score

Calme 3.2 Instruct 78b

70.03

Score

Calme 3.2 Instruct 78b

70.03

Score

Calme 3.2 Instruct 78b

70.03

Score

Calme 3.2 Instruct 78b

70.03

Score

Calme 3.2 Instruct 78b

70.03

Score

Calme 3.2 Instruct 78b

70.03

Score

Calme 3.2 Instruct 78b

70.03

Score

Calme 3.2 Instruct 78b

70.03

Score

Calme 3.2 Instruct 78b

70.03

Score

Calme 3.2 Instruct 78b

70.03

Score

Calme 3.2 Instruct 78b

70.03

Score

Calme 3.2 Instruct 78b

70.03

Score

Calme 3.2 Instruct 78b

70.03

Score

Calme 3.2 Instruct 78b

70.03

Score

Calme 3.2 Instruct 78b

70.03

Score

Calme 3.2 Instruct 78b

70.03

Score

Calme 3.2 Instruct 78b

70.03

Score

Calme 3.2 Instruct 78b

70.03

Score

Calme 3.2 Instruct 78b

70.03

Score

Calme 3.2 Instruct 78b

70.03

Score