Expert Knowledge

GPQA Leaderboard

Graduate-Level Google-Proof Q&A Benchmark

Why This Matters

Graduate-level scientific knowledge - essential for research and specialized domains

Good Scores

35%+ is good (experts score ~65%), 45%+ is excellent, 50%+ is exceptional

Use Cases

  • Scientific research tools
  • Technical documentation
  • Academic assistance
  • Expert systems

Peak Score

29.42

Average

6.57

Models Tested

53,438

Median Score

5.93

Performance by Model Size

How different size classes perform on this benchmark

📄 License Analysis

Performance by license type

llama29.42
20447 modelsAvg: 5.68
Unknown24.94
14487 modelsAvg: 7.08
license:apache-2.022.26
14019 modelsAvg: 7.37
license:mit20.92
2592 modelsAvg: 7.10
license:gpl-3.020.58
168 modelsAvg: 5.08
license:cc-by-nc-4.014.43
1019 modelsAvg: 6.55
dataset:anthracite-org/c2_logs_16k_llama_v1.112.19
72 modelsAvg: 10.66
llama-factory10.51
70 modelsAvg: 3.75

🔧 Framework Analysis

Performance by framework

OTHER29.42
53299 modelsAvg: 6.58
PYTORCH8.72
116 modelsAvg: 3.62
HUGGINGFACE0.56
23 modelsAvg: 0.56

About GPQA

GPQA is a challenging dataset of 448 multiple-choice questions written by domain experts in biology, physics, and chemistry. Questions are designed to be difficult for laypersons but answerable by experts in the field.

Last updated: 11/18/2025

Test These Models Yourself

Run benchmarks on your own data with these platforms

Together.ai

Instant API access to this model

Fastest API

Production-ready inference API. Start free, scale to millions.

Try Free API

Modal

Run this model on serverless GPU

Most Popular

Deploy in seconds with $30 free credits. Pay only for what you use.

Get $30 Free Credits

RunPod

Rent GPU starting at $0.34/hour

Best Value

Deploy on cloud GPU or serverless. 70% cheaper than AWS.

Start from $0.34/hr

Disclosure: We may earn a commission from these partners. This helps keep LLMYourWay free.

Complete Leaderboard

Top 50 models ranked by GPQA performance

L3.3-MS-Nevoria-70b

Steelskullllama

29.42

Score

L3.3-MS-Nevoria-70b

Steelskullllama

29.42

Score

L3.3-MS-Nevoria-70b

Steelskullllama

29.42

Score

#4

L3.3-MS-Nevoria-70b

Steelskullllama

29.42

Score

#5

L3.3-MS-Nevoria-70b

Steelskullllama

29.42

Score

#6

L3.3-MS-Nevoria-70b

Steelskullllama

29.42

Score

#7

L3.3-MS-Nevoria-70b

Steelskullllama

29.42

Score

#8

L3.3-MS-Nevoria-70b

Steelskullllama

29.42

Score

#9

L3.3-MS-Nevoria-70b

Steelskullllama

29.42

Score

#10

L3.3-MS-Nevoria-70b

Steelskullllama

29.42

Score

#11

L3.3-MS-Nevoria-70b

Steelskullllama

29.42

Score

#12

L3.3-MS-Nevoria-70b

Steelskullllama

29.42

Score

#13

L3.3-MS-Nevoria-70b

Steelskullllama

29.42

Score

#14

L3.3-MS-Nevoria-70b

Steelskullllama

29.42

Score

#15

L3.3-MS-Nevoria-70b

Steelskullllama

29.42

Score

#16

L3.3-MS-Nevoria-70b

Steelskullllama

29.42

Score

#17

L3.3-MS-Nevoria-70b

Steelskullllama

29.42

Score

#18

L3.3-MS-Nevoria-70b

Steelskullllama

29.42

Score

#19

L3.3-MS-Nevoria-70b

Steelskullllama

29.42

Score

#20

L3.3-MS-Nevoria-70b

Steelskullllama

29.42

Score

#21

L3.3-MS-Nevoria-70b

Steelskullllama

29.42

Score

#22

L3.3-MS-Nevoria-70b

Steelskullllama

29.42

Score

#23

L3.3-MS-Nevoria-70b

Steelskullllama

29.42

Score

#24

L3.3-MS-Nevoria-70b

Steelskullllama

29.42

Score

#25

L3.3-Nevoria-R1-70b

Steelskullllama

29.19

Score

#26

L3.3-Nevoria-R1-70b

Steelskullllama

29.19

Score

#27

L3.3-Nevoria-R1-70b

Steelskullllama

29.19

Score

#28

L3.3-Nevoria-R1-70b

Steelskullllama

29.19

Score

#29

L3.3-Nevoria-R1-70b

Steelskullllama

29.19

Score

#30

L3.3-Nevoria-R1-70b

Steelskullllama

29.19

Score

#31

L3.3-Nevoria-R1-70b

Steelskullllama

29.19

Score

#32

L3.3-Nevoria-R1-70b

Steelskullllama

29.19

Score

#33

L3.3-Nevoria-R1-70b

Steelskullllama

29.19

Score

#34

L3.3-Nevoria-R1-70b

Steelskullllama

29.19

Score

#35

L3.3-Nevoria-R1-70b

Steelskullllama

29.19

Score

#36

L3.3-Nevoria-R1-70b

Steelskullllama

29.19

Score

#37

L3.3-Nevoria-R1-70b

Steelskullllama

29.19

Score

#38

L3.3-Nevoria-R1-70b

Steelskullllama

29.19

Score

#39

L3.3-Nevoria-R1-70b

Steelskullllama

29.19

Score

#40

L3.3-Nevoria-R1-70b

Steelskullllama

29.19

Score

#41

L3.3-Nevoria-R1-70b

Steelskullllama

29.19

Score

#42

L3.3-Nevoria-R1-70b

Steelskullllama

29.19

Score

#43

L3.3-Nevoria-R1-70b

Steelskullllama

29.19

Score

#44

L3.3-Nevoria-R1-70b

Steelskullllama

29.19

Score

#45

L3.3-Nevoria-R1-70b

Steelskullllama

29.19

Score

#46

L3.3-Nevoria-R1-70b

Steelskullllama

29.19

Score

#47

L3.3-Nevoria-R1-70b

Steelskullllama

29.19

Score

#48

L3.3-Nevoria-R1-70b

Steelskullllama

29.19

Score

#49

70B-L3.3-Cirrus-x1

Sao10Kllama

26.62

Score

#50

70B-L3.3-Cirrus-x1

Sao10Kllama

26.62

Score