Models
Devices
Edge AI
Compare
Insights
Enterprise
More
Search models...
⌘
K
Toggle theme
Leaderboards
Group
General
Math
Chat
Task
mmlu_pro
bbh
openllm_v2_avg
ifeval
musr
View
Rank
Model
Score
Metric
Evaluated
Source
1
nvidia/Hymba-1.5B-Base
10.25
score
1/24/2026
link
2
nvidia/AceMath-7B-RM
1.54
score
1/24/2026
link
3
nvidia/AceMath-7B-Instruct
26.48
score
1/24/2026
link
4
nvidia/AceMath-72B-RM
1.98
score
1/24/2026
link
5
nvidia/AceMath-72B-Instruct
37.90
score
1/24/2026
link
6
nvidia/AceMath-1.5B-Instruct
11.82
score
1/24/2026
link
7
nvidia/AceInstruct-7B
35.30
score
1/24/2026
link
8
nvidia/AceInstruct-72B
43.04
score
1/24/2026
link
9
nvidia/AceInstruct-1.5B
17.49
score
1/24/2026
link
10
nothingiisreal/MN-12B-Starcannon-v3
25.16
score
1/24/2026
link
11
nothingiisreal/MN-12B-Starcannon-v2
23.65
score
1/24/2026
link
12
nothingiisreal/L3.1-8B-Celeste-V1.5
30.05
score
1/24/2026
link
13
notbdq/Qwen2.5-14B-Instruct-1M-GRPO-Reasoning
42.77
score
1/24/2026
link
14
noname0202/llama-math-1b-r8-512tokens-test
8.36
score
1/24/2026
link
15
noname0202/llama-math-1b-r32-test
8.68
score
1/24/2026
link
16
noname0202/llama-math-1b-r32-0to512tokens-test
8.45
score
1/24/2026
link
17
noname0202/llama-math-1b-r16-0to512tokens-test
8.09
score
1/24/2026
link
18
noname0202/gemma-2-9b-sft-jp-en-zh-v2
29.72
score
1/24/2026
link
19
noname0202/gemma-2-9b-sft-jp-en-zh-v1
23.61
score
1/24/2026
link
20
noname0202/gemma-2-2b-it-ties
17.34
score
1/24/2026
link
21
noname0202/Llama-3.2-4x3B-Instruct
25.39
score
1/24/2026
link
22
nlpguy/StarFusion-alpha1
24.34
score
1/24/2026
link
23
nlpguy/StableProse
27.43
score
1/24/2026
link
24
nlpguy/Mistral-NeMo-Minitron-Upscale-v3
1.90
score
1/24/2026
link
25
nlpguy/Mistral-NeMo-Minitron-Upscale-v2
10.29
score
1/24/2026
link
26
nlpguy/Mistral-NeMo-Minitron-Upscale-v1
17.08
score
1/24/2026
link
27
nlpguy/Miisce-one
49.02
score
1/24/2026
link
28
nlpguy/Lion-Lamarck-v.1.1.0
40.34
score
1/24/2026
link
29
nlpguy/Lion-Lamarck-v.1.0.9
41.16
score
1/24/2026
link
30
nlpguy/Lion-Lamarck-v.1.0.8
40.48
score
1/24/2026
link
31
nisten/tqwendo-36b
37.56
score
1/24/2026
link
32
nisten/franqwenstein-35b
51.23
score
1/24/2026
link
33
nidum/Nidum-Limitless-Gemma-2B
1.93
score
1/24/2026
link
34
nhyha/merge_Qwen2.5-7B-Instruct_20241023_0314
39.36
score
1/24/2026
link
35
nhyha/N3N_gemma-2-9b-it_20241110_2026
33.56
score
1/24/2026
link
36
nhyha/N3N_gemma-2-9b-it_20241029_1532
34.69
score
1/24/2026
link
37
nhyha/N3N_Llama-3.1-8B-Instruct_1028_0216
29.31
score
1/24/2026
link
38
nhyha/N3N_Delirium-v1_1030_0227
35.00
score
1/24/2026
link
39
ngxson/MiniThinky-v2-1B-Llama-3.2
1.29
score
1/24/2026
link
40
ngxson/MiniThinky-1B-Llama-3.2
1.63
score
1/24/2026
link
41
nguyentd/FinancialAdvice-Qwen2.5-7B
30.58
score
1/24/2026
link
42
newsbang/Homer-v1.0-Qwen2.5-7B
39.27
score
1/24/2026
link
43
newsbang/Homer-v1.0-Qwen2.5-72B
57.17
score
1/24/2026
link
44
newsbang/Homer-v0.5-Qwen2.5-7B
37.44
score
1/24/2026
link
45
newsbang/Homer-v0.4-Qwen2.5-7B
37.36
score
1/24/2026
link
46
newsbang/Homer-v0.3-Qwen2.5-7B
38.40
score
1/24/2026
link
47
newsbang/Homer-7B-v0.2
37.89
score
1/24/2026
link
48
newsbang/Homer-7B-v0.1
38.61
score
1/24/2026
link
49
netease-youdao/Confucius-o1-14B
47.39
score
1/24/2026
link
50
netcat420/qwen2.5-MFANN-7b-v1.1
24.98
score
1/24/2026
link
51
netcat420/qwen2.5-MFANN-7b-SLERPv1.1
27.20
score
1/24/2026
link
52
netcat420/qwen2.5-MFANN-7b-SLERP-V1.2
27.09
score
1/24/2026
link
53
netcat420/Qwen2.5-MFANN-7b
24.81
score
1/24/2026
link
54
netcat420/Qwen2.5-DeepSeek-R1-MFANN-Slerp-7b
7.52
score
1/24/2026
link
55
netcat420/Qwen2.5-Coder-Scholar-7B-Abliterated-MFANN-Slerp-Unretrained
27.02
score
1/24/2026
link
56
netcat420/Qwen2.5-Coder-Scholar-7B-Abliterated-MFANN
23.96
score
1/24/2026
link
57
netcat420/Qwen2.5-7b-nerd-uncensored-MFANN-slerp
1.12
score
1/24/2026
link
58
netcat420/Qwen2.5-7b-MFANN-slerp
26.85
score
1/24/2026
link
59
netcat420/Qwen2.5-7B-nerd-uncensored-v0.9-MFANN
32.26
score
1/24/2026
link
60
netcat420/MFANNv0.25
26.03
score
1/24/2026
link
61
netcat420/MFANNv0.24
26.09
score
1/24/2026
link
62
netcat420/MFANNv0.23
26.53
score
1/24/2026
link
63
netcat420/MFANNv0.22.1
26.03
score
1/24/2026
link
64
netcat420/MFANNv0.21
22.57
score
1/24/2026
link
65
netcat420/MFANNv0.20
24.47
score
1/24/2026
link
66
netcat420/MFANNv0.19
16.36
score
1/24/2026
link
67
netcat420/MFANN3bv1.4
18.95
score
1/24/2026
link
68
netcat420/MFANN3bv1.3
14.17
score
1/24/2026
link
69
netcat420/MFANN3bv1.2
5.00
score
1/24/2026
link
70
netcat420/MFANN3bv1.1
1.76
score
1/24/2026
link
71
netcat420/MFANN3bv0.24
15.02
score
1/24/2026
link
72
netcat420/MFANN3bv0.23
15.75
score
1/24/2026
link
73
netcat420/MFANN3bv0.22
16.86
score
1/24/2026
link
74
netcat420/MFANN3bv0.21
15.48
score
1/24/2026
link
75
netcat420/MFANN3bv0.20
16.67
score
1/24/2026
link
76
netcat420/MFANN3bv0.19
16.89
score
1/24/2026
link
77
netcat420/MFANN3bv0.18
16.67
score
1/24/2026
link
78
netcat420/MFANN3bv0.15
16.32
score
1/24/2026
link
79
netcat420/MFANN3b
14.51
score
1/24/2026
link
80
netcat420/MFANN-phigments-slerp-V3.3
20.03
score
1/24/2026
link
81
netcat420/MFANN-phigments-slerp-V3.2
18.95
score
1/24/2026
link
82
netcat420/MFANN-phigments-slerp-V2
19.08
score
1/24/2026
link
83
netcat420/MFANN-llama3.1-abliterated-v2
27.67
score
1/24/2026
link
84
netcat420/MFANN-llama3.1-abliterated-SLERP-v3.1
28.26
score
1/24/2026
link
85
netcat420/MFANN-llama3.1-abliterated-SLERP-v3
28.12
score
1/24/2026
link
86
netcat420/MFANN-llama3.1-Abliterated-SLERP
21.42
score
1/24/2026
link
87
netcat420/MFANN-abliterated-phi2-merge-unretrained
5.31
score
1/24/2026
link
88
netcat420/MFANN-SFT
25.96
score
1/24/2026
link
89
netcat420/MFANN-Llama3.1-Abliterated-Slerp-V3.2
28.08
score
1/24/2026
link
90
netcat420/MFANN-Llama3.1-Abliterated-Slerp-TIES
28.13
score
1/24/2026
link
91
netcat420/MFANN-Llama3.1-Abliterated-SLERP-V5
27.17
score
1/24/2026
link
92
netcat420/MFANN-Llama3.1-Abliterated-SLERP-V4
27.96
score
1/24/2026
link
93
netcat420/MFANN-Llama3.1-Abliterated-SLERP-TIES-V3
27.67
score
1/24/2026
link
94
netcat420/MFANN-Llama3.1-Abliterated-SLERP-TIES-V2
28.03
score
1/24/2026
link
95
netcat420/Llama3.1-MFANN-8b
19.17
score
1/24/2026
link
96
netcat420/DeepSeek-R1-MFANN-TIES-unretrained-7b
1.61
score
1/24/2026
link
97
netcat420/DeepSeek-R1-Distill-Qwen-MFANN-Slerp-7b
1.00
score
1/24/2026
link
98
neopolita/loki-v0.1-virtuoso
45.88
score
1/24/2026
link
99
neopolita/jessi-v0.6-falcon3-7b-instruct
32.85
score
1/24/2026
link
100
neopolita/jessi-v0.5-falcon3-7b-instruct
32.96
score
1/24/2026
link
Showing latest 100 models for “mmlu_pro”. Switch task or group to explore other leaderboards.