Reasoning

MuSR Leaderboard

Multistep Soft Reasoning Benchmark

Why This Matters

Multi-step logical reasoning - essential for complex problem-solving and analysis

Good Scores

55%+ is good, 65%+ is strong, 75%+ is excellent

Use Cases

  • Business analytics
  • Strategy development
  • Causal analysis
  • Complex investigations

Peak Score

38.69

Average

9.93

Models Tested

53,438

Median Score

10.15

📄 License Analysis

Performance by license type

license:apache-2.038.69
14019 modelsAvg: 11.45
Unknown38.53
14487 modelsAvg: 9.60
license:mit36.37
2592 modelsAvg: 9.99
llama25.88
20447 modelsAvg: 9.03
license:gpl-3.023.18
168 modelsAvg: 10.57
license:cc-by-nc-4.021.30
1019 modelsAvg: 12.37
dataset:anthracite-org/c2_logs_16k_llama_v1.117.56
72 modelsAvg: 11.80
dataset:LightningRodLabs/deepseek-ai_DeepSeek-R1-Distill-Llama-70B-questions-polymarket-set-1-22-zero-shot-chat-template16.45
24 modelsAvg: 16.45

🔧 Framework Analysis

Performance by framework

OTHER38.69
53299 modelsAvg: 9.93
PYTORCH15.35
116 modelsAvg: 9.57
HUGGINGFACE13.24
23 modelsAvg: 13.24

About MuSR

MuSR evaluates complex reasoning requiring multiple steps of inference and soft reasoning across diverse scenarios. Tests model's ability to chain together logical steps.

Last updated: 11/19/2025

Test These Models Yourself

Run benchmarks on your own data with these platforms

Together.ai

Instant API access to this model

Fastest API

Production-ready inference API. Start free, scale to millions.

Try Free API

Modal

Run this model on serverless GPU

Most Popular

Deploy in seconds with $30 free credits. Pay only for what you use.

Get $30 Free Credits

RunPod

Rent GPU starting at $0.34/hour

Best Value

Deploy on cloud GPU or serverless. 70% cheaper than AWS.

Start from $0.34/hr

Disclosure: We may earn a commission from these partners. This helps keep LLMYourWay free.

Complete Leaderboard

Top 50 models ranked by MuSR performance

T3Q Qwen2.5 14b V1.0 E3

JungZoonalicense:apache-2.0

38.69

Score

T3Q Qwen2.5 14b V1.0 E3

JungZoonalicense:apache-2.0

38.69

Score

T3Q Qwen2.5 14b V1.0 E3

JungZoonalicense:apache-2.0

38.69

Score

#4

T3Q Qwen2.5 14b V1.0 E3

JungZoonalicense:apache-2.0

38.69

Score

#5

T3Q Qwen2.5 14b V1.0 E3

JungZoonalicense:apache-2.0

38.69

Score

#6

T3Q Qwen2.5 14b V1.0 E3

JungZoonalicense:apache-2.0

38.69

Score

#7

T3Q Qwen2.5 14b V1.0 E3

JungZoonalicense:apache-2.0

38.69

Score

#8

T3Q Qwen2.5 14b V1.0 E3

JungZoonalicense:apache-2.0

38.69

Score

#9

T3Q Qwen2.5 14b V1.0 E3

JungZoonalicense:apache-2.0

38.69

Score

#10

T3Q Qwen2.5 14b V1.0 E3

JungZoonalicense:apache-2.0

38.69

Score

#11

T3Q Qwen2.5 14b V1.0 E3

JungZoonalicense:apache-2.0

38.69

Score

#12

T3Q Qwen2.5 14b V1.0 E3

JungZoonalicense:apache-2.0

38.69

Score

#13

T3Q Qwen2.5 14b V1.0 E3

JungZoonalicense:apache-2.0

38.69

Score

#14

T3Q Qwen2.5 14b V1.0 E3

JungZoonalicense:apache-2.0

38.69

Score

#15

T3Q Qwen2.5 14b V1.0 E3

JungZoonalicense:apache-2.0

38.69

Score

#16

T3Q Qwen2.5 14b V1.0 E3

JungZoonalicense:apache-2.0

38.69

Score

#17

T3Q Qwen2.5 14b V1.0 E3

JungZoonalicense:apache-2.0

38.69

Score

#18

T3Q Qwen2.5 14b V1.0 E3

JungZoonalicense:apache-2.0

38.69

Score

#19

T3Q Qwen2.5 14b V1.0 E3

JungZoonalicense:apache-2.0

38.69

Score

#20

T3Q Qwen2.5 14b V1.0 E3

JungZoonalicense:apache-2.0

38.69

Score

#21

T3Q Qwen2.5 14b V1.0 E3

JungZoonalicense:apache-2.0

38.69

Score

#22

T3Q Qwen2.5 14b V1.0 E3

JungZoonalicense:apache-2.0

38.69

Score

#23

T3Q Qwen2.5 14b V1.0 E3

JungZoonalicense:apache-2.0

38.69

Score

#24

T3Q Qwen2.5 14b V1.0 E3

JungZoonalicense:apache-2.0

38.69

Score

#25

Calme 3.2 Instruct 78b

MaziyarPanahi

38.53

Score

#26

Calme 3.2 Instruct 78b

MaziyarPanahi

38.53

Score

#27

Calme 3.2 Instruct 78b

MaziyarPanahi

38.53

Score

#28

Calme 3.2 Instruct 78b

MaziyarPanahi

38.53

Score

#29

Calme 3.2 Instruct 78b

MaziyarPanahi

38.53

Score

#30

Calme 3.2 Instruct 78b

MaziyarPanahi

38.53

Score

#31

Calme 3.2 Instruct 78b

MaziyarPanahi

38.53

Score

#32

Calme 3.2 Instruct 78b

MaziyarPanahi

38.53

Score

#33

Calme 3.2 Instruct 78b

MaziyarPanahi

38.53

Score

#34

Calme 3.2 Instruct 78b

MaziyarPanahi

38.53

Score

#35

Calme 3.2 Instruct 78b

MaziyarPanahi

38.53

Score

#36

Calme 3.2 Instruct 78b

MaziyarPanahi

38.53

Score

#37

Calme 3.2 Instruct 78b

MaziyarPanahi

38.53

Score

#38

Calme 3.2 Instruct 78b

MaziyarPanahi

38.53

Score

#39

Calme 3.2 Instruct 78b

MaziyarPanahi

38.53

Score

#40

Calme 3.2 Instruct 78b

MaziyarPanahi

38.53

Score

#41

Calme 3.2 Instruct 78b

MaziyarPanahi

38.53

Score

#42

Calme 3.2 Instruct 78b

MaziyarPanahi

38.53

Score

#43

Calme 3.2 Instruct 78b

MaziyarPanahi

38.53

Score

#44

Calme 3.2 Instruct 78b

MaziyarPanahi

38.53

Score

#45

Calme 3.2 Instruct 78b

MaziyarPanahi

38.53

Score

#46

Calme 3.2 Instruct 78b

MaziyarPanahi

38.53

Score

#47

Calme 3.2 Instruct 78b

MaziyarPanahi

38.53

Score

#48

Calme 3.2 Instruct 78b

MaziyarPanahi

38.53

Score

#49

calme-3.1-instruct-78b

MaziyarPanahi

36.50

Score

#50

calme-3.1-instruct-78b

MaziyarPanahi

36.50

Score