MuSR Leaderboard
Multistep Soft Reasoning Benchmark
Why This Matters
Multi-step logical reasoning - essential for complex problem-solving and analysis
Good Scores
55%+ is good, 65%+ is strong, 75%+ is excellent
Use Cases
- •Business analytics
- •Strategy development
- •Causal analysis
- •Complex investigations
About MuSR
MuSR evaluates complex reasoning requiring multiple steps of inference and soft reasoning across diverse scenarios. Tests model's ability to chain together logical steps.
Test These Models Yourself
Run benchmarks on your own data with these platforms
Together.ai
Instant API access to this model
Production-ready inference API. Start free, scale to millions.
Try Free APIModal
Run this model on serverless GPU
Deploy in seconds with $30 free credits. Pay only for what you use.
Get $30 Free CreditsRunPod
Rent GPU starting at $0.34/hour
Deploy on cloud GPU or serverless. 70% cheaper than AWS.
Start from $0.34/hrDisclosure: We may earn a commission from these partners. This helps keep LLMYourWay free.
Complete Leaderboard
Top 50 models ranked by MuSR performance
T3Q Qwen2.5 14b V1.0 E3
38.69
Score
T3Q Qwen2.5 14b V1.0 E3
38.69
Score
T3Q Qwen2.5 14b V1.0 E3
38.69
Score
T3Q Qwen2.5 14b V1.0 E3
38.69
Score
T3Q Qwen2.5 14b V1.0 E3
38.69
Score
T3Q Qwen2.5 14b V1.0 E3
38.69
Score
T3Q Qwen2.5 14b V1.0 E3
38.69
Score
T3Q Qwen2.5 14b V1.0 E3
38.69
Score
T3Q Qwen2.5 14b V1.0 E3
38.69
Score
T3Q Qwen2.5 14b V1.0 E3
38.69
Score
T3Q Qwen2.5 14b V1.0 E3
38.69
Score
T3Q Qwen2.5 14b V1.0 E3
38.69
Score
T3Q Qwen2.5 14b V1.0 E3
38.69
Score
T3Q Qwen2.5 14b V1.0 E3
38.69
Score
T3Q Qwen2.5 14b V1.0 E3
38.69
Score
T3Q Qwen2.5 14b V1.0 E3
38.69
Score
T3Q Qwen2.5 14b V1.0 E3
38.69
Score
T3Q Qwen2.5 14b V1.0 E3
38.69
Score
T3Q Qwen2.5 14b V1.0 E3
38.69
Score
T3Q Qwen2.5 14b V1.0 E3
38.69
Score
T3Q Qwen2.5 14b V1.0 E3
38.69
Score
T3Q Qwen2.5 14b V1.0 E3
38.69
Score
T3Q Qwen2.5 14b V1.0 E3
38.69
Score
T3Q Qwen2.5 14b V1.0 E3
38.69
Score
T3Q Qwen2.5 14b V1.0 E3
38.69
Score
T3Q Qwen2.5 14b V1.0 E3
38.69
Score
T3Q Qwen2.5 14b V1.0 E3
38.69
Score
T3Q Qwen2.5 14b V1.0 E3
38.69
Score
T3Q Qwen2.5 14b V1.0 E3
38.69
Score
T3Q Qwen2.5 14b V1.0 E3
38.69
Score
T3Q Qwen2.5 14b V1.0 E3
38.69
Score
T3Q Qwen2.5 14b V1.0 E3
38.69
Score
T3Q Qwen2.5 14b V1.0 E3
38.69
Score
T3Q Qwen2.5 14b V1.0 E3
38.69
Score
T3Q Qwen2.5 14b V1.0 E3
38.69
Score
T3Q Qwen2.5 14b V1.0 E3
38.69
Score
T3Q Qwen2.5 14b V1.0 E3
38.69
Score
T3Q Qwen2.5 14b V1.0 E3
38.69
Score
T3Q Qwen2.5 14b V1.0 E3
38.69
Score
T3Q Qwen2.5 14b V1.0 E3
38.69
Score
T3Q Qwen2.5 14b V1.0 E3
38.69
Score
T3Q Qwen2.5 14b V1.0 E3
38.69
Score
T3Q Qwen2.5 14b V1.0 E3
38.69
Score
T3Q Qwen2.5 14b V1.0 E3
38.69
Score
T3Q Qwen2.5 14b V1.0 E3
38.69
Score
T3Q Qwen2.5 14b V1.0 E3
38.69
Score
T3Q Qwen2.5 14b V1.0 E3
38.69
Score
T3Q Qwen2.5 14b V1.0 E3
38.69
Score
T3Q Qwen2.5 14b V1.0 E3
38.69
Score
T3Q Qwen2.5 14b V1.0 E3
38.69
Score