Chatbot Arena ELO Leaderboard
Crowdsourced chatbot rankings via human preference
Why This Matters
Real user preferences - the ultimate test for conversational AI quality
Good Scores
1100+ is good, 1200+ is strong, 1250+ is top-tier
Use Cases
- •Customer service
- •Virtual assistants
- •Conversational interfaces
- •General chatbots
Peak Score
1339.00
Average
1339.00
Models Tested
24
Median Score
1339.00
Efficiency Leaders
Best performance per billion parameters - The smart choices
Llama-3_3-Nemotron-Super-49B-v1_5
49.0B params • Score: 1339.00
Efficiency
27.33
Llama-3_3-Nemotron-Super-49B-v1_5
49.0B params • Score: 1339.00
Efficiency
27.33
Llama-3_3-Nemotron-Super-49B-v1_5
49.0B params • Score: 1339.00
Efficiency
27.33
Llama-3_3-Nemotron-Super-49B-v1_5
49.0B params • Score: 1339.00
Efficiency
27.33
Llama-3_3-Nemotron-Super-49B-v1_5
49.0B params • Score: 1339.00
Efficiency
27.33
Llama-3_3-Nemotron-Super-49B-v1_5
49.0B params • Score: 1339.00
Efficiency
27.33
Llama-3_3-Nemotron-Super-49B-v1_5
49.0B params • Score: 1339.00
Efficiency
27.33
Llama-3_3-Nemotron-Super-49B-v1_5
49.0B params • Score: 1339.00
Efficiency
27.33
Llama-3_3-Nemotron-Super-49B-v1_5
49.0B params • Score: 1339.00
Efficiency
27.33
Llama-3_3-Nemotron-Super-49B-v1_5
49.0B params • Score: 1339.00
Efficiency
27.33
Performance by Model Size
How different size classes perform on this benchmark
large
Avg Score: 1339.00
📈 Most Downloaded Models
Popularity meets performance
Llama-3_3-Nemotron-Super-49B-v1_5
103.6K downloads • Rank #1
Llama-3_3-Nemotron-Super-49B-v1_5
103.6K downloads • Rank #1
Llama-3_3-Nemotron-Super-49B-v1_5
103.6K downloads • Rank #1
Llama-3_3-Nemotron-Super-49B-v1_5
103.6K downloads • Rank #1
Llama-3_3-Nemotron-Super-49B-v1_5
103.6K downloads • Rank #1
📄 License Analysis
Performance by license type
🔧 Framework Analysis
Performance by framework
About Chatbot Arena ELO
Chatbot Arena ELO is a leaderboard based on over 500,000 human preference votes. Users chat with two anonymous models and vote for which response they prefer. ELO ratings are calculated from head-to-head matchups.
Test These Models Yourself
Run benchmarks on your own data with these platforms
Together.ai
Instant API access to this model
Production-ready inference API. Start free, scale to millions.
Try Free APIModal
Run this model on serverless GPU
Deploy in seconds with $30 free credits. Pay only for what you use.
Get $30 Free CreditsRunPod
Rent GPU starting at $0.34/hour
Deploy on cloud GPU or serverless. 70% cheaper than AWS.
Start from $0.34/hrDisclosure: We may earn a commission from these partners. This helps keep LLMYourWay free.
Complete Leaderboard
Top 50 models ranked by Chatbot Arena ELO performance
Llama-3_3-Nemotron-Super-49B-v1_5
1339.00
Score
Llama-3_3-Nemotron-Super-49B-v1_5
1339.00
Score
Llama-3_3-Nemotron-Super-49B-v1_5
1339.00
Score
Llama-3_3-Nemotron-Super-49B-v1_5
1339.00
Score
Llama-3_3-Nemotron-Super-49B-v1_5
1339.00
Score
Llama-3_3-Nemotron-Super-49B-v1_5
1339.00
Score
Llama-3_3-Nemotron-Super-49B-v1_5
1339.00
Score
Llama-3_3-Nemotron-Super-49B-v1_5
1339.00
Score
Llama-3_3-Nemotron-Super-49B-v1_5
1339.00
Score
Llama-3_3-Nemotron-Super-49B-v1_5
1339.00
Score
Llama-3_3-Nemotron-Super-49B-v1_5
1339.00
Score
Llama-3_3-Nemotron-Super-49B-v1_5
1339.00
Score
Llama-3_3-Nemotron-Super-49B-v1_5
1339.00
Score
Llama-3_3-Nemotron-Super-49B-v1_5
1339.00
Score
Llama-3_3-Nemotron-Super-49B-v1_5
1339.00
Score
Llama-3_3-Nemotron-Super-49B-v1_5
1339.00
Score
Llama-3_3-Nemotron-Super-49B-v1_5
1339.00
Score
Llama-3_3-Nemotron-Super-49B-v1_5
1339.00
Score
Llama-3_3-Nemotron-Super-49B-v1_5
1339.00
Score
Llama-3_3-Nemotron-Super-49B-v1_5
1339.00
Score
Llama-3_3-Nemotron-Super-49B-v1_5
1339.00
Score
Llama-3_3-Nemotron-Super-49B-v1_5
1339.00
Score
Llama-3_3-Nemotron-Super-49B-v1_5
1339.00
Score
Llama-3_3-Nemotron-Super-49B-v1_5
1339.00
Score