Human Preference

Chatbot Arena ELO Leaderboard

Crowdsourced chatbot rankings via human preference

Why This Matters

Real user preferences - the ultimate test for conversational AI quality

Good Scores

1100+ is good, 1200+ is strong, 1250+ is top-tier

Use Cases

  • Customer service
  • Virtual assistants
  • Conversational interfaces
  • General chatbots

Peak Score

1339.00

Average

1339.00

Models Tested

24

Median Score

1339.00

Performance by Model Size

How different size classes perform on this benchmark

📄 License Analysis

Performance by license type

llama-31339.00
24 modelsAvg: 1339.00

🔧 Framework Analysis

Performance by framework

OTHER1339.00
24 modelsAvg: 1339.00

About Chatbot Arena ELO

Chatbot Arena ELO is a leaderboard based on over 500,000 human preference votes. Users chat with two anonymous models and vote for which response they prefer. ELO ratings are calculated from head-to-head matchups.

Last updated: 11/14/2025

Test These Models Yourself

Run benchmarks on your own data with these platforms

Together.ai

Instant API access to this model

Fastest API

Production-ready inference API. Start free, scale to millions.

Try Free API

Modal

Run this model on serverless GPU

Most Popular

Deploy in seconds with $30 free credits. Pay only for what you use.

Get $30 Free Credits

RunPod

Rent GPU starting at $0.34/hour

Best Value

Deploy on cloud GPU or serverless. 70% cheaper than AWS.

Start from $0.34/hr

Disclosure: We may earn a commission from these partners. This helps keep LLMYourWay free.

Complete Leaderboard

Top 50 models ranked by Chatbot Arena ELO performance

Llama-3_3-Nemotron-Super-49B-v1_5

nvidiallama-3

1339.00

Score

Llama-3_3-Nemotron-Super-49B-v1_5

nvidiallama-3

1339.00

Score

Llama-3_3-Nemotron-Super-49B-v1_5

nvidiallama-3

1339.00

Score

#4

Llama-3_3-Nemotron-Super-49B-v1_5

nvidiallama-3

1339.00

Score

#5

Llama-3_3-Nemotron-Super-49B-v1_5

nvidiallama-3

1339.00

Score

#6

Llama-3_3-Nemotron-Super-49B-v1_5

nvidiallama-3

1339.00

Score

#7

Llama-3_3-Nemotron-Super-49B-v1_5

nvidiallama-3

1339.00

Score

#8

Llama-3_3-Nemotron-Super-49B-v1_5

nvidiallama-3

1339.00

Score

#9

Llama-3_3-Nemotron-Super-49B-v1_5

nvidiallama-3

1339.00

Score

#10

Llama-3_3-Nemotron-Super-49B-v1_5

nvidiallama-3

1339.00

Score

#11

Llama-3_3-Nemotron-Super-49B-v1_5

nvidiallama-3

1339.00

Score

#12

Llama-3_3-Nemotron-Super-49B-v1_5

nvidiallama-3

1339.00

Score

#13

Llama-3_3-Nemotron-Super-49B-v1_5

nvidiallama-3

1339.00

Score

#14

Llama-3_3-Nemotron-Super-49B-v1_5

nvidiallama-3

1339.00

Score

#15

Llama-3_3-Nemotron-Super-49B-v1_5

nvidiallama-3

1339.00

Score

#16

Llama-3_3-Nemotron-Super-49B-v1_5

nvidiallama-3

1339.00

Score

#17

Llama-3_3-Nemotron-Super-49B-v1_5

nvidiallama-3

1339.00

Score

#18

Llama-3_3-Nemotron-Super-49B-v1_5

nvidiallama-3

1339.00

Score

#19

Llama-3_3-Nemotron-Super-49B-v1_5

nvidiallama-3

1339.00

Score

#20

Llama-3_3-Nemotron-Super-49B-v1_5

nvidiallama-3

1339.00

Score

#21

Llama-3_3-Nemotron-Super-49B-v1_5

nvidiallama-3

1339.00

Score

#22

Llama-3_3-Nemotron-Super-49B-v1_5

nvidiallama-3

1339.00

Score

#23

Llama-3_3-Nemotron-Super-49B-v1_5

nvidiallama-3

1339.00

Score

#24

Llama-3_3-Nemotron-Super-49B-v1_5

nvidiallama-3

1339.00

Score