turboderp

⚠️ Requires ExLlamaV3 v0.0.7 (or v0.0.6 `dev` branch) 2.00 bits per weight 3.00 bits per weight 4.00 bits per weight 5.00 bits per weight 2.08 bits per weight 2.27 bits per weight 2.78 bits per weight 3.14 bits per weight 3.53 bits per weight 4.06 bits per weight 4.51 bits per weight

NaNK

license:apache-2.0

Qwen3-Next-80B-A3B-Thinking-exl3

⚠️ Requires ExLlamaV3 v0.0.7 (or v0.0.6 `dev` branch) 2.00 bits per weight 3.00 bits per weight 4.00 bits per weight 5.00 bits per weight 6.00 bits per weight 2.08 bits per weight 2.27 bits per weight 2.78 bits per weight 3.14 bits per weight 3.53 bits per weight 4.06 bits per weight 4.51 bits per weight

NaNK

license:apache-2.0

Qwama-0.5B-Instruct

NaNK

license:apache-2.0

EXAONE-4.0-32B-exl3

2.00 bits per weight 2.50 bits per weight 3.00 bits per weight 3.50 bits per weight 4.00 bits per weight 5.00 bits per weight 6.00 bits per weight 8.00 bits per weight / H8

NaNK

—

Qwen3-8B-exl3

NaNK

license:apache-2.0

ERNIE-4.5-300B-A47B-Base-PT-exl3

NaNK

license:apache-2.0

gemma-2-27b-it-exl2

NaNK

—

gemma-3-27b-it-exl3

NaNK

—

Qwen3-30B-A3B-exl3

NaNK

license:apache-2.0

ERNIE-4.5-300B-A47B-PT-exl3

NaNK

license:apache-2.0

Mistral-Large-Instruct-2407-123B-exl2

NaNK

—

SmolLM3-3B-exl3

2.00 bits per weight 2.50 bits per weight 3.00 bits per weight 3.50 bits per weight 4.00 bits per weight 5.00 bits per weight 6.00 bits per weight 8.00 bits per weight / H8

NaNK

license:apache-2.0

Qwen3-32B-exl3

NaNK

license:apache-2.0

Llama-3.1-70B-Instruct-exl2

NaNK

license:llama3.1

Llama-3.1-8B-Instruct-exl2

NaNK

license:llama3.1

Grok-3-reasoning-gemma3-12B-distilled-HF-exl3

EXL3 quants of Grok-3-reasoning-gemma3-12B-distilled-HF

NaNK

license:apache-2.0

Llama-3.1-70B-Instruct-exl3

NaNK

base_model:meta-llama/Llama-3.1-70B-Instruct

MiniMax-M2-exl3

⚠️ Requires ExLlamaV3 v0.0.12 (or v0.0.11 `dev` branch) 2.00 bits per weight 3.00 bits per weight 4.00 bits per weight 2.04 bits per weight 2.27 bits per weight 3.04 bits per weight 3.50 bits per weight 4.03 bits per weight . | KL-div | ppl | HumanEval@1 ---------|--------|-------|------------- 2.00 bpw | 0.400 | 10.92 | 80.5% 2.04 bpw | 0.297 | 10.23 | 87.1% 2.27 bpw | 0.252 | 9.78 | 88.4% 3.00 bpw | 0.141 | 8.99 | 87.8% 3.04 bpw | 0.117 | 8.73 | 87.2% 3.50 bpw | 0.094 | 8.78 | 88.4% 4.00 bpw | 0.087 | 8.58 | 89.6% 4.03 bpw | 0.077 | 8.61 | 87.8% original | - | 8.51 | 87.2%¹

NaNK

license:mit

Apertus-70B-Instruct-2509-exl3

2.00 bits per weight 2.50 bits per weight 3.00 bits per weight 3.50 bits per weight 4.00 bits per weight 5.00 bits per weight 6.00 bits per weight . | MMLU | 95% CI ----------|--------------|------------ 2.0 bpw | 58.90% | +/- 1.50% 2.5 bpw | 64.20% | +/- 1.46% 3.0 bpw | 67.00% | +/- 1.43% 3.5 bpw | 67.70% | +/- 1.43% 4.0 bpw | 69.40% | +/- 1.40% 5.0 bpw | 70.30% | +/- 1.39% 6.0 bpw | 69.60% | +/- 1.40%

NaNK

license:apache-2.0

Qwen2.5-VL-7B-Instruct-exl2

NaNK

license:apache-2.0

c4ai-command-r7b-12-2024-exl3

2.00 bits per weight 2.50 bits per weight 3.00 bits per weight 4.00 bits per weight 5.00 bits per weight 6.00 bits per weight 8.00 bits per weight / H8

NaNK

license:cc-by-nc-4.0

c4ai-command-r-08-2024-exl3

2.00 bits per weight 2.50 bits per weight 3.00 bits per weight 4.00 bits per weight 5.00 bits per weight 6.00 bits per weight 8.00 bits per weight / H8

NaNK

license:cc-by-nc-4.0

Qwen3-VL-235B-A22B-Thinking-exl3

NaNK

license:apache-2.0

Mistral-7B-Instruct-v0.3-exl3

NaNK

license:apache-2.0

Mistral-Nemo-Instruct-12B-exl2

NaNK

license:apache-2.0

GLM-4.6V-exl3

NaNK

license:mit

command-r-plus-103B-exl2

NaNK

—

Mixtral-8x7B-exl2

NaNK

—

Llama-3.1-8B-Instruct-exl3

NaNK

base_model:meta-llama/Llama-3.1-8B-Instruct

Llama-3.2-1B-Instruct-exl3

NaNK

base_model:meta-llama/Llama-3.2-1B-Instruct

Qwen2.5-7B-Instruct-exl3

NaNK

license:apache-2.0

gemma-3-27b-it-exl2

NaNK

—

Mistral-7B-instruct-v0.3-exl2

NaNK

license:apache-2.0

CodeLlama-34B-instruct-exl2

NaNK

—

Apertus-8B-Instruct-2509-exl3

2.00 bits per weight 2.50 bits per weight 3.00 bits per weight 3.50 bits per weight 4.00 bits per weight 5.00 bits per weight 6.00 bits per weight 8.00 bits per weight / H8

NaNK

license:apache-2.0