azalahmadkhan
27 models • 0 total models in database
Sort by:
Qwen2.5-3B-Instruct-GRPO-vanilla-G-16-novllm-25pct
NaNK
—
26
0
Qwen2.5-3B-Instruct-GRPO-vanilla-G-8-novllm-25pct
NaNK
—
26
0
Qwen2.5-3B-Instruct-GRPO-vanilla-G-8-novllm-50pct
NaNK
—
22
0
Llama-3.2-3B-Instruct-GRPO-vanilla-G-8-novllm-25pct
NaNK
llama
13
0
Qwen2.5-3B-Instruct-DAPO-G-16-novllm-50pct
NaNK
—
13
0
Qwen2.5-3B-Instruct-DAPO-G-8-novllm-75pct
NaNK
—
13
0
Llama-3.2-3B-Instruct-GRPO-vanilla-G-4-novllm-25pct
NaNK
llama
13
0
Qwen2.5-3B-Instruct-DAPO-G-16-novllm-25pct
NaNK
—
13
0
Qwen2.5-3B-Instruct-DAPO-G-8-novllm-50pct
NaNK
—
13
0
Qwen2.5-3B-Instruct-DAPO-G-8-novllm-25pct
NaNK
—
13
0
Qwen2.5-3B-Instruct-DAPO-G-16-novllm-75pct
NaNK
—
11
0
Qwen2.5-3B-Instruct-GRPO-vanilla-G-8-novllm-75pct
NaNK
—
10
0
Qwen2.5-3B-Instruct-GRPO-vanilla-G-16-novllm-50pct
NaNK
—
9
0
Qwen2.5-3B-Instruct-GRPO-vanilla-G-4-novllm-50pct
NaNK
—
9
0
Qwen2.5-3B-Instruct-GRPO-vanilla-G-4-novllm-75pct
NaNK
—
8
0
Qwen2.5-3B-Instruct-GRPO-vanilla-G-4-novllm-25pct
NaNK
—
8
0
Qwen2.5-3B-Instruct-DAPO-G-8-75pct
NaNK
—
8
0
Qwen2.5-3B-Instruct-DAPO-G-8-25pct
NaNK
—
8
0
Qwen2.5-3B-Instruct-DAPO-DynamicG-75pct
NaNK
—
7
0
Llama-3.2-3B-Instruct-DAPO-G-4-25pct
NaNK
llama
7
0
Qwen2.5-3B-Instruct-DAPO-DynamicG-25pct
NaNK
—
6
0
Qwen2.5-3B-Instruct-DAPO-G-8-50pct
NaNK
—
5
0
Llama-3.2-3B-Instruct-DAPO-G-4-50pct
NaNK
llama
4
0
Qwen2.5-3B-Instruct-DAPO-DynamicG-50pct
NaNK
—
4
0
Qwen2.5-3B-Instruct-DAPO-G-16-25pct
NaNK
—
4
0
Llama-3.2-3B-Instruct-GRPO-vanilla-G-16-25pct
NaNK
llama
4
0
Llama-3.2-3B-Instruct-GRPO-vanilla-G-16-50pct
NaNK
llama
3
0