yinita

25 models • 1 total models in database
Sort by:

Mg 8b Cot Sft General 1024

NaNK
78
1

mg_qwen3_8B_grpo_180step_mix

NaNK
76
0

mg_qwen3_8B_gigpo_400step_mix

NaNK
67
0

qwen3-8b-v1-lora-0812-3epochs

NaNK
41
0

mg_mix_rl_60step_1023

38
0

mg_qwen3_8B_grpo_180step_mix_only_gpt_5

NaNK
33
0

mg-qwen3-8b-big-dataset-0826-1epochs

NaNK
14
0

qwen3-8b-big-dataset-1003-1epochs

NaNK
14
0

qwen3-8b-v3-ppo-lora-0823-456

This is the model card of a 🤗 transformers model that has been pushed on the Hub. This model card has been automatically generated. - Developed by: [More Information Needed] - Funded by [optional]: [More Information Needed] - Shared by [optional]: [More Information Needed] - Model type: [More Information Needed] - Language(s) (NLP): [More Information Needed] - License: [More Information Needed] - Finetuned from model [optional]: [More Information Needed] - Repository: [More Information Needed] - Paper [optional]: [More Information Needed] - Demo [optional]: [More Information Needed] Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations. Carbon emissions can be estimated using the Machine Learning Impact calculator presented in Lacoste et al. (2019). - Hardware Type: [More Information Needed] - Hours used: [More Information Needed] - Cloud Provider: [More Information Needed] - Compute Region: [More Information Needed] - Carbon Emitted: [More Information Needed]

NaNK
11
0

mg-ppo-4o-4b-0828-mix-v1-382step

This is the model card of a 🤗 transformers model that has been pushed on the Hub. This model card has been automatically generated. - Developed by: [More Information Needed] - Funded by [optional]: [More Information Needed] - Shared by [optional]: [More Information Needed] - Model type: [More Information Needed] - Language(s) (NLP): [More Information Needed] - License: [More Information Needed] - Finetuned from model [optional]: [More Information Needed] - Repository: [More Information Needed] - Paper [optional]: [More Information Needed] - Demo [optional]: [More Information Needed] Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations. Carbon emissions can be estimated using the Machine Learning Impact calculator presented in Lacoste et al. (2019). - Hardware Type: [More Information Needed] - Hours used: [More Information Needed] - Cloud Provider: [More Information Needed] - Compute Region: [More Information Needed] - Carbon Emitted: [More Information Needed]

NaNK
11
0

mg-ppo-4o-4b-mix-0828-v1-200step

This is the model card of a 🤗 transformers model that has been pushed on the Hub. This model card has been automatically generated. - Developed by: [More Information Needed] - Funded by [optional]: [More Information Needed] - Shared by [optional]: [More Information Needed] - Model type: [More Information Needed] - Language(s) (NLP): [More Information Needed] - License: [More Information Needed] - Finetuned from model [optional]: [More Information Needed] - Repository: [More Information Needed] - Paper [optional]: [More Information Needed] - Demo [optional]: [More Information Needed] Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations. Carbon emissions can be estimated using the Machine Learning Impact calculator presented in Lacoste et al. (2019). - Hardware Type: [More Information Needed] - Hours used: [More Information Needed] - Cloud Provider: [More Information Needed] - Compute Region: [More Information Needed] - Carbon Emitted: [More Information Needed]

NaNK
10
0

mg-8b-cot-sft-mafia-1024

NaNK
10
0

cpdc_Qwen3-14B-0627-200steps-toolcall-rl

NaNK
9
0

cpdc-qwen3-8b-task2-v0616-stage1-lora-cp-sync-by-lian-dataversion2-1epoch

NaNK
4
0

qwen3-8b-v2-gpt5-chat-distill-lora-0814-3epochs

NaNK
4
0

qwen3-8b-v4-strategy_colonel-1003-1epochs

NaNK
4
0

qwen3-8b-v2-gpt5-distill-lora-1003-3epochs

NaNK
4
0

sft-best

4
0

cpdc-Qwen3-8B-grpo-v1-300step

NaNK
3
0

cpdc_Qwen3-8B_grpo-0617_1318-onlytoolcall_step_100

NaNK
3
0

cpdc_qwen3-14b-task2-v0620-lora-lian-v2

This is the model card of a 🤗 transformers model that has been pushed on the Hub. This model card has been automatically generated. - Developed by: [More Information Needed] - Funded by [optional]: [More Information Needed] - Shared by [optional]: [More Information Needed] - Model type: [More Information Needed] - Language(s) (NLP): [More Information Needed] - License: [More Information Needed] - Finetuned from model [optional]: [More Information Needed] - Repository: [More Information Needed] - Paper [optional]: [More Information Needed] - Demo [optional]: [More Information Needed] Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations. Carbon emissions can be estimated using the Machine Learning Impact calculator presented in Lacoste et al. (2019). - Hardware Type: [More Information Needed] - Hours used: [More Information Needed] - Cloud Provider: [More Information Needed] - Compute Region: [More Information Needed] - Carbon Emitted: [More Information Needed]

NaNK
3
0

cpdc_rl-official-100step-qwen3-14B

NaNK
3
0

mg-qwen3-8b-big-dataset-0826-1epochs_plain_0929_50step

NaNK
3
0

3pipd-qwen3-8b-base_plain_0929_100step

NaNK
3
0

cpdc_official-q3-8b-sft-3epoch

This is the model card of a 🤗 transformers model that has been pushed on the Hub. This model card has been automatically generated. - Developed by: [More Information Needed] - Funded by [optional]: [More Information Needed] - Shared by [optional]: [More Information Needed] - Model type: [More Information Needed] - Language(s) (NLP): [More Information Needed] - License: [More Information Needed] - Finetuned from model [optional]: [More Information Needed] - Repository: [More Information Needed] - Paper [optional]: [More Information Needed] - Demo [optional]: [More Information Needed] Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations. Carbon emissions can be estimated using the Machine Learning Impact calculator presented in Lacoste et al. (2019). - Hardware Type: [More Information Needed] - Hours used: [More Information Needed] - Cloud Provider: [More Information Needed] - Compute Region: [More Information Needed] - Carbon Emitted: [More Information Needed]

NaNK
2
0