sthenno
tempesthenno-reasoning-exp-1216
tempesthenno-kto-0205-ckpt80
update: now checking for evaluations without chat templates This is a merge of pre-trained language models created using mergekit. This model was merged using the SCE merge method using sthenno/tempesthenno-nuslerp-0124 as a base. The following models were included in the merge: sthenno/tempesthenno-icy-0130-01 sthenno/tempesthenno-icy-0130-02 sthenno/tempesthenno-icy-0130-03 The following YAML configuration was used to produce this model: Open LLM Leaderboard Evaluation Results Detailed results can be found here | Metric |Value| |-------------------|----:| |Avg. |39.74| |IFEval (0-Shot) |62.18| |BBH (3-Shot) |50.10| |MATH Lvl 5 (4-Shot)|37.99| |GPQA (0-shot) |19.69| |MuSR (0-shot) |19.84| |MMLU-PRO (5-shot) |48.65|
tempesthenno-sft-0314-stage1-ckpt50
This modelcard aims to be a base template for new models. It has been generated using this raw template. - Developed by: [More Information Needed] - Funded by [optional]: [More Information Needed] - Shared by [optional]: [More Information Needed] - Model type: [More Information Needed] - Language(s) (NLP): [More Information Needed] - License: [More Information Needed] - Finetuned from model [optional]: [More Information Needed] - Repository: [More Information Needed] - Paper [optional]: [More Information Needed] - Demo [optional]: [More Information Needed] Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations. Carbon emissions can be estimated using the Machine Learning Impact calculator presented in Lacoste et al. (2019). - Hardware Type: [More Information Needed] - Hours used: [More Information Needed] - Cloud Provider: [More Information Needed] - Compute Region: [More Information Needed] - Carbon Emitted: [More Information Needed]
tempesthenno-icy-0130
tempesthenno-ms-0309-001
tempesthenno-nuslerp-001
This is a merge of pre-trained language models created using mergekit. This model was merged using the NuSLERP merge method. The following models were included in the merge: /Users/sthenno/models/tempesthenno--converge-dtask /Users/sthenno/models/tempesthenno--converge-breadcrumbs The following YAML configuration was used to produce this model: Open LLM Leaderboard Evaluation Results Detailed results can be found here | Metric |Value| |-------------------|----:| |Avg. |39.94| |IFEval (0-Shot) |79.26| |BBH (3-Shot) |51.04| |MATH Lvl 5 (4-Shot)|31.72| |GPQA (0-shot) |16.44| |MuSR (0-shot) |13.88| |MMLU-PRO (5-shot) |47.30|
tempesthenno-nuslerp-0124
update: now checking the evaluations without chat templates This is a merge of pre-trained language models created using mergekit. This model was merged using the NuSLERP merge method. The following models were included in the merge: /Users/sthenno/models/tempesthenno--converge-breadcrumbs /Users/sthenno/models/tempesthenno--converge-dtask The following YAML configuration was used to produce this model: Open LLM Leaderboard Evaluation Results Detailed results can be found here | Metric |Value| |-------------------|----:| |Avg. |40.97| |IFEval (0-Shot) |70.04| |BBH (3-Shot) |49.28| |MATH Lvl 5 (4-Shot)|39.27| |GPQA (0-shot) |18.68| |MuSR (0-shot) |20.21| |MMLU-PRO (5-shot) |48.36|
tempesthenno-ppo-ckpt40
This is a merge of pre-trained language models created using mergekit. This model was merged using the NuSLERP merge method. The following models were included in the merge: /Users/sthenno/models/tempesthenno--converge-dtask /Users/sthenno/models/tempesthenno--converge-breadcrumbs The following YAML configuration was used to produce this model: | Metric |Value| |-------------------|----:| |Avg. |40.55| |IFEval (0-Shot) |79.23| |BBH (3-Shot) |50.57| |MATH Lvl 5 (4-Shot)|34.21| |GPQA (0-shot) |17.00| |MuSR (0-shot) |14.56| |MMLU-PRO (5-shot) |47.69| | Metric |Value| |-------------------|----:| |Avg. |42.74| |IFEval (0-Shot) |79.23| |BBH (3-Shot) |50.57| |MATH Lvl 5 (4-Shot)|47.36| |GPQA (0-shot) |17.00| |MuSR (0-shot) |14.56| |MMLU-PRO (5-shot) |47.69|
tempesthenno-sft-0309-ckpt10
This modelcard aims to be a base template for new models. It has been generated using this raw template. - Developed by: [More Information Needed] - Funded by [optional]: [More Information Needed] - Shared by [optional]: [More Information Needed] - Model type: [More Information Needed] - Language(s) (NLP): [More Information Needed] - License: [More Information Needed] - Finetuned from model [optional]: [More Information Needed] - Repository: [More Information Needed] - Paper [optional]: [More Information Needed] - Demo [optional]: [More Information Needed] Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations. Carbon emissions can be estimated using the Machine Learning Impact calculator presented in Lacoste et al. (2019). - Hardware Type: [More Information Needed] - Hours used: [More Information Needed] - Cloud Provider: [More Information Needed] - Compute Region: [More Information Needed] - Carbon Emitted: [More Information Needed]
tempesthenno-14b-nuslerp-0111
tempesthenno-0126-ckpt150
tempesthenno-ms-0314-001
miscii-1218-ckpt350
tempesthenno-fusion-0309
This is a merge of pre-trained language models created using mergekit. This model was merged using the Arcee Fusion merge method using sthenno-com/miscii-14b-0218 as a base. The following models were included in the merge: /home/ubuntu/tmp/models/tempesthenno-sft-0309-stage1-ckpt10 The following YAML configuration was used to produce this model:
tempesthenno-sft-0314
inferno-math-stage1-ckpt1400
miscii-1020
miscii-1023-001-200
miscii-1223-exp-001
miscii-1225-19b-preset
This is a merge of pre-trained language models created using mergekit. This model was merged using the passthrough merge method. The following models were included in the merge: sthenno-com/miscii-14b-1225 The following YAML configuration was used to produce this model: