sthenno

21 models • 7 total models in database

Sort by:

tempesthenno-reasoning-exp-1216

tempesthenno-kto-0205-ckpt80

update: now checking for evaluations without chat templates This is a merge of pre-trained language models created using mergekit. This model was merged using the SCE merge method using sthenno/tempesthenno-nuslerp-0124 as a base. The following models were included in the merge: sthenno/tempesthenno-icy-0130-01 sthenno/tempesthenno-icy-0130-02 sthenno/tempesthenno-icy-0130-03 The following YAML configuration was used to produce this model: Open LLM Leaderboard Evaluation Results Detailed results can be found here | Metric |Value| |-------------------|----:| |Avg. |39.74| |IFEval (0-Shot) |62.18| |BBH (3-Shot) |50.10| |MATH Lvl 5 (4-Shot)|37.99| |GPQA (0-shot) |19.69| |MuSR (0-shot) |19.84| |MMLU-PRO (5-shot) |48.65|

license:apache-2.0

tempesthenno-sft-0314-stage1-ckpt50

This modelcard aims to be a base template for new models. It has been generated using this raw template. - Developed by: [More Information Needed] - Funded by [optional]: [More Information Needed] - Shared by [optional]: [More Information Needed] - Model type: [More Information Needed] - Language(s) (NLP): [More Information Needed] - License: [More Information Needed] - Finetuned from model [optional]: [More Information Needed] - Repository: [More Information Needed] - Paper [optional]: [More Information Needed] - Demo [optional]: [More Information Needed] Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations. Carbon emissions can be estimated using the Machine Learning Impact calculator presented in Lacoste et al. (2019). - Hardware Type: [More Information Needed] - Hours used: [More Information Needed] - Cloud Provider: [More Information Needed] - Compute Region: [More Information Needed] - Carbon Emitted: [More Information Needed]

license:apache-2.0

tempesthenno-icy-0130

license:apache-2.0

tempesthenno-ms-0309-001

license:apache-2.0

tempesthenno-nuslerp-001

This is a merge of pre-trained language models created using mergekit. This model was merged using the NuSLERP merge method. The following models were included in the merge: /Users/sthenno/models/tempesthenno--converge-dtask /Users/sthenno/models/tempesthenno--converge-breadcrumbs The following YAML configuration was used to produce this model: Open LLM Leaderboard Evaluation Results Detailed results can be found here | Metric |Value| |-------------------|----:| |Avg. |39.94| |IFEval (0-Shot) |79.26| |BBH (3-Shot) |51.04| |MATH Lvl 5 (4-Shot)|31.72| |GPQA (0-shot) |16.44| |MuSR (0-shot) |13.88| |MMLU-PRO (5-shot) |47.30|

license:apache-2.0

tempesthenno-nuslerp-0124

update: now checking the evaluations without chat templates This is a merge of pre-trained language models created using mergekit. This model was merged using the NuSLERP merge method. The following models were included in the merge: /Users/sthenno/models/tempesthenno--converge-breadcrumbs /Users/sthenno/models/tempesthenno--converge-dtask The following YAML configuration was used to produce this model: Open LLM Leaderboard Evaluation Results Detailed results can be found here | Metric |Value| |-------------------|----:| |Avg. |40.97| |IFEval (0-Shot) |70.04| |BBH (3-Shot) |49.28| |MATH Lvl 5 (4-Shot)|39.27| |GPQA (0-shot) |18.68| |MuSR (0-shot) |20.21| |MMLU-PRO (5-shot) |48.36|

license:apache-2.0

tempesthenno-ppo-ckpt40

This is a merge of pre-trained language models created using mergekit. This model was merged using the NuSLERP merge method. The following models were included in the merge: /Users/sthenno/models/tempesthenno--converge-dtask /Users/sthenno/models/tempesthenno--converge-breadcrumbs The following YAML configuration was used to produce this model: | Metric |Value| |-------------------|----:| |Avg. |40.55| |IFEval (0-Shot) |79.23| |BBH (3-Shot) |50.57| |MATH Lvl 5 (4-Shot)|34.21| |GPQA (0-shot) |17.00| |MuSR (0-shot) |14.56| |MMLU-PRO (5-shot) |47.69| | Metric |Value| |-------------------|----:| |Avg. |42.74| |IFEval (0-Shot) |79.23| |BBH (3-Shot) |50.57| |MATH Lvl 5 (4-Shot)|47.36| |GPQA (0-shot) |17.00| |MuSR (0-shot) |14.56| |MMLU-PRO (5-shot) |47.69|

license:apache-2.0

tempesthenno-sft-0309-ckpt10

This modelcard aims to be a base template for new models. It has been generated using this raw template. - Developed by: [More Information Needed] - Funded by [optional]: [More Information Needed] - Shared by [optional]: [More Information Needed] - Model type: [More Information Needed] - Language(s) (NLP): [More Information Needed] - License: [More Information Needed] - Finetuned from model [optional]: [More Information Needed] - Repository: [More Information Needed] - Paper [optional]: [More Information Needed] - Demo [optional]: [More Information Needed] Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations. Carbon emissions can be estimated using the Machine Learning Impact calculator presented in Lacoste et al. (2019). - Hardware Type: [More Information Needed] - Hours used: [More Information Needed] - Cloud Provider: [More Information Needed] - Compute Region: [More Information Needed] - Carbon Emitted: [More Information Needed]

license:apache-2.0

tempesthenno-14b-nuslerp-0111

license:apache-2.0

tempesthenno-0126-ckpt150

license:apache-2.0

tempesthenno-ms-0314-001

miscii-1218-ckpt350

license:apache-2.0

tempesthenno-fusion-0309

This is a merge of pre-trained language models created using mergekit. This model was merged using the Arcee Fusion merge method using sthenno-com/miscii-14b-0218 as a base. The following models were included in the merge: /home/ubuntu/tmp/models/tempesthenno-sft-0309-stage1-ckpt10 The following YAML configuration was used to produce this model:

license:apache-2.0

tempesthenno-sft-0314

inferno-math-stage1-ckpt1400

license:apache-2.0

miscii-1020

miscii-1023-001-200

miscii-1223-exp-001

miscii-1225-19b-preset

This is a merge of pre-trained language models created using mergekit. This model was merged using the passthrough merge method. The following models were included in the merge: sthenno-com/miscii-14b-1225 The following YAML configuration was used to produce this model:

tempesthenno-hs2-rm

license:apache-2.0