sthenno

21 models • 7 total models in database
Sort by:

tempesthenno-reasoning-exp-1216

13
2

tempesthenno-kto-0205-ckpt80

update: now checking for evaluations without chat templates This is a merge of pre-trained language models created using mergekit. This model was merged using the SCE merge method using sthenno/tempesthenno-nuslerp-0124 as a base. The following models were included in the merge: sthenno/tempesthenno-icy-0130-01 sthenno/tempesthenno-icy-0130-02 sthenno/tempesthenno-icy-0130-03 The following YAML configuration was used to produce this model: Open LLM Leaderboard Evaluation Results Detailed results can be found here | Metric |Value| |-------------------|----:| |Avg. |39.74| |IFEval (0-Shot) |62.18| |BBH (3-Shot) |50.10| |MATH Lvl 5 (4-Shot)|37.99| |GPQA (0-shot) |19.69| |MuSR (0-shot) |19.84| |MMLU-PRO (5-shot) |48.65|

NaNK
license:apache-2.0
6
3

tempesthenno-sft-0314-stage1-ckpt50

This modelcard aims to be a base template for new models. It has been generated using this raw template. - Developed by: [More Information Needed] - Funded by [optional]: [More Information Needed] - Shared by [optional]: [More Information Needed] - Model type: [More Information Needed] - Language(s) (NLP): [More Information Needed] - License: [More Information Needed] - Finetuned from model [optional]: [More Information Needed] - Repository: [More Information Needed] - Paper [optional]: [More Information Needed] - Demo [optional]: [More Information Needed] Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations. Carbon emissions can be estimated using the Machine Learning Impact calculator presented in Lacoste et al. (2019). - Hardware Type: [More Information Needed] - Hours used: [More Information Needed] - Cloud Provider: [More Information Needed] - Compute Region: [More Information Needed] - Carbon Emitted: [More Information Needed]

NaNK
license:apache-2.0
4
4

tempesthenno-icy-0130

NaNK
license:apache-2.0
3
8

tempesthenno-ms-0309-001

NaNK
license:apache-2.0
3
5

tempesthenno-nuslerp-001

This is a merge of pre-trained language models created using mergekit. This model was merged using the NuSLERP merge method. The following models were included in the merge: /Users/sthenno/models/tempesthenno--converge-dtask /Users/sthenno/models/tempesthenno--converge-breadcrumbs The following YAML configuration was used to produce this model: Open LLM Leaderboard Evaluation Results Detailed results can be found here | Metric |Value| |-------------------|----:| |Avg. |39.94| |IFEval (0-Shot) |79.26| |BBH (3-Shot) |51.04| |MATH Lvl 5 (4-Shot)|31.72| |GPQA (0-shot) |16.44| |MuSR (0-shot) |13.88| |MMLU-PRO (5-shot) |47.30|

NaNK
license:apache-2.0
3
4

tempesthenno-nuslerp-0124

update: now checking the evaluations without chat templates This is a merge of pre-trained language models created using mergekit. This model was merged using the NuSLERP merge method. The following models were included in the merge: /Users/sthenno/models/tempesthenno--converge-breadcrumbs /Users/sthenno/models/tempesthenno--converge-dtask The following YAML configuration was used to produce this model: Open LLM Leaderboard Evaluation Results Detailed results can be found here | Metric |Value| |-------------------|----:| |Avg. |40.97| |IFEval (0-Shot) |70.04| |BBH (3-Shot) |49.28| |MATH Lvl 5 (4-Shot)|39.27| |GPQA (0-shot) |18.68| |MuSR (0-shot) |20.21| |MMLU-PRO (5-shot) |48.36|

NaNK
license:apache-2.0
3
4

tempesthenno-ppo-ckpt40

This is a merge of pre-trained language models created using mergekit. This model was merged using the NuSLERP merge method. The following models were included in the merge: /Users/sthenno/models/tempesthenno--converge-dtask /Users/sthenno/models/tempesthenno--converge-breadcrumbs The following YAML configuration was used to produce this model: | Metric |Value| |-------------------|----:| |Avg. |40.55| |IFEval (0-Shot) |79.23| |BBH (3-Shot) |50.57| |MATH Lvl 5 (4-Shot)|34.21| |GPQA (0-shot) |17.00| |MuSR (0-shot) |14.56| |MMLU-PRO (5-shot) |47.69| | Metric |Value| |-------------------|----:| |Avg. |42.74| |IFEval (0-Shot) |79.23| |BBH (3-Shot) |50.57| |MATH Lvl 5 (4-Shot)|47.36| |GPQA (0-shot) |17.00| |MuSR (0-shot) |14.56| |MMLU-PRO (5-shot) |47.69|

NaNK
license:apache-2.0
2
4

tempesthenno-sft-0309-ckpt10

This modelcard aims to be a base template for new models. It has been generated using this raw template. - Developed by: [More Information Needed] - Funded by [optional]: [More Information Needed] - Shared by [optional]: [More Information Needed] - Model type: [More Information Needed] - Language(s) (NLP): [More Information Needed] - License: [More Information Needed] - Finetuned from model [optional]: [More Information Needed] - Repository: [More Information Needed] - Paper [optional]: [More Information Needed] - Demo [optional]: [More Information Needed] Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations. Carbon emissions can be estimated using the Machine Learning Impact calculator presented in Lacoste et al. (2019). - Hardware Type: [More Information Needed] - Hours used: [More Information Needed] - Cloud Provider: [More Information Needed] - Compute Region: [More Information Needed] - Carbon Emitted: [More Information Needed]

NaNK
license:apache-2.0
2
2

tempesthenno-14b-nuslerp-0111

NaNK
license:apache-2.0
2
1

tempesthenno-0126-ckpt150

NaNK
license:apache-2.0
2
1

tempesthenno-ms-0314-001

NaNK
1
3

miscii-1218-ckpt350

NaNK
license:apache-2.0
1
2

tempesthenno-fusion-0309

This is a merge of pre-trained language models created using mergekit. This model was merged using the Arcee Fusion merge method using sthenno-com/miscii-14b-0218 as a base. The following models were included in the merge: /home/ubuntu/tmp/models/tempesthenno-sft-0309-stage1-ckpt10 The following YAML configuration was used to produce this model:

NaNK
license:apache-2.0
1
2

tempesthenno-sft-0314

NaNK
1
2

inferno-math-stage1-ckpt1400

license:apache-2.0
1
0

miscii-1020

license:mit
0
2

miscii-1023-001-200

license:mit
0
2

miscii-1223-exp-001

0
1

miscii-1225-19b-preset

This is a merge of pre-trained language models created using mergekit. This model was merged using the passthrough merge method. The following models were included in the merge: sthenno-com/miscii-14b-1225 The following YAML configuration was used to produce this model:

NaNK
0
1

tempesthenno-hs2-rm

NaNK
license:apache-2.0
0
1