Trelis

264 models • 1 total models in database

Sort by:

incorrect2874__partial2114_ties

This is the model card of a 🤗 transformers model that has been pushed on the Hub. This model card has been automatically generated. - Developed by: [More Information Needed] - Funded by [optional]: [More Information Needed] - Shared by [optional]: [More Information Needed] - Model type: [More Information Needed] - Language(s) (NLP): [More Information Needed] - License: [More Information Needed] - Finetuned from model [optional]: [More Information Needed] - Repository: [More Information Needed] - Paper [optional]: [More Information Needed] - Demo [optional]: [More Information Needed] Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations. Carbon emissions can be estimated using the Machine Learning Impact calculator presented in Lacoste et al. (2019). - Hardware Type: [More Information Needed] - Hours used: [More Information Needed] - Cloud Provider: [More Information Needed] - Compute Region: [More Information Needed] - Carbon Emitted: [More Information Needed]

—

Qwen3-4B_dsarc-programs-50-full-200-incorrect_20250808-134330-c5748

NaNK

—

incorrect2874__partial2114_linear

—

Llama-2-7b-chat-hf-sharded-bf16

NaNK

llama

SmolLM-135M-layer-pruned-90M-raw

llama

Qwen3-4B_dsarc-programs-correct-50_20250806-233716

NaNK

license:apache-2.0

Qwen3-4B_dsarc-programs-correct-10_20250806-233707-c132

- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

NaNK

license:apache-2.0

Qwen3-4B_dsarc-programs-correct-10_20250806-233707-c176

- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

NaNK

license:apache-2.0

Qwen3-4B_dsarc-programs-correct-50_20250806-233716-c453

- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

NaNK

license:apache-2.0

Qwen3-4B_dsarc-programs-correct-50_20250806-233716-c604

- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

NaNK

license:apache-2.0

Qwen3-4B_dsarc-programs-50-full-200-partial_20250807-211749

- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

NaNK

license:apache-2.0

Qwen3-4B_dsarc-programs-50-full-200-partial_20250807-211749-c1057

NaNK

license:apache-2.0

Qwen3-4B_dsarc-programs-50-full-200-partial_20250807-211749-c4228

NaNK

license:apache-2.0

Qwen3-4B_dsarc-programs-50-full-200-incorrect_20250808-134330-trainercheckpoint-2874-temp

- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

NaNK

license:apache-2.0

gpt-oss-20b_ds-arc-agi-2-partialplus-max-c1421

- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/gpt-oss-20b-unsloth-bnb-4bit This gptoss model was trained 2x faster with Unsloth and Huggingface's TRL library.

NaNK

license:apache-2.0

Qwen3-4B_dsarc-programs-correct-10_20250806-233707

NaNK

license:apache-2.0

Qwen3-4B_dsarc-programs-50-full-200-partial_20250807-211749-c3171

NaNK

license:apache-2.0

Qwen3-4B_dsarc-agi-1-train-programs-best-length-filtered-250_20250811-221545_cst-c2712

NaNK

license:apache-2.0

Qwen3-4B_ds-arc-agi-1-perfect-50-c321

NaNK

license:apache-2.0

Qwen3-4B_ds-arc-agi-2-partial-100-tricky-10-c120

- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

NaNK

license:apache-2.0

99-instruct-v9

llama

Qwen3-4B_dsarc-agi-1-train-programs-best-length-filtered-250_20250811-113400-100stps-c75

- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

NaNK

license:apache-2.0

Qwen3-4B_dsarc-agi-1-train-programs-best-length-filtered-250_20250811-124700-100s-pz-c100

- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

NaNK

license:apache-2.0

Qwen3-4B_ds-parquet-programs-c24

- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

NaNK

license:apache-2.0

Qwen3-4B_ds-arc-agi-2-partial-100-c2806_ds-arc-agi-2-training-hard-curriculum-c262

NaNK

license:apache-2.0

gpt-oss-20b-BF16_ds-arc-agi-2-reasoning-5_test-c1

- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/gpt-oss-20b-BF16 This gptoss model was trained 2x faster with Unsloth and Huggingface's TRL library.

NaNK

license:apache-2.0

Qwen3-4B_ds-arc-agi-2-reasoning-5-c178

NaNK

license:apache-2.0

Qwen3-4B_ds-arc-agi-2-reasoning-5-c89

- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

NaNK

license:apache-2.0

Qwen3-4B_dsarc-programs-50-full-200-partial_20250807-211749-c2114

NaNK

license:apache-2.0

Qwen3-4B_dsarc-agi-1-train-programs-best-length-filtered-250_20250811-113400-100stps-c25

- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

NaNK

license:apache-2.0

Qwen3-4B_dsarc-agi-1-train-programs-best-length-filtered-250_20250811-113400-100stps-c50

- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

NaNK

license:apache-2.0

Qwen3-4B_dsarc-agi-1-train-programs-best-length-filtered-250_20250811-113400-100stps-c100

- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

NaNK

license:apache-2.0

Qwen3-4B_dsarc-agi-1-train-programs-best-length-filtered-250_20250811-121308-100s4b4-c25

- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

NaNK

license:apache-2.0

Qwen3-4B_dsarc-agi-1-train-programs-best-length-filtered-250_20250811-121606-4b4fa-c25

- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

NaNK

license:apache-2.0

Qwen3-4B_dsarc-agi-1-train-programs-best-length-filtered-250_20250811-121606-4b4fa-c75

- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

NaNK

license:apache-2.0

Qwen3-4B_dsarc-agi-1-train-programs-best-length-filtered-250_20250811-121606-4b4fa-c100

- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

NaNK

license:apache-2.0

Qwen3-4B_dsarc-agi-1-train-programs-best-length-filtered-250_20250811-124700-100s-pz-c25

- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

NaNK

license:apache-2.0

Qwen3-4B_dsarc-agi-1-train-programs-best-length-filtered-250_20250811-124700-100s-pz-c75

- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

NaNK

license:apache-2.0

Qwen3-4B_dsarc-agi-1-train-programs-best-length-filtered-250_20250811-125017-100s2e-4-c25

- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

NaNK

license:apache-2.0

Qwen3-4B_dsarc-agi-1-train-programs-best-length-filtered-250_20250811-125017-100s2e-4-c50

NaNK

license:apache-2.0

Qwen3-4B_dsarc-agi-1-train-programs-best-length-filtered-250_20250811-125017-100s2e-4-c75

NaNK

license:apache-2.0

Qwen3-4B_dsarc-agi-1-train-programs-best-length-filtered-250_20250811-125017-100s2e-4-c100

NaNK

license:apache-2.0

Qwen3-4B_dsarc-agi-1-train-programs-best-length-filtered-250_20250811-133320-c1

NaNK

license:apache-2.0

Qwen3-4B_dsarc-agi-1-train-programs-best-length-filtered-250_20250811-133320-c2

NaNK

license:apache-2.0

Qwen3-4B_dsarc-agi-1-train-programs-best-length-filtered-250_20250811-133320-c75

NaNK

license:apache-2.0

Qwen3-4B_dsarc-agi-1-train-programs-best-length-filtered-250_20250811-154450-c10

NaNK

license:apache-2.0

Qwen3-4B_dsarc-agi-1-train-programs-best-length-filtered-250_20250811-155856-c904

- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

NaNK

license:apache-2.0

Qwen3-4B_dsarc-agi-1-train-programs-best-length-filtered-250_20250811-155856-c1808

- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

NaNK

license:apache-2.0

Qwen3-4B_dsarc-agi-1-train-programs-best-length-filtered-250_20250811-155856-c2712

- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

NaNK

license:apache-2.0

Qwen3-4B_dsarc-agi-1-train-programs-best-length-filtered-250_20250811-155856-c3614

- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

NaNK

license:apache-2.0

Qwen3-4B_dsarc-agi-1-train-programs-best-length-filtered-250_20250811-221545_cst-c904

NaNK

license:apache-2.0

Qwen3-4B_dsarc-agi-1-train-programs-best-length-filtered-250_20250811-221545_cst-c1808

NaNK

license:apache-2.0

Qwen3-4B_dsarc-agi-1-train-programs-best-length-filtered-250_20250811-221545_cst-c3614

NaNK

license:apache-2.0

arc-1-fake-ttt-unblended-partialplus-c56

- Developed by: Trelis - License: apache-2.0 - Finetuned from model : Trelis/Qwen3-4Bdsarc-programs-50-full-200-partial20250807-211749-c3171 This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

NaNK

license:apache-2.0

arc-1-fake-ttt-unblended-partialplus-c112

NaNK

license:apache-2.0

arc-1-fake-ttt-unblended-partialplus-c168

NaNK

license:apache-2.0

arc-1-fake-ttt-unblended-partialplus-c224

NaNK

license:apache-2.0

arc-1-fake-ttt-blended-c201

NaNK

license:apache-2.0

arc-1-fake-ttt-blended-c402

NaNK

license:apache-2.0

arc-1-fake-ttt-blended-c603

NaNK

license:apache-2.0

arc-1-fake-ttt-blended-c802

NaNK

license:apache-2.0

arc-1-fake-ttt-unblended-all-c148

NaNK

license:apache-2.0

arc-1-fake-ttt-unblended-all-c296

NaNK

license:apache-2.0

arc-1-fake-ttt-unblended-all-c444

NaNK

license:apache-2.0

arc-1-fake-ttt-unblended-all-c592

NaNK

license:apache-2.0

Qwen3-4B_ds-arc-agi-2-perfect-100_test-c8

NaNK

license:apache-2.0

Qwen3-4B_ds-arc-agi-2-perfect-50-c485

- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

NaNK

license:apache-2.0

Qwen3-4B_ds-arc-agi-2-perfect-50-c970

- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

NaNK

license:apache-2.0

Qwen3-4B_ds-arc-agi-2-partial-100-c1574

NaNK

license:apache-2.0

Qwen3-4B_ds-arc-agi-2-perfect-50_test-c4

NaNK

license:apache-2.0

Qwen3-4B_ds-arc-agi-2-perfect-50_test-c8

NaNK

license:apache-2.0

Qwen3-4B_ds-arc-agi-1-perfect-50_test-c4

- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

NaNK

license:apache-2.0

Qwen3-4B_ds-arc-agi-1-perfect-50_test-c8

- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

NaNK

license:apache-2.0

Qwen3-4B_ds-arc-agi-1-partial-100_test-c4

NaNK

license:apache-2.0

Qwen3-4B_ds-arc-agi-1-partial-100_test-c8

NaNK

license:apache-2.0

Qwen3-4B_ds-arc-agi-1-partial-100-c4

- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

NaNK

license:apache-2.0

Qwen3-4B_ds-arc-agi-1-partial-100-c8

NaNK

license:apache-2.0

Qwen3-4B_ds-arc-agi-1-partial-100-c771

NaNK

license:apache-2.0

Qwen3-4B_ds-arc-agi-1-partial-100-c1542

NaNK

license:apache-2.0

Qwen3-4B_ds-parquet-programs-c1403

- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

NaNK

license:apache-2.0

Qwen3-4B_ds-parquet-programs-c2806

- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

NaNK

license:apache-2.0

Qwen3-4B_ds-parquet-programs-c12

- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

NaNK

license:apache-2.0

Qwen3-4B_ds-arc-agi-2-partial-100-c4

- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

NaNK

license:apache-2.0

Qwen3-4B_ds-arc-agi-2-partial-100-c8

- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

NaNK

license:apache-2.0

Qwen3-4B_ds-arc-agi-2-partial-100-c1403

- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

NaNK

license:apache-2.0

Qwen3-4B_ds-arc-agi-2-partial-100-c2806

- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

NaNK

license:apache-2.0

Qwen3-4B_ds-arc-agi-2-partial-100-tricky-10_test-c4

- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

NaNK

license:apache-2.0

Qwen3-4B_ds-arc-agi-2-partial-100-tricky-10_test-c8

- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

NaNK

license:apache-2.0

Qwen3-4B_ds-arc-agi-2-partial-100-tricky-10-c4

- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

NaNK

license:apache-2.0

Qwen3-4B_ds-arc-agi-2-partial-100-tricky-10-c8

- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

NaNK

license:apache-2.0

Qwen3-4B_ds-arc-agi-2-partial-100-tricky-10-c60

- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

NaNK

license:apache-2.0

Qwen3-4B_ds-arc-agi-2-partial-100-c2806_ds-arc-agi-2-training-hard-curriculum-c131

NaNK

license:apache-2.0

Qwen3-4B_ds-arc-agi-1-partial-100-c1542_ds-arc-agi-1-refinement-finetuning-c81

NaNK

license:apache-2.0

Qwen3-4B_ds-arc-agi-1-partial-100-c1542_ds-arc-agi-1-refinement-finetuning-c162

NaNK

license:apache-2.0

Qwen3-4B_ds-arc-agi-2-partial-20_test-c4

NaNK

license:apache-2.0

Qwen3-4B_ds-arc-agi-2-partial-20_test-c8

NaNK

license:apache-2.0

Qwen3-4B_ds-arc-agi-2-partial-20-c976

NaNK

license:apache-2.0

Qwen3-4B_ds-arc-agi-1-refinement-finetuning-partialplus-c552

- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

NaNK

license:apache-2.0

gpt-oss-20b_ds-arc-agi-2-reasoning-5-c89

- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/gpt-oss-20b-unsloth-bnb-4bit This gptoss model was trained 2x faster with Unsloth and Huggingface's TRL library.

NaNK

license:apache-2.0

Meta-Llama-3-8B-Instruct-function-calling

NaNK

llama

transcribe-en_gb-spelling-v1-turbo

—

transcribe-en_us-spelling-v1-turbo

—

Qwen3-4B_dsarc-agi-1-train-programs-best-length-filtered-250_20250811-121606-4b4fa-c50

- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

NaNK

license:apache-2.0

Qwen3-4B_dsarc-agi-1-train-programs-best-length-filtered-250_20250811-124700-100s-pz-c50

- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

NaNK

license:apache-2.0

Qwen3-4B_ds-arc-agi-2-partial-100-c3148

NaNK

license:apache-2.0

Qwen3-4B_ds-parquet-programs_test-c4

- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

NaNK

license:apache-2.0

Qwen3-4B_ds-parquet-programs_test-c8

- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

NaNK

license:apache-2.0

Qwen3-4B_ds-arc-agi-2-partial-100-c2806_ds-arc-agi-2-training-hard-curriculum-c4

NaNK

license:apache-2.0

Qwen3-4B_ds-arc-agi-2-partial-100-c2806_ds-datasets-c2

- Developed by: Trelis - License: apache-2.0 - Finetuned from model : Trelis/Qwen3-4Bds-arc-agi-2-partial-100-c2806 This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

NaNK

license:apache-2.0

Qwen3-4B_ds-arc-agi-1-perfect-50-c642

NaNK

license:apache-2.0

test-oss-c1

- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/gpt-oss-20b-BF16 This gptoss model was trained 2x faster with Unsloth and Huggingface's TRL library.

NaNK

license:apache-2.0

Qwen3-4B_ds-arc-agi-2-partial-100-c2806_ds-datasets_rLoRA-32-c2

- Developed by: Trelis - License: apache-2.0 - Finetuned from model : Trelis/Qwen3-4Bds-arc-agi-2-partial-100-c2806 This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

NaNK

license:apache-2.0

smol-v7-sc-temporal_aa1

—

mpt-7b-8k-chat-sharded-bf16

NaNK

license:cc-by-nc-sa-4.0

Llama-2-7b-chat-hf-function-calling

NaNK

llama

Qwen3-4B-ds20250724_131808-20250725-132523

NaNK

license:apache-2.0

Qwen3-4B-ds20250729_114431-20250729-113936

- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

NaNK

license:apache-2.0

smol-v7-1M_aa2

—

TinyLlama-1.1B-Chat-v0.3-AWQ

NaNK

llama

Mistral-Small-3.1-24B-Instruct-2503-touch-rugby-comprehensive-qa

NaNK

license:apache-2.0

Qwen3-4B-ds20250729_114431-20250729-115543

- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

NaNK

license:apache-2.0

Qwen3-4B-ds20250729_114431-20250729-123351

- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

NaNK

license:apache-2.0

Qwen3-4B-ds20250729_114431-20250729-142811

NaNK

license:apache-2.0

Soar-qwen-7b-ds20250729_114431-20250731-140526

NaNK

license:apache-2.0

Qwen3-4B-ds20250729_114431-20250801-141514

NaNK

license:apache-2.0

Qwen3-4B_ds-arc-agi-2-partial-100-c2806_ds-datasets_rLoRA-32-c4

- Developed by: Trelis - License: apache-2.0 - Finetuned from model : Trelis/Qwen3-4Bds-arc-agi-2-partial-100-c2806 This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

NaNK

license:apache-2.0

Soar-qwen-14b-FP8-Dynamic

NaNK

—

Llama-2-7b-chat-hf-hosted-inference-8bit

NaNK

llama

Llama-2-7b-chat-hf-sharded-bf16-5GB

NaNK

llama

mamba-2.8b-slimpj-bf16

NaNK

license:apache-2.0

Qwen1.5-MLX-test

—

SmolLM-135M-Instruct-layer-pruned-90M-raw

llama

transcribe-british-spelling-v1-tiny-ctranslate2

—

TrelisSmolLM-base

llama

Soar-qwen-7b-ds20250724_131808-20250725-130403

- Developed by: Trelis - License: apache-2.0 - Finetuned from model : julien31/Soar-qwen-7b This qwen2 model was trained 2x faster with Unsloth and Huggingface's TRL library.

NaNK

license:apache-2.0

Qwen3-4B-ds20250729_114431-20250729-111804

- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

NaNK

license:apache-2.0

Mistral-7B-Instruct-v0.2-function-calling-v3

NaNK

—

Microsoft_Phi-4-FP8-Dynamic

—

whisper-large-v3-turbo-pilotgpt-unified-all-raw-no-pack-1s-merged-filtered

—

transcribe-en_us-spelling-v1-tiny-ctranslate2

license:mit

TinyLlama-1.1B-Chat-v1.0-bf16

NaNK

llama

OpenELM-450M-instruct-ORPO

—

Llama-3.2-1B-Instruct-MATH-3ep

NaNK

llama

act_so101_test

—

Qwen3-4B_ds-arc-agi-2-partial-100-c2806_ds-datasets-c8

NaNK

license:apache-2.0

Llama-2-7b-hf-function-calling

NaNK

llama

all-MiniLM-L12-v2-ft-pairs-cosine

NaNK

—

multi-qa-MiniLM-L6-cos-v1-ft-pairs-1-epoch

NaNK

—

Llama-3.2-1B-Instruct-MATH-synthetic

NaNK

llama

SO-101-ACT

—

SO-101-ACT-beta_0.25

—

SO-101-ACT-n10

—

SO-101-ACT-n10_beta_1

—

SO-101-ACT-n10_beta_1_12500

—

SO-101-ACT-n10_k50

—

SO-101-SmolVLA-test

—

SO-101-ACT-test

—

lorge-16-jul

NaNK

license:apache-2.0

gemini_synth_10-22jul-test

- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

NaNK

license:apache-2.0

Qwen3-4B-ds20250724_131808-20250725-142549

- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

NaNK

license:apache-2.0

Qwen3-4B-ds20250729_114431-20250729-114617

- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

NaNK

license:apache-2.0

gpt-oss-20b_ds-arc-agi-2-partialplus-max-c2842

- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/gpt-oss-20b-unsloth-bnb-4bit This gptoss model was trained 2x faster with Unsloth and Huggingface's TRL library.

NaNK

license:apache-2.0

llava-v1.6-mistral-7b-PATCHED

NaNK

license:apache-2.0

Llama-2-13b-chat-hf-touch-rugby-rules-adapters

NaNK

—

all-MiniLM-L12-v2-ft-triplets-10Qs

NaNK

—

Meta-Llama-3.1-8B-Instruct-Trelis-ARC-1ep-20241013-201317-ft

NaNK

llama

Llama-3.2-1B-Instruct-ft-comprehensive-qa

NaNK

license:apache-2.0

gemma-3-4b-it-ft-touch-rugby-comprehensive-qa

NaNK

license:apache-2.0

gemini-2.5-reasoning-smol-21-jul

- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

NaNK

license:apache-2.0

Qwen2.5-Coder-7B-Instruct-gemini_synth_50_random_split_1_training-20250723-113848

- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen2.5-Coder-7B-Instruct This qwen2 model was trained 2x faster with Unsloth and Huggingface's TRL library.

NaNK

license:apache-2.0

qwen-soar-sft-23-jul

NaNK

license:apache-2.0

Qwen3-4B-ds20250729_114431-20250730-172810

- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

NaNK

license:apache-2.0

Qwen3-4B_ds-arc-agi-2-partial-100-c2806_ds-datasets-c4

- Developed by: Trelis - License: apache-2.0 - Finetuned from model : Trelis/Qwen3-4Bds-arc-agi-2-partial-100-c2806 This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

NaNK

license:apache-2.0

whisper-turbo-sample-uk-english

—

mpt-7b-instruct-hosted-inference-8bit

NaNK

license:cc-by-sa-3.0

TinyLlama-1.1B-intermediate-step-480k-1T-chat-llama-style

NaNK

llama

Llama-2-13b-chat-hf-stanford-nil-policy-adapters

NaNK

base_model:meta-llama/Llama-2-13b-chat-hf

Llama-2-7b-chat-hf-6k

NaNK

llama

falcon-7b-chat-commercially-licensed-adapters

NaNK

—

falcon-7b-chat-commercial-use-4k

NaNK

—

Llama-2-7b-hf-stanford-nil-policy-ft-push-demo

NaNK

llama

mamba-2.8b-slimpj-chat-4k

NaNK

license:apache-2.0

TinyLlama-chat-ORPO-beta0.2

NaNK

llama

all-MiniLM-L12-v2-ft-Llama-3-70B

NaNK

—

all-MiniLM-L12-v2-ft-pairs-balanced

NaNK

—

all-MiniLM-L12-v2-ft-pairs-balanced-cpu

NaNK

—

all-MiniLM-L12-v2-ft-triplets-10q

NaNK

—

multi-qa-MiniLM-L6-cos-v1-ft-pairs

NaNK

—

multi-qa-MiniLM-L6-cos-v1-ft-pairs-1-epoch-scale-20

NaNK

—

multi-qa-MiniLM-L6-cos-v1-ft-pairs-2-cos-epoch-s20

NaNK

—

multi-qa-MiniLM-L6-dot-v1-ft-pairs-4-cst-epoch-s1-overlap

NaNK

—

multi-qa-MiniLM-L6-dot-v1-ft-triplets-2-cst-epoch-overlap

NaNK

—

SmolLM-360M-width-depth-pruned-to-80M

llama

Llama-3.2-3B-Instruct-MATH-ft

NaNK

llama

Llama-3.2-1B-Instruct-MATH-augmented-synthetic

NaNK

llama

Llama-3.2-1B-Instruct-MATH-synthetic-augmented

NaNK

llama

Llama-3.2-1B-Instruct-touch-rugby-synth-1epochs-20241009-122734-distilled-distilled

NaNK

llama

song-birds

—

Llama-3.2-1B-Instruct_gsm8k_rl_step2

NaNK

llama

Llama-3.2-1B-Instruct_ORPO_1_2p5em5lr

NaNK

llama

soar-sft-21-jul

- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

NaNK

license:apache-2.0

gemini_synth_50_random_split_1_training-23jul-1epoch

- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

NaNK

license:apache-2.0

qwen-synth_1-train-23jul-3epoch

- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

NaNK

license:apache-2.0

Qwen3-4B-ds20250724_131808-20250724-123014

- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

NaNK

license:apache-2.0

ddp-demo-2

- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

NaNK

license:apache-2.0

Qwen3-4B_ds-arc-agi-2-partial-100-c2806_ds-datasets-c1

- Developed by: Trelis - License: apache-2.0 - Finetuned from model : Trelis/Qwen3-4Bds-arc-agi-2-partial-100-c2806 This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

NaNK

license:apache-2.0