Trelis
Llama-2-7b-chat-hf-function-calling-v2
Llama-2-7b-chat-hf-function-calling-v3
whisper-tiny-llm-lingo
whisper-small-llm-lingo
Qwen3-4B-Thinking-2507_ds-arc-agi-2-reasoning-5-c89
TinyLlama-1.1B-intermediate-step-480k-1T-GGUF
transcribe-en_gb-spelling-v1-tiny
TinyLlama-1.1B-Chat-v0.2-GGUF
transcribe-en_gb-spelling-v1-tiny-ctranslate2
Llama-2-7b-chat-hf-function-calling-GPTQ
Qwen3-4B-Thinking-2507_ds-arc-agi-2-reasoning-5-c178
transcribe-en_us-spelling-v1-tiny
transcribe-british-spelling-v1-tiny
Qwen3-4B_ds-arc-agi-2-perfect-100_test-c4
TinyLlama-1.1B-Chat-v0.1-GGUF
incorrect2874__partial2114_ties
This is the model card of a 🤗 transformers model that has been pushed on the Hub. This model card has been automatically generated. - Developed by: [More Information Needed] - Funded by [optional]: [More Information Needed] - Shared by [optional]: [More Information Needed] - Model type: [More Information Needed] - Language(s) (NLP): [More Information Needed] - License: [More Information Needed] - Finetuned from model [optional]: [More Information Needed] - Repository: [More Information Needed] - Paper [optional]: [More Information Needed] - Demo [optional]: [More Information Needed] Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations. Carbon emissions can be estimated using the Machine Learning Impact calculator presented in Lacoste et al. (2019). - Hardware Type: [More Information Needed] - Hours used: [More Information Needed] - Cloud Provider: [More Information Needed] - Compute Region: [More Information Needed] - Carbon Emitted: [More Information Needed]
Qwen3-4B_dsarc-programs-50-full-200-incorrect_20250808-134330-c5748
This is the model card of a 🤗 transformers model that has been pushed on the Hub. This model card has been automatically generated. - Developed by: [More Information Needed] - Funded by [optional]: [More Information Needed] - Shared by [optional]: [More Information Needed] - Model type: [More Information Needed] - Language(s) (NLP): [More Information Needed] - License: [More Information Needed] - Finetuned from model [optional]: [More Information Needed] - Repository: [More Information Needed] - Paper [optional]: [More Information Needed] - Demo [optional]: [More Information Needed] Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations. Carbon emissions can be estimated using the Machine Learning Impact calculator presented in Lacoste et al. (2019). - Hardware Type: [More Information Needed] - Hours used: [More Information Needed] - Cloud Provider: [More Information Needed] - Compute Region: [More Information Needed] - Carbon Emitted: [More Information Needed]
incorrect2874__partial2114_linear
This is the model card of a 🤗 transformers model that has been pushed on the Hub. This model card has been automatically generated. - Developed by: [More Information Needed] - Funded by [optional]: [More Information Needed] - Shared by [optional]: [More Information Needed] - Model type: [More Information Needed] - Language(s) (NLP): [More Information Needed] - License: [More Information Needed] - Finetuned from model [optional]: [More Information Needed] - Repository: [More Information Needed] - Paper [optional]: [More Information Needed] - Demo [optional]: [More Information Needed] Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations. Carbon emissions can be estimated using the Machine Learning Impact calculator presented in Lacoste et al. (2019). - Hardware Type: [More Information Needed] - Hours used: [More Information Needed] - Cloud Provider: [More Information Needed] - Compute Region: [More Information Needed] - Carbon Emitted: [More Information Needed]
Llama-2-7b-chat-hf-sharded-bf16
SmolLM-135M-layer-pruned-90M-raw
Qwen3-4B_dsarc-programs-correct-50_20250806-233716
Qwen3-4B_dsarc-programs-correct-10_20250806-233707-c132
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
Qwen3-4B_dsarc-programs-correct-10_20250806-233707-c176
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
Qwen3-4B_dsarc-programs-correct-50_20250806-233716-c453
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
Qwen3-4B_dsarc-programs-correct-50_20250806-233716-c604
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
Qwen3-4B_dsarc-programs-50-full-200-partial_20250807-211749
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
Qwen3-4B_dsarc-programs-50-full-200-partial_20250807-211749-c1057
Qwen3-4B_dsarc-programs-50-full-200-partial_20250807-211749-c4228
Qwen3-4B_dsarc-programs-50-full-200-incorrect_20250808-134330-trainercheckpoint-2874-temp
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
gpt-oss-20b_ds-arc-agi-2-partialplus-max-c1421
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/gpt-oss-20b-unsloth-bnb-4bit This gptoss model was trained 2x faster with Unsloth and Huggingface's TRL library.
Qwen3-4B_dsarc-programs-correct-10_20250806-233707
Qwen3-4B_dsarc-programs-50-full-200-partial_20250807-211749-c3171
Qwen3-4B_dsarc-agi-1-train-programs-best-length-filtered-250_20250811-221545_cst-c2712
Qwen3-4B_ds-arc-agi-1-perfect-50-c321
Qwen3-4B_ds-arc-agi-2-partial-100-tricky-10-c120
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
99-instruct-v9
Qwen3-4B_dsarc-agi-1-train-programs-best-length-filtered-250_20250811-113400-100stps-c75
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
Qwen3-4B_dsarc-agi-1-train-programs-best-length-filtered-250_20250811-124700-100s-pz-c100
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
Qwen3-4B_ds-parquet-programs-c24
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
Qwen3-4B_ds-arc-agi-2-partial-100-c2806_ds-arc-agi-2-training-hard-curriculum-c262
gpt-oss-20b-BF16_ds-arc-agi-2-reasoning-5_test-c1
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/gpt-oss-20b-BF16 This gptoss model was trained 2x faster with Unsloth and Huggingface's TRL library.
Qwen3-4B_ds-arc-agi-2-reasoning-5-c178
Qwen3-4B_ds-arc-agi-2-reasoning-5-c89
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
Qwen3-4B_dsarc-programs-50-full-200-partial_20250807-211749-c2114
Qwen3-4B_dsarc-agi-1-train-programs-best-length-filtered-250_20250811-113400-100stps-c25
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
Qwen3-4B_dsarc-agi-1-train-programs-best-length-filtered-250_20250811-113400-100stps-c50
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
Qwen3-4B_dsarc-agi-1-train-programs-best-length-filtered-250_20250811-113400-100stps-c100
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
Qwen3-4B_dsarc-agi-1-train-programs-best-length-filtered-250_20250811-121308-100s4b4-c25
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
Qwen3-4B_dsarc-agi-1-train-programs-best-length-filtered-250_20250811-121606-4b4fa-c25
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
Qwen3-4B_dsarc-agi-1-train-programs-best-length-filtered-250_20250811-121606-4b4fa-c75
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
Qwen3-4B_dsarc-agi-1-train-programs-best-length-filtered-250_20250811-121606-4b4fa-c100
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
Qwen3-4B_dsarc-agi-1-train-programs-best-length-filtered-250_20250811-124700-100s-pz-c25
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
Qwen3-4B_dsarc-agi-1-train-programs-best-length-filtered-250_20250811-124700-100s-pz-c75
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
Qwen3-4B_dsarc-agi-1-train-programs-best-length-filtered-250_20250811-125017-100s2e-4-c25
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
Qwen3-4B_dsarc-agi-1-train-programs-best-length-filtered-250_20250811-125017-100s2e-4-c50
Qwen3-4B_dsarc-agi-1-train-programs-best-length-filtered-250_20250811-125017-100s2e-4-c75
Qwen3-4B_dsarc-agi-1-train-programs-best-length-filtered-250_20250811-125017-100s2e-4-c100
Qwen3-4B_dsarc-agi-1-train-programs-best-length-filtered-250_20250811-133320-c1
Qwen3-4B_dsarc-agi-1-train-programs-best-length-filtered-250_20250811-133320-c2
Qwen3-4B_dsarc-agi-1-train-programs-best-length-filtered-250_20250811-133320-c75
Qwen3-4B_dsarc-agi-1-train-programs-best-length-filtered-250_20250811-154450-c10
Qwen3-4B_dsarc-agi-1-train-programs-best-length-filtered-250_20250811-155856-c904
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
Qwen3-4B_dsarc-agi-1-train-programs-best-length-filtered-250_20250811-155856-c1808
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
Qwen3-4B_dsarc-agi-1-train-programs-best-length-filtered-250_20250811-155856-c2712
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
Qwen3-4B_dsarc-agi-1-train-programs-best-length-filtered-250_20250811-155856-c3614
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
Qwen3-4B_dsarc-agi-1-train-programs-best-length-filtered-250_20250811-221545_cst-c904
Qwen3-4B_dsarc-agi-1-train-programs-best-length-filtered-250_20250811-221545_cst-c1808
Qwen3-4B_dsarc-agi-1-train-programs-best-length-filtered-250_20250811-221545_cst-c3614
arc-1-fake-ttt-unblended-partialplus-c56
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : Trelis/Qwen3-4Bdsarc-programs-50-full-200-partial20250807-211749-c3171 This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
arc-1-fake-ttt-unblended-partialplus-c112
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : Trelis/Qwen3-4Bdsarc-programs-50-full-200-partial20250807-211749-c3171 This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
arc-1-fake-ttt-unblended-partialplus-c168
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : Trelis/Qwen3-4Bdsarc-programs-50-full-200-partial20250807-211749-c3171 This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
arc-1-fake-ttt-unblended-partialplus-c224
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : Trelis/Qwen3-4Bdsarc-programs-50-full-200-partial20250807-211749-c3171 This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
arc-1-fake-ttt-blended-c201
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : Trelis/Qwen3-4Bdsarc-programs-50-full-200-partial20250807-211749-c3171 This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
arc-1-fake-ttt-blended-c402
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : Trelis/Qwen3-4Bdsarc-programs-50-full-200-partial20250807-211749-c3171 This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
arc-1-fake-ttt-blended-c603
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : Trelis/Qwen3-4Bdsarc-programs-50-full-200-partial20250807-211749-c3171 This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
arc-1-fake-ttt-blended-c802
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : Trelis/Qwen3-4Bdsarc-programs-50-full-200-partial20250807-211749-c3171 This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
arc-1-fake-ttt-unblended-all-c148
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : Trelis/Qwen3-4Bdsarc-programs-50-full-200-partial20250807-211749-c3171 This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
arc-1-fake-ttt-unblended-all-c296
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : Trelis/Qwen3-4Bdsarc-programs-50-full-200-partial20250807-211749-c3171 This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
arc-1-fake-ttt-unblended-all-c444
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : Trelis/Qwen3-4Bdsarc-programs-50-full-200-partial20250807-211749-c3171 This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
arc-1-fake-ttt-unblended-all-c592
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : Trelis/Qwen3-4Bdsarc-programs-50-full-200-partial20250807-211749-c3171 This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
Qwen3-4B_ds-arc-agi-2-perfect-100_test-c8
Qwen3-4B_ds-arc-agi-2-perfect-50-c485
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
Qwen3-4B_ds-arc-agi-2-perfect-50-c970
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
Qwen3-4B_ds-arc-agi-2-partial-100-c1574
Qwen3-4B_ds-arc-agi-2-perfect-50_test-c4
Qwen3-4B_ds-arc-agi-2-perfect-50_test-c8
Qwen3-4B_ds-arc-agi-1-perfect-50_test-c4
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
Qwen3-4B_ds-arc-agi-1-perfect-50_test-c8
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
Qwen3-4B_ds-arc-agi-1-partial-100_test-c4
Qwen3-4B_ds-arc-agi-1-partial-100_test-c8
Qwen3-4B_ds-arc-agi-1-partial-100-c4
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
Qwen3-4B_ds-arc-agi-1-partial-100-c8
Qwen3-4B_ds-arc-agi-1-partial-100-c771
Qwen3-4B_ds-arc-agi-1-partial-100-c1542
Qwen3-4B_ds-parquet-programs-c1403
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
Qwen3-4B_ds-parquet-programs-c2806
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
Qwen3-4B_ds-parquet-programs-c12
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
Qwen3-4B_ds-arc-agi-2-partial-100-c4
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
Qwen3-4B_ds-arc-agi-2-partial-100-c8
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
Qwen3-4B_ds-arc-agi-2-partial-100-c1403
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
Qwen3-4B_ds-arc-agi-2-partial-100-c2806
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
Qwen3-4B_ds-arc-agi-2-partial-100-tricky-10_test-c4
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
Qwen3-4B_ds-arc-agi-2-partial-100-tricky-10_test-c8
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
Qwen3-4B_ds-arc-agi-2-partial-100-tricky-10-c4
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
Qwen3-4B_ds-arc-agi-2-partial-100-tricky-10-c8
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
Qwen3-4B_ds-arc-agi-2-partial-100-tricky-10-c60
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
Qwen3-4B_ds-arc-agi-2-partial-100-c2806_ds-arc-agi-2-training-hard-curriculum-c131
Qwen3-4B_ds-arc-agi-1-partial-100-c1542_ds-arc-agi-1-refinement-finetuning-c81
Qwen3-4B_ds-arc-agi-1-partial-100-c1542_ds-arc-agi-1-refinement-finetuning-c162
Qwen3-4B_ds-arc-agi-2-partial-20_test-c4
Qwen3-4B_ds-arc-agi-2-partial-20_test-c8
Qwen3-4B_ds-arc-agi-2-partial-20-c976
Qwen3-4B_ds-arc-agi-1-refinement-finetuning-partialplus-c552
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
gpt-oss-20b_ds-arc-agi-2-reasoning-5-c89
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/gpt-oss-20b-unsloth-bnb-4bit This gptoss model was trained 2x faster with Unsloth and Huggingface's TRL library.
Meta-Llama-3-8B-Instruct-function-calling
transcribe-en_gb-spelling-v1-turbo
transcribe-en_us-spelling-v1-turbo
Qwen3-4B_dsarc-agi-1-train-programs-best-length-filtered-250_20250811-121606-4b4fa-c50
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
Qwen3-4B_dsarc-agi-1-train-programs-best-length-filtered-250_20250811-124700-100s-pz-c50
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
Qwen3-4B_ds-arc-agi-2-partial-100-c3148
Qwen3-4B_ds-parquet-programs_test-c4
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
Qwen3-4B_ds-parquet-programs_test-c8
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
Qwen3-4B_ds-arc-agi-2-partial-100-c2806_ds-arc-agi-2-training-hard-curriculum-c4
Qwen3-4B_ds-arc-agi-2-partial-100-c2806_ds-datasets-c2
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : Trelis/Qwen3-4Bds-arc-agi-2-partial-100-c2806 This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
Qwen3-4B_ds-arc-agi-1-perfect-50-c642
test-oss-c1
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/gpt-oss-20b-BF16 This gptoss model was trained 2x faster with Unsloth and Huggingface's TRL library.
Qwen3-4B_ds-arc-agi-2-partial-100-c2806_ds-datasets_rLoRA-32-c2
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : Trelis/Qwen3-4Bds-arc-agi-2-partial-100-c2806 This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
smol-v7-sc-temporal_aa1
mpt-7b-8k-chat-sharded-bf16
Llama-2-7b-chat-hf-function-calling
Qwen3-4B-ds20250724_131808-20250725-132523
Qwen3-4B-ds20250729_114431-20250729-113936
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
smol-v7-1M_aa2
TinyLlama-1.1B-Chat-v0.3-AWQ
Mistral-Small-3.1-24B-Instruct-2503-touch-rugby-comprehensive-qa
Qwen3-4B-ds20250729_114431-20250729-115543
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
Qwen3-4B-ds20250729_114431-20250729-123351
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
Qwen3-4B-ds20250729_114431-20250729-142811
Soar-qwen-7b-ds20250729_114431-20250731-140526
Qwen3-4B-ds20250729_114431-20250801-141514
Qwen3-4B_ds-arc-agi-2-partial-100-c2806_ds-datasets_rLoRA-32-c4
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : Trelis/Qwen3-4Bds-arc-agi-2-partial-100-c2806 This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
Soar-qwen-14b-FP8-Dynamic
Llama-2-7b-chat-hf-hosted-inference-8bit
Llama-2-7b-chat-hf-sharded-bf16-5GB
mamba-2.8b-slimpj-bf16
Qwen1.5-MLX-test
SmolLM-135M-Instruct-layer-pruned-90M-raw
transcribe-british-spelling-v1-tiny-ctranslate2
TrelisSmolLM-base
Soar-qwen-7b-ds20250724_131808-20250725-130403
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : julien31/Soar-qwen-7b This qwen2 model was trained 2x faster with Unsloth and Huggingface's TRL library.
Qwen3-4B-ds20250729_114431-20250729-111804
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
Mistral-7B-Instruct-v0.2-function-calling-v3
Microsoft_Phi-4-FP8-Dynamic
whisper-large-v3-turbo-pilotgpt-unified-all-raw-no-pack-1s-merged-filtered
transcribe-en_us-spelling-v1-tiny-ctranslate2
TinyLlama-1.1B-Chat-v1.0-bf16
OpenELM-450M-instruct-ORPO
Llama-3.2-1B-Instruct-MATH-3ep
act_so101_test
Qwen3-4B_ds-arc-agi-2-partial-100-c2806_ds-datasets-c8
Llama-2-7b-hf-function-calling
all-MiniLM-L12-v2-ft-pairs-cosine
multi-qa-MiniLM-L6-cos-v1-ft-pairs-1-epoch
Llama-3.2-1B-Instruct-MATH-synthetic
SO-101-ACT
SO-101-ACT-beta_0.25
SO-101-ACT-n10
SO-101-ACT-n10_beta_1
SO-101-ACT-n10_beta_1_12500
SO-101-ACT-n10_k50
SO-101-SmolVLA-test
SO-101-ACT-test
lorge-16-jul
gemini_synth_10-22jul-test
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
Qwen3-4B-ds20250724_131808-20250725-142549
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
Qwen3-4B-ds20250729_114431-20250729-114617
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
gpt-oss-20b_ds-arc-agi-2-partialplus-max-c2842
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/gpt-oss-20b-unsloth-bnb-4bit This gptoss model was trained 2x faster with Unsloth and Huggingface's TRL library.
llava-v1.6-mistral-7b-PATCHED
Llama-2-13b-chat-hf-touch-rugby-rules-adapters
all-MiniLM-L12-v2-ft-triplets-10Qs
Meta-Llama-3.1-8B-Instruct-Trelis-ARC-1ep-20241013-201317-ft
Llama-3.2-1B-Instruct-ft-comprehensive-qa
gemma-3-4b-it-ft-touch-rugby-comprehensive-qa
gemini-2.5-reasoning-smol-21-jul
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
Qwen2.5-Coder-7B-Instruct-gemini_synth_50_random_split_1_training-20250723-113848
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen2.5-Coder-7B-Instruct This qwen2 model was trained 2x faster with Unsloth and Huggingface's TRL library.
qwen-soar-sft-23-jul
Qwen3-4B-ds20250729_114431-20250730-172810
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
Qwen3-4B_ds-arc-agi-2-partial-100-c2806_ds-datasets-c4
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : Trelis/Qwen3-4Bds-arc-agi-2-partial-100-c2806 This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
whisper-turbo-sample-uk-english
mpt-7b-instruct-hosted-inference-8bit
TinyLlama-1.1B-intermediate-step-480k-1T-chat-llama-style
Llama-2-13b-chat-hf-stanford-nil-policy-adapters
Llama-2-7b-chat-hf-6k
falcon-7b-chat-commercially-licensed-adapters
falcon-7b-chat-commercial-use-4k
Llama-2-7b-hf-stanford-nil-policy-ft-push-demo
mamba-2.8b-slimpj-chat-4k
TinyLlama-chat-ORPO-beta0.2
all-MiniLM-L12-v2-ft-Llama-3-70B
all-MiniLM-L12-v2-ft-pairs-balanced
all-MiniLM-L12-v2-ft-pairs-balanced-cpu
all-MiniLM-L12-v2-ft-triplets-10q
multi-qa-MiniLM-L6-cos-v1-ft-pairs
multi-qa-MiniLM-L6-cos-v1-ft-pairs-1-epoch-scale-20
multi-qa-MiniLM-L6-cos-v1-ft-pairs-2-cos-epoch-s20
multi-qa-MiniLM-L6-dot-v1-ft-pairs-4-cst-epoch-s1-overlap
multi-qa-MiniLM-L6-dot-v1-ft-triplets-2-cst-epoch-overlap
SmolLM-360M-width-depth-pruned-to-80M
Llama-3.2-3B-Instruct-MATH-ft
Llama-3.2-1B-Instruct-MATH-augmented-synthetic
Llama-3.2-1B-Instruct-MATH-synthetic-augmented
Llama-3.2-1B-Instruct-touch-rugby-synth-1epochs-20241009-122734-distilled-distilled
song-birds
This is the model card of a 🤗 transformers model that has been pushed on the Hub. This model card has been automatically generated. - Developed by: [More Information Needed] - Funded by [optional]: [More Information Needed] - Shared by [optional]: [More Information Needed] - Model type: [More Information Needed] - Language(s) (NLP): [More Information Needed] - License: [More Information Needed] - Finetuned from model [optional]: [More Information Needed] - Repository: [More Information Needed] - Paper [optional]: [More Information Needed] - Demo [optional]: [More Information Needed] Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations. Carbon emissions can be estimated using the Machine Learning Impact calculator presented in Lacoste et al. (2019). - Hardware Type: [More Information Needed] - Hours used: [More Information Needed] - Cloud Provider: [More Information Needed] - Compute Region: [More Information Needed] - Carbon Emitted: [More Information Needed]
Llama-3.2-1B-Instruct_gsm8k_rl_step2
Llama-3.2-1B-Instruct_ORPO_1_2p5em5lr
soar-sft-21-jul
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
gemini_synth_50_random_split_1_training-23jul-1epoch
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
qwen-synth_1-train-23jul-3epoch
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
Qwen3-4B-ds20250724_131808-20250724-123014
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
ddp-demo-2
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : unsloth/Qwen3-4B This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
Qwen3-4B_ds-arc-agi-2-partial-100-c2806_ds-datasets-c1
- Developed by: Trelis - License: apache-2.0 - Finetuned from model : Trelis/Qwen3-4Bds-arc-agi-2-partial-100-c2806 This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.