crestf411
L3.1-70B-sunfall-v0.6.1
L3.1-nemotron-sunfall-v0.7.0
daybreak-kunoichi-2dpo-7b-gguf
L3.1-8B-Dark-Planet-Slush
This is based on v1.1 and includes a merge with DavidAU/L3.1-Dark-Planet-SpinFire-Uncensored-8B. I did all my testing with temp 1, min-p 0.1, DRY 0.8. I enabled XTC at higher contexts. This model was merged using the TIES merge method using meta-llama/Llama-3.1-8B as a base. The following YAML configuration was used to produce this model:
L3-70B-daybreak-abliterated-v0.4
MN-Slush-GGLD-GGUF
MN-Slush
Slush is a two-stage model trained with high LoRA dropout, where stage 1 is a pretraining continuation on the base model, aimed at boosting the model's creativity and writing capabilities. This is then merged into the instruction tune model, and stage 2 is a fine tuning step on top of this to further enhance its roleplaying capabilities and/or to repair any damage caused in the stage 1 merge. This is still early stage. As always, feedback is welcome, and begone if you demand perfection. The second stage, like the Sunfall series, follows the Silly Tavern preset (Mistral V2 & V3, though V3-Tekken works fine), so ymmv in particular if you use some other tool and/or preset. I did all my testing with temp 1, min-p 0.1, DRY 0.8. Stage 1 (continued pretraining) Target: mistralai/Mistral-Nemo-Base-2407 (resulting LoRA merged into mistralai/Mistral-Nemo-Instruct-2407) LoRA dropout 0.5 (motivation) LoRA rank 64, alpha 128 (motivation) LR cosine 4e-6 LoRA+ with LR Ratio: 15 Context size: 16384 Gradient accumulation steps: 4 Epochs: 1 Stage 2 (fine tune) Target: Stage 1 model LoRA dropout 0.5 LoRA rank 32, alpha 64 LR cosine 5e-6 (min 5e-7) LoRA+ with LR Ratio: 15 Context size: 16384 Gradient accumulation steps: 4 Epochs: 2 This model was merged using the TIES merge method using mistralai/Mistral-Nemo-Base-2407 as a base. The following YAML configuration was used to produce this model: