crestf411

26 models • 1 total models in database
Sort by:

L3.1-70B-sunfall-v0.6.1

NaNK
llama
254
8

L3.1-nemotron-sunfall-v0.7.0

NaNK
llama
238
15

daybreak-kunoichi-2dpo-7b-gguf

NaNK
208
18

L3.1-8B-Dark-Planet-Slush

This is based on v1.1 and includes a merge with DavidAU/L3.1-Dark-Planet-SpinFire-Uncensored-8B. I did all my testing with temp 1, min-p 0.1, DRY 0.8. I enabled XTC at higher contexts. This model was merged using the TIES merge method using meta-llama/Llama-3.1-8B as a base. The following YAML configuration was used to produce this model:

NaNK
llama
122
2

L3-70B-daybreak-abliterated-v0.4

NaNK
llama
67
5

MN-Slush-GGLD-GGUF

67
4

MN-Slush

Slush is a two-stage model trained with high LoRA dropout, where stage 1 is a pretraining continuation on the base model, aimed at boosting the model's creativity and writing capabilities. This is then merged into the instruction tune model, and stage 2 is a fine tuning step on top of this to further enhance its roleplaying capabilities and/or to repair any damage caused in the stage 1 merge. This is still early stage. As always, feedback is welcome, and begone if you demand perfection. The second stage, like the Sunfall series, follows the Silly Tavern preset (Mistral V2 & V3, though V3-Tekken works fine), so ymmv in particular if you use some other tool and/or preset. I did all my testing with temp 1, min-p 0.1, DRY 0.8. Stage 1 (continued pretraining) Target: mistralai/Mistral-Nemo-Base-2407 (resulting LoRA merged into mistralai/Mistral-Nemo-Instruct-2407) LoRA dropout 0.5 (motivation) LoRA rank 64, alpha 128 (motivation) LR cosine 4e-6 LoRA+ with LR Ratio: 15 Context size: 16384 Gradient accumulation steps: 4 Epochs: 1 Stage 2 (fine tune) Target: Stage 1 model LoRA dropout 0.5 LoRA rank 32, alpha 64 LR cosine 5e-6 (min 5e-7) LoRA+ with LR Ratio: 15 Context size: 16384 Gradient accumulation steps: 4 Epochs: 2 This model was merged using the TIES merge method using mistralai/Mistral-Nemo-Base-2407 as a base. The following YAML configuration was used to produce this model:

NaNK
63
33

L3-70B-daybreak-storywriter-v0.4

NaNK
llama
60
5

MN-DPT-Slush-GGUF

49
0

MS-sunfall-v0.7.0-gguf

NaNK
48
7

daybreak-kunoichi-dpo-7b-gguf

NaNK
26
4

L3.1-8B-sunfall-v0.6.1-dpo

NaNK
llama
19
7

L3.1-8B-Slush-v1.1

NaNK
llama
19
6

crestfall-echidna-v0.3-L2-13b

NaNK
16
6

L3.1-nemotron-sunfall-v0.7.0-q4_k_m

14
1

L3.1-8B-sunfall-stheno-v0.6.1

NaNK
llama
12
4

daybreak-mixtral-8x7b-v1.0-gguf

NaNK
10
3

Q2.5-32B-Slush

NaNK
7
11

MN-SlushoMix

7
2

gemma2-9B-sunfall-v0.5.2

NaNK
2
23

nemo-sunfall-v0.6.1

2
7

daybreak-kunoichi-2dpo-7b

NaNK
1
15

MS-sunfall-v0.7.0

NaNK
1
12

daybreak-kunoichi-dpo-7b

NaNK
1
1

sunfall-peft

0
15

daybreak-peft

0
6