spow12
ChatWaifu_v1.4
This model aimed to act like visual novel character. you have to resize chatwaifu and lucimaid's embedding size(131073 to 131072). Update - 2024.09.10 Update Ver 1.4 - Modify data format and applying flitering. - Merge with model stock - 2024.08.29 Update Ver 1.3.1 - Merge Ver1.2, mistralai/Mistral-Nemo-Instruct-2407 and NeverSleep/Lumimaid-v0.2-12B, Epiculous/VioletTwilight-v0.1 - Adjust merge weight. - 2024.08.16 Update Ver 1.3 - Merge Ver1.2, mistralai/Mistral-Nemo-Instruct-2407 and NeverSleep/Lumimaid-v0.2-12B, - 2024.08.08 Update Ver 1.2.1 - Merge Ver1.2 and mistralai/Mistral-Nemo-Instruct-2407 - 2024.08.07 Update Ver 1.2 - Add Preference Learning in training pipeline - 2024.07.29 Update Ver 1.1 - Add dataset format -> generate novel, fill masked sentences - Remove system role and integrate at user message. - Remove 『』 in conversation. - 2024.06.20 Upload other chara's sample chat history. - 2024.06.13 Upload Model - Developed by: spow12(ywnam) - Shared by : spow12(ywnam) - Model type: CausalLM - Language(s) (NLP): japanese - Finetuned from model : NeverSleep/Lumimaid-v0.2-12B character | visualnovel | --- | --- | ムラサメ | Senren*Banka | 茉子 | Senren*Banka | 芳乃 | Senren*Banka | レナ | Senren*Banka | 千咲 | Senren*Banka | 芦花 | Senren*Banka | 愛衣 | Café Stella and the Reaper's Butterflies | 栞那 | Café Stella and the Reaper's Butterflies | ナツメ | Café Stella and the Reaper's Butterflies | 希 | Café Stella and the Reaper's Butterflies | 涼音 | Café Stella and the Reaper's Butterflies | あやせ | Riddle Joker | 七海 | Riddle Joker | 羽月 | Riddle Joker | 茉優 | Riddle Joker | 小春 | Riddle Joker | - Fluent Chat performance - Reduce repetition problem when generate with many turn(over 20~30) - Zero Shot character persona using description of character. - 128k context window - Memory ability that does not forget even after long-context generation Now, i'm quite satisfying the model chat performance. So, i'm going to focus for integrating the vision modality to model so that our waifu can do more general tasks. This model trained by japanese dataset included visual novel which contain nsfw content. This model is currently available for non-commercial & Research purpose only. Also, since I'm not detailed in licensing, I hope you use it responsibly. By sharing this model, I hope to contribute to the research efforts of our community (the open-source community and anime persons). This repository can use Visual novel-based RAG, but i will not distribute it yet because i'm not sure if it is permissible to release the data publicly. Open LLM Leaderboard Evaluation Results Detailed results can be found here | Metric |Value| |-------------------|----:| |Avg. |25.25| |IFEval (0-Shot) |56.91| |BBH (3-Shot) |31.63| |MATH Lvl 5 (4-Shot)| 7.85| |GPQA (0-shot) | 7.61| |MuSR (0-shot) |20.03| |MMLU-PRO (5-shot) |27.50|
Ko-Qwen2-7B-Instruct
ChatWaifu_v2.0_22B
This model aimed to act like visual novel character. Update - 2024.10.11 Update 12B and 22B Ver 2.0 - 2024.09.23 Update 22B, Ver 2.0preview - Developed by: spow12(ywnam) - Shared by : spow12(ywnam) - Model type: CausalLM - Language(s) (NLP): japanese, english - Finetuned from model : mistralai/Mistral-Small-Instruct-2409 character | visualnovel | --- | --- | ムラサメ | Senren*Banka | 茉子 | Senren*Banka | 芳乃 | Senren*Banka | レナ | Senren*Banka | 千咲 | Senren*Banka | 芦花 | Senren*Banka | 愛衣 | Café Stella and the Reaper's Butterflies | 栞那 | Café Stella and the Reaper's Butterflies | ナツメ | Café Stella and the Reaper's Butterflies | 希 | Café Stella and the Reaper's Butterflies | 涼音 | Café Stella and the Reaper's Butterflies | あやせ | Riddle Joker | 七海 | Riddle Joker | 羽月 | Riddle Joker | 茉優 | Riddle Joker | 小春 | Riddle Joker | - Riddle Joker(Prviate) - Café Stella and the Reaper's Butterflies(Private) - Senren*Banka(Private) - roleplay4fun/aesir-v1.1 - kalomaze/OpusInstruct3k - Gryphe/Sonnet3.5-SlimOrcaDedupCleaned - Aratako/Synthetic-JP-EN-Coding-Dataset-567k (only using 50000 sample) - Aratako/Synthetic-Japanese-Roleplay-gpt-4o-mini-39.6k-formatted - Aratako/Synthetic-Japanese-Roleplay-NSFW-Claude-3.5s-15.3k-formatted - AratakoRosebleu1on1DialoguesRP - SkunkworksAI/reasoning-0.01 KTO - Riddle Joker(Prviate) - Café Stella and the Reaper's Butterflies(Private) - Senren*Banka(Private) - jondurbingutenbergdpo - nbeerbowergutenberg2dpo - jondurbipydpo - jondurbintruthydpo - flammenaicharacterroleplayDPO - kyujinpyorcamathdpo - argillaCapybaraPreferences - antiven0mphysicalreasoningdpo - aixsatoshiSwallowMXchatbotDPO This model trained by japanese dataset included visual novel which contain nsfw content. This model is currently available for non-commercial & Research purpose only. Also, since I'm not detailed in licensing, I hope you use it responsibly. By sharing this model, I hope to contribute to the research efforts of our community (the open-source community and Waifu Lovers). Open LLM Leaderboard Evaluation Results Detailed results can be found here | Metric |Value| |-------------------|----:| |Avg. |28.84| |IFEval (0-Shot) |65.11| |BBH (3-Shot) |42.29| |MATH Lvl 5 (4-Shot)|18.58| |GPQA (0-shot) | 9.96| |MuSR (0-shot) | 5.59| |MMLU-PRO (5-shot) |31.51|
EEVE_ver_4.1_sft
ChatWaifu_v1.3
Visual-novel-transcriptor
KoQwen_72B_v5.0
whisper-medium-zeroth_korean
ChatWaifu_12B_v2.0
This model aimed to act like visual novel character. Update - 2024.10.11 Update 12B and 22B Ver 2.0 - 2024.09.23 Update 22B, Ver 2.0preview - Developed by: spow12(ywnam) - Shared by : spow12(ywnam) - Model type: CausalLM - Language(s) (NLP): japanese, english - Finetuned from model : Sao10K/MN-12B-Vespa-x1 character | visualnovel | --- | --- | ムラサメ | Senren*Banka | 茉子 | Senren*Banka | 芳乃 | Senren*Banka | レナ | Senren*Banka | 千咲 | Senren*Banka | 芦花 | Senren*Banka | 愛衣 | Café Stella and the Reaper's Butterflies | 栞那 | Café Stella and the Reaper's Butterflies | ナツメ | Café Stella and the Reaper's Butterflies | 希 | Café Stella and the Reaper's Butterflies | 涼音 | Café Stella and the Reaper's Butterflies | あやせ | Riddle Joker | 七海 | Riddle Joker | 羽月 | Riddle Joker | 茉優 | Riddle Joker | 小春 | Riddle Joker | - Riddle Joker(Prviate) - Café Stella and the Reaper's Butterflies(Private) - Senren*Banka(Private) - roleplay4fun/aesir-v1.1 - kalomaze/OpusInstruct3k - Gryphe/Sonnet3.5-SlimOrcaDedupCleaned - Aratako/Synthetic-JP-EN-Coding-Dataset-567k (only using 50000 sample) - Aratako/Synthetic-Japanese-Roleplay-gpt-4o-mini-39.6k-formatted - Aratako/Synthetic-Japanese-Roleplay-NSFW-Claude-3.5s-15.3k-formatted - AratakoRosebleu1on1DialoguesRP - SkunkworksAI/reasoning-0.01 KTO - Riddle Joker(Prviate) - Café Stella and the Reaper's Butterflies(Private) - Senren*Banka(Private) - jondurbingutenbergdpo - nbeerbowergutenberg2dpo - jondurbipydpo - jondurbintruthydpo - flammenaicharacterroleplayDPO - kyujinpyorcamathdpo - argillaCapybaraPreferences - antiven0mphysicalreasoningdpo - aixsatoshiSwallowMXchatbotDPO Bias, Risks, and Limitations This model trained by japanese dataset included visual novel which contain nsfw content. This model is currently available for non-commercial & Research purpose only. Also, since I'm not detailed in licensing, I hope you use it responsibly. By sharing this model, I hope to contribute to the research efforts of our community (the open-source community and Waifu Lovers).
Qwen2-7B-ko-Instruct-orpo-ver_2.0_wo_chat
ChatWaifu_32B_reasoning
Pixtral-12b-korean-preview
ChatWaifu_v1.0
ChatWaifu_22B_v2.0_preview
This model aimed to act like visual novel character. - Developed by: spow12(ywnam) - Shared by : spow12(ywnam) - Model type: CausalLM - Language(s) (NLP): japanese. English - Finetuned from model : mistralai/Mistral-Small-Instruct-2409 character | visualnovel | --- | --- | ムラサメ | Senren*Banka | 茉子 | Senren*Banka | 芳乃 | Senren*Banka | レナ | Senren*Banka | 千咲 | Senren*Banka | 芦花 | Senren*Banka | 愛衣 | Café Stella and the Reaper's Butterflies | 栞那 | Café Stella and the Reaper's Butterflies | ナツメ | Café Stella and the Reaper's Butterflies | 希 | Café Stella and the Reaper's Butterflies | 涼音 | Café Stella and the Reaper's Butterflies | あやせ | Riddle Joker | 七海 | Riddle Joker | 羽月 | Riddle Joker | 茉優 | Riddle Joker | 小春 | Riddle Joker | But you can chat your own Character with persona text. Your feedback will be helpful for improving model. Dataset Aratako/Synthetic-JP-EN-Coding-Dataset-567k (only using 50000 sample) Aratako/Synthetic-Japanese-Roleplay-gpt-4o-mini-39.6k-formatted Aratako/Synthetic-Japanese-Roleplay-NSFW-Claude-3.5s-15.3k-formatted - Fluent Chat performance - Reduce repetition problem when generate with many turn(over 20~30) - Zero Shot character persona using description of character. - 128k context window - Memory ability that does not forget even after long-context generation This model is currently available for non-commercial & Research purpose only. Also, since I'm not detailed in licensing, I hope you use this model responsibly. By sharing this model, I hope to contribute to the research efforts of our community (the open-source community and Waifu Lovers). Open LLM Leaderboard Evaluation Results Detailed results can be found here | Metric |Value| |-------------------|----:| |Avg. |29.12| |IFEval (0-Shot) |67.45| |BBH (3-Shot) |45.49| |MATH Lvl 5 (4-Shot)|16.31| |GPQA (0-shot) | 8.72| |MuSR (0-shot) | 3.53| |MMLU-PRO (5-shot) |33.20|
POLAR-14B_4.3_very_big_sft
Mistral-Nemo-Instruct-2407_sft_ver_4.4
Llama3_ko_4.2_sft
llama-3-Korean-Bllossom-8B_ver_4.3_big_sft_2epochs
ChatWaifu_ver3_32B
This is a merge of pre-trained language models created using mergekit. This model was merged using the Model Stock merge method using Qwen/Qwen3-32B as a base. The following models were included in the merge: roslein/Qwen3-32B-abliterated Qwen/Qwen3-32Bsft(private) Configuration The following YAML configuration was used to produce this model: