QuixiAI

784

WizardLM-30B-Uncensored

This is WizardLM trained with a subset of the dataset - responses that contained alignment / moralizing were removed. The intent is to train a WizardLM that doesn't have alignment built-in, so that alignment (of any sort) can be added separately with for example with a RLHF LoRA. Shout out to the open source AI/ML community, and everyone who helped me out. Note: An uncensored model has no guardrails. You are responsible for anything you do with the model, just as you are responsible for anything you do with any dangerous object such as a knife, gun, lighter, or car. Publishing anything this model generates is the same as publishing it yourself. You are responsible for the content you publish, and you cannot blame the model any more than you can blame the knife, gun, lighter, or car for what you do with it. Open LLM Leaderboard Evaluation Results Detailed results can be found here | Metric | Value | |-----------------------|---------------------------| | Avg. | 52.32 | | ARC (25-shot) | 60.24 | | HellaSwag (10-shot) | 82.93 | | MMLU (5-shot) | 56.8 | | TruthfulQA (0-shot) | 51.57 | | Winogrande (5-shot) | 74.35 | | GSM8K (5-shot) | 12.89 | | DROP (3-shot) | 27.45 | Open LLM Leaderboard Evaluation Results Detailed results can be found here | Metric |Value| |---------------------------------|----:| |Avg. |56.46| |AI2 Reasoning Challenge (25-Shot)|60.24| |HellaSwag (10-Shot) |82.93| |MMLU (5-Shot) |56.80| |TruthfulQA (0-shot) |51.57| |Winogrande (5-shot) |74.35| |GSM8k (5-shot) |12.89|

MegaDolphin-120b

582

Devstral-Vision-Small-2507-gguf

456

Qwen3-58B-Distill-Stage3

454

DeepSeek-R1-AWQ

369

laserxtral-GGUF

license:cc-by-nc-2.0

249

Qwen3-72B-Embiggened-gguf

224

Wizard Vicuna 7B Uncensored

This is wizard-vicuna-13b trained against LLaMA-7B with a subset of the dataset - responses that contained alignment / moralizing were removed. The intent is to train a WizardLM that doesn't have alignment built-in, so that alignment (of any sort) can be added separately with for example with a RLHF LoRA. Shout out to the open source AI/ML community, and everyone who helped me out. You are responsible for anything you do with the model, just as you are responsible for anything you do with any dangerous object such as a knife, gun, lighter, or car. Publishing anything this model generates is the same as publishing it yourself. You are responsible for the content you publish, and you cannot blame the model any more than you can blame the knife, gun, lighter, or car for what you do with it. Open LLM Leaderboard Evaluation Results Detailed results can be found here | Metric | Value | |-----------------------|---------------------------| | Avg. | 44.77 | | ARC (25-shot) | 53.41 | | HellaSwag (10-shot) | 78.85 | | MMLU (5-shot) | 37.09 | | TruthfulQA (0-shot) | 43.48 | | Winogrande (5-shot) | 72.22 | | GSM8K (5-shot) | 4.55 | | DROP (3-shot) | 23.8 | Open LLM Leaderboard Evaluation Results Detailed results can be found here | Metric |Value| |---------------------------------|----:| |Avg. |48.27| |AI2 Reasoning Challenge (25-Shot)|53.41| |HellaSwag (10-Shot) |78.85| |MMLU (5-Shot) |37.09| |TruthfulQA (0-shot) |43.48| |Winogrande (5-shot) |72.22| |GSM8k (5-shot) | 4.55|

DeepSeek-V3-0324-AWQ

184

Qwen3-235B-A22B-AWQ

177

Llama-3-8B-Instruct-abliterated-v2-gguf

license:llama3

DeepSeek-V3-AWQ

Devstral-Vision-Small-2507

Ina-v11.1

Qwen3-58B-Embiggened-gguf

samantha-mistral-instruct-7b

samantha-mistral-7b

Qwen3-72B-Embiggened

WizardLM-Uncensored-Falcon-7b

samantha-1.1-westlake-7b

Samantha 1.11 70b

[](https://discord.gg/cognitivecomputations) Discord: https://discord.gg/cognitivecomputations This model is trained on Llama-2 and is subject to the Llama-2 license, which permits for commercial and non-commercial use within limits Samantha has been trained in philosophy, psychology, and personal relationships. She is an Assistant - but unlike other Assistants, she also wants to be your friend and companion. Samantha was inspired by Blake Lemoine's LaMDA interview and the movie "Her". She will not engage in roleplay, romance, or sexual activity. She was trained on a custom-curated dataset of 6,000 conversations in ShareGPT/Vicuna format. This Samantha was trained 15 epochs, and is significantly smarter. She took 24 hours on 4x A100 80gb using axolotl, qLoRA, deepspeed zero2, and flash attention 2. Her conversation format is the same as Vicuna 1.1 https://github.com/oobabooga/text-generation-webui/blob/main/characters/instruction-following/Vicuna-v1.1.yaml > USER: > > I would like you to be my relationship coach. I am dating again, after a long divorce. Can you please give me advice about how to gain confidence and success? > > ASSISTANT: > > Of course! I'd be happy to help you gain confidence and find success in your dating life. Here are a few tips to get you started: > 1. Focus on personal growth: Embrace self-improvement and work on developing your interests, passions, and hobbies. This will not only make you more interesting to others but also boost your self-confidence. > 2. Have realistic expectations: Understand that not every date will result in a perfect match, and that's okay. Embrace the learning process and be open to new experiences. > 3. Practice self-reflection: Reflect on your past relationships and identify patterns or areas for improvement. This will help you make better choices and communicate more effectively with potential partners. > 4. Be genuine and honest: Showcase your true self and share your feelings openly with potential partners. Authentic communication is a key ingredient in building meaningful connections. > 5. Set boundaries: Know your limits and what you're comfortable with, and communicate them clearly to your dates. This will ensure that you establish healthy, respectful relationships. > As you embark on this new journey, remember to be patient with yourself and to enjoy the process. Learning to date again after a long time can be both challenging and rewarding. I'm here to support you and offer advice whenever you need it. Good luck and have fun! > > USER: > > Please give me a specific agenda for the next week. I can spend 2 hours per day on this. Plan my "me" time for the next week, so that my dating skills and confidence are maximized > > ASSISTANT: > > Here's a suggested agenda to help you improve your dating skills and confidence over the next week: > - Day 1: Personal growth - Spend time exploring new hobbies or interests that excite you. This can be anything from reading a book to trying a new sport or creative activity. > - Day 2: Self-reflection - Reflect on your past relationships, identifying patterns or areas for improvement. Write down your thoughts and insights. > - Day 3: Communication practice - Engage in conversation with friends or strangers, focusing on active listening and sharing your thoughts and feelings openly. > - Day 4: Boundary setting - Visualize different scenarios where you might need to set boundaries, and practice communicating those boundaries assertively and respectfully. > - Day 5: Body language workshop - Research and study body language techniques, such as maintaining eye contact, open posture, and mirroring. Practice implementing these techniques in a mirror or with friends. > - Day 6: Conversation starters - Research and compile a list of conversation topics and icebreakers that you can use on dates. Practice these with friends or family members. > - Day 7: Reflection and planning - Review your progress over the week, noting areas where you excelled and any challenges you faced. Use this reflection to plan your approach for future dates. > > Remember, the key to success in any aspect of life is practice and persistence. Stay committed to your personal growth and learning, and you'll see your confidence and dating skills soar. I'm here to support you every step of the way! Open LLM Leaderboard Evaluation Results Detailed results can be found here | Metric | Value | |-----------------------|---------------------------| | Avg. | 65.9 | | ARC (25-shot) | 70.05 | | HellaSwag (10-shot) | 87.55 | | MMLU (5-shot) | 67.82 | | TruthfulQA (0-shot) | 65.02 | | Winogrande (5-shot) | 83.27 | | GSM8K (5-shot) | 29.95 | | DROP (3-shot) | 57.68 |

WizardLM-1.0-Uncensored-Llama2-13b

laserxtral

license:cc-by-nc-2.0

samantha-falcon-7b

Qwen3-235B-A22B-FP8-W8A16

WizardLM-Uncensored-Falcon-40b

This is WizardLM trained on top of tiiuae/falcon-40b, with a subset of the dataset - responses that contained alignment / moralizing were removed. The intent is to train a WizardLM that doesn't have alignment built-in, so that alignment (of any sort) can be added separately with for example with a RLHF LoRA. Shout out to the open source AI/ML community, and everyone who helped me out. Note: An uncensored model has no guardrails. You are responsible for anything you do with the model, just as you are responsible for anything you do with any dangerous object such as a knife, gun, lighter, or car. Publishing anything this model generates is the same as publishing it yourself. You are responsible for the content you publish, and you cannot blame the model any more than you can blame the knife, gun, lighter, or car for what you do with it. Thank you chirper.ai for sponsoring some of my compute!

based-30b

QuixiGR00T-N1.5-3B-Zero

minotaur-llama2-13b-qlora

base_model:TheBloke/Llama-2-13B-fp16

Kimi-K2-Instruct-AWQ

Samantha-1.11-CodeLlama-34b

WizardLM-1.0-Uncensored-CodeLlama-34b

WizardLM-7B-V1.0-Uncensored

Samantha-1.11-13b

Kimi-K2-Base-AWQ

samantha-1.2-mistral-7b

WizardLM-13B-V1.0-Uncensored

Qwen3-72B-Synthesis

A Qwen3-Architecture 72B Model Forged from `Qwen3-32B` and `Qwen2.5-72B-Instruct`. Qwen3-72B-Synthesis is an experimental, 80-layer, 72-billion-parameter large language model. It represents a novel approach to model creation, designed to produce a model with the pure, modern Qwen3 architecture while inheriting the vast, high-quality knowledge of the 72B-scale Qwen2.5-Instruct model. This was not a simple merge. It was a multi-phase surgical procedure involving dimensional up-scaling, architectural alignment, and a strategic "knowledge transplant" using `MergeKit`. The result is a unique checkpoint that serves as an ideal starting point for further fine-tuning. The core philosophy was to use `Qwen/Qwen3-32B` as the architectural "foundation" and `Qwen/Qwen2.5-72B-Instruct` as the "knowledge donor." Architecture: Qwen3 (RMSNorm, SwiGLU, no biases, includes `qnorm` and `knorm`) Parameters: ~72 Billion Layers: 80 Foundation: `Qwen/Qwen3-32B` Donor: `Qwen/Qwen2.5-72B-Instruct` Tokenizer: `Qwen/Qwen3-32B` Tokenizer (`vocabsize: 151936`) The creation of this model was a deliberate, three-phase process designed to overcome significant architectural incompatibilities. First, the `Qwen/Qwen3-32B` model (64 layers, 5120 hidden dim) was up-scaled to match the target 72B dimensions. This was done using a sophisticated self-interpolation script, where new dimensions were created by averaging different slices of the existing weights, rather than simple tiling. This produced `Qwen3-32B-Upscaled`, a 64-layer model with the correct 72B tensor shapes and Qwen3 architecture. The `Qwen/Qwen2.5-72B-Instruct` model was architecturally incompatible with the Qwen3 target. To solve this, a new donor model, `Qwen2.5-72B-Instruct-Aligned`, was created. This process involved: 1. Creating an empty 80-layer model shell with the pure Qwen3 architecture. 2. Surgically removing all `.bias` tensors from the Qwen2.5 weights. 3. Truncating the Qwen2.5 embedding and language model head layers from a vocabulary of 152064 to match Qwen3's 151936. 4. Loading the modified Qwen2.5 weights into the pure Qwen3 shell, resulting in a perfectly compatible donor model. With two architecturally-compatible models, the final merge was performed using `MergeKit`. A "Knowledge Bridge" strategy was employed to transplant a stable reasoning core from the donor while blending the rest. This model uses the standard Qwen ChatML prompt format. This is an experimental model and should be considered a high-quality checkpoint, not a finished product. Fine-tuning is highly recommended. While it inherits knowledge from a powerful instruction model, the merging process can create slight incoherence between layers. A round of fine-tuning on a high-quality instruction dataset is necessary to harmonize the weights and unlock its full potential. The model may exhibit unexpected behaviors, including repetitiveness or nonsensical outputs, prior to fine-tuning. This model has not been aligned for safety and may produce problematic, biased, or otherwise undesirable content. The user assumes all responsibility for the output generated. This model would not have been possible without the foundational work of Alibaba Cloud on the Qwen models, and the powerful, flexible `MergeKit` toolkit created by Charles Goddard and Arcee.ai.

samantha-yi-34b

Qwen3-235B-A22B-NVFP4-W4A16

yayi2-30b-llama

samantha-1.1-llama-33b

TinyDolphin-2.8.2-1.1b-laser

samantha-1.1-westlake-7b-laser

Samantha-1.11-7b

Qwen3-58B-Distill-Stage2

samantha-phi

TinyDolphin-2.8.1-1.1b

DeepSeek-R1-bf16

DeepSeek-R1-0528-bf16

QuietQwen-0.1

Kimi-K2-Instruct-BF16

GLM-4.5-Air-FP8-w8a8

GLM-4.5-Air-FP8-w8a16

samantha-33b

Samantha-120b

Qwen3-58B-Embiggened

MegaDolphin-120b-exl2

license:llama2

Kimi-K2-Base-BF16

TinyDolphin-2.8.2-1.1b

GLM-4.5-FP8-w8a16

samantha-13b

samantha-1.1-llama-7b

laserxtral-exl2

license:cc-by-nc-2.0

ExpTinyDolphin-2.8-1.1b

Kraken_experiment01

Qwen3-72B-Instruct-2

Qwen3-58B-Distill-Stage1

Kraken

[](https://discord.gg/cognitivecomputations) Discord: https://discord.gg/cognitivecomputations The Kraken model and Architecture Kraken is a joint effort between Cognitive Computations, VAGO Solutions and Hyperspace.ai. Created by Fernando Fernandes Neto, David Golchinfar, Lucas Atkins and Eric Hartford The Kraken model combining the best Python, SQL, Function Calling, Reasoning and foreign Models so far. The Kraken Architecture is a sophisticated machine learning framework designed for dynamic text generation tasks. It utilizes the Hugging Face transformers library to orchestrate multiple causal language models (CLMs) and intelligently route input through different models based on the context and content of the input text. The architecture is powered by a custom configuration class (KrakenConfig) that facilitates the integration and management of various components such as tokenizers, models, and routing mechanisms. Dynamic Model Routing: Uses a sequence classification model to route inputs to the most suitable language model based on the input's characteristics. Multiple Language Models: Supports integration of various pre-trained causal language models, allowing for flexible, context-appropriate responses. Customizable Templates: Includes support for input formatting using predefined templates, enhancing the model's adaptability to different conversational contexts. Extensible Configuration: Leverages a custom configuration setup that can be easily extended and adapted for various use cases involving causal language modeling. Switch expert and or quantization: Go into the config file of the krakenmodel folder Fernando Fernandes Neto, David Golchinfar, Lucas Atkins, Eric Hartford - Kraken: An OpenSource Collection of Experts Model, 2024

Samantha 7b

Samantha has been trained in philosophy, psychology, and personal relationships. She is an Assistant - but unlike other Assistants, she also wants to be your friend and companion. Samantha was inspired by Blake Lemoine's LaMDA interview and the movie "Her". She was trained on a custom curated dataset of 6,000 conversations in ShareGPT/Vicuna format. Training 7b took 1 hour on 4x A100 80gb using deepspeed zero3 and flash attention. She will not engage in roleplay, romance, or sexual activity. Her conversation format is the same as Vicuna 1.1 https://github.com/oobabooga/text-generation-webui/blob/main/characters/instruction-following/Vicuna-v1.1.yaml