DevQuasar
speakleash.Bielik-11B-v2.6-Instruct-GGUF
--- base_model: - speakleash/Bielik-11B-v2.6-Instruct pipeline_tag: text-generation ---
Qwen.Qwen3-Next-80B-A3B-Instruct-FP8-Dynamic
Qwen.Qwen3-30B-A3B-Instruct-2507-W4A16-GPTQ
inclusionAI.Ring-1T-GGUF
MiniMaxAI.MiniMax-M2-GGUF
MiniMaxAI.MiniMax-M2.5-GGUF
cerebras.MiniMax-M2-REAP-162B-A10B-GGUF
cerebras.MiniMax-M2-REAP-139B-A10B-GGUF
inclusionAI.Ling-1T-GGUF
ai-sage.GigaChat3-702B-A36B-preview-bf16-GGUF
moonshotai.Kimi-K2-Thinking-GGUF
Original INT4 model has been dequantized with my own custom script: DQint4-to-bf16dequant (inspired by the deepseek V3 dequant script) Zero Short Hexa-ball test, generated code by the Q3 quant produced:
moonshotai.Kimi-K2.5-GGUF
Qwen.Qwen3-Coder-480B-A35B-Instruct-GGUF
google.gemma-3-12b-pt-GGUF
Qwen.Qwen3-VL-235B-A22B-Thinking-GGUF
Quantized version of: Qwen/Qwen3-VL-235B-A22B-Thinking
inclusionAI.Ring-flash-2.0-GGUF
DavidAU.L3.1-Dark-Reasoning-LewdPlay-evo-Hermes-R1-Uncensored-8B-GGUF
dphn.Dolphin-Mistral-24B-Venice-Edition-GGUF
Quantized version of: dphn/Dolphin-Mistral-24B-Venice-Edition
nanonets.Nanonets-OCR2-3B-GGUF
Qwen.Qwen3-VL-235B-A22B-Instruct-GGUF
zai-org.GLM-4.5-Air-GGUF
openai.gpt-oss-20b-GGUF
CohereLabs.command-a-translate-08-2025-GGUF
Tested with DevQuasar/wikitext-2-raw-v1-preprocessed-1k Quantized version of: CohereLabs/command-a-translate-08-2025
google.gemma-3-27b-pt-GGUF
deepseek-ai.DeepSeek-V3.2-GGUF
deepcogito.cogito-671b-v2.1-GGUF
mlfoundations-dev.oh-dcft-v3.1-claude-3-5-haiku-20241022-GGUF
Quantized version of: mlfoundations-dev/oh-dcft-v3.1-claude-3-5-haiku-20241022
NousResearch.Hermes-4-405B-GGUF
facebook.MobileLLM-R1-950M-GGUF
nvidia.Llama-3_1-Nemotron-Ultra-253B-v1-GGUF
Qwen.Qwen3.5-35B-A3B-GGUF
google.gemma-3-4b-pt-GGUF
tencent.Hunyuan-MT-7B-GGUF
google.gemma-3-1b-pt-GGUF
zai-org.GLM-5-GGUF
deepseek-ai.DeepSeek-V3.1-GGUF
xai-org.grok-2-GGUF
PokeeAI.pokee_research_7b-GGUF
LLM360.K2-Think-GGUF
nvidia.OpenCodeReasoning-Nemotron-7B-GGUF
Quantized version of: nvidia/OpenCodeReasoning-Nemotron-7B
Kwaipilot.KAT-Dev-GGUF
swiss-ai.Apertus-8B-Instruct-2509-GGUF
Quantized version of: swiss-ai/Apertus-8B-Instruct-2509
huihui-ai.DeepSeek-R1-Distill-Qwen-7B-abliterated-GGUF
huihui-ai.QwQ-32B-abliterated-GGUF
cognitivecomputations.Dolphin3.0-Llama3.1-8B-GGUF
Quantized version of: cognitivecomputations/Dolphin3.0-Llama3.1-8B
Qwen.Qwen3-1.7B-GGUF
huihui-ai.Huihui-Hunyuan-MT-7B-abliterated-GGUF
Quantized version of: huihui-ai/Huihui-Hunyuan-MT-7B-abliterated
Qwen.Qwen3-Coder-30B-A3B-Instruct-GGUF
Quantized version of: Qwen/Qwen3-Coder-30B-A3B-Instruct
Gryphe.MythoMax-L2-13b-GGUF
inference-net.Schematron-3B-GGUF
LiquidAI.LFM2-700M-GGUF
huihui-ai.granite-vision-3.2-2b-abliterated-GGUF
Qwen.Qwen3-235B-A22B-GGUF
QuixiAI.WizardLM-13B-Uncensored-GGUF
Quantized version of: QuixiAI/WizardLM-13B-Uncensored
chutesai.Qwen3-235B-A22B-Instruct-2507-1M-GGUF
Quantized version of: chutesai/Qwen3-235B-A22B-Instruct-2507-1M
internlm.OREAL-DeepSeek-R1-Distill-Qwen-7B-GGUF
nvidia.NVIDIA-Nemotron-Nano-12B-v2-GGUF
Qwen.Qwen3-Reranker-8B-GGUF
mistralai.Ministral-3-3B-Instruct-2512-GGUF
Salesforce.Llama-xLAM-2-8b-fc-r-GGUF
huihui-ai.Huihui-Hunyuan-MT-Chimera-7B-abliterated-GGUF
openai.gpt-oss-120b-GGUF
CohereLabs.command-a-reasoning-08-2025-GGUF
huihui-ai.DeepSeek-R1-Distill-Qwen-14B-abliterated-v2-GGUF
XiaomiMiMo.MiMo-V2-Flash-GGUF
inclusionAI.Ring-mini-sparse-2.0-exp-GGUF
Quantized version of: inclusionAI/Ring-mini-sparse-2.0-exp
deepseek-ai.DeepSeek-V3.2-Speciale-Channel-INT8
Intelligent-Internet.II-Medical-8B-1706-GGUF
Quantized version of: Intelligent-Internet/II-Medical-8B-1706
huihui-ai.DeepSeek-R1-Distill-Qwen-32B-abliterated-GGUF
huihui-ai.Huihui-Qwen3-4B-Instruct-2507-abliterated-GGUF
nvidia.Llama-3_3-Nemotron-Super-49B-v1_5-GGUF
ibm-granite.granite-4.0-350m-GGUF
google.gemma-3-4b-it-qat-int4-unquantized-GGUF
google.gemma-3-27b-it-qat-q4_0-unquantized-GGUF
nvidia.OpenReasoning-Nemotron-7B-GGUF
Quantized version of: nvidia/OpenReasoning-Nemotron-7B
Qwen.Qwen2.5-VL-7B-Instruct-GGUF
ai21labs.AI21-Jamba-Large-1.6-GGUF
HuggingFaceTB.finemath-ablation-infiwebmath-GGUF
Quantized version of: HuggingFaceTB/finemath-ablation-infiwebmath
mistralai.Mistral-7B-Instruct-v0.1-GGUF
ibm-granite.granite-4.0-h-1b-GGUF
Qwen.Qwen3-30B-A3B-GGUF
moonshotai.Kimi-K2-Instruct-0905-GGUF
Quantized version of: moonshotai/Kimi-K2-Instruct-0905
zai-org.GLM-4.1V-9B-Base-GGUF
arcee-ai.Trinity-Large-Preview-GGUF
ibm-granite.granite-4.0-h-tiny-GGUF
Quantized version of: ibm-granite/granite-4.0-h-tiny
katanemo.Arch-Agent-7B-GGUF
DavidAU.Llama-3.2-8X3B-MOE-Dark-Champion-Instruct-uncensored-abliterated-18.4B-GGUF
Quantized version of: DavidAU/Llama-3.2-8X3B-MOE-Dark-Champion-Instruct-uncensored-abliterated-18.4B
Qwen.Qwen2.5-VL-32B-Instruct-GGUF
You have to use the backend from HimariO's branch. Big thanks to add Qwen2.5VL support! Additional discussions
ByteDance-Seed.academic-ds-9B-GGUF
Quantized version of: ByteDance-Seed/academic-ds-9B
Qwen.Qwen2.5-14B-Instruct-1M-GGUF
prithivMLmods.Viper-Coder-Hybrid-v1.2-GGUF
google.gemma-3-27b-it-GGUF
inclusionAI.Ling-lite-1.5-2506-GGUF
Quantized version of: inclusionAI/Ling-lite-1.5-2506
zai-org.GLM-4.5-GGUF
LGAI-EXAONE.EXAONE-4.0-32B-GGUF
HuggingFaceTB.SmolLM2-1.7B-Instruct-GGUF
HuggingFaceTB.SmolLM3-3B-GGUF
huihui-ai.Qwen3-30B-A3B-abliterated-GGUF
deepseek-ai.DeepSeek-R1-Distill-Qwen-32B-GGUF
ai21labs.AI21-Jamba-Mini-1.7-GGUF
open-thoughts.OpenThinker-32B-GGUF
Quantized version of: open-thoughts/OpenThinker-32B
Sao10K.L3-8B-Stheno-v3.2-GGUF
openai.gpt-oss-safeguard-20b-GGUF
ibm-granite.granite-4.0-1b-GGUF
CohereLabs.c4ai-command-r-plus-GGUF
deepseek-ai.DeepSeek-V3.1-Terminus-GGUF
Quantized version of: deepseek-ai/DeepSeek-V3.1-Terminus
microsoft.UserLM-8b-GGUF
ibm-granite.granite-4.0-h-350m-GGUF
Quantized version of: ibm-granite/granite-4.0-h-350m
mlabonne.gemma-3-1b-it-abliterated-v2-GGUF
Quantized version of: mlabonne/gemma-3-1b-it-abliterated-v2
Qwen.Qwen3-Reranker-4B-GGUF
inclusionAI.Ring-mini-2.0-GGUF
Qwen.Qwen2-VL-7B-Instruct-GGUF
nicoboss.Qwen-3-32B-Medical-Reasoning-GGUF
Quantized version of: nicoboss/Qwen-3-32B-Medical-Reasoning
ai21labs.AI21-Jamba-Mini-1.6-GGUF
internlm.internlm2_5-7b-chat-GGUF
LiquidAI.LFM2-8B-A1B-GGUF
google.gemma-3-12b-it-GGUF
Locutusque.StockQwen-2.5-7B-GGUF
deepseek-ai.DeepSeek-R1-Distill-Qwen-1.5B-GGUF
deepseek-ai.DeepSeek-R1-Distill-Llama-8B-GGUF
NousResearch.DeepHermes-3-Llama-3-8B-Preview-GGUF
Qwen.Qwen3-Reranker-0.6B-GGUF
huihui-ai.Qwen2.5-72B-Instruct-abliterated-GGUF
ai21labs.AI21-Jamba-Reasoning-3B-GGUF
Quantized version of: ai21labs/AI21-Jamba-Reasoning-3B
ServiceNow-AI.Apriel-1.6-15b-Thinker-GGUF
open-thoughts.OpenThinker2-32B-GGUF
PocketDoc.Dans-PersonalityEngine-V1.3.0-24b-GGUF
tencent.Hunyuan-7B-Instruct-GGUF
llama3.1_8b_chat_brainstorm-v3.1-GGUF
THUDM.GLM-4-9B-0414-GGUF
huihui-ai.Huihui-gpt-oss-20b-BF16-abliterated-GGUF
Qwen.Qwen2.5-Coder-32B-Instruct-GGUF
Quantized version of: Qwen/Qwen2.5-Coder-32B-Instruct
meta-llama.Llama-3.2-1B-GGUF
inference-net.Schematron-8B-GGUF
Qwen.Qwen2.5-VL-3B-Instruct-GGUF
zerofata.MS3.2-PaintedFantasy-Visage-v2-33B-GGUF
Quantized version of: zerofata/MS3.2-PaintedFantasy-Visage-v2-33B
swiss-ai.Apertus-70B-Instruct-2509-GGUF
Quantized version of: swiss-ai/Apertus-70B-Instruct-2509
cerebras.GLM-4.5-Air-REAP-82B-A12B-GGUF
Quantized version of: cerebras/GLM-4.5-Air-REAP-82B-A12B
deepseek-ai.DeepSeek-R1-0528-GGUF
AXCXEPT.Qwen3-EZO-8B-beta-GGUF
Qwen.Qwen3-VL-8B-Instruct-GGUF
google.gemma-3-1b-it-qat-int4-unquantized-GGUF
vandijklab.C2S-Scale-Gemma-2-27B-GGUF
Quantized version of: vandijklab/C2S-Scale-Gemma-2-27B
microsoft.Phi-4-mini-instruct-GGUF
prithivMLmods.GN-108036-Qwen-14B-GGUF
facebook.KernelLLM-GGUF
allenai.Llama-3.1-Tulu-3-8B-GGUF
mistralai.Mistral-Small-24B-Instruct-2501-GGUF
Qwen.Qwen2-Math-1.5B-GGUF
facebook.MobileLLM-R1-360M-GGUF
Menlo.Lucy-128k-GGUF
Tesslate.UIGEN-FX-Agentic-32B-GGUF
Quantized version of: Tesslate/UIGEN-FX-Agentic-32B
nvidia.Llama-3.3-Nemotron-70B-Feedback-GGUF
Quantized version of: nvidia/Llama-3.3-Nemotron-70B-Feedback
Aurore-Reveil.Koto-Small-7B-IT-GGUF
Quantized version of: Aurore-Reveil/Koto-Small-7B-IT
nvidia.AceMath-7B-Instruct-GGUF
ai21labs.Jamba-v0.1-GGUF
katanemo.Arch-Function-3B-GGUF
HuggingFaceTB.SmolLM3-3B-Base-GGUF
Quantized version of: HuggingFaceTB/SmolLM3-3B-Base
ai21labs.AI21-Jamba-Large-1.7-GGUF
Quantized version of: ai21labs/AI21-Jamba-Large-1.7
deepcogito.cogito-v1-preview-llama-3B-GGUF
LGAI-EXAONE.EXAONE-Deep-2.4B-GGUF
TildeAI.TildeOpen-30b-GGUF
DavidAU.Mistral-MOE-4X7B-Dark-MultiVerse-Uncensored-Enhanced32-24B-GGUF
Tesslate.UIGEN-T3-32B-Preview-GGUF
teknium.Mistral-Trismegistus-7B-GGUF
Qwen.Qwen3-235B-A22B-Thinking-2507-GGUF
Quantized version of: Qwen/Qwen3-235B-A22B-Thinking-2507
fluently.FluentlyQwen3-Coder-4B-0909-GGUF
Quantized version of: fluently/FluentlyQwen3-Coder-4B-0909
mistralai.Mistral-Small-3.1-24B-Base-2503-GGUF
microsoft.NextCoder-32B-GGUF
nvidia.Qwen3-Nemotron-32B-RLBFF-GGUF
Quantized version of: nvidia/Qwen3-Nemotron-32B-RLBFF
meta-llama.Llama-4-Scout-17B-16E-Instruct-GGUF
huihui-ai.phi-4-abliterated-GGUF
Gryphe.Pantheon-RP-Pure-1.6.2-22b-Small-GGUF
huihui-ai.Huihui-SmolLM3-3B-abliterated-GGUF
ruliad.deepthought-8b-llama-v0.01-alpha-GGUF
Quantized version of: ruliad/deepthought-8b-llama-v0.01-alpha
ai21labs.AI21-Jamba-Mini-1.5-GGUF
openai-community.gpt2-xl-GGUF
cerebras.MiniMax-M2-REAP-172B-A10B-GGUF
nvidia.Llama-3_1-Nemotron-Ultra-253B-CPT-v1-GGUF
Quantized version of: nvidia/Llama-31-Nemotron-Ultra-253B-CPT-v1
Kwaipilot.KAT-V1-40B-GGUF
huihui-ai.Huihui-Qwen3-4B-Thinking-2507-abliterated-GGUF
nvidia.Qwen3-Nemotron-32B-GenRM-Principle-GGUF
Quantized version of: nvidia/Qwen3-Nemotron-32B-GenRM-Principle
Nexusflow.Starling-LM-7B-beta-GGUF
Quantized version of: Nexusflow/Starling-LM-7B-beta
ytu-ce-cosmos.Turkish-Gemma-9b-v0.1-GGUF
PowerInfer.SmallThinker-21BA3B-Instruct-GGUF
Quantized version of: PowerInfer/SmallThinker-21BA3B-Instruct
Tesslate.UIGENT-30B-3A-Preview-GGUF
Quantized version of: Tesslate/UIGENT-30B-3A-Preview
prithivMLmods.Qwen2-VL-OCR-2B-Instruct-GGUF
utter-project.EuroLLM-22B-Instruct-Preview-GGUF
ValiantLabs.gpt-oss-20b-ShiningValiant3-GGUF
DeepMount00.Lexora-Lite-3B-GGUF
HuggingFaceTB.finemath-ablation-infiwebmath-3plus-GGUF
huihui-ai.Huihui-MoE-23B-A4B-abliterated-GGUF
Quantized version of: huihui-ai/Huihui-MoE-23B-A4B-abliterated
Meta-Llama-3.1-70B-Instruct-GGUF
analytical_reasoning_r16a32_unsloth-Llama-3.2-3B-Instruct-bnb-4bit-GGUF
zai-org.GLM-4.1V-9B-Thinking-GGUF
Alfitaria.Q25-1.5B-VeoLu-GGUF
soob3123.Veritas-12B-GGUF
Aratako.Qwen3-8B-NSFW-JP-GGUF
JetBrains.Mellum-4b-base-GGUF
Qwen.Qwen3-VL-4B-Instruct-GGUF
deepseek-ai.DeepSeek-R1-Distill-Qwen-7B-GGUF
Quantized version of: deepseek-ai/DeepSeek-R1-Distill-Qwen-7B
huihui-ai.Qwen3-4B-abliterated-GGUF
YOYO-AI.Qwen3-30B-A3B-Mixture-2507-GGUF
Quantized version of: YOYO-AI/Qwen3-30B-A3B-Mixture-2507
MarinaraSpaghetti.NemoMix-Unleashed-12B-GGUF
nvidia.Llama-3_3-Nemotron-Super-49B-GenRM-Multilingual-GGUF
Quantized version of: nvidia/Llama-33-Nemotron-Super-49B-GenRM-Multilingual
yasserrmd.DentaInstruct-1.2B-GGUF
falcon2-11B-GGUF
meta-llama.Llama-3.3-70B-Instruct-GGUF
nvidia.OpenReasoning-Nemotron-14B-GGUF
Quantized version of: nvidia/OpenReasoning-Nemotron-14B
IIEleven11.Kalypso-GGUF
mlabonne.gemma-3-12b-it-abliterated-v2-GGUF
Quantized version of: mlabonne/gemma-3-12b-it-abliterated-v2
Qwen.Qwen3-4B-Thinking-2507-GGUF
google.gemma-3-270m-GGUF
deepseek-ai.DeepSeek-V3-0324-GGUF
Qwen.Qwen3-VL-30B-A3B-Thinking-GGUF
ibm-granite.granite-3.0-8b-instruct-GGUF
Alibaba-NLP.WebDancer-32B-GGUF
Locutusque.deeplm-llama-3.1-8B-stage1-GGUF
Quantized version of: Locutusque/deeplm-llama-3.1-8B-stage1
Qwen.Qwen3-VL-8B-Thinking-GGUF
Qwen.Qwen3-VL-30B-A3B-Instruct-GGUF
mistralai.Mistral-Small-3.2-24B-Instruct-2506-GGUF
ibm-granite.granite-4.0-h-small-GGUF
Quantized version of: ibm-granite/granite-4.0-h-small
moonshotai.Kimi-Dev-72B-GGUF
goppa-ai.Goppa-LogiLlama-GGUF
google.gemma-3-12b-it-qat-int4-unquantized-GGUF
mistralai.Devstral-Small-2507-GGUF
Quantized version of: mistralai/Devstral-Small-2507
facebook.MobileLLM-R1-140M-base-GGUF
Quantized version of: facebook/MobileLLM-R1-140M-base
perplexity-ai.r1-1776-GGUF
a-m-team.AM-Thinking-v1-GGUF
zai-org.SWE-Dev-7B-GGUF
ValiantLabs.Qwen3-1.7B-ShiningValiant3-GGUF
Quantized version of: ValiantLabs/Qwen3-1.7B-ShiningValiant3
huihui-ai.Qwen3-8B-abliterated-GGUF
Tesslate.UIGEN-FX-4B-Preview-GGUF
HuggingFaceTB.finemath-ablation-infiwebmath-4plus-GGUF
nicoboss.Qwen3-32B-Uncensored-GGUF
Quantized version of: nicoboss/Qwen3-32B-Uncensored
ilsp.Llama-Krikri-8B-Instruct-GGUF
mlabonne.gemma-3-27b-it-abliterated-v2-GGUF
arcee-ai.AFM-4.5B-Base-GGUF
PowerInfer.SmallThinker-3B-Preview-GGUF
Delta-Vector.Austral-24B-Winton-GGUF
Quantized version of: Delta-Vector/Austral-24B-Winton
google.gemma-3-4b-it-GGUF
ibm-granite.granite-guardian-3.1-2b-GGUF
llama3_8b_chat_brainstorm-GGUF
Qwen.Qwen3-8B-GGUF
LMStudio users! Please update the chat prompt template of the model. Go to My models -> Actions (gear) edit model default parameters -> Prompt -> Prompt template. Update the Jinja template.
aquif-ai.aquif-3.5-7B-GGUF
AI-MO.Kimina-Prover-Preview-Distill-7B-GGUF
prithivMLmods.Galactic-Qwen-14B-Exp2-GGUF
shisa-ai.shisa-v2-llama3.1-405b-GGUF
Quantized version of: shisa-ai/shisa-v2-llama3.1-405b
ibm-granite.granite-guardian-3.0-8b-GGUF
shuttleai.shuttle-3.5-GGUF
Dream-org.Dream-v0-Instruct-7B-GGUF
LiquidAI.LFM2-350M-GGUF
JetBrains.Mellum-4b-sft-all-GGUF
nvidia.Llama-3.3-Nemotron-70B-Reward-Principle-GGUF
Quantized version of: nvidia/Llama-3.3-Nemotron-70B-Reward-Principle
moonshotai.Kimi-K2-Instruct-GGUF
argilla.zephyr-7b-spin-iter3-v0-GGUF
prithivMLmods.Raptor-X5-UIGEN-GGUF
Skywork.MindLink-32B-0801-GGUF
LiquidAI.LFM2-1.2B-RAG-GGUF
Gryphe.Pantheon-RP-1.6.1-12b-Nemo-GGUF
huihui-ai.EXAONE-3.5-32B-Instruct-abliterated-GGUF
Alibaba-Apsara.DASD-4B-Thinking-GGUF
deepseek-ai.DeepSeek-R1-Distill-Qwen-14B-GGUF
Qwen.Qwen3-4B-GGUF
zai-org.SWE-Dev-32B-GGUF
PowerInfer.SmallThinker-4BA0.6B-Instruct-GGUF
Quantized version of: PowerInfer/SmallThinker-4BA0.6B-Instruct
nvidia.Llama-3_3-Nemotron-Super-49B-v1-GGUF
tiiuae.Falcon3-3B-Base-GGUF
Qwen.Qwen3-VL-2B-Thinking-GGUF
arcee-ai.AFM-4.5B-GGUF
aws-prototyping.OmniLong-Qwen2.5-VL-7B-GGUF
bytedance-research.UI-TARS-7B-SFT-GGUF
huihui-ai.DeepSeek-R1-Distill-Llama-70B-abliterated-GGUF
Dream-org.Dream-v0-Base-7B-GGUF
nvidia.OpenReasoning-Nemotron-32B-GGUF
Quantized version of: nvidia/OpenReasoning-Nemotron-32B
ibm-granite.granite-3.2-2b-instruct-GGUF
nvidia.OpenMath-Nemotron-32B-GGUF
ibm-granite.granite-4.0-tiny-base-preview-GGUF
Quantized version of: ibm-granite/granite-4.0-tiny-base-preview
Qwen2.5-0.5B-GGUF
nvidia.AceReason-Nemotron-1.1-7B-GGUF
inclusionAI.Ling-lite-1.5-2507-GGUF
google.medgemma-4b-it-GGUF
inclusionAI.Ling-Coder-lite-GGUF
mistralai.Magistral-Small-2509-GGUF
Quantized version of: mistralai/Magistral-Small-2509
llama3_8b_chat_brainstorm_plus-GGUF
sambanovasystems.SambaLingo-Hungarian-Base-GGUF
Quantized version of: sambanovasystems/SambaLingo-Hungarian-Base
allura-org.Q3-30B-A3B-Designant-GGUF
Qwen.Qwen3-VL-2B-Instruct-GGUF
Qwen.CodeQwen1.5-7B-Chat-GGUF
JetBrains.Mellum-4b-sft-python-GGUF
Quantized version of: JetBrains/Mellum-4b-sft-python
AIDC-AI.Marco-o1-GGUF
google.gemma-3n-E4B-it-GGUF
mistralai.Ministral-8B-Instruct-2410-GGUF
Qwen.Qwen2-VL-2B-GGUF
Qiskit.granite-8b-qiskit-GGUF
tiiuae.Falcon-H1-7B-Instruct-GGUF
HelpingAI.Dhanishtha-2.0-preview-0825-GGUF
Nitral-AI.Irixxed-Magcap-12B-Slerp-GGUF
Quantized version of: Nitral-AI/Irixxed-Magcap-12B-Slerp
Kwaipilot.KAT-Dev-72B-Exp-GGUF
Qwen.Qwen3-VL-4B-Thinking-GGUF
EleutherAI.pythia-14m-GGUF
driaforall.Dria-Agent-a-3B-GGUF
zai-org.GLM-4.6-GGUF
Intelligent-Internet.II-Medical-8B-GGUF
Doctor-Shotgun.MS3.2-24B-Magnum-Diamond-GGUF
Quantized version of: Doctor-Shotgun/MS3.2-24B-Magnum-Diamond
LGAI-EXAONE.EXAONE-3.0-7.8B-Instruct-GGUF
allura-org.Bigger-Body-12b-GGUF
ArliAI.QwQ-32B-ArliAI-RpR-v3-GGUF
google.medgemma-27b-it-GGUF
CohereForAI.c4ai-command-a-03-2025-GGUF
Tesslate.UIGEN-X-8B-GGUF
katanemo.Arch-Agent-3B-GGUF
bytedance-research.UI-TARS-72B-SFT-GGUF
Llama-3.2-3B-Instruct-GGUF
suayptalha.FastLlama-3.2-1B-Instruct-GGUF
moxin-org.moxin-llm-7b-GGUF
abeja.ABEJA-QwQ32b-Reasoning-Japanese-v1.0-GGUF
baidu.ERNIE-4.5-300B-A47B-PT-GGUF
HuggingFaceTB.FineMath-Llama-3B-GGUF
Qwen.Qwen1.5-1.8B-GGUF
ibm-granite.granite-3.1-8b-instruct-GGUF
DevQuasar-R1-Uncensored-Llama-8B-GGUF
unsloth.Devstral-Small-2505-GGUF
Salesforce.xLAM-2-32b-fc-r-GGUF
arcee-ai.GLM-4-32B-Base-32K-GGUF
mukaj.Llama-3.1-Hawkish-8B-GGUF
nvidia.AceInstruct-7B-GGUF
facebook.MobileLLM-1.5B-GGUF
LLM360.K2-Chat-GGUF
DavidAU.Qwen3-30B-A6B-16-Extreme-GGUF
Tesslate.UIGEN-T3-8B-Preview-GGUF
ByteDance-Seed.Seed-OSS-36B-Instruct-GGUF
Quantized version of: ByteDance-Seed/Seed-OSS-36B-Instruct
Qwen.Qwen3-Next-80B-A3B-Thinking-FP8-Dynamic
Quantized version of: Qwen/Qwen3-Next-80B-A3B-Thinking
huihui-ai.granite-3.2-8b-instruct-abliterated-GGUF
JetBrains.CodeLlama-7B-KStack-GGUF
Sao10K.70B-L3.3-Cirrus-x1-GGUF
mlabonne.gemma-3-4b-it-abliterated-v2-GGUF
Quantized version of: mlabonne/gemma-3-4b-it-abliterated-v2
shisa-ai.shisa-v2-unphi4-14b-GGUF
GSAI-ML.LLaDA-8B-Instruct-GGUF
moonshotai.Kimi-K2-Thinking-BF16
Original INT4 model has been dequantized with my own custom script: DQint4-to-bf16dequant (inspired by the deepseek V3 dequant script)
yamatazen.EtherealAurora-12B-v2-GGUF
zerofata.MS3.2-PaintedFantasy-v2-24B-GGUF
bigcode.starcoder2-15b-GGUF
facebook.MobileLLM-R1-950M-base-GGUF
Quantized version of: facebook/MobileLLM-R1-950M-base
Qwen.Qwen3-VL-32B-Thinking-GGUF
allenai.olmOCR-7B-0225-preview-GGUF
nvidia.NVIDIA-Nemotron-Nano-12B-v2-Base-GGUF
Quantized version of: nvidia/NVIDIA-Nemotron-Nano-12B-v2-Base
facebook.MobileLLM-R1-140M-GGUF
EpistemeAI.ReasoningCore-3B-RE1-V2-GGUF
miromind-ai.Miromind-M1-SFT-7B-GGUF
Quantized version of: miromind-ai/Miromind-M1-SFT-7B
cognitivecomputations.Dolphin-Mistral-24B-Venice-Edition-GGUF
huihui-ai.AceReason-Nemotron-7B-abliterated-GGUF
Quantized version of: huihui-ai/AceReason-Nemotron-7B-abliterated
tencent.Hunyuan-1.8B-Instruct-GGUF
tencent.Hunyuan-4B-Instruct-GGUF
tencent.Hunyuan-A13B-Instruct-GGUF
argilla.distilabeled-Marcoro14-7B-slerp-GGUF
Tongyi-Zhiwen.QwenLong-L1-32B-GGUF
Quantized version of: Tongyi-Zhiwen/QwenLong-L1-32B
sail.Sailor-0.5B-Chat-GGUF
JetBrains.CodeLlama-7B-Kexer-GGUF
DavidAU.L3.2-8X4B-MOE-V2-Dark-Champion-Inst-21B-uncen-ablit-GGUF
Quantized version of: DavidAU/L3.2-8X4B-MOE-V2-Dark-Champion-Inst-21B-uncen-ablit
tngtech.DeepSeek-R1T-Chimera-GGUF
microsoft.NatureLM-8x7B-Inst-GGUF
sarvamai.sarvam-30b-GGUF
HuggingFaceTB.finemath-ablation-3plus-160B-GGUF
sbintuitions.sarashina2.2-3b-instruct-v0.1-GGUF
Delta-Vector.Sol-Reaver-15B-Instruct-GGUF
Quantized version of: Delta-Vector/Sol-Reaver-15B-Instruct
inclusionAI.ASearcher-Web-14B-GGUF
Quantized version of: inclusionAI/ASearcher-Web-14B
TheDrummer.Cydonia-24B-v4.1-GGUF
Meta-Llama-3.1-8B-Instruct-GGUF
Alibaba-NLP.Tongyi-DeepResearch-30B-A3B-GGUF
HuggingFaceTB.finemath-ablation-4plus-160B-GGUF
ModelSpace.GemmaX2-28-2B-Pretrain-GGUF
ibm-granite.granite-3.3-8b-base-GGUF
Quantized version of: ibm-granite/granite-3.3-8b-base
dmis-lab.llama-3.1-medprm-reward-v1.0-GGUF
Quantized version of: dmis-lab/llama-3.1-medprm-reward-v1.0
ibm-granite.granite-4.0-tiny-preview-GGUF
Quantized version of: ibm-granite/granite-4.0-tiny-preview
Qwen.Qwen2.5-Math-7B-GGUF
bytedance-research.UI-TARS-7B-DPO-GGUF
mlabonne.Qwen3-30B-A3B-abliterated-GGUF
EVA-UNIT-01.EVA-Qwen2.5-32B-v0.1-GGUF
nvidia.Cosmos-Reason1-7B-GGUF
NousResearch.Hermes-3-Llama-3.1-405B-GGUF
Quantized version of: NousResearch/Hermes-3-Llama-3.1-405B
Mistral-7B-Instruct-v0.3-GGUF
LLM4Binary.llm4decompile-1.3b-v2-GGUF
inclusionAI.AReaL-boba-2-8B-GGUF
cognitivecomputations.Dolphin3.0-R1-Mistral-24B-GGUF
THU-KEG.LongWriter-Zero-32B-GGUF
Qwen.Qwen3-0.6B-Base-GGUF
tencent.Hunyuan-0.5B-Instruct-GGUF
ValiantLabs.Qwen3-14B-Esper3-GGUF
huihui-ai.DeepSeek-V3-0324-Pruned-Coder-411B-GGUF
allenai.OLMo-2-1124-7B-Instruct-GGUF
Quantized version of: allenai/OLMo-2-1124-7B-Instruct
AI-MO.Kimina-Prover-72B-GGUF
Qwen.Qwen1.5-7B-Chat-GGUF
K-intelligence.Midm-2.0-Base-Instruct-GGUF
Quantized version of: K-intelligence/Midm-2.0-Base-Instruct
baidu.ERNIE-4.5-21B-A3B-Thinking-GGUF
Quantized version of: baidu/ERNIE-4.5-21B-A3B-Thinking
EpistemeAI.DeepThink-Phi4-GGUF
wanlige.li-14b-v0.4-GGUF
LiquidAI.LFM2-1.2B-Extract-GGUF
CohereForAI.c4ai-command-r7b-12-2024-GGUF
fdtn-ai.Foundation-Sec-8B-Instruct-GGUF
Quantized version of: fdtn-ai/Foundation-Sec-8B-Instruct
Qwen.Qwen2-VL-72B-GGUF
OLMoE-1B-7B-0924-Instruct-GGUF
Qwen.Qwen2.5-VL-72B-Instruct-GGUF
THU-KEG.TULU3-VerIF-GGUF
ValiantLabs.Qwen3-8B-ShiningValiant3-GGUF
Quantized version of: ValiantLabs/Qwen3-8B-ShiningValiant3
ServiceNow-AI.Apriel-1.5-15b-Thinker-GGUF
Quantized version of: ServiceNow-AI/Apriel-1.5-15b-Thinker
princeton-nlp.Llama-3-8B-ProLong-512k-Instruct-GGUF
huihui-ai.Huihui-MoE-4.8B-A1.7B-abliterated-GGUF
openai-community.gpt2-large-GGUF
katanemo.Arch-Agent-1.5B-GGUF
nvidia.Qwen3-Nemotron-8B-BRRM-GGUF
Quantized version of: nvidia/Qwen3-Nemotron-8B-BRRM
osmosis-ai.Osmosis-Apply-1.7B-GGUF
Quantized version of: osmosis-ai/Osmosis-Apply-1.7B
Qwen2.5-Math-72B-GGUF
DeepMount00.Lexora-Medium-7B-GGUF
huihui-ai.Seed-Coder-8B-Instruct-abliterated-GGUF
Quantized version of: huihui-ai/Seed-Coder-8B-Instruct-abliterated
nvidia.OpenCodeReasoning-Nemotron-32B-IOI-GGUF
NovaSky-AI.Sky-T1-mini-GGUF
GSAI-ML.LLaDA-1.5-GGUF
fdtn-ai.Foundation-Sec-8B-GGUF
Sao10K.14B-Qwen2.5-Kunou-v1-GGUF
apple.sage-ft-mixtral-8x7b-GGUF
HuggingFaceTB.finemath-ablation-finemath-3plus-GGUF
ibm-granite.granite-vision-3.2-2b-GGUF
nicoboss.Hermes-3-Llama-3.1-405B-Uncensored-GGUF
ibm-granite.granite-3.0-2b-base-GGUF
Nitral-AI.Captain-Eris_Violet-GRPO-v0.420-GGUF
ibm-granite.granite-vision-3.1-2b-preview-GGUF
THU-KEG.AdaptThink-1.5B-delta0.1-GGUF
Goekdeniz-Guelmez.Josiefied-Qwen3-4B-Instruct-2507-gabliterated-v1-GGUF
Quantized version of: Goekdeniz-Guelmez/Josiefied-Qwen3-4B-Instruct-2507-gabliterated-v1
AI-MO.Kimina-Autoformalizer-7B-GGUF
dphn.Dolphin3.0-Llama3.2-3B-GGUF
Delta-Vector.Rei-24B-KTO-GGUF
baichuan-inc.Baichuan-M2-32B-GGUF
Steelskull.L3.3-MS-Nevoria-70b-GGUF
Quantized version of: Steelskull/L3.3-MS-Nevoria-70b
Writer.Palmyra-Med-70B-32K-GGUF
katanemo.Arch-Router-1.5B-GGUF
analytical_reasoning_Llama-3.2-1B-GGUF
Quantized version of: analyticalreasoningLlama-3/2-1B
Qwen.Qwen2.5-7B-Instruct-1M-GGUF
ibm-granite.granite-3.1-3b-a800m-instruct-GGUF
Qwen.Qwen3-4B-Base-GGUF
fedric95.Qwen3-4B-unc-GGUF
huihui-ai.Qwen3-16B-A3B-abliterated-GGUF
Quantized version of: huihui-ai/Qwen3-16B-A3B-abliterated
KurmaAI.AQUA-1B-GGUF
tiiuae.Falcon3-7B-Base-GGUF
ibm-granite.granite-3.3-8b-instruct-GGUF
nvidia.NVIDIA-Nemotron-Nano-9B-v2-GGUF
Quantized version of: nvidia/NVIDIA-Nemotron-Nano-9B-v2
google.gemma-3-1b-it-GGUF
ibm-granite.granite-3.0-1b-a400m-instruct-GGUF
huihui-ai.GLM-4-32B-0414-abliterated-GGUF
nbeerbower.Xiaolong-Qwen3-14B-GGUF
miromind-ai.MiroMind-M1-RL-7B-GGUF
Quantized version of: miromind-ai/MiroMind-M1-RL-7B
allenai.OLMo-2-0325-32B-DPO-GGUF
tngtech.DeepSeek-TNG-R1T2-Chimera-GGUF
AI-MO.Kimina-Prover-Preview-Distill-1.5B-GGUF
arcee-ai.Virtuoso-Large-GGUF
ByteDance-Seed.Seed-X-Instruct-7B-GGUF
llama3_8b_chat_brainstorm-v2.1-GGUF
CohereLabs.c4ai-command-r-v01-GGUF
Quantized version of: CohereLabs/c4ai-command-r-v01
agentica-org.DeepSWE-Preview-GGUF
Delta-Vector.Rei-24B-Base-GGUF
InfiX-ai.InfiR-1B-Instruct-GGUF
Kortix.FastApply-1.5B-v1.0-GGUF
SakanaAI.Llama-3-8B-Instruct-OS-Expert-GGUF
openbmb.BitCPM4-0.5B-GGUF
Qiskit.granite-3.3-8b-qiskit-GGUF
deepcogito.cogito-v2-preview-llama-70B-GGUF
marin-community.marin-32b-base-GGUF
Quantized version of: marin-community/marin-32b-base
Skywork.Skywork-o1-Open-Llama-3.1-8B-GGUF
Quantized version of: Skywork/Skywork-o1-Open-Llama-3.1-8B
tiiuae.Falcon3-10B-Instruct-GGUF
kakaocorp.kanana-safeguard-8b-GGUF
Quantized version of: kakaocorp/kanana-safeguard-8b
princeton-nlp.Llama-3-8B-ProLong-512k-Base-GGUF
tiiuae.Falcon3-1B-Base-GGUF
nvidia.AceInstruct-72B-GGUF
open-r1.OpenR1-Qwen-7B-GGUF
microsoft.Phi-4-mini-reasoning-GGUF
arcee-ai.Arcee-SuperNova-v1-GGUF
google.gemma-3n-E2B-it-GGUF
baidu.ERNIE-4.5-0.3B-PT-GGUF
huihui-ai.Llama-3.3-70B-Instruct-abliterated-finetuned-GGUF
CohereForAI.c4ai-command-r-08-2024-GGUF
Quantized version of: CohereForAI/c4ai-command-r-08-2024
Qwen.Qwen2.5-14B-Instruct-GGUF
Infermatic.R1-vortextic-70B-L3.3-v1-GGUF
Quantized version of: Infermatic/R1-vortextic-70B-L3.3-v1
ValiantLabs.Qwen3-4B-ShiningValiant3-GGUF
aquif-ai.aquif-3.5-3B-GGUF
AmanPriyanshu.gpt-oss-6.0b-specialized-all-pruned-moe-only-7-experts-GGUF
ModelSpace.GemmaX2-28-2B-v0.1-GGUF
S4nfs.Neeto-1.0-8b-GGUF
mistralai.Mistral-Large-3-675B-Instruct-2512-GGUF
utter-project.EuroLLM-1.7B-Instruct-GGUF
Qwen.Qwen1.5-0.5B-GGUF
Delta-Vector.Austral-24B-Base-GGUF
Quantized version of: Delta-Vector/Austral-24B-Base
K-intelligence.Midm-2.0-Mini-Instruct-GGUF
huihui-ai.Llama-3.1-Nemotron-Nano-8B-v1-abliterated-GGUF
huihui-ai.Huihui-MoE-23B-A4B-GGUF
OLMo-7B-0724-Instruct-hf-GGUF
VAGOsolutions.SauerkrautLM-v2-14b-SFT-GGUF
DavidAU.L3.1-Dark-Reasoning-Unholy-Hermes-R1-Uncensored-8B-GGUF
Quantized version of: DavidAU/L3.1-Dark-Reasoning-Unholy-Hermes-R1-Uncensored-8B
Skywork.Skywork-SWE-32B-GGUF
inclusionAI.ASearcher-Local-7B-GGUF
Quantized version of: inclusionAI/ASearcher-Local-7B