roleplaiapp

500 models • 1 total models in database

Sort by:

Llama-3.3-70B-Instruct-Q4_K_M-GGUF

Repo: `roleplaiapp/Llama-3.3-70B-Instruct-Q4KM -GGUF` Original Model: `Llama-3.3-70B-Instruct` Organization: `meta-llama` Quantized File: `llama-3.3-70b-instruct-q3km.gguf` Quantization: `GGUF` Quantization Method: `Q4KM ` Use Imatrix: `False` Split Model: `False` Overview This is an GGUF Q4KM quantized version of Llama-3.3-70B-Instruct. Quantization By I often have idle A100 GPUs while building/testing and training the RP app, so I put them to use quantizing models. I hope the community finds these quantizations useful.

DeepSeek-R1-Distill-Alpaca-FineTuned-Q5_K_M-GGUF

roleplaiapp/DeepSeek-R1-Distill-Alpaca-FineTuned-Q5KM-GGUF Repo: `roleplaiapp/DeepSeek-R1-Distill-Alpaca-FineTuned-Q5KM-GGUF` Original Model: `DeepSeek-R1-Distill-Alpaca-FineTuned` Quantized File: `DeepSeek-R1-Distill-Alpaca-FineTuned.Q5KM.gguf` Quantization: `GGUF` Quantization Method: `Q5KM` Overview This is a GGUF Q5KM quantized version of DeepSeek-R1-Distill-Alpaca-FineTuned Quantization By I often have idle GPUs while building/testing for the RP app, so I put them to use quantizing models. I hope the community finds these quantizations useful.

llama-cpp

DeepSeek-R1-Distill-Llama-70B-Q6_K-GGUF

NaNK

llama-cpp

DeepSeek-R1-Distill-Qwen-14B-Q3_K_M-GGUF

NaNK

llama-cpp

DeepSauerHuatuoSkywork-R1-o1-Llama-3.1-8B-IQ4_XS-GGUF

roleplaiapp/DeepSauerHuatuoSkywork-R1-o1-Llama-3.1-8B-IQ4XS-GGUF Repo: `roleplaiapp/DeepSauerHuatuoSkywork-R1-o1-Llama-3.1-8B-IQ4XS-GGUF` Original Model: `DeepSauerHuatuoSkywork-R1-o1-Llama-3.1-8B` Quantized File: `DeepSauerHuatuoSkywork-R1-o1-Llama-3.1-8B.IQ4XS.gguf` Quantization: `GGUF` Quantization Method: `IQ4XS` Overview This is a GGUF IQ4XS quantized version of DeepSauerHuatuoSkywork-R1-o1-Llama-3.1-8B Quantization By I often have idle GPUs while building/testing for the RP app, so I put them to use quantizing models. I hope the community finds these quantizations useful.

NaNK

llama

DeepSeek-R1-Distill-Llama-70B-Q3_K_S-GGUF

NaNK

llama-cpp

MistralRP-Noromaid-NSFW-Mistral-7B-Q8_0-GGUF

NaNK

llama-cpp

DeepSeek-R1-Distill-Qwen-32B-Q6_K-GGUF

NaNK

llama-cpp

DeepSauerHuatuoSkywork-R1-o1-Llama-3.1-8B-i1-Q2_K-GGUF

NaNK

llama

DeepSeek-R1-Distill-Alpaca-FineTuned-Q3_K_L-GGUF

llama-cpp

roleplaiapp/DeepSauerHuatuoSkywork-R1-o1-Llama-3.1-8B-Q3KS-GGUF Repo: `roleplaiapp/DeepSauerHuatuoSkywork-R1-o1-Llama-3.1-8B-Q3KS-GGUF` Original Model: `DeepSauerHuatuoSkywork-R1-o1-Llama-3.1-8B` Quantized File: `DeepSauerHuatuoSkywork-R1-o1-Llama-3.1-8B.Q3KS.gguf` Quantization: `GGUF` Quantization Method: `Q3KS` Overview This is a GGUF Q3KS quantized version of DeepSauerHuatuoSkywork-R1-o1-Llama-3.1-8B Quantization By I often have idle GPUs while building/testing for the RP app, so I put them to use quantizing models. I hope the community finds these quantizations useful.

roleplaiapp/DeepSauerHuatuoSkywork-R1-o1-Llama-3.1-8B-f16-GGUF Repo: `roleplaiapp/DeepSauerHuatuoSkywork-R1-o1-Llama-3.1-8B-f16-GGUF` Original Model: `DeepSauerHuatuoSkywork-R1-o1-Llama-3.1-8B` Quantized File: `DeepSauerHuatuoSkywork-R1-o1-Llama-3.1-8B.f16.gguf` Quantization: `GGUF` Quantization Method: `f16` Overview This is a GGUF f16 quantized version of DeepSauerHuatuoSkywork-R1-o1-Llama-3.1-8B Quantization By I often have idle GPUs while building/testing for the RP app, so I put them to use quantizing models. I hope the community finds these quantizations useful.

DeepSauerHuatuoSkywork-R1-o1-Llama-3.1-8B-Q8_0-GGUF

roleplaiapp/DeepSauerHuatuoSkywork-R1-o1-Llama-3.1-8B-Q80-GGUF Repo: `roleplaiapp/DeepSauerHuatuoSkywork-R1-o1-Llama-3.1-8B-Q80-GGUF` Original Model: `DeepSauerHuatuoSkywork-R1-o1-Llama-3.1-8B` Quantized File: `DeepSauerHuatuoSkywork-R1-o1-Llama-3.1-8B.Q80.gguf` Quantization: `GGUF` Quantization Method: `Q80` Overview This is a GGUF Q80 quantized version of DeepSauerHuatuoSkywork-R1-o1-Llama-3.1-8B Quantization By I often have idle GPUs while building/testing for the RP app, so I put them to use quantizing models. I hope the community finds these quantizations useful.

NaNK

llama

DeepSeek-R1-Distill-Alpaca-FineTuned-Q5_K_S-GGUF

roleplaiapp/DeepSeek-R1-Distill-Alpaca-FineTuned-Q5KS-GGUF Repo: `roleplaiapp/DeepSeek-R1-Distill-Alpaca-FineTuned-Q5KS-GGUF` Original Model: `DeepSeek-R1-Distill-Alpaca-FineTuned` Quantized File: `DeepSeek-R1-Distill-Alpaca-FineTuned.Q5KS.gguf` Quantization: `GGUF` Quantization Method: `Q5KS` Overview This is a GGUF Q5KS quantized version of DeepSeek-R1-Distill-Alpaca-FineTuned Quantization By I often have idle GPUs while building/testing for the RP app, so I put them to use quantizing models. I hope the community finds these quantizations useful.

llama-cpp

AceInstruct-1.5B-Q2_K-GGUF

NaNK

llama-cpp

DeepSeek-R1-Distill-Qwen-14B-Q2_K-GGUF

NaNK

llama-cpp

DeepSeek-R1-Distill-Alpaca-FineTuned-Q6_K-GGUF

llama-cpp

DeepSeek-R1-Distill-Alpaca-FineTuned-IQ4_XS-GGUF

NaNK

llama-cpp

oh-dcft-v3.1-claude-3-5-haiku-20241022-Q8_0-GGUF

llama-cpp

Minerva-14b-V0.1-i1-IQ4_XS-GGUF

NaNK

llama-cpp

deepseek-r1-qwen-2.5-32B-ablated-Q6_K-GGUF

roleplaiapp/deepseek-r1-qwen-2.5-32B-ablated-Q6K-GGUF Repo: `roleplaiapp/deepseek-r1-qwen-2.5-32B-ablated-Q6K-GGUF` Original Model: `deepseek-r1-qwen-2.5-32B-ablated` Quantized File: `deepseek-r1-qwen-2.5-32B-ablated-Q6K.gguf` Quantization: `GGUF` Quantization Method: `Q6K` Overview This is a GGUF Q6K quantized version of deepseek-r1-qwen-2.5-32B-ablated Quantization By I often have idle GPUs while building/testing for the RP app, so I put them to use quantizing models. I hope the community finds these quantizations useful.

14B-Qwen2.5-Kunou-v1-IQ4_XS-GGUF

NaNK

NaNK

llama-cpp

Llama-3.1-Nemotron-70B-Instruct-HF-Q5_K_S-GGUF

NaNK

llama-cpp

Qwen2.5-7B-Instruct-Uncensored-IQ4_XS-GGUF

NaNK

llama-cpp

Qwen2.5-7B-Instruct-Uncensored-Q4_K_S-GGUF

NaNK

llama-cpp

DS-R1-Distill-Q2.5-14B-Harmony_V0.1-Q3_K_S-GGUF

llama

DS-R1-Distill-Q2.5-14B-Harmony_V0.1-Q5_K_S-GGUF

roleplaiapp/DS-R1-Distill-Q2.5-14B-HarmonyV0.1-Q5KS-GGUF Repo: `roleplaiapp/DS-R1-Distill-Q2.5-14B-HarmonyV0.1-Q5KS-GGUF` Original Model: `DS-R1-Distill-Q2.5-14B-HarmonyV0.1` Quantized File: `DS-R1-Distill-Q2.5-14B-HarmonyV0.1.Q5KS.gguf` Quantization: `GGUF` Quantization Method: `Q5KS` Overview This is a GGUF Q5KS quantized version of DS-R1-Distill-Q2.5-14B-HarmonyV0.1 Quantization By I often have idle GPUs while building/testing for the RP app, so I put them to use quantizing models. I hope the community finds these quantizations useful.

NaNK

llama-cpp

llama-cpp

Qwen2.5-14B-DeepSeek-R1-1M-Uncensored-Q3_K_M-GGUF

NaNK

llama-cpp

Qwen2.5-Coder-14B-Instruct-Uncensored-Q5_K_M-GGUF

NaNK

llama-cpp

Qwen2.5-7B-olm-v1.4-i1-Q2_K-GGUF

Repo: `roleplaiapp/Qwen2.5-7B-olm-v1.4-i1-Q2K-GGUF` Original Model: `Qwen2.5-7B-olm-v1.4-i1` Quantized File: `Qwen2.5-7B-olm-v1.4.i1-Q2K.gguf` Quantization: `GGUF` Quantization Method: `Q2K` Overview This is a GGUF Q2K quantized version of Qwen2.5-7B-olm-v1.4-i1 Quantization By I often have idle GPUs while building/testing for the RP app, so I put them to use quantizing models. I hope the community finds these quantizations useful.

NaNK

llama-cpp

ArxivLlama-3.1-8B-Q3_K_L-GGUF

NaNK

arxivllama

deepseek-r1-qwen-2.5-32B-ablated-Q3_K_L-GGUF

NaNK

llama-cpp

AceInstruct-7B-Q2_K-GGUF

NaNK

llama-cpp

Qwen2.5-32B-DeepSeek-R1-Instruct-i1-IQ4_XS-GGUF

roleplaiapp/Qwen2.5-32B-DeepSeek-R1-Instruct-i1-IQ4XS-GGUF Repo: `roleplaiapp/Qwen2.5-32B-DeepSeek-R1-Instruct-i1-IQ4XS-GGUF` Original Model: `Qwen2.5-32B-DeepSeek-R1-Instruct-i1` Quantized File: `Qwen2.5-32B-DeepSeek-R1-Instruct.i1-IQ4XS.gguf` Quantization: `GGUF` Quantization Method: `IQ4XS` Overview This is a GGUF IQ4XS quantized version of Qwen2.5-32B-DeepSeek-R1-Instruct-i1 Quantization By I often have idle GPUs while building/testing for the RP app, so I put them to use quantizing models. I hope the community finds these quantizations useful.

llama-cpp

Qwen2.5-32b-Erudite-Writer-i1-Q3_K_L-GGUF

roleplaiapp/Qwen2.5-32b-Erudite-Writer-i1-Q3KL-GGUF Repo: `roleplaiapp/Qwen2.5-32b-Erudite-Writer-i1-Q3KL-GGUF` Original Model: `Qwen2.5-32b-Erudite-Writer-i1` Quantized File: `Qwen2.5-32b-Erudite-Writer.i1-Q3KL.gguf` Quantization: `GGUF` Quantization Method: `Q3KL` Overview This is a GGUF Q3KL quantized version of Qwen2.5-32b-Erudite-Writer-i1 Quantization By I often have idle GPUs while building/testing for the RP app, so I put them to use quantizing models. I hope the community finds these quantizations useful.

NaNK

llama-cpp

medicine-LLM-Q3_K_L-GGUF

llama-cpp

Dolphin3.0-Llama3.1-8B-Q8_0-GGUF

NaNK

llama-cpp

Dolphin3.0-Llama3.1-8B-Q4_K_M-GGUF

NaNK

llama-cpp

Wayfarer-12B-Q4_K_M-GGUF

NaNK

llama-cpp

DeepSeek-R1-Distill-Qwen-32B-Q4_K_S-GGUF

NaNK

llama-cpp

deepseek-r1-qwen-2.5-32B-ablated-Q4_K_S-GGUF

roleplaiapp/deepseek-r1-qwen-2.5-32B-ablated-Q4KS-GGUF Repo: `roleplaiapp/deepseek-r1-qwen-2.5-32B-ablated-Q4KS-GGUF` Original Model: `deepseek-r1-qwen-2.5-32B-ablated` Quantized File: `deepseek-r1-qwen-2.5-32B-ablated-Q4KS.gguf` Quantization: `GGUF` Quantization Method: `Q4KS` Overview This is a GGUF Q4KS quantized version of deepseek-r1-qwen-2.5-32B-ablated Quantization By I often have idle GPUs while building/testing for the RP app, so I put them to use quantizing models. I hope the community finds these quantizations useful.

DS-Distilled-Hermes-Llama-3.1_TIES-i1-Q3_K_M-GGUF

llama

Llama3.2-doker-Q4_K_S-GGUF

llama-cpp

Jaja-small-v1-Q6_K-GGUF

Repo: `roleplaiapp/Jaja-small-v1-Q6K-GGUF` Original Model: `Jaja-small-v1` Quantized File: `Jaja-small-v1.Q6K.gguf` Quantization: `GGUF` Quantization Method: `Q6K` Overview This is a GGUF Q6K quantized version of Jaja-small-v1 Quantization By I often have idle GPUs while building/testing for the RP app, so I put them to use quantizing models. I hope the community finds these quantizations useful.

llama-cpp

NaNK

llama

deepseek-r1-qwen-2.5-32B-ablated-Q3_K_M-GGUF

roleplaiapp/deepseek-r1-qwen-2.5-32B-ablated-Q3KM-GGUF Repo: `roleplaiapp/deepseek-r1-qwen-2.5-32B-ablated-Q3KM-GGUF` Original Model: `deepseek-r1-qwen-2.5-32B-ablated` Quantized File: `deepseek-r1-qwen-2.5-32B-ablated-Q3KM.gguf` Quantization: `GGUF` Quantization Method: `Q3KM` Overview This is a GGUF Q3KM quantized version of deepseek-r1-qwen-2.5-32B-ablated Quantization By I often have idle GPUs while building/testing for the RP app, so I put them to use quantizing models. I hope the community finds these quantizations useful.

NaNK

llama-cpp

deepseek-r1-qwen-2.5-32B-ablated-f16-GGUF

NaNK

llama-cpp

cyberagent-DeepSeek-R1-Distill-Qwen-32B-Japanese-gguf-Q4_K_S-GGUF

NaNK

llama-cpp

cyberagent-DeepSeek-R1-Distill-Qwen-14B-Japanese-gguf-Q4_K_S-GGUF

NaNK

llama-cpp

L3.3-Nevoria-R1-70b-IQ3_M-GGUF

Repo: `roleplaiapp/L3.3-Nevoria-R1-70b-IQ3M-GGUF` Original Model: `L3.3-Nevoria-R1-70b` Quantized File: `L3.3-Nevoria-R1-70b-IQ3M.gguf` Quantization: `GGUF` Quantization Method: `IQ3M` Overview This is a GGUF IQ3M quantized version of L3.3-Nevoria-R1-70b Quantization By I often have idle GPUs while building/testing for the RP app, so I put them to use quantizing models. I hope the community finds these quantizations useful.

14B-Qwen2.5-Kunou-v1-Q8_0-GGUF

NaNK

llama-cpp

Virtuoso-Lite-Q5_K_M-GGUF

llama-cpp

GRAG-R1-14B-SFT-DE-EXP-Q4_K_M-GGUF

NaNK

llama-cpp

GRAG-R1-14B-SFT-DE-EXP-IQ4_XS-GGUF

NaNK

llama-cpp

DialoGPT-medium-anand-Q4_K_M-GGUF

llama-cpp

Mistral-Small-24B-Instruct-2501-Q4_K_M-GGUF

roleplaiapp/Mistral-Small-24B-Instruct-2501-Q4KM-GGUF Repo: `roleplaiapp/Mistral-Small-24B-Instruct-2501-Q4KM-GGUF` Original Model: `Mistral-Small-24B-Instruct-2501` Quantized File: `Mistral-Small-24B-Instruct-2501-Q4KM.gguf` Quantization: `GGUF` Quantization Method: `Q4KM` Overview This is a GGUF Q4KM quantized version of Mistral-Small-24B-Instruct-2501 Quantization By I often have idle GPUs while building/testing for the RP app, so I put them to use quantizing models. I hope the community finds these quantizations useful.

NaNK

llama-cpp

Mistral-Small-24B-Instruct-2501-Q4_K_S-GGUF

NaNK

llama-cpp

JapMed-SLERP-IQ4_XS-GGUF

llama-cpp

Qwen2.5-7B-olm-v1.4-i1-IQ3_M-GGUF

NaNK

llama-cpp

DeepSeek-R1-Distill-Llama-70B-Uncensored-v2-Q3_K_M-GGUF

NaNK

llama

deepseek-r1-qwen-2.5-32B-ablated-Q5_K_S-GGUF

roleplaiapp/deepseek-r1-qwen-2.5-32B-ablated-Q5KS-GGUF Repo: `roleplaiapp/deepseek-r1-qwen-2.5-32B-ablated-Q5KS-GGUF` Original Model: `deepseek-r1-qwen-2.5-32B-ablated` Quantized File: `deepseek-r1-qwen-2.5-32B-ablated-Q5KS.gguf` Quantization: `GGUF` Quantization Method: `Q5KS` Overview This is a GGUF Q5KS quantized version of deepseek-r1-qwen-2.5-32B-ablated Quantization By I often have idle GPUs while building/testing for the RP app, so I put them to use quantizing models. I hope the community finds these quantizations useful.

AceInstruct-1.5B-Q5_K_M-GGUF

Repo: `roleplaiapp/AceInstruct-1.5B-Q5KM-GGUF` Original Model: `AceInstruct-1.5B` Organization: `nvidia` Quantized File: `aceinstruct-1.5b-q5km.gguf` Quantization: `GGUF` Quantization Method: `Q5KM` Use Imatrix: `False` Split Model: `False` Overview This is an GGUF Q5KM quantized version of AceInstruct-1.5B. Quantization By I often have idle A100 GPUs while building/testing and training the RP app, so I put them to use quantizing models. I hope the community finds these quantizations useful.

NaNK

llama-cpp

Omni-Reasoner-2B-Q6_K-GGUF

NaNK

llama-cpp

Omni-Reasoner-2B-Q5_0-GGUF

Repo: `roleplaiapp/Omni-Reasoner-2B-Q50-GGUF` Original Model: `Omni-Reasoner-o1` Organization: `prithivMLmods` Quantized File: `omni-reasoner-2b-q50.gguf` Quantization: `GGUF` Quantization Method: `Q50` Use Imatrix: `False` Split Model: `False` Overview This is an GGUF Q50 quantized version of Omni-Reasoner-o1. Quantization By I often have idle A100 GPUs while building/testing and training the RP app, so I put them to use quantizing models. I hope the community finds these quantizations useful.

NaNK

llama-cpp

DeepSeek-R1-Distill-Llama-8B-Q3_K_L-GGUF

NaNK

llama-cpp

DeepSeek-R1-Distill-Llama-8B-Q5_K_S-GGUF

NaNK

llama-cpp

Midnight-Miqu-70B-v1.5-i1-Q4_K_S-GGUF

NaNK

llama-cpp

DeepSeek-R1-Distill-Qwen-32B-Uncensored-Q3_K_L-GGUF

NaNK

NaNK

llama-cpp

Jaja-small-v1-Q5_K_M-GGUF

Repo: `roleplaiapp/Jaja-small-v1-Q5KM-GGUF` Original Model: `Jaja-small-v1` Quantized File: `Jaja-small-v1.Q5KM.gguf` Quantization: `GGUF` Quantization Method: `Q5KM` Overview This is a GGUF Q5KM quantized version of Jaja-small-v1 Quantization By I often have idle GPUs while building/testing for the RP app, so I put them to use quantizing models. I hope the community finds these quantizations useful.

llama-cpp

llama

ArxivLlama-3.1-8B-Q3_K_M-GGUF

NaNK

arxivllama

ArxivLlama-3.1-8B-IQ4_XS-GGUF

NaNK

arxivllama

DeepSeek-R1-Distill-Alpaca-FineTuned-Q2_K-GGUF

llama-cpp

deepseek-r1-qwen-2.5-32B-ablated-Q3_K_S-GGUF

roleplaiapp/deepseek-r1-qwen-2.5-32B-ablated-Q3KS-GGUF Repo: `roleplaiapp/deepseek-r1-qwen-2.5-32B-ablated-Q3KS-GGUF` Original Model: `deepseek-r1-qwen-2.5-32B-ablated` Quantized File: `deepseek-r1-qwen-2.5-32B-ablated-Q3KS.gguf` Quantization: `GGUF` Quantization Method: `Q3KS` Overview This is a GGUF Q3KS quantized version of deepseek-r1-qwen-2.5-32B-ablated Quantization By I often have idle GPUs while building/testing for the RP app, so I put them to use quantizing models. I hope the community finds these quantizations useful.

NaNK

llama-cpp

Qwen2.5-7B-Instruct-1M-Q4_K_S-GGUF

NaNK

llama-cpp

Qwen2.5-14B-DeepSeek-R1-1M-Q2_K-GGUF

NaNK

llama-cpp

Llama-3-monika-ddlc-11.5b-v1-i1-Q2_K-GGUF

roleplaiapp/Llama-3-monika-ddlc-11.5b-v1-i1-Q2K-GGUF Repo: `roleplaiapp/Llama-3-monika-ddlc-11.5b-v1-i1-Q2K-GGUF` Original Model: `Llama-3-monika-ddlc-11.5b-v1-i1` Quantized File: `Llama-3-monika-ddlc-11.5b-v1.i1-Q2K.gguf` Quantization: `GGUF` Quantization Method: `Q2K` Overview This is a GGUF Q2K quantized version of Llama-3-monika-ddlc-11.5b-v1-i1 Quantization By I often have idle GPUs while building/testing for the RP app, so I put them to use quantizing models. I hope the community finds these quantizations useful.

roleplaiapp/SILMA-Kashif-2B-Instruct-v1.0-i1-Q2K-GGUF Repo: `roleplaiapp/SILMA-Kashif-2B-Instruct-v1.0-i1-Q2K-GGUF` Original Model: `SILMA-Kashif-2B-Instruct-v1.0-i1` Quantized File: `SILMA-Kashif-2B-Instruct-v1.0.i1-Q2K.gguf` Quantization: `GGUF` Quantization Method: `Q2K` Overview This is a GGUF Q2K quantized version of SILMA-Kashif-2B-Instruct-v1.0-i1 Quantization By I often have idle GPUs while building/testing for the RP app, so I put them to use quantizing models. I hope the community finds these quantizations useful.

NaNK

llama-cpp

SILMA-Kashif-2B-Instruct-v1.0-i1-Q6_K-GGUF

NaNK

llama-cpp

SILMA-Kashif-2B-Instruct-v1.0-i1-IQ4_XS-GGUF

NaNK

llama-cpp

14B-Qwen2.5-Kunou-v1-Q6_K-GGUF

NaNK

llama-cpp

llama-cpp

MN-12B-Mimicore-Orochi-Q3_K_S-GGUF

NaNK

llama-cpp

Llama3.2-doker-Q5_K_M-GGUF

llama-cpp

Llama-3.1-Nemotron-70B-Instruct-HF-Q6_K-GGUF

MN-12B-Mimicore-Orochi-Q3_K_L-GGUF

NaNK

llama-cpp

GRAG-R1-14B-SFT-DE-EXP-Q8_0-GGUF

NaNK

llama-cpp

GRAG-R1-14B-SFT-DE-EXP-Q3_K_S-GGUF

NaNK

llama-cpp

DialoGPT-medium-anand-Q6_K-GGUF

llama-cpp

DialoGPT-medium-anand-Q3_K_L-GGUF

llama-cpp

DialoGPT-medium-anand-f16-GGUF

Repo: `roleplaiapp/DialoGPT-medium-anand-f16-GGUF` Original Model: `DialoGPT-medium-anand` Quantized File: `DialoGPT-medium-anand.f16.gguf` Quantization: `GGUF` Quantization Method: `f16` Overview This is a GGUF f16 quantized version of DialoGPT-medium-anand Quantization By I often have idle GPUs while building/testing for the RP app, so I put them to use quantizing models. I hope the community finds these quantizations useful.

llama-cpp

Jaja-small-v1-Q3_K_S-GGUF

Repo: `roleplaiapp/Jaja-small-v1-Q3KS-GGUF` Original Model: `Jaja-small-v1` Quantized File: `Jaja-small-v1.Q3KS.gguf` Quantization: `GGUF` Quantization Method: `Q3KS` Overview This is a GGUF Q3KS quantized version of Jaja-small-v1 Quantization By I often have idle GPUs while building/testing for the RP app, so I put them to use quantizing models. I hope the community finds these quantizations useful.

llama-cpp

Mistral-Small-24B-Instruct-2501-Q3_K_L-GGUF

NaNK

llama-cpp

Reasoning-Llama-3.1-CoT-RE1-Q6_K-GGUF

Repo: `roleplaiapp/Reasoning-Llama-3.1-CoT-RE1-Q6K-GGUF` Original Model: `Reasoning-Llama-3.1-CoT-RE1` Quantized File: `Reasoning-Llama-3.1-CoT-RE1.Q6K.gguf` Quantization: `GGUF` Quantization Method: `Q6K` Overview This is a GGUF Q6K quantized version of Reasoning-Llama-3.1-CoT-RE1 Quantization By I often have idle GPUs while building/testing for the RP app, so I put them to use quantizing models. I hope the community finds these quantizations useful.

llama

DeepSauerHuatuoSkywork-R1-o1-Llama-3.1-8B-Q5_K_M-GGUF

roleplaiapp/DeepSauerHuatuoSkywork-R1-o1-Llama-3.1-8B-Q5KM-GGUF Repo: `roleplaiapp/DeepSauerHuatuoSkywork-R1-o1-Llama-3.1-8B-Q5KM-GGUF` Original Model: `DeepSauerHuatuoSkywork-R1-o1-Llama-3.1-8B` Quantized File: `DeepSauerHuatuoSkywork-R1-o1-Llama-3.1-8B.Q5KM.gguf` Quantization: `GGUF` Quantization Method: `Q5KM` Overview This is a GGUF Q5KM quantized version of DeepSauerHuatuoSkywork-R1-o1-Llama-3.1-8B Quantization By I often have idle GPUs while building/testing for the RP app, so I put them to use quantizing models. I hope the community finds these quantizations useful.

NaNK

llama

Llama-3-monika-ddlc-11.5b-v1-i1-IQ3_S-GGUF

NaNK

llama

L3.3-Nevoria-R1-70b-Q4_K_S-GGUF

NaNK

llama-cpp

medicine-LLM-Q5_K_S-GGUF

llama-cpp

Repo: `roleplaiapp/Wayfarer-12B-Q3KL-GGUF` Original Model: `Wayfarer-12B` Organization: `LatitudeGames` Quantized File: `wayfarer-12b-q3kl.gguf` Quantization: `GGUF` Quantization Method: `Q3KL` Use Imatrix: `False` Split Model: `False` Overview This is an GGUF Q3KL quantized version of Wayfarer-12B. Quantization By I often have idle A100 GPUs while building/testing and training the RP app, so I put them to use quantizing models. I hope the community finds these quantizations useful.

Llama3.2-doker-Q8_0-GGUF

llama-cpp

NaNK

llama-cpp

cyberagent-DeepSeek-R1-Distill-Qwen-14B-Japanese-gguf-Q8_0-GGUF

NaNK

llama-cpp

FuseO1-DeekSeekR1-QwQ-SkyT1-32B-Preview-Q3_K_L-GGUF

NaNK

llama-cpp

Qwen2.5-7B-Instruct-1M-IQ3_XS-GGUF

NaNK

llama-cpp

Repo: `roleplaiapp/AceInstruct-72B-Q50-GGUF` Original Model: `AceInstruct-72B` Organization: `nvidia` Quantized File: `aceinstruct-72b-q50.gguf` Quantization: `GGUF` Quantization Method: `Q50` Use Imatrix: `False` Split Model: `True` Overview This is an GGUF Q50 quantized version of AceInstruct-72B. Quantization By I often have idle A100 GPUs while building/testing and training the RP app, so I put them to use quantizing models. I hope the community finds these quantizations useful.

NaNK

llama-cpp

Llama-3.1-Nemotron-70B-Instruct-HF-Q4_K_S-GGUF

NaNK

llama-cpp

DeepSeek-R1-Distill-Qwen-32B-Q3_K_M-GGUF

NaNK

llama-cpp

DS-Distilled-Hermes-Llama-3.1_TIES-i1-Q5_K_M-GGUF

llama

DS-Distilled-Hermes-Llama-3.1_TIES-i1-IQ4_XS-GGUF

llama

Reasoning-Llama-3.1-CoT-RE1-i1-Q2_K-GGUF

roleplaiapp/Reasoning-Llama-3.1-CoT-RE1-i1-Q2K-GGUF Repo: `roleplaiapp/Reasoning-Llama-3.1-CoT-RE1-i1-Q2K-GGUF` Original Model: `Reasoning-Llama-3.1-CoT-RE1-i1` Quantized File: `Reasoning-Llama-3.1-CoT-RE1.i1-Q2K.gguf` Quantization: `GGUF` Quantization Method: `Q2K` Overview This is a GGUF Q2K quantized version of Reasoning-Llama-3.1-CoT-RE1-i1 Quantization By I often have idle GPUs while building/testing for the RP app, so I put them to use quantizing models. I hope the community finds these quantizations useful.

llama

Virtuoso-Lite-Q3_K_L-GGUF

llama-cpp

Pathfinder-RP-12B-RU-Q5_K_S-GGUF

NaNK

llama-cpp

MN-12B-Mimicore-Orochi-Q2_K-GGUF

NaNK

llama-cpp

roleplaiapp

DeepSeek-R1-Distill-Llama-70B-Uncensored-v2-Q4_K_M-GGUF

Midnight-Miqu-70B-v1.5-i1-Q4_K_M-GGUF

DeepSeek-R1-Distill-Qwen-7B-Q4_K_M-GGUF

DeepSeek-R1-Distill-Qwen-1.5B-Q4_0-GGUF

DeepSeek-R1-Distill-Qwen-1.5B-Q4_K_M-GGUF

DeepSeek-R1-Distill-Qwen-1.5B-Q4_K_S-GGUF

Llama-3.3-70B-Instruct-Q4_K_M-GGUF

DeepSeek-R1-Distill-Llama-70B-Q4_0-GGUF

Llama-3.1-Nemotron-70B-Instruct-HF-Q4_K_M-GGUF

Llama-3.3-70B-Instruct-Q4_0-GGUF

DeepSeek-R1-Distill-Llama-70B-Q4_K_S-GGUF

DeepSeek-R1-Distill-Llama-70B-Uncensored-v2-Q4_K_S-GGUF

Llama-3.1-Nemotron-70B-Instruct-HF-Q4_0-GGUF

TunnedLlama-3.1-8B_v2-Q8_0-GGUF

MN-12B-Mag-Mell-R1-Q4_K_M-GGUF

Gemma The Writer N Restless Quill 10B Uncensored IQ4 XS GGUF

DeepSeek-R1-Distill-Qwen-32B-Uncensored-Q8_0-GGUF

Llama-3.2-3B-Instruct-uncensored-Q6_K-GGUF

Qwen2.5-7B-Instruct-Uncensored-f16-GGUF

DeepSeek-R1-Distill-Qwen-7B-Q3_K_M-GGUF

DeepSeek-R1-Distill-Qwen-32B-Uncensored-Q2_K-GGUF

DeepSeek-R1-Distill-Llama-70B-Q2_K-GGUF

DeepSeek-R1-Distill-Qwen-7B-Q4_K_S-GGUF

mistral_fp8

DeepSeek-R1-Distill-Llama-70B-Q3_K_M-GGUF

DeepSeek-R1-Distill-Qwen-32B-Q4_0-GGUF

DeepSeek-R1-Distill-Qwen-14B-Q4_K_M-GGUF

DeepSeek-R1-Distill-Alpaca-FineTuned-f16-GGUF

DeepSeek-R1-Distill-Llama-3.1-16.5B-Brainstorm-i1-Q3_K_L-GGUF

Pathfinder-RP-12B-RU-Q6_K-GGUF

DeepSeek-R1-Distill-Llama-3.1-16.5B-Brainstorm-i1-Q4_K_S-GGUF

DeepSeek-R1-Distill-Qwen-14B-Q5_0-GGUF

DeepSauerHuatuoSkywork-R1-o1-Llama-3.1-8B-i1-Q4_K_M-GGUF

DeepSeek-R1-Distill-Llama-3.1-16.5B-Brainstorm-i1-Q3_K_M-GGUF

DeepSeek-R1-Distill-Llama-3.1-16.5B-Brainstorm-i1-Q6_K-GGUF

DeepSeek-R1-Distill-Llama-3.1-16.5B-Brainstorm-i1-Q5_K_S-GGUF

DeepSeek-R1-Distill-Llama-3.1-16.5B-Brainstorm-i1-IQ4_XS-GGUF

DeepSeek-R1-Distill-Qwen-14B-Q4_0-GGUF

DeepSauerHuatuoSkywork-R1-o1-Llama-3.1-8B-i1-IQ3_M-GGUF

DeepSeek-R1-Distill-Alpaca-FineTuned-Q4_K_M-GGUF

Mistral-MOE-4X7B-Dark-MultiVerse-Uncensored-Enhanced32-24B-gguf-Q8_0-GGUF

oh-dcft-v3.1-claude-3-5-haiku-20241022-Q3_K_L-GGUF

DeepSeek-R1-Distill-Alpaca-FineTuned-Q5_K_M-GGUF

DeepSeek-R1-Distill-Llama-70B-Q6_K-GGUF

DeepSeek-R1-Distill-Qwen-14B-Q3_K_M-GGUF

DeepSauerHuatuoSkywork-R1-o1-Llama-3.1-8B-IQ4_XS-GGUF

DeepSeek-R1-Distill-Llama-70B-Q3_K_S-GGUF

MistralRP-Noromaid-NSFW-Mistral-7B-Q8_0-GGUF

DeepSeek-R1-Distill-Qwen-32B-Q6_K-GGUF

DeepSauerHuatuoSkywork-R1-o1-Llama-3.1-8B-i1-Q2_K-GGUF

DeepSeek-R1-Distill-Alpaca-FineTuned-Q3_K_L-GGUF

DeepSeek-R1-Distill-Alpaca-FineTuned-Q3_K_M-GGUF

DeepSeek-R1-Distill-Llama-3.1-16.5B-Brainstorm-i1-Q5_K_M-GGUF

DeepSeek-R1-Distill-Llama-8B-Q4_0-GGUF

DeepSeek-R1-Distill-Qwen-32B-Uncensored-Q4_K_M-GGUF

Janus-Pro-7B-LM-Q8_0-GGUF

DeepSauerHuatuoSkywork-R1-o1-Llama-3.1-8B-i1-Q3_K_M-GGUF

DeepSeek-R1-Distill-Alpaca-FineTuned-Q3_K_S-GGUF

DeepSauerHuatuoSkywork-R1-o1-Llama-3.1-8B-Q6_K-GGUF

DeepSauerHuatuoSkywork-R1-o1-Llama-3.1-8B-Q3_K_S-GGUF

MistralRP-Noromaid-NSFW-Mistral-7B-Q5_K_M-GGUF

DeepSeek-R1-Distill-Llama-3.1-16.5B-Brainstorm-i1-IQ3_S-GGUF

Janus-Pro-7B-LM-Q4_K_M-GGUF

AceInstruct-1.5B-Q4_K_S-GGUF

DeepSeek-R1-Distill-Qwen-7B-Q3_K_L-GGUF

DeepSauerHuatuoSkywork-R1-o1-Llama-3.1-8B-f16-GGUF

DeepSeek-R1-Distill-Qwen-32B-Q4_K_M-GGUF

DeepSauerHuatuoSkywork-R1-o1-Llama-3.1-8B-Q2_K-GGUF

DeepSeek-R1-Distill-Llama-3.1-16.5B-Brainstorm-i1-Q3_K_S-GGUF

DeepSeek-R1-Distill-Llama-3.1-16.5B-Brainstorm-i1-IQ3_M-GGUF

DeepSeek-R1-Distill-Qwen-32B-Uncensored-Q4_K_S-GGUF

DeepSeek-R1-Distill-Alpaca-FineTuned-Q4_K_S-GGUF

Mistral-MOE-4X7B-Dark-MultiVerse-Uncensored-Enhanced32-24B-gguf-IQ4_XS-GGUF

saiga_nemo_12b_gguf-Q8_0-GGUF

DeepSeek-R1-Distill-Llama-3.1-16.5B-Brainstorm-i1-Q4_K_M-GGUF

DeepSeek-R1-Distill-Qwen-14B-Q3_K_S-GGUF

DeepSeek-R1-Distill-Llama-3.1-16.5B-Brainstorm-i1-IQ3_XS-GGUF

Codestral-22B-v0.1-Q4_K_M-GGUF

DeepSeek-R1-Distill-Qwen-1.5B-Q8_0-GGUF