Karsh-CAI

13 models • 1 total models in database

Sort by:

Qwen2.5 32B AGI Q6 K GGUF

Kas1o/Qwen2.5-32B-AGI-Q6K-GGUF This model was converted to GGUF format from `Kas1o/Qwen2.5-32B-AGI` using llama.cpp via the ggml.ai's GGUF-my-repo space. Refer to the original model card for more details on the model. Use with llama.cpp Install llama.cpp through brew (works on Mac and Linux) Note: You can also use this checkpoint directly through the usage steps listed in the Llama.cpp repo as well. Step 2: Move into the llama.cpp folder and build it with `LLAMACURL=1` flag along with other hardware-specific flags (for ex: LLAMACUDA=1 for Nvidia GPUs on Linux).

NaNK

llama-cpp

llama3-8B-cn-rochat-v1-Q5_K_M-GGUF

NaNK

llama3

Qwen2.5-0.5B-Instruct-Thinking-Q8_0-GGUF

NaNK

llama-cpp

OR-7B-Q5_K_M-GGUF

NaNK

llama-cpp

Qwen2.5-14B-Instruct-1M-abliterated-Q8_0-GGUF

NaNK

llama-cpp

SuZhiDiXia-7B

NaNK

—

Mistral-Small-24B-Instruct-2501-Q8_0-GGUF

Karsh-CAI/Mistral-Small-24B-Instruct-2501-Q80-GGUF This model was converted to GGUF format from `mistralai/Mistral-Small-24B-Instruct-2501` using llama.cpp via the ggml.ai's GGUF-my-repo space. Refer to the original model card for more details on the model. Use with llama.cpp Install llama.cpp through brew (works on Mac and Linux) Note: You can also use this checkpoint directly through the usage steps listed in the Llama.cpp repo as well. Step 2: Move into the llama.cpp folder and build it with `LLAMACURL=1` flag along with other hardware-specific flags (for ex: LLAMACUDA=1 for Nvidia GPUs on Linux).

NaNK

llama-cpp