PrunedHub-GPT-OSS-20B-27x-Zerobias

38
llama-cpp
by
GOBA-AI-Labs
Language Model
OTHER
20B params
New
38 downloads
Early-stage
Edge AI:
Mobile
Laptop
Server
45GB+ RAM
Mobile
Laptop
Server
Quick Summary

AI model with specialized capabilities.

Device Compatibility

Mobile
4-6GB RAM
Laptop
16GB RAM
Server
GPU
Minimum Recommended
19GB+ RAM

Code Examples

Usagebash
llama-server -m PrunedHub-GPT-OSS-20B-27x-Zerobias-Q4_K_M.gguf --port 8090 -ngl 99 -c 4096
moe-streambash
# CLI inference
moe-stream PrunedHub-GPT-OSS-20B-27x-Zerobias-Q4_K_M.gguf 512 \
  --prompt "Explain quantum computing" --stream

# OpenAI-compatible HTTP server
moe-stream-server --model PrunedHub-GPT-OSS-20B-27x-Zerobias-Q4_K_M.gguf --port 11434

Deploy This Model

Production-ready deployment in minutes

Together.ai

Instant API access to this model

Fastest API

Production-ready inference API. Start free, scale to millions.

Try Free API

Replicate

One-click model deployment

Easiest Setup

Run models in the cloud with simple API. No DevOps required.

Deploy Now

Disclosure: We may earn a commission from these partners. This helps keep LLMYourWay free.