Mistral-Small-3.2-24B-Instruct-2506-awq-sym

Name: Mistral-Small-3.2-24B-Instruct-2506-awq-sym
Author: jeffcookio

4.3K

—

jeffcookio

Other

OTHER

24B params

New

4K downloads

Early-stage

Edge AI:

Mobile

Laptop

Server

54GB+ RAM

Mobile

Laptop

Server

Quick Summary

Created with `llm-compressor`'s latest changes, quantized on a GH200, works well for me with vLLM's `main` branch on my RTX 3090Ti as of 2025-07-01.

Mobile

4-6GB RAM

Laptop

16GB RAM

Server

GPU

Minimum Recommended

23GB+ RAM

Production-ready deployment in minutes

Instant API access to this model

Fastest API

Production-ready inference API. Start free, scale to millions.

One-click model deployment

Easiest Setup

Run models in the cloud with simple API. No DevOps required.

Disclosure: We may earn a commission from these partners. This helps keep LLMYourWay free.