mit-han-lab

96 models • 3 total models in database

Sort by:

svdq-int4-flux.1-fill-dev

This repository has been deprecated and will be hidden in December 2025. Please use https://huggingface.co/nunchaku-tech/nunchaku-flux.1-fill-dev. The FLUX.1 [dev] Model is licensed by Black Forest Labs Inc. under the FLUX.1 [dev] Non-Commercial License. Copyright Black Forest Labs Inc. IN NO EVENT SHALL BLACK FOREST LABS INC. BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH USE OF THIS MODEL.

dataset:mit-han-lab/svdquant-datasets

136,327

nunchaku-flux.1-kontext-dev

This repository has been migrated to https://huggingface.co/nunchaku-tech/nunchaku-flux.1-kontext-dev and will be hidden in December 2025. This repository contains Nunchaku-quantized versions of FLUX.1-Kontext-dev, capable of editing images based on text instructions. It is optimized for efficient inference while maintaining minimal loss in performance. - Developed by: Nunchaku Team - Model type: image-to-image - License: flux-1-dev-non-commercial-license - Quantized from model: FLUX.1-Kontext-dev - `svdq-int4r32-flux.1-kontext-dev.safetensors`: SVDQuant quantized INT4 FLUX.1-Kontext-dev model. For users with non-Blackwell GPUs (pre-50-series). - `svdq-fp4r32-flux.1-kontext-dev.safetensors`: SVDQuant quantized NVFP4 FLUX.1-Kontext-dev model. For users with Blackwell GPUs (50-series). - Inference Engine: nunchaku - Quantization Library: deepcompressor - Paper: SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models - Demo: svdquant.mit.edu - Diffusers Usage: See flux.1-kontext-dev.py. Check our tutorial for more advanced usage. - ComfyUI Usage: See nunchaku-flux.1-kontext-dev.json. The FLUX.1 [dev] Model is licensed by Black Forest Labs Inc. under the FLUX.1 [dev] Non-Commercial License. Copyright Black Forest Labs Inc. IN NO EVENT SHALL BLACK FOREST LABS INC. BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH USE OF THIS MODEL.

dataset:mit-han-lab/svdquant-datasets

23,953

157

dc-ae-f32c32-sana-1.0

Deep Compression Autoencoder for Efficient High-Resolution Diffusion Models Figure 1: We address the reconstruction accuracy drop of high spatial-compression autoencoders. Figure 2: DC-AE delivers significant training and inference speedup without performance drop. Figure 3: DC-AE enables efficient text-to-image generation on the laptop. We present Deep Compression Autoencoder (DC-AE), a new family of autoencoder models for accelerating high-resolution diffusion models. Existing autoencoder models have demonstrated impressive results at a moderate spatial compression ratio (e.g., 8x), but fail to maintain satisfactory reconstruction accuracy for high spatial compression ratios (e.g., 64x). We address this challenge by introducing two key techniques: (1) Residual Autoencoding, where we design our models to learn residuals based on the space-to-channel transformed features to alleviate the optimization difficulty of high spatial-compression autoencoders; (2) Decoupled High-Resolution Adaptation, an efficient decoupled three-phases training strategy for mitigating the generalization penalty of high spatial-compression autoencoders. With these designs, we improve the autoencoder's spatial compression ratio up to 128 while maintaining the reconstruction quality. Applying our DC-AE to latent diffusion models, we achieve significant speedup without accuracy drop. For example, on ImageNet 512x512, our DC-AE provides 19.1x inference speedup and 17.9x training speedup on H100 GPU for UViT-H while achieving a better FID, compared with the widely used SD-VAE-f8 autoencoder. If DC-AE is useful or relevant to your research, please kindly recognize our contributions by citing our papers:

—

17,434

nunchaku-flux.1-fill-dev

This repository has been migrated to https://huggingface.co/nunchaku-tech/nunchaku-flux.1-fill-dev and will be hidden in December 2025. This repository contains Nunchaku-quantized versions of FLUX.1-Fill-dev, capable of filling areas in existing images based on a text description. It is optimized for efficient inference while maintaining minimal loss in performance. - Developed by: Nunchaku Team - Model type: image-to-image - License: flux-1-dev-non-commercial-license - Quantized from model: FLUX.1-Fill-dev - `svdq-int4r32-flux.1-fill-dev.safetensors`: SVDQuant quantized INT4 FLUX.1-Fill-dev model. For users with non-Blackwell GPUs (pre-50-series). - `svdq-fp4r32-flux.1-fill-dev.safetensors`: SVDQuant quantized NVFP4 FLUX.1-Fill-dev model. For users with Blackwell GPUs (50-series). - Inference Engine: nunchaku - Quantization Library: deepcompressor - Paper: SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models - Demo: svdquant.mit.edu - Diffusers Usage: See flux.1-fill-dev.py. Check our tutorial for more advanced usage. - ComfyUI Usage: See nunchaku-flux.1-fill-dev.json. The FLUX.1 [dev] Model is licensed by Black Forest Labs Inc. under the FLUX.1 [dev] Non-Commercial License. Copyright Black Forest Labs Inc. IN NO EVENT SHALL BLACK FOREST LABS INC. BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH USE OF THIS MODEL.

dataset:mit-han-lab/svdquant-datasets

1,927

svdq-int4-flux.1-dev

This repository has been deprecated and will be hidden in December 2025. Please use https://huggingface.co/nunchaku-tech/nunchaku-flux.1-dev. The FLUX.1 [dev] Model is licensed by Black Forest Labs Inc. under the FLUX.1 [dev] Non-Commercial License. Copyright Black Forest Labs Inc. IN NO EVENT SHALL BLACK FOREST LABS INC. BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH USE OF THIS MODEL.

mit-han-lab

svdq-int4-flux.1-fill-dev

nunchaku-flux.1-kontext-dev

dc-ae-f32c32-sana-1.0

nunchaku-flux.1-dev

svdq-int4-flux.1-schnell

StreamingVLM

dc-ae-f64c128-in-1.0-diffusers

dc-ae-f32c32-mix-1.0

dc-ae-f32c32-sana-1.1-diffusers

dc-ae-f32c32-in-1.0

nunchaku-flux.1-fill-dev

svdq-int4-flux.1-dev

vila-u-7b-256

dc-ae-f32c32-sana-1.1

dc-ae-f64c128-in-1.0

Qwen2.5-32B-Eagle-RL

nunchaku-flux.1-schnell

dc-ae-f64c128-mix-1.0

nunchaku-flux.1-canny-dev

dc-ae-f32c32-sana-1.0-diffusers

svdq-fp4-flux.1-dev

Qwen2-VL-1.5B-Instruct

Nunchaku Flux.1 Depth Dev

Qwen2.5-7B-Eagle-RL

svdq-int4-flux.1-depth-dev

nunchaku-shuttle-jaguar

svdq-fp4-flux.1-schnell

svdq-int4-flux.1-canny-dev

dc-ae-f32c32-in-1.0-256px

svdq-flux.1-schnell-pix2pix-turbo

opt-1.3b-smoothquant

svdq-fp4-flux.1-fill-dev

dc-ae-f128c512-in-1.0

dc-ae-lite-f32c32-sana-1.1-diffusers

dc-ae-f64c128-in-1.0-uvit-h-in-512px-train2000k

svdq-int4-sana-1600m

nunchaku-sana

svdq-fp4-flux.1-depth-dev

svdq-fp4-shuttle-jaguar

dc-ae-f32c32-in-1.0-diffusers

dc-ae-f64c128-in-1.0-uvit-h-in-512px

dc-ae-f64c128-mix-1.0-diffusers

dc-ae-f128c512-mix-1.0

dc-ae-f32c32-mix-1.0-diffusers

svdq-int4-shuttle-jaguar

dc-ae-f128c512-mix-1.0-diffusers

opt-125m-smoothquant

dc-ae-f32c32-in-1.0-usit-2b-in-512px

svdq-fp4-flux.1-canny-dev

dc-ae-lite-f32c32-sana-1.1

dc-ae-f128c512-in-1.0-diffusers

Llama-3-8B-Instruct-QServe

opt-13b-smoothquant

Llama-3-8B-Instruct-QServe-g128

nunchaku-t5

Llama-3-8B-Instruct-QServe-W8A8

opt-6.7b-smoothquant

opt-30b-smoothquant

vicuna-13b-v1.3-4bit-g128-awq

Mistral-7B-v0.1-QServe

Llama-3-8B-QServe-g128

dc-ae-f32c32-in-1.0-dit-xl-in-512px

Llama-2-7B-QServe-g128

Llama-2-13B-QServe-g128

dc-ae-f32c32-in-1.0-uvit-s-in-512px

dc-ae-f32c32-in-1.0-usit-h-in-512px

dc-ae-f32c32-in-1.0-sit-xl-in-512px

Llama-3-8B-Instruct-Gradient-1048k-w8a8-per-channel-kv8-per-tensor

dc-ae-f32c32-in-1.0-uvit-h-in-512px

dc-ae-f64c128-in-1.0-uvit-2b-in-512px-train2000k

Mistral-7B-v0.1-QServe-g128

Yi-34B-QServe-g128

dc-ae-f64c128-in-1.0-uvit-2b-in-512px

dc-ae-f32c32-in-1.0-dit-xl-in-512px-trainbs1024

Llama-2-7B-QServe

Llama-3-8B-QServe

vicuna-7b-v1.5-QServe

vicuna-13b-v1.5-QServe

vicuna-7b-v1.5-QServe-g128