moondream
8 models • 1 total models in database
Sort by:
moondream3-preview
—
7,953
603
moondream-2b-2025-04-14-4bit
Moondream is a small vision language model designed to run efficiently everywhere. This repository contains the 2025-04-14 4-bit release of Moondream. On an Nvidia RTX 3090, it uses 2,450 MB of VRAM and runs at a speed of 184 tokens/second. We used quantization-aware training techniques to build this version of the model, allowing us to achieve a 42% reduction in memory usage with only an 0.6% drop in accuracy. There's more information about this version of the model in our release blog post. Other revisions, as well as release history, can be found here.
NaNK
license:apache-2.0
3,129
57
moondream2-gguf
—
524
27
moondream-2b-2025-04-14
NaNK
license:apache-2.0
84
6
ft-detect-skus-coreml
—
27
0
md3p-int4
license:apache-2.0
17
3
starmie-v1
—
0
3
SegHeadRefiner
—
0
2