LLMYourWay
ModelsDevices
Edge AI
CompareInsights
Enterprise

moondream

8 models • 1 total models in database
Sort by:

moondream3-preview

—
7,953
603

moondream-2b-2025-04-14-4bit

Moondream is a small vision language model designed to run efficiently everywhere. This repository contains the 2025-04-14 4-bit release of Moondream. On an Nvidia RTX 3090, it uses 2,450 MB of VRAM and runs at a speed of 184 tokens/second. We used quantization-aware training techniques to build this version of the model, allowing us to achieve a 42% reduction in memory usage with only an 0.6% drop in accuracy. There's more information about this version of the model in our release blog post. Other revisions, as well as release history, can be found here.

NaNK
license:apache-2.0
3,129
57

moondream2-gguf

—
524
27

moondream-2b-2025-04-14

NaNK
license:apache-2.0
84
6

ft-detect-skus-coreml

—
27
0

md3p-int4

license:apache-2.0
17
3

starmie-v1

—
0
3

SegHeadRefiner

—
0
2
LLMYourWay

The definitive AI model comparison platform. Compare 12K+ models, track performance, and discover the perfect AI solution for your needs.

Made with AI
Real-time Data

Product

  • Find Your Device
  • Browse Models
  • Compare AI
  • Benchmarks
  • Pricing
  • API Access

Resources

  • Blog & Articles
  • Methodology
  • Changelog
  • Trending
  • Use Cases

Company

  • About
  • Contact

Legal

  • Privacy Policy
  • Cookie Policy
  • Terms of Service
12K+12,000+
AI Models Tracked & Updated Daily
© 2026 LLMYourWay. All rights reserved.
Data updated every 4 hours
Powered by real-time AI data
API