Qwen3VL-8B-4bit-GGUF-Jetson-Deployment

134
llama-cpp
by
WayBob
Image Model
OTHER
8B params
New
134 downloads
Early-stage
Edge AI:
Mobile
Laptop
Server
18GB+ RAM
Mobile
Laptop
Server
Quick Summary

AI model with specialized capabilities.

Device Compatibility

Mobile
4-6GB RAM
Laptop
16GB RAM
Server
GPU
Minimum Recommended
8GB+ RAM

Code Examples

Option 3: OpenAI-Compatible API Callpython
import base64
import requests

with open("disaster_image.jpg", "rb") as f:
    img_b64 = base64.b64encode(f.read()).decode()

response = requests.post(
    "http://localhost:8080/v1/chat/completions",
    json={
        "messages": [
            {
                "role": "system",
                "content": (
                    "You are a disaster recognition expert. "
                    "When analyzing disaster images, first identify the disaster type, "
                    "then explain the key visual evidence supporting your classification. "
                    "Respond in the same language as the user."
                )
            },
            {
                "role": "user",
                "content": [
                    {
                        "type": "image_url",
                        "image_url": {"url": f"data:image/jpeg;base64,{img_b64}"}
                    },
                    {
                        "type": "text",
                        "text": "What type of disaster occurred in this image?"
                    }
                ]
            }
        ],
        "temperature": 0
    },
    timeout=300
)

print(response.json()["choices"][0]["message"]["content"])

Deploy This Model

Production-ready deployment in minutes

Together.ai

Instant API access to this model

Fastest API

Production-ready inference API. Start free, scale to millions.

Try Free API

Replicate

One-click model deployment

Easiest Setup

Run models in the cloud with simple API. No DevOps required.

Deploy Now

Disclosure: We may earn a commission from these partners. This helps keep LLMYourWay free.