Qwen3VL-8B-4bit-GGUF-Jetson-Deployment
134
llama-cpp
by
WayBob
Image Model
OTHER
8B params
New
134 downloads
Early-stage
Edge AI:
Mobile
Laptop
Server
18GB+ RAM
Mobile
Laptop
Server
Quick Summary
AI model with specialized capabilities.
Device Compatibility
Mobile
4-6GB RAM
Laptop
16GB RAM
Server
GPU
Minimum Recommended
8GB+ RAM
Code Examples
Option 3: OpenAI-Compatible API Callpython
import base64
import requests
with open("disaster_image.jpg", "rb") as f:
img_b64 = base64.b64encode(f.read()).decode()
response = requests.post(
"http://localhost:8080/v1/chat/completions",
json={
"messages": [
{
"role": "system",
"content": (
"You are a disaster recognition expert. "
"When analyzing disaster images, first identify the disaster type, "
"then explain the key visual evidence supporting your classification. "
"Respond in the same language as the user."
)
},
{
"role": "user",
"content": [
{
"type": "image_url",
"image_url": {"url": f"data:image/jpeg;base64,{img_b64}"}
},
{
"type": "text",
"text": "What type of disaster occurred in this image?"
}
]
}
],
"temperature": 0
},
timeout=300
)
print(response.json()["choices"][0]["message"]["content"])Deploy This Model
Production-ready deployment in minutes
Together.ai
Instant API access to this model
Production-ready inference API. Start free, scale to millions.
Try Free APIReplicate
One-click model deployment
Run models in the cloud with simple API. No DevOps required.
Deploy NowDisclosure: We may earn a commission from these partners. This helps keep LLMYourWay free.