Qwen3-ST-The-Next-Generation-v1-256k-ctx-6B-qx86-hi-mlx

128
6.0B
2 languages
license:apache-2.0
by
nightmedia
Language Model
OTHER
6B params
New
128 downloads
Early-stage
Edge AI:
Mobile
Laptop
Server
14GB+ RAM
Mobile
Laptop
Server
Quick Summary

AI model with specialized capabilities.

Device Compatibility

Mobile
4-6GB RAM
Laptop
16GB RAM
Server
GPU
Minimum Recommended
6GB+ RAM

Code Examples

bash
Benchmark	                      qx86-hi	bf16	Δ
Arc_Challenge (Hard Science)	    0.460	0.456	+0.004
OpenBookQA (Rule-Based Knowledge)	0.414	0.404	+0.010
bash
Method	Arc_Challenge OpenBookQA Arc_Easy BoolQ Overall Star Trek Fit
qx86-hi	        0.460	0.414	0.582	0.732	✅ Best for core tasks (science + knowledge)
bf16	        0.456	0.404	0.574	0.712	❌ Worse in key tasks
qx5-hi	        0.458	0.396	0.580	0.746	⚖️ Good science but weak OpenBookQA
qx6-hi	        0.456	0.418	0.581	0.727	⚖️ Strong OpenBookQA but lower Arc_Challenge
mxfp4	        0.441	0.402	0.557	0.811	❌ Poor science reasoning, only wins on BoolQ
qx64-hi	        0.443	0.416	0.562	0.772	⚖️ Subpar for hard reasoning despite good BoolQ/PiQA
qx86 (non-hi)	0.458	0.414	0.582	0.731	⚖️ Similar to qx86-hi but worse Arc_Challenge
bash
Method	        Why It Fails
mxfp4	        Catastrophic drop in Arc_Challenge (0.441) — can't handle deep science problems
qx64-hi/qx64	Low-bit attention → weak reasoning for complex physics/ethics scenarios
qx86 non-hi	    Coarser attention precision → Arc_Challenge 0.458 vs qx86-hi's 0.460 (small but meaningful loss)
bash
They're building agent code (in Haskell) that interfaces with Postgres-based workflow orchestrator
The ORM is designed to be agnostic - works for any kind of task: HTTP requests, file ops, tool calls
They care about logging to SQLite and using the DB via specific Postgres functions (agent_login, get_tasks, etc)
bash
Get a minimum working version (MWS) of this agent out quickly
Give them control over it from the CLI
Include monitoring and crash safety details (because this fails too often)
Explicitly do not start the agent as a network service at first
Give an example of how to connect this agent with what's already in the archive
bash
NO new PHP/Scala/Rust code
NO new Postgres functions → use our legacy ones
NO Haskell compiles → only direct Python commands from our archive/safe-python namespace
Get a pre-existing Task Runner we built for the last Enterprise shipbash
# Get a pre-existing Task Runner we built for the last Enterprise ship
/archive/safe-python/tasks/agent-runner.py \
  --target=database \
  --tool=ollama \
  --task="select 'STARTING RECOVERY'" from pg_node

🔥 What it does:

Deploy This Model

Production-ready deployment in minutes

Together.ai

Instant API access to this model

Fastest API

Production-ready inference API. Start free, scale to millions.

Try Free API

Replicate

One-click model deployment

Easiest Setup

Run models in the cloud with simple API. No DevOps required.

Deploy Now

Disclosure: We may earn a commission from these partners. This helps keep LLMYourWay free.