GigaChat3.1-702B-A36B-bf16
750
6
license:mit
by
ai-sage
Language Model
OTHER
702B params
New
750 downloads
Early-stage
Edge AI:
Mobile
Laptop
Server
1570GB+ RAM
Mobile
Laptop
Server
Quick Summary
AI model with specialized capabilities.
Device Compatibility
Mobile
4-6GB RAM
Laptop
16GB RAM
Server
GPU
Minimum Recommended
654GB+ RAM
Code Examples
Example evaluation setupbash
export HF_ALLOW_CODE_EVAL=1
# Example: launch SGLang server for a multi-node deployment
python -m sglang.launch_server \
--model-path <path_to_model> \
--host 127.0.0.1 \
--port 30000 \
--nnodes 2 \
--node-rank <0_or_1> \
--tp 16 \
--ep 16 \
--dtype auto \
--mem-fraction-static 0.7 \
--trust-remote-code \
--allow-auto-truncate \
--speculative-algorithm EAGLE \
--speculative-num-steps 1 \
--speculative-eagle-topk 1 \
--speculative-num-draft-tokens 2 \
--dist-init-addr <master_node_ip>:50000Deploy This Model
Production-ready deployment in minutes
Together.ai
Instant API access to this model
Production-ready inference API. Start free, scale to millions.
Try Free APIReplicate
One-click model deployment
Run models in the cloud with simple API. No DevOps required.
Deploy NowDisclosure: We may earn a commission from these partners. This helps keep LLMYourWay free.