GigaChat3.1-702B-A36B-bf16

750
6
license:mit
by
ai-sage
Language Model
OTHER
702B params
New
750 downloads
Early-stage
Edge AI:
Mobile
Laptop
Server
1570GB+ RAM
Mobile
Laptop
Server
Quick Summary

AI model with specialized capabilities.

Device Compatibility

Mobile
4-6GB RAM
Laptop
16GB RAM
Server
GPU
Minimum Recommended
654GB+ RAM

Code Examples

Example evaluation setupbash
export HF_ALLOW_CODE_EVAL=1

# Example: launch SGLang server for a multi-node deployment
python -m sglang.launch_server \
  --model-path <path_to_model> \
  --host 127.0.0.1 \
  --port 30000 \
  --nnodes 2 \
  --node-rank <0_or_1> \
  --tp 16 \
  --ep 16 \
  --dtype auto \
  --mem-fraction-static 0.7 \
  --trust-remote-code \
  --allow-auto-truncate \
  --speculative-algorithm EAGLE \
  --speculative-num-steps 1 \
  --speculative-eagle-topk 1 \
  --speculative-num-draft-tokens 2 \
  --dist-init-addr <master_node_ip>:50000

Deploy This Model

Production-ready deployment in minutes

Together.ai

Instant API access to this model

Fastest API

Production-ready inference API. Start free, scale to millions.

Try Free API

Replicate

One-click model deployment

Easiest Setup

Run models in the cloud with simple API. No DevOps required.

Deploy Now

Disclosure: We may earn a commission from these partners. This helps keep LLMYourWay free.