LLMYourWay
ModelsDevices
Edge AI
CompareInsights
Enterprise

zed-industries

2 models • 1 total models in database
Sort by:

zeta

This repository contains a fine-tuned version of Qwen2.5-Coder-7B to support edit prediction in Zed. The model has been fine-tuned using the zeta dataset. If you want to fine-tune the model yourself, you can refer to the following scripts: - DPO Fine-Tuning: View Notebook - SFT Fine-Tuning: View Notebook The dataset used for training is available at: zed-industries/zeta `vllm serve zed-industries/zeta --served-model-name zeta` - Quantization vLLM supports FP8 (8-bit floating point) weight and activation quantization using hardware acceleration on GPUs such as Nvidia H100 and AMD MI300x. - NGram Speculative Decoding configures vLLM to use speculative decoding where proposals are generated by matching n-grams in the prompt. This is a great fit for edit predictions since many of the tokens are already present in the prompt and the model is only needed to generate changes to the code file. `vllm serve zed-industries/zeta --served-model-name zeta --enable-prefix-caching --enable-chunked-prefill --quantization="fp8" --speculative-model [ngram] --ngram-prompt-lookup-max 4 --ngram-prompt-lookup-min 2 --num-speculative-tokens 8` For more insights about the model and its integration in Zed, check out the official blog post: Zed Blog - Edit Prediction

license:apache-2.0
15,648
366

zeta-2

NaNK
llama
20
18
LLMYourWay

The definitive AI model comparison platform. Compare 12K+ models, track performance, and discover the perfect AI solution for your needs.

Made with AI
Real-time Data

Product

  • Find Your Device
  • Browse Models
  • Compare AI
  • Benchmarks
  • Pricing
  • API Access

Resources

  • Blog & Articles
  • Methodology
  • Changelog
  • Trending
  • Use Cases

Company

  • About
  • Contact

Legal

  • Privacy Policy
  • Cookie Policy
  • Terms of Service
12K+12,000+
AI Models Tracked & Updated Daily
© 2026 LLMYourWay. All rights reserved.
Data updated every 4 hours
Powered by real-time AI data
API