LLMYourWay
ModelsDevices
Edge AI
CompareInsights
Enterprise

Zhaoxuan

1 models • 1 total models in database
Sort by:

PUGC Mistral DPO

This is the model checkpoint for ACL 2025 paper "Aligning Large Language Models with Implicit Preferences from User-Generated Content" (https://arxiv.org/abs/2506.04463) The model is trained from Mistral-7B-Instruct-v0.2 with DPO, using preference data harvested from user-generated content. If you find this model helpful to your research, please cite the following paper:

NaNK
license:apache-2.0
20
2
LLMYourWay

The definitive AI model comparison platform. Compare 12K+ models, track performance, and discover the perfect AI solution for your needs.

Made with AI
Real-time Data

Product

  • Find Your Device
  • Browse Models
  • Compare AI
  • Benchmarks
  • Pricing
  • API Access

Resources

  • Blog & Articles
  • Methodology
  • Changelog
  • Trending
  • Use Cases

Company

  • About
  • Contact

Legal

  • Privacy Policy
  • Cookie Policy
  • Terms of Service
12K+12,000+
AI Models Tracked & Updated Daily
© 2026 LLMYourWay. All rights reserved.
Data updated every 4 hours
Powered by real-time AI data
API