This model is a fine-tuned version of deepseek-ai/DeepSeek-V2-Lite. It has been trained using TRL. - PEFT 0.17.1 - TRL: 0.24.0 - Transformers: 4.57.1 - Pytorch: 2.8.0+cu129 - Datasets: 4.2.0 - Tokenizers: 0.22.0