Kwai-Keye
Keye-VL-8B-Preview
Keye-VL-1_5-8B
--- language: - en library_name: transformers license: apache-2.0 pipeline_tag: video-text-to-text tags: - multimodal ---
Thyme-RL
[š Home Page] [š Github Repo] [š Technique Report] [š Thyme SFT Model] [š Thyme RL Model] [š SFT Data] [š RL Data] š„ News `2025.08.15` š We are excited to introduce Thyme: Think Beyond Images. Thyme transcends traditional ``thinking with images'' paradigms by autonomously generating and executing diverse image processing and computational operations through executable code, significantly enhancing performance on high-resolution perception and complex reasoning tasks. Leveraging a novel two-stage training strategy that combines supervised fine-tuning with reinforcement learning and empowered by the innovative GRPO-ATS algorithm, Thyme achieves a sophisticated balance between reasoning exploration and code execution precision. We have provided the usage instructions, training code, and evaluation code in the GitHub repo. If you find Thyme useful in your research or applications, please cite our paper:
Keye-VL-671B-A37B
Thyme-SFT
[š Home Page] [š Github Repo] [š Technique Report] [š Thyme SFT Model] [š Thyme RL Model] [š SFT Data] [š RL Data] š„ News `2025.08.15` š We are excited to introduce Thyme: Think Beyond Images. Thyme transcends traditional ``thinking with images'' paradigms by autonomously generating and executing diverse image processing and computational operations through executable code, significantly enhancing performance on high-resolution perception and complex reasoning tasks. Leveraging a novel two-stage training strategy that combines supervised fine-tuning with reinforcement learning and empowered by the innovative GRPO-ATS algorithm, Thyme achieves a sophisticated balance between reasoning exploration and code execution precision. We have provided the usage instructions, training code, and evaluation code in the GitHub repo. If you find Thyme useful in your research or applications, please cite our paper: