jadohu
MongMong
Qwen3-14B-MASA
Description This repository contains the model for Meta-Awareness Enhances Reasoning Models: Self-Alignment Reinforcement Learning. Official Implementation https://github.com/akatigre/MASA-RL
Qwen3-8B-MASA
Description This repository contains the model for Meta-Awareness Enhances Reasoning Models: Self-Alignment Reinforcement Learning. Official Implementation https://github.com/akatigre/MASA-RL
Qwen3-8B-GRPO
Description This repository contains the model for Meta-Awareness Enhances Reasoning Models: Self-Alignment Reinforcement Learning. Official Implementation https://github.com/akatigre/MASA-RL
Qwen3-14B-GRPO
Qwen3-8B-MASA-efficient
Description This repository contains the model for Meta-Awareness Enhances Reasoning Models: Self-Alignment Reinforcement Learning. Official Implementation https://github.com/akatigre/MASA-RL