CodeGoat24

38 models • 4 total models in database

Sort by:

UnifiedReward-2.0-qwen-7b

UnifiedReward-2.0-qwen-7B We are actively gathering feedback from the community to improve our models. We welcome your input and encourage you to stay updated through our repository!! 🔥🔥🔥 We release UnifiedReward-2.0-qwen-[3b/7b/32b/72b]. This version introduces several new capabilities: >1. Pairwise scoring for image and video generation assessment on Alignment, Coherence, Style dimensions. > >2. Pointwise scoring for image and video generation assessment on Alignment, Coherence/Physics, Style dimensions. Welcome to try the latest version, and the inference code is available at `here`. `UnifiedReward-2.0-qwen-7b` is the first unified reward model based on Qwen/Qwen2.5-VL-7B-Instruct for multimodal understanding and generation assessment, enabling both pairwise ranking and pointwise scoring, which can be employed for vision model preference alignment. For further details, please refer to the following resources: - 📰 Paper: https://arxiv.org/pdf/2503.05236 - 🪐 Project Page: https://codegoat24.github.io/UnifiedReward/ - 🤗 Model Collections: https://huggingface.co/collections/CodeGoat24/unifiedreward-models-67c3008148c3a380d15ac63a - 🤗 Dataset Collections: https://huggingface.co/collections/CodeGoat24/unifiedreward-training-data-67c300d4fd5eff00fa7f1ede - 👋 Point of Contact: Yibin Wang | Reward Model | Method| Image Generation | Image Understanding | Video Generation | Video Understanding | :-----: | :-----: |:-----: |:-----: | :-----: | :-----: | | PickScore |Point | √ | | || | HPS | Point | √ | ||| | ImageReward | Point| √| ||| | LLaVA-Critic | Pair/Point | | √ ||| | IXC-2.5-Reward | Pair/Point | | √ ||√| | VideoScore | Point | | |√ || | LiFT | Point | | |√| | | VisionReward | Point |√ | |√|| | VideoReward | Point | | |√ || | UnifiedReward (Ours) | Pair/Point | √ | √ |√|√|

CodeGoat24

UnifiedReward-2.0-qwen-7b

UnifiedReward-7b-v1.5

UnifiedReward-qwen-7b

UnifiedReward-Think-qwen3vl-8b

UnifiedReward-2.0-qwen-32b

UnifiedReward-Think-qwen-7b

UnifiedReward-Edit-qwen3vl-8b

UnifiedReward-Think-qwen3vl-32b

UnifiedReward-2.0-qwen-3b

UnifiedReward-7b

UnifiedReward-Flex-qwen3vl-8b

UnifiedReward Edit Qwen 32b

UnifiedReward-2.0-qwen-72b

UnifiedReward Edit Qwen 7b

Wan2.1-T2V-14B-UnifiedReward-Flex-lora

FLUX.2-klein-base-9B-UnifiedReward-Flex-lora

UniGenBench EvalModel Qwen 72b V1

LLaVA-Video-7B-Qwen2-UnifiedReward-DPO

UnifiedReward-Flex-qwen3vl-2b

UnifiedReward-Flex-qwen3vl-4b

UnifiedReward-qwen-3b

UnifiedReward-Edit-qwen3vl-2b

UnifiedReward Edit Qwen 3b

UnifiedReward-Edit-qwen-72b

FLUX.1-dev-UnifiedReward-Flex

FLUX.1-dev-PrefGRPO

UnifiedReward-qwen-32b

UnifiedReward-Edit-qwen3vl-4b

UnifiedReward-Think-7b

UnifiedReward-Flex-qwen3vl-32b

UnifiedReward-Think-qwen3vl-4b

sdxl-turbo-unified-reward-dpo

UnifiedReward-0.5b

UnifiedReward-Think-qwen3vl-2b

llava-onevision-qwen2-7b-ov-unifiedreward-dpo

UnifiedReward-2.0-qwen35-9b

Wan2.2-T2V-A14B-UnifiedReward-Flex-lora

Face-diffuser