-
Unified Reward Model for Multimodal Understanding and Generation
Paper • 2503.05236 • Published • 124 -
Unified Multimodal Chain-of-Thought Reward Model through Reinforcement Fine-Tuning
Paper • 2505.03318 • Published • 94 -
CodeGoat24/UnifiedReward-Think-qwen35-9b
9B • Updated • 59 -
CodeGoat24/UnifiedReward-Think-qwen35-27b
3.05M • Updated • 413
SII-Yibin Wang
CodeGoat24
AI & ML interests
I'm part of Shanghai Innovation Institute, focusing on Multimodal RL and Generation.
Recent Activity
upvoted a paper about 5 hours ago
Project Imaging-X: A Survey of 1000+ Open-Access Medical Imaging Datasets for Foundation Model Development authored a paper 5 days ago
Project Imaging-X: A Survey of 1000+ Open-Access Medical Imaging Datasets for Foundation Model Development liked a model 5 days ago
CodeGoat24/UnifiedReward-Flex-qwen35-4b