Yaorui SHI's picture

Yaorui SHI

yrshi

·

syr-cn

AI & ML interests

None yet

Recent Activity

upvoted a paper about 7 hours ago

DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards

upvoted a paper 2 days ago

SOD: Step-wise On-policy Distillation for Small Language Model Agents

upvoted a paper 4 days ago

GoLongRL: Capability-Oriented Long Context Reinforcement Learning with Multitask Alignment

View all activity

Organizations

yrshi 's models 5

yrshi/ReMemR1-7B

8B • Updated Feb 21 • 319 • 2

yrshi/AutoRefine-Qwen2.5-7B-Instruct

Question Answering • 8B • Updated Jan 21 • 8 • 1

yrshi/AutoRefine-Qwen2.5-7B-Base

Question Answering • 8B • Updated Jan 21 • 43 • 1

yrshi/AutoRefine-Qwen2.5-3B-Base

Question Answering • 3B • Updated Jan 21 • 107 • 2

yrshi/AutoRefine-Qwen2.5-3B-Instruct

3B • Updated May 16, 2025 • 5 • 1