Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
10
122
29
Yaorui SHI
yrshi
Follow
zayidu's profile picture
OldKingMeister's profile picture
mm1106's profile picture
11 followers
·
22 following
syr-cn
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 7 hours ago
DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards
upvoted
a
paper
2 days ago
SOD: Step-wise On-policy Distillation for Small Language Model Agents
upvoted
a
paper
4 days ago
GoLongRL: Capability-Oriented Long Context Reinforcement Learning with Multitask Alignment
View all activity
Organizations
yrshi
's models
5
Sort: Recently updated
yrshi/ReMemR1-7B
8B
•
Updated
Feb 21
•
319
•
2
yrshi/AutoRefine-Qwen2.5-7B-Instruct
Question Answering
•
8B
•
Updated
Jan 21
•
8
•
1
yrshi/AutoRefine-Qwen2.5-7B-Base
Question Answering
•
8B
•
Updated
Jan 21
•
43
•
1
yrshi/AutoRefine-Qwen2.5-3B-Base
Question Answering
•
3B
•
Updated
Jan 21
•
107
•
2
yrshi/AutoRefine-Qwen2.5-3B-Instruct
3B
•
Updated
May 16, 2025
•
5
•
1