arxiv:2606.00408
XuQixin
Racktic
AI & ML interests
NLP, mutimodel
Recent Activity
authored a paper 8 days ago
Masking Stale Observations Helps Search Agents -- Until It Doesn't: A Regime Map and Its Mechanism upvoted a paper 10 days ago
Harness-1: Reinforcement Learning for Search Agents with State-Externalizing Harnesses