SUPERNOVA: Eliciting General Reasoning in LLMs with Reinforcement Learning on Natural Instructions Paper • 2604.08477 • Published 17 days ago • 1
Ashima/micro_top2_augmented_going_against_strong_prior_Mar19-2244 Viewer • Updated Mar 20 • 7.45k • 11
Ashima/micro_top2_augmented_going_against_strong_prior_Mar19-2244 Viewer • Updated Mar 20 • 7.45k • 11