Submitted by Evgeniy Glukhov 21 The Best of N Worlds: Aligning Reinforcement Learning with Best-of-N Sampling via max@k Optimisation JetBrains 1
4 Mellum: Production-Grade in-IDE Contextual Code Completion with Multi-File Project Understanding JetBrains