view article Article Granite 4.0 3B Vision: Compact Multimodal Intelligence for Enterprise Documents 5 days ago • 33
\$OneMillion-Bench: How Far are Language Agents from Human Experts? Paper • 2603.07980 • Published 27 days ago • 27
ScreenSpot-Pro: GUI Grounding for Professional High-Resolution Computer Use Paper • 2504.07981 • Published Apr 4, 2025 • 5
view article Article The First Healthcare Robotics Dataset and Foundational Physical AI Models for Healthcare Robotics 20 days ago • 24
Strategic Navigation or Stochastic Search? How Agents and Humans Reason Over Document Collections Paper • 2603.12180 • Published 24 days ago • 64
NVIDIA Nemotron v3 Collection Open, Production-ready Enterprise Models • 15 items • Updated 5 days ago • 262
SWE-rebench V2: Language-Agnostic SWE Task Collection at Scale Paper • 2602.23866 • Published Feb 27 • 88