41 36

Umar Azam

UmarAzam

Umar-Azam

AI & ML interests

Robotics and Simulations

Recent Activity

liked a model 23 days ago

facebook/boxer

liked a model 29 days ago

Overworld/Waypoint-1.5-1B

upvoted an article 29 days ago

Waypoint-1.5: Higher-Fidelity Interactive Worlds for Everyday GPUs

View all activity

Organizations

None yet

liked a model 23 days ago

facebook/boxer

Object Detection • Updated 10 days ago • 51

liked a model 29 days ago

Overworld/Waypoint-1.5-1B

2B • Updated about 1 month ago • 1.89k • 38

upvoted an article 29 days ago

Article

Waypoint-1.5: Higher-Fidelity Interactive Worlds for Everyday GPUs

lapp0, LouisCastricato, ScottieFox, shahbuland, xAesthetics

•

Apr 9

• 29

liked 4 models about 1 month ago

upvoted a paper about 2 months ago

LongCat-Next: Lexicalizing Modalities as Discrete Tokens

Paper • 2603.27538 • Published Mar 29 • 146

upvoted 2 articles about 2 months ago

Article

How I contributed a new model to the Transformers library using Codex

nielsr

•

Mar 30

• 51

Article

Holotron-12B - High Throughput Computer Use Agent

Hcompany

•

Mar 17

• 22

liked a model about 2 months ago

allenai/MolmoBot-SPOC-DROID

Updated Mar 24 • 5

upvoted a paper about 2 months ago

Loc3R-VLM: Language-based Localization and 3D Reasoning with Vision-Language Models

Paper • 2603.18002 • Published Mar 18 • 13

liked a model 2 months ago

THU-SI/Spatial-TTT-nano

Updated Mar 9 • 18 • 4

upvoted a paper 2 months ago

Utonia: Toward One Encoder for All Point Clouds

Paper • 2603.03283 • Published Mar 3 • 185

liked a model 2 months ago

UWGZQ/TRASER

Video-Text-to-Text • 928k • Updated Mar 8 • 7 • 4

upvoted a paper 3 months ago

GigaBrain-0.5M*: a VLA That Learns From World Model-Based Reinforcement Learning

Paper • 2602.12099 • Published Feb 12 • 62

liked a model 3 months ago

Nanbeige/Nanbeige4.1-3B

Text Generation • 4B • Updated Mar 25 • 220k • • 1.11k

upvoted an article 3 months ago

Article

Nemotron ColEmbed V2: Raising the Bar for Multimodal Retrieval with ViDoRe V3’s Top Model

nvidia

•

Feb 4

• 28

upvoted a collection 3 months ago

LingBot-VLA

Collection

Vision-Language-Action Foundation Model • 5 items • Updated Mar 9 • 14

upvoted a paper 4 months ago

DynamicVLA: A Vision-Language-Action Model for Dynamic Object Manipulation

Paper • 2601.22153 • Published Jan 29 • 75

Umar Azam

AI & ML interests

Recent Activity

Organizations

UmarAzam's activity

Waypoint-1.5: Higher-Fidelity Interactive Worlds for Everyday GPUs

How I contributed a new model to the Transformers library using Codex

Holotron-12B - High Throughput Computer Use Agent

Nemotron ColEmbed V2: Raising the Bar for Multimodal Retrieval with ViDoRe V3’s Top Model