zhangwenbin

ExceedZhang

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 hour ago

Reflexion: Language Agents with Verbal Reinforcement Learning

liked a model about 11 hours ago

google/gemma-4-31B

upvoted a paper about 13 hours ago

Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning

View all activity

Organizations

None yet

upvoted a paper about 1 hour ago

Reflexion: Language Agents with Verbal Reinforcement Learning

Paper • 2303.11366 • Published Mar 20, 2023 • 6

liked a model about 11 hours ago

google/gemma-4-31B

Image-Text-to-Text • 33B • Updated about 11 hours ago • 1.63k • 120

upvoted a paper about 13 hours ago

Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning

Paper • 2505.24726 • Published May 30, 2025 • 280

liked a model about 13 hours ago

Jackrong/Qwopus3.5-27B-v3

Image-Text-to-Text • 27B • Updated about 18 hours ago • 13 • 56

upvoted an article about 14 hours ago

Article

Holo3: Breaking the Computer Use Frontier

1 day ago

•

liked a model about 15 hours ago

Hcompany/Holo3-35B-A3B

Image-Text-to-Text • 35B • Updated about 19 hours ago • 603 • 187

liked a model 2 days ago

chromadb/context-1

Text Generation • 21B • Updated 4 days ago • 2.82k • 358

upvoted a paper 5 days ago

Speed by Simplicity: A Single-Stream Architecture for Fast Audio-Video Generative Foundation Model

Paper • 2603.21986 • Published 11 days ago • 120

liked a model 5 days ago

GAIR/daVinci-MagiHuman

Image-to-Video • Updated 9 days ago • 670 • 288

upvoted 3 papers 5 days ago

UI-Voyager: A Self-Evolving GUI Agent Learning via Failed Experience

Paper • 2603.24533 • Published 8 days ago • 45

VideoDetective: Clue Hunting via both Extrinsic Query and Intrinsic Relevance for Long Video Understanding

Paper • 2603.22285 • Published 10 days ago • 50

MinerU-Diffusion: Rethinking Document OCR as Inverse Rendering via Diffusion Decoding

Paper • 2603.22458 • Published 10 days ago • 131

upvoted an article 9 days ago

Article

Introducing Storage Buckets on the Hugging Face Hub

24 days ago

•

187

liked a model 10 days ago

mistralai/Mistral-Small-4-119B-2603

119B • Updated 8 days ago • 63.1k • 344

upvoted a paper 10 days ago

TRUST-SQL: Tool-Integrated Multi-Turn Reinforcement Learning for Text-to-SQL over Unknown Schemas

Paper • 2603.16448 • Published 17 days ago • 58

liked a dataset 11 days ago

gudo7208/CAD-Coder

Viewer • Updated Jan 9 • 250k • 53 • 1

liked a model 11 days ago

Multilingual-Multimodal-NLP/IndustrialCoder

Text Generation • 32B • Updated 7 days ago • 1.52k • 50

upvoted 2 papers 11 days ago

SocialOmni: Benchmarking Audio-Visual Social Interactivity in Omni Models

Paper • 2603.16859 • Published 16 days ago • 248

InCoder-32B: Code Foundation Model for Industrial Scenarios

Paper • 2603.16790 • Published 16 days ago • 306

liked a model 12 days ago

HauhauCS/Qwen3.5-35B-A3B-Uncensored-HauhauCS-Aggressive

Image-Text-to-Text • 35B • Updated 23 days ago • 622k • 1.15k