arxiv:2411.04923
Shehan Munasinghe
shehan97
AI & ML interests
Computer Vision, Multi-modal learning
Recent Activity
upvoted a paper about 6 hours ago
Paper Circle: An Open-source Multi-agent Research Discovery and Analysis Framework authored a paper 5 months ago
VideoGLaMM: A Large Multimodal Model for Pixel-Level Visual Grounding in
Videos upvoted a paper 10 months ago
Sekai: A Video Dataset towards World Exploration