Shehan Munasinghe's picture

Shehan Munasinghe

shehan97

·

https://shehanmunasinghe.github.io/

AI & ML interests

Computer Vision, Multi-modal learning

Recent Activity

upvoted a paper about 6 hours ago

Paper Circle: An Open-source Multi-agent Research Discovery and Analysis Framework

authored a paper 5 months ago

VideoGLaMM: A Large Multimodal Model for Pixel-Level Visual Grounding in Videos

upvoted a paper 10 months ago

Sekai: A Video Dataset towards World Exploration

View all activity

Organizations

Papers 2

arxiv:2411.04923

arxiv:2311.13435

models 9

shehan97/molmo-video

Updated Feb 24, 2025

shehan97/lora-molmo-pixmo-video

Updated Feb 24, 2025

shehan97/lora-molmo-pixmo

Updated Feb 19, 2025

shehan97/qwen2-7b-instruct-trl-sft-ChartQA

Updated Feb 19, 2025

shehan97/lora

Updated Feb 19, 2025

shehan97/mobilevitv2-1.0-voc-deeplabv3

Image Segmentation • Updated May 2, 2023 • 567

shehan97/mobilevitv2-1.0-imagenet1k-256

Image Classification • Updated May 2, 2023 • 193

shehan97/mobilevitv2-1.5-voc-deeplabv3

Updated May 2, 2023 • 1

shehan97/mobilevitv2-2.0-imagenet1k-256

Updated May 2, 2023 • 4

datasets 0

None public yet