A comprehensive framework designed to cultivate VLMs with human-like visuospatial abilities.
Ray Yang
rayruiyang
AI & ML interests
None yet
Recent Activity
updated
a dataset 1 day ago
rayruiyang/vst_500k upvoted a paper 5 days ago
Holi-Spatial: Evolving Video Streams into Holistic 3D Spatial Intelligence upvoted a paper 11 days ago
Utonia: Toward One Encoder for All Point Clouds Organizations
None yet