arxiv:2605.04045
Shengqiong Wu
ChocoWu
AI & ML interests
Large Language Model, Multimodal learning, Scene graph Generation
Recent Activity
authored a paper about 9 hours ago
Audio-Visual Intelligence in Large Foundation Models upvoted a paper 5 days ago
Audio-Visual Intelligence in Large Foundation Models upvoted a paper 5 months ago
SemanticGen: Video Generation in Semantic Space