A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation
AI & ML interests
None defined yet.
Recent Activity
View all activity
Papers
View all Papers
datasets
7
JavisVerse/JavisUnd-Eval
Updated
•
46
JavisVerse/MM-PreTrain
Viewer
•
Updated
•
340k
•
77
JavisVerse/JavisInst-Omni
Viewer
•
Updated
•
91.4k
•
46
JavisVerse/AV-FineTune
Viewer
•
Updated
•
1.8M
•
20
JavisVerse/JavisBench
Viewer
•
Updated
•
22.3k
•
61
JavisVerse/JavisData-audios
Viewer
•
Updated
•
788k
•
40
JavisVerse/TAVGBench_clean
Viewer
•
Updated
•
1.58M
•
15