Rationale-aided Efficient 7B size Large Language and Vision Models. Let's enjoy it!
Byung-Kwan Lee
BK-Lee
AI & ML interests
Vision Language Models
Recent Activity
upvoted
a
paper
about 2 hours ago
SurgWorld: Learning Surgical Robot Policies from Videos via World Modeling
upvoted
a
paper
about 16 hours ago
Quantile Rendering: Efficiently Embedding High-dimensional Feature on 3D Gaussian Splatting
upvoted
a
paper
about 24 hours ago
Masking Teacher and Reinforcing Student for Distilling Vision-Language Models