Research

rhecker 's Collections

speech

updated Oct 6, 2025

SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics

Paper • 2506.01844 • Published Jun 2, 2025 • 147
VLA-RFT: Vision-Language-Action Reinforcement Fine-tuning with Verified Rewards in World Simulators

Paper • 2510.00406 • Published Oct 1, 2025 • 65

Note A reinforcemenet fine-tuning framework that uses a simulator. Looks very promising! It looks like they predict future actions/frames to generate data for reinforcment learning? They train a model on a dataset that predicts images and its rewards They work in two stages: 1. WM and policy pretraining. -> Train a world model on existing dataset 2. VLA Optimization through WM interaction -> VLA fine tuning using the world model in chuncks.