SmolVLA A Vision-Language-Action Model for Affordable and Efficient Robotics

Source

@misc{shukor_2025_smol,
      title={SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics}, 
      author={Mustafa Shukor and Dana Aubakirova and Francesco Capuano and Pepijn Kooijmans and Steven Palma and Adil Zouitine and Michel Aractingi and Caroline Pascal and Martino Russi and Andres Marafioti and Simon Alibert and Matthieu Cord and Thomas Wolf and Remi Cadene},
      year={2025},
      eprint={2506.01844},
      archivePrefix={arXiv},
      primaryClass={cs.LG},
}

(Sorbonne University, Hugging Face) | arXiv

TL;DR

Flash Reading

References