VLA-Adapter An Effective Paradigm for Tiny-Scale Vision-Language-Action Model

Source

@misc{Wang_2025_vlaadapter,
    title={{VLA}-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Model},
    author={Yihao Wang and Pengxiang Ding and Lingxiao Li and Can Cui and Zirui Ge and Xinyang Tong and Wenxuan Song and Han Zhao and Wei Zhao and Pengxu Hou and Siteng Huang and Yifan Tang and Wenhui Wang and Ru Zhang and Jianyi Liu and Donglin Wang},
    year={2025},
    eprint={2509.09372},
    archivePrefix={arXiv},
    primaryClass={cs.RO},
    url={https://arxiv.org/abs/2509.09372},
}
(Beijing University of Posts and Telecommunications, Westlake University) arXiv

TL;DR

General concept

Flash Reading

Feature extraction effect

References