SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics

What is the action space of this paper?