
Review the role details and submit your application.





Description
The AI Robot Association (AIRoA) is launching a groundbreaking initiative: collecting one million hours of humanoid robot operation data with hundreds of robots, and leveraging it to train the world’s most powerful Vision-Language-Action (VLA) models.
What makes AIRoA unique is not only the unprecedented scale of real-world data and humanoid platforms, but also our commitment to making everything open and accessible. We are building a shared “robot data ecosystem” where datasets, trained models, and benchmarks are available to everyone. Researchers around the world will be able to evaluate their models on standardized humanoid robots through our open evaluation platform.
Job Description
Develop Vision-Language models, or equivalent multimodal models, with a view toward applications in the robotics domain
Fine-tune existing models, conduct evaluations, perform error analysis, and improve performance
Build training pipelines using real-world data, design evaluation metrics, and operate iterative improvement cycles
Prepare and preprocess data, and build training environments for image, video, language, and action data
Research the latest trends in technologies and academic studies, select appropriate technologies, and incorporate findings into model improvements
Establish training infrastructure, inference infrastructure, and experimental environments for real-world model deployment
Collaborate with related teams such as software engineers and robotics engineers to define requirements, design validation plans, and drive development
Submit your application for this role at AI Robot Association (AIRoA).