RIVR
Industry · Engineering
Senior AI Engineer - VLA Foundation Model
CHF 150'000 – 170'000 / year
ZÜRICH
AI-TITLEMACHINE LEARNINGDEEP LEARNINGNEURAL NETWORKREINFORCEMENT LEARNINGSUPERVISED LEARNINGGENERATIVE AIDIFFUSION MODELFOUNDATION MODELAI ENGINEERPYTORCH
Description
Amazon RIVR, an ETH Zurich spin-off acquired by Amazon, is building the next generation of safe, reliable autonomous robots for last-mile delivery.
In this role, you will develop multi-modal Vision-Language-Action (VLA) models to enable robots to autonomously generate actions from demonstrations, real-time sensor data, and natural language commands.
Responsibilities
- Develop and implement cutting-edge Vision-Language-Action (VLA) models, generalist robot transformers, and imitation learning algorithms (e.g., diffusion policies) to enable robots to autonomously execute complex tasks.
- Design, test, and refine your algorithms to meet the demands of complex real-world autonomy and navigation tasks, with a focus on spatial reasoning and generalization.
- Streamline the data collection and training workflow to efficiently expand model capabilities with new tasks and data sources.
- Collaborate with the reinforcement learning team to innovate methods that leverage both simulated and real-world data.
- Optimize and distill networks for real-time deployment on the edge (e.g. Nvidia Jetson Thor).
- Build, lead and mentor an exceptional team of software engineers.
- Provide expert guidance to product managers and executives for strategic decision-making.
- Create and maintain documentation, guidelines, and best practices to streamline knowledge sharing.
Qualifications
- Master’s degree or higher in a relevant field such as Engineering, Robotics, or Machine Learning.
- A minimum of three years of industry or research experience, with PhD experience applicable.
- Strong deep learning fundamentals including supervised learning, self-supervised learning, Transformer-based architectures, policy optimization algorithms, imitation learning, and generative AI techniques (including Diffusion Models).
- Proven experience in developing Vision-Language-Action (VLA) models or large-scale generalist robot models (e.g., RT-2, Octo, etc.).
- Strong background in robotics including autonomy, navigation.
- Experience with deploying artificial neural networks on hardware platforms.
- Ability to prototype algorithms and train deep neural networks in Python (Pytorch)
- PhD degree in Robotics, Engineering, Computer Science, Machine Learning or a similar discipline, or an equivalent amount of research experience.
- Publications at top-tier conferences.
- Experience in managing a software team.
- Ability to write production-level code in modern C++