Posted on 7/1/2025
Software Engineer - Data Infrastructure
Luma AI
Palo Alto, CA
Qualifications
- Very strong generalist python coding
- Requirement of 5+ years of engineering, including 2+ years of work experience in petabyte-level data processing
- Experience engineering large-scale systems that process and serve petabytes of data
- Deep understanding of Kubernetes, SLURM, Ray and other cluster orchestration systems
- Experience working with visual data
- Experience working closely with ML is a strong plus
Responsibilities
- You’ll be part of Luma’s applied research team and work directly on mission critical work-streams utilizing thousands of GPUs
- Design, build and automate infrastructure for processing data across multiple clusters of thousands of GPUs
- Work with researchers to identify and implement technical data requirements, and optimize distributed loading for model training
- Work cross-functionally for diverse backend engineering needs
- Design & build performant infrastructure to manage and leverage large-scale datasets for our model training
Full Description
We are looking for people with strong Backend Data Engineering capabilities to build highly efficient, resilient systems & pipelines for large-scale data processing. You’ll be part of Luma’s applied research team and work directly on mission critical work-streams utilizing thousands of GPUs.
Responsibilities
• Design, build and automate infrastructure for processing data across multiple clustersof thousands of GPUs.
• Work with researchers to identify and implement technical data requirements, and optimize distributed loading for model training.
• Work cross-functionally for diverse backend engineering needs.
• Design & build performant infrastructure to manage and leverage large-scale datasets for our model training.
Experience
• Very strong generalist python coding.
• Requirement of 5+ years of engineering, including 2+ years of work experience in petabyte-level data processing.
• Experience engineering large-scale systems that process and serve petabytes of data.
• Deep understanding of Kubernetes, SLURM, Ray and other cluster orchestration systems.
• Experience working with visual data.Experience working closely with ML is a strong plus .
Find AI, ML, Data Science Jobs By Location
Find Jobs By Position
Subscribe to the AI Search Newsletter
Get top updates in AI to your inbox every weekend. It's free!