Posted on 7/1/2025

Software Engineer - Data Infrastructure

Luma AI

Palo Alto, CA

Full-time

$150K–$300K

Apply Promote

Qualifications

Very strong generalist python coding
Requirement of 5+ years of engineering, including 2+ years of work experience in petabyte-level data processing
Experience engineering large-scale systems that process and serve petabytes of data
Deep understanding of Kubernetes, SLURM, Ray and other cluster orchestration systems
Experience working with visual data
Experience working closely with ML is a strong plus

Responsibilities

You’ll be part of Luma’s applied research team and work directly on mission critical work-streams utilizing thousands of GPUs
Design, build and automate infrastructure for processing data across multiple clusters of thousands of GPUs
Work with researchers to identify and implement technical data requirements, and optimize distributed loading for model training
Work cross-functionally for diverse backend engineering needs
Design & build performant infrastructure to manage and leverage large-scale datasets for our model training

Full Description

We are looking for people with strong Backend Data Engineering capabilities to build highly efficient, resilient systems & pipelines for large-scale data processing. You’ll be part of Luma’s applied research team and work directly on mission critical work-streams utilizing thousands of GPUs.

Responsibilities

• Design, build and automate infrastructure for processing data across multiple clustersof thousands of GPUs.

• Work with researchers to identify and implement technical data requirements, and optimize distributed loading for model training.

• Work cross-functionally for diverse backend engineering needs.

• Design & build performant infrastructure to manage and leverage large-scale datasets for our model training.

Experience

• Very strong generalist python coding.

• Requirement of 5+ years of engineering, including 2+ years of work experience in petabyte-level data processing.

• Experience engineering large-scale systems that process and serve petabytes of data.

• Deep understanding of Kubernetes, SLURM, Ray and other cluster orchestration systems.

• Experience working with visual data.Experience working closely with ML is a strong plus .

Apply Promote

Subscribe to the AI Search Newsletter

Get top updates in AI to your inbox every weekend. It's free!