About the Role
We are looking for a Staff or Senior Software Engineer to join our Machine Learning Platform team in the areas of Data and Feature Infrastructure. The Machine Learning Platform builds and operates the core platform on which all machine learning at Netflix is developed, trained, and deployed. Our mission is to enable Netflix to build, deploy, and operate thousands of ML models and continuously innovate on the member experience at scale.
This role offers an opportunity to work on critical infrastructure that powers ML at Netflix, impacting various product areas from recommendations to content creation. You will work on challenging problems involving large-scale distributed systems, data processing, and machine learning infrastructure.
What you’ll do
- Design, build, and operate scalable, reliable, and efficient batch/stream processing infrastructure to manage ML data and features.
- Develop and maintain data quality, governance, and observability tools for ML data.
- Partner with ML engineers and data scientists to understand their needs, provide tooling, and best practices.
- Contribute to the overall architecture and strategy of the ML Platform.
- Mentor and grow junior engineers on the team.
About you
- 5+ years of experience in software development, with a focus on distributed systems, data infrastructure, or machine learning platforms.
- Proficiency in at least one modern programming language (e.g., Java, Scala, Python).
- Strong understanding of data processing technologies (e.g., Spark, Flink, Kafka, Presto).
- Experience with cloud platforms (e.g., AWS, Azure, GCP) and containerization technologies (e.g., Docker, Kubernetes) is a plus.
- Ability to design and build scalable, reliable, and maintainable software systems.
- Excellent communication and collaboration skills.
- BS or MS in Computer Science or a related field, or equivalent practical experience.