About the team:
The Machine Learning Platform (MLP) team at Netflix builds the core infrastructure that powers all ML innovation across the company. Our platform enables thousands of engineers and scientists to build, train, deploy, and monitor ML models at scale, impacting everything from content recommendations to streaming quality, and personalized experiences. We are a passionate group of engineers focused on delivering high-impact solutions that directly influence Netflix’s success.
About the role:
We’re looking for a Senior Software Engineer to help build the Machine Learning Training Platform at Netflix. In this role, you will be pivotal in shaping the future of how Netflix uses ML. You will work on cutting-edge technologies, solve complex distributed systems problems, and contribute to a platform that empowers hundreds of ML practitioners.
What you will do:
- Design and implement critical components of our ML Training Platform.
- Partner with ML engineers and scientists to understand their needs and translate them into platform features.
- Build highly scalable, reliable, and performant systems that can handle large-scale ML workloads.
- Contribute to the overall architecture and technical strategy of the ML Platform.
- Mentor and provide technical leadership to junior engineers.
Who you are:
- 7+ years of experience building scalable distributed systems.
- Strong proficiency in Java, Scala, Python, or Go.
- Experience with cloud platforms (AWS, GCP, Azure) and container technologies (Docker, Kubernetes).
- Deep understanding of distributed systems principles, data structures, and algorithms.
- Excellent communication, collaboration, and problem-solving skills.
- Experience with large-scale data processing technologies (Spark, Flink) is a plus.
- Experience with ML frameworks (TensorFlow, PyTorch) and MLOps platforms is a plus.
- Bachelor’s or Master’s degree in Computer Science or a related field, or equivalent practical experience.