As a Member of Technical Staff on the Inference team at , you will be at the forefront of building the infrastructure and tooling required to serve our cutting-edge generative AI models at scale. Our team is responsible for optimizing the performance, reliability, and cost-efficiency of our inference stack, enabling us to deliver world-class experiences to our users.
About the Role
We’re looking for an experienced and highly motivated engineer to join our Inference team. This team is central to our mission, responsible for the high-performance execution of our generative AI models. As a member of this team, you’ll tackle challenging problems at the intersection of deep learning, systems engineering, and distributed computing. You’ll have the opportunity to make a significant impact on our product by ensuring our models run efficiently and reliably for millions of users worldwide.
What You’ll Do
- Design, implement, and optimize highly efficient and scalable deep learning inference systems.
- Work on a wide range of tasks, from optimizing low-level kernel performance to designing robust distributed systems for large-scale model serving.
- Collaborate closely with research scientists and engineers to deploy state-of-the-art models into production.
- Identify and resolve performance bottlenecks across the entire inference stack.
- Stay up-to-date with the latest advancements in inference optimization techniques and hardware accelerators.
Who You Are
- Bachelor’s or Master’s degree in Computer Science, Electrical Engineering, or a related field, or equivalent practical experience.
- 5+ years of experience in high-performance computing, deep learning inference, or systems engineering.
- Strong programming skills in Python and C++.
- Experience with deep learning frameworks such as PyTorch or TensorFlow.
- Deep understanding of computer architecture and GPU programming (CUDA).
- Experience with optimizing deep learning models for various hardware platforms (GPUs, TPUs, etc.).
- Familiarity with distributed systems and cloud platforms (AWS, GCP, Azure).
- Excellent problem-solving and communication skills.
Benefits
- Medical, Dental, and Vision insurance
- Unlimited PTO
- Flexible working hours
- Competitive salary and equity
- 401k plan with employer match
- Paid parental leave
- Gym and wellness stipend
- Regular team events and off-sites
About
is at the forefront of applying AI to creativity, building tools that empower artists and redefine what’s possible in content creation. Our mission is to build the future of storytelling through artificial intelligence. We believe that by creating powerful and intuitive AI tools, we can democratize creativity and enable anyone to tell their story, regardless of their technical expertise. We are a team of passionate researchers, engineers, and designers dedicated to pushing the boundaries of what AI can do.
is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees. We strongly encourage applications from individuals of all backgrounds.