The team
We’re looking for a Member of Technical Staff to join our Research Tooling & Data Platform team. This team is focused on building core infrastructure and tooling that empowers our Research team to build and iterate on new models faster. You will be building cutting-edge infrastructure in the generative AI space. The team builds and maintains data ingestion pipelines, model training infrastructure, research platforms, and the underlying tooling to support them. You’ll work closely with researchers to understand their needs, design and implement solutions, and ensure the reliability and scalability of our systems.
What you’ll do:
- Design, build, and maintain robust, scalable, and efficient data platforms and research tooling.
- Collaborate with our Research team to understand their needs and translate them into technical requirements and solutions.
- Develop and optimize data ingestion pipelines, model training infrastructure, and research platforms.
- Ensure the reliability, performance, and scalability of our systems in a fast-paced, high-growth environment.
- Contribute to the overall architecture and technical strategy of the Research Tooling & Data Platform team.
- Mentor junior engineers and contribute to a culture of technical excellence and continuous improvement.
What you’ll need:
- 8+ years of experience designing, building, and operating large-scale distributed systems and data platforms.
- Strong proficiency in at least one programming language (e.g., Python, Go, Rust, Java, C++).
- Extensive experience with cloud platforms (e.g., AWS, GCP, Azure) and containerization technologies (e.g., Docker, Kubernetes).
- Deep understanding of data warehousing, ETL processes, and big data technologies (e.g., Spark, Flink, Kafka, Snowflake).
- Experience with machine learning infrastructure and MLOps principles.
- Excellent problem-solving, communication, and collaboration skills.
- BS, MS, or PhD in Computer Science or a related technical field.
Bonus points:
- Experience with generative AI models and related technologies.
- Open-source contributions or a strong track record of building and shipping complex software projects.