AI Deployment Engineer – Codex

About the Team

The deployment team is responsible for ensuring that our models run reliably, efficiently, and at scale. We partner closely with research and product teams to integrate our cutting-edge models into production systems and build robust infrastructure to support them. Our work directly impacts users by making powerful AI tools accessible and performant.

About the Role

OpenAI is hiring an AI Deployment Engineer to work on Codex. In this role, you will be responsible for deploying, optimizing, and maintaining large language models (LLMs) in production. You will work on a variety of tasks, from improving inference performance and reliability to building tools and infrastructure that enable seamless integration of new models. This role offers a unique opportunity to work at the forefront of AI research and deployment, making a tangible impact on the accessibility and performance of advanced AI systems.

Qualifications

  • 3+ years of experience in software engineering, with a focus on deployment, reliability, or performance optimization.
  • Experience with large-scale distributed systems and cloud platforms (e.g., AWS, Azure, GCP).
  • Experience with machine learning frameworks (e.g., TensorFlow, PyTorch) and MLOps tools.
  • Experience with performance profiling and optimization techniques.
  • Experience with containerization technologies (e.g., Docker, Kubernetes).
  • Experience with CI/CD pipelines and automated testing.
  • Strong programming skills in Python or Go.
  • Excellent problem-solving and debugging skills.
  • Strong communication and collaboration skills.
  • Bachelor’s or Master’s degree in Computer Science, Engineering, or a related field, or equivalent practical experience.
Job Category: N/A
Job Type: Remote
Job Location: USA
Organization: Job Hunting U

Apply for this position

Allowed Type(s): .pdf, .doc, .docx