Our company is a mission-driven organization transforming the debt collections industry to deliver better, more equitable outcomes for consumers. We bring empathy and humanity to debt collection by marrying world-class machine learning, behavioral science, and a modern tech stack to develop products that help consumers resolve their financial obligations with respect and dignity.
As a Technical Operations Manager, you will play a critical role in ensuring the reliability, scalability, and performance of our production systems. You will lead a team of operations engineers, drive operational excellence, and collaborate closely with engineering teams to build and maintain robust infrastructure and applications.
What you’ll do:
- Own the operational stability, reliability, and performance of critical internal and customer-facing systems.
- Develop and implement proactive monitoring, alerting, and incident response processes.
- Lead investigations into complex technical issues, working cross-functionally to identify root causes and implement lasting solutions.
- Manage and optimize cloud infrastructure (AWS) for cost, performance, and security.
- Automate operational tasks and workflows to improve efficiency and reduce manual effort.
- Collaborate with engineering teams to ensure operational readiness of new features and services.
- Define and track key operational metrics (SLAs, SLOs) to measure and improve system health.
- Participate in an on-call rotation to provide 24/7 support for critical systems.
- Develop and maintain comprehensive documentation for systems, processes, and playbooks.
- Mentor and guide junior team members, fostering a culture of operational excellence.
What you’ll bring:
- 7+ years of experience in technical operations, SRE, or DevOps roles.
- Proven track record of managing and scaling production systems in a cloud environment (AWS preferred).
- Deep understanding of Linux operating systems, networking, and distributed systems.
- Proficiency in scripting languages (Python, Go, or similar) and automation tools (Terraform, Ansible, or similar).
- Experience with containerization technologies (Docker, Kubernetes).
- Familiarity with monitoring and logging tools (Datadog, Prometheus, ELK stack).
- Strong problem-solving skills and the ability to troubleshoot complex technical issues.
- Excellent communication and collaboration skills, with a focus on cross-functional teamwork.
- Bachelor’s degree in Computer Science, Engineering, or a related field (or equivalent practical experience).
Our company is an equal opportunity employer and values diversity at our company. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.