Join our team as a Senior/Lead Software Data Engineer and play a crucial role in building and maintaining the foundational data and pipelines that power our global road network. You’ll work on high-volume, real-time data processing, ensuring the accuracy, freshness, and completeness of our road data. This position is open to candidates located in Poland.
About the team:
The Roads team is responsible for the company’s global road network, a critical component for routing, navigation, and location services. We process petabytes of data from various sources, including GPS traces, satellite imagery, and open data initiatives, to create a highly accurate and up-to-date representation of the world’s roads. We build scalable, reliable, and efficient data pipelines and services that power millions of queries daily.
What You’ll Do:
- Design, build, and maintain robust, scalable, and highly performant data pipelines for processing petabytes of road data.
- Develop and optimize data quality checks and validation processes to ensure the accuracy and freshness of our road network.
- Work with a variety of data sources, including GPS traces, satellite imagery, and open data, to integrate and transform them into a unified road dataset.
- Collaborate with other engineering teams (e.g., routing, navigation, maps) to understand their data needs and provide solutions.
- Improve existing data infrastructure and tools to enhance efficiency, reliability, and observability.
- Mentor junior engineers and contribute to technical leadership within the team (for Lead roles).
- Participate in on-call rotations to support production systems.
What You’ll Bring:
- 5+ years of experience in data engineering, software engineering, or a related field.
- Strong proficiency in programming languages such as Python, Scala, or Java.
- Extensive experience with big data technologies (e.g., Spark, Flink, Hadoop, Kafka).
- Experience with cloud platforms (AWS, Azure, GCP) and their data services (e.g., S3, EMR, Kinesis, Dataflow).
- Solid understanding of database systems (SQL and NoSQL) and data warehousing concepts.
- Familiarity with geospatial data and tools (e.g., PostGIS, GDAL) is a plus.
- Experience with CI/CD practices and infrastructure as code (e.g., Terraform).
- Proven ability to design and implement scalable, reliable, and maintainable data solutions.
- Excellent problem-solving skills and attention to detail.
- Strong communication and collaboration skills.
- Bachelor’s or Master’s degree in Computer Science, Engineering, or a related technical field, or equivalent practical experience.