Site Reliability Engineer Data & Analytics Infrastructure H/F
Role details
Job location
Tech stack
Job description
As a Senior Site Reliability Engineer in the Platform Core group at Criteo, you'll play a key role in designing, building, and operating the foundational systems that power all of our mission-critical applications. Whether you're working on relational or distributed databases, key-value stores, or high-performance SDKs, your work will have broad technical impact at massive scale.
What You'll Do
-
Design, build, and evolve large-scale infrastructure and database systems to meet our growing needs in performance, scalability, and resilience - where high speed and low latency are key.
-
Collaborate across teams to ensure reliability and consistency, supporting developers with tools, automation, and best practices that balance autonomy and operational excellence.
-
Drive innovation and transformation by migrating systems to our private cloud, automating away repetitive tasks, and exploring modern solutions to support evolving business and client needs.
-
Take ownership of incidents, troubleshoot critical issues, and continuously improve systems through proactive monitoring, root cause analysis, and long-term fixes.
-
Shape the future of our infrastructure by actively contributing to technical vision, sharing ideas, and delivering solutions that have a company-wide impact across hundreds of services and applications.
-
Stack: C#, Java, Scala, Python, Golang, NoSQL, SQL, Kafka, Hadoop, Vertica, Druid, Memcached, Elasticsearch, Aerospike, Kubernetes, Mesos, Linux, Prometheus, Grafana, Kibana, etc.
Requirements
You hold a master's or PhD degree in computer science, a related field, or equivalent practical experience.
-
You have 5+ years' experience in backend or SRE roles, with strong coding skills (C#, Java, Scala, Python or Golang).
-
Deep knowledge of relational databases, including design, tuning, and administration - with hands-on experience in MSSQL, MariaDB, PostgreSQL; familiarity with distributed databases like CockroachDB or Yugabyte is a bonus.
-
Solid understanding of distributed systems and cloud-native technologies, especially Kubernetes, and a strong interest in scalability and performance.
-
Strong SRE mindset: you're passionate about automation, observability, and reliability at scale.
-
Team player with excellent communication skills, a proactive attitude, and a drive to solve complex technical problems collaboratively.
Take a look at for access and insight into our engineering culture and achievements.
We understand that you might not meet each of the outlined requirements listed above, or may have experience that is a little different from our specifications. If you think that you can still bring value to the role, we want to hear from you.