Site Reliability Engineer
Role details
Job location
Tech stack
Job description
-
Drive architectural change by participating in RFCs, architecture forums and platform-wide initiatives to improve reliability, scalability and efficiency
-
Lead and evolve ClearScore's Kubernetes platform, designing, upgrading and optimising clusters at scale while shaping how we use Kubernetes across the company
-
Troubleshoot and resolve complex production issues independently, using a deep understanding of distributed systems and containerisation to mitigate and prevent incidents
-
Design and contribute to Kubernetes controllers and automation tools to improve our infrastructure and developer experience
-
Enhance our AWS estate, ensuring cost efficiency, security and scalability while promoting best practices across teams
-
Collaborate with developers to improve service observability, implement metrics and alerting strategies, and build meaningful dashboards for complex systems
-
Build and maintain CI/CD pipelines from scratch for new use cases, manage migrations, and introduce new tooling where beneficial
-
Contribute to open source projects through fixes, feedback or new tools aligned with our mission
-
Mentor and guide mid-level SREs and other engineers, helping them develop deep technical expertise and operational excellence, We believe this structure offers the best of both worlds - the flexibility of remote work and the synergy of face-to-face collaboration. Our office days are carefully coordinated to maximise team interactions and learning/ mentorship opportunities. What This Means for You:
-
Flexibility to manage your work and life
-
Dedicated in-office days for team building and collaborative projects
-
Office facilities (with plants!) designed for productive interactions
-
Clear expectations and support for maintaining our hybrid schedule
We're committed to creating an inclusive environment that accommodates diverse needs while maintaining our collaborative culture. Join us in shaping the future of work! Note: While we offer flexibility, commitment to our hybrid schedule is an important aspect of our team culture and performance expectations.
Requirements
- Expert-level Kubernetes knowledge, including experience with cluster upgrades, networking (CNI), container runtimes and troubleshooting node-level issues
- Strong AWS expertise, including architecture, networking and cost management, with an awareness of industry standards and the ability to influence adoption across teams
- Deep understanding of Linux internals, containerisation and operating system-level performance tuning
- Proficiency in at least one compiled language (for example Go, Rust or C++) and one interpreted language (for example Python or Bash)
- Proven ability to automate infrastructure, deployments and monitoring with strong scripting skills across multiple languages
- Experience designing, deploying and operating distributed systems with complex failure modes
- Strong networking fundamentals, capable of debugging complex routing or firewall issues and designing resilient architectures
- Hands-on experience with CI/CD pipelines and tooling such as Jenkins, ArgoCD or Spinnaker, including building and managing large-scale migrations
- Deep observability expertise, from instrumenting applications and building dashboards to managing large monitoring stack upgrades and integrations
Benefits & conditions
- 25 paid holidays and a "duvet day" on your birthday
- Hybrid Work Environment
- Private health and dental cover - including mental health support through Bupa
- GP office visits
- Life assurance scheme
- Up to 6% matched pension
- Regular Lunch and Learns with guest speakers
- Dog-friendly office
- Daily breakfast and free snacks
- Access to discounts via Cobens Extras
- Free sports and social clubs
- Continued investment into learning and development
- Leadership-led training
- In-house psychotherapist
- Financial coach to help you plan and achieve your goals
- No clock-watching culture
- Generous maternity and paternity plans
- Culture and inclusion representatives
- Transparent pay structure and a career growth plan
Equal Opportunities
ClearScore is committed to providing equal employment opportunities to all qualified individuals. As an equal opportunity employer, we are able to make reasonable adjustments to accommodate individuals with disabilities during the recruitment and selection process. If you require accommodation, please inform us in advance, and we will work with you to meet your needs. Our Hybrid Model We embrace a dynamic hybrid work environment that balances flexibility with collaborative in-person experiences. Our approach is designed to foster innovation, team connection, and individual productivity.
- Levels 1-5: Minimum 2 days per week in-office
- Level 6 and above: Minimum 3 days per week in-office