TELECOMMUTE
Role details
Job location
Tech stack
Job description
Our growth is driving us to strengthen our Platform SRE team to build, standardize, and reliable the infrastructure hosting Scaleway's products.
Your mission will be to ensure the operational readiness of our infrastructures and the onboarding of product teams in order to maintain high-performance standards, ensure continuous improvement, and support the deployment of product stacks across new regions.
YOUR FUTURE TEAM
We work in a collaborative and international environment where the diversity of Scalers, combined with a spirit of sharing, helps bring new projects to life every day, advancing our ambitions together.
You will be part of a team of 5 people, including your manager. The team operates in a stabilized environment with a fresh dynamic, focusing on onboarding multiple products and uniformizing engineering practices. We use a mix of Scrum and Kanban methodologies and rotate product referents to keep the work engaging and mitigate "bus factor" risks.
YOUR DAILY ROUTINE
Tasks
- Build, standardize, and enhance the reliability of the platform infrastructure
- Onboard product teams and facilitate the deployment of product stacks in new regions
- Implement and manage observability tools (Grafana, Thanos, Alertmanager)
- Automate infrastructure deployment and management using Gitops processes
- Ensure configuration consistency through tools like Ansible or Salt
- Manage operational maintenance (MCO) and handle production incidents
- Define and monitor reliability metrics such as SLAs and SLOs
- Maintain clear and accurate technical documentation
- Manage security components and secrets (e.g., HashiCorp Vault)
- Participate in a weekly on-call rotation (approximately one week per month), Manager interview to understand your technical skills and approach to the role (45 min)
- Technical interview and Use Case with the team to validate your expertise (1h)
- Deep dive interview to deepen discussions and assess your fit with the team (45 min)
- Final validation with the Head of Tribe and office tour to meet your future colleagues
Requirements
Senior-level experience in Linux system administration and infrastructure management
- Proficiency with Kubernetes (K8S) and Gitops workflows (Argocd, Fluxcd)
- Strong mastery of Infrastructure as Code (IaC) and automation (Ansible, Salt)
- Solid understanding of Networking fundamentals and security protocols
- Experience managing Secrets (e.g., Vault) and Observability stacks (Thanos, Grafana)
- Proficiency in scripting and automation for high-availability environments
SOFT SKILLS:
- Pragmatic approach to problem-solving
- Strong listening skills and ability to collaborate across teams
- High level of precision and attention to detail
- Curiosity and a continuous improvement mindset
- Open-mindedness and a collaborative spirit