Site Reliability Engineer
Role details
Job location
Tech stack
Job description
Automate deployments, monitoring, and operational processes using Infrastructure as Code (IaC) and CI/CD tools.
-
Oversee system observability: metrics, logs, traces, and alerts.
-
Manage critical incidents, lead resolution, and execute postmortem plans.
-
Collaborate with development teams to improve performance, reliability, and security of services.
-
Promote DevOps/SRE culture: automation, resilience, and continuous improvement.
-
Optimize costs and efficiency in both cloud and on-premise environments.
? WHY US? ?
Join our dynamic team of talented individuals and experience a world of growth and opportunities. Here's what we offer:
-
Grow rapidly with a tailored career path and salary evaluation. 70% of our senior leaders started at entry level.
-
Enhance your skills through our Tech Academy catalog, Udemy E-learning Platform, Languages Sessions, webinars, and workshops.
-
Take charge of your training with an annual personal budget and company-paid certifications.
-
Enjoy flexible policies, remote work options, and fantastic social benefits like transit and restaurant tickets, kindergarten support, and private health insurance.
-
Benefit from our WeCare program, supporting employees in critical situations.
Requirements
- 3-5 years of experience as SRE / DevOps.
- Experience managing CI pipelines (build, test, image push).
- Strong knowledge of AWS, Azure, or GCP.
- Terraform for infrastructure provisioning.
- Solid experience with Docker & Kubernetes (cluster and add-ons management).
- Linux/Unix and scripting skills (Bash, Python, Go).
- Agile and collaborative mindset.
Nice to have
- ArgoCD and GitOps practices.
- Helm Charts.
- Monitoring and observability tools.
- Incident management/on-call experience.
- Cloud or Kubernetes certifications.
? WHAT WILL YOU DO? ?
- Design, implement, and maintain highly available and scalable infrastructures.