Site Reliability Engineer (SRE)
Role details
Job location
Tech stack
Job description
Design, implement, and manage scalable infrastructure Monitor and enhance system performance Automate repetitive tasks for efficiency Develop monitoring, alerting, and incident response systems Perform root cause analysis and preventative maintenance Ensure SIEM data sources remain healthy and troubleshoot logging issues
Requirements
Are you a skilled Site Reliability Engineer (SRE) with experience in maintaining scalable and reliable infrastructure? We're looking for a proactive leader with a passion for automation, incident management, and system optimization., 5+ years of SRE or similar experience Expertise in Cloud Platforms (SIEM technologies preferred) Proficiency in Python or Bash scripting Hands-on experience with Infrastructure as Code (e.g., Terraform, Ansible) Familiarity with Docker and Kubernetes Strong problem-solving and collaboration skills