Senior Site Reliability Engineer
Role details
Job location
Tech stack
Job description
Responsibilities: Lead the deployment and management of Kubernetes environments, utilizing tools like Google Kubernetes Engine and Rancher to enhance system scalability and reliability. Implement and manage configuration systems using tools such as Saltstack, Chef, or Puppet to ensure consistent environments across development, testing, and production. Develop robust tooling and automation solutions to streamline workflow for IT operations, development teams, and fellow SREs, improving overall system efficiency and operability. Design and implement docker-based CI/CD pipelines, employing tools like CircleCI, CodeFresh, and Argo to manage the software development lifecycle effectively. Build and enforce IAM strategies to meet organizational security policies and compliance requirements in cloud environments. Proactively monitor system performance, respond to system incidents, and implement recovery protocols to minimize downtime and service disruption. Maintain comprehensive documentation on system configurations, tooling, and operational procedures. Continually report on system health and reliability metrics to stakeholders.
Requirements
Do you have experience in Scalable systems?, Do you have a Master's degree?, Educ./experience: Master's degree or foreign equivalent in Management Information Systems, Computer Science or related field, plus 2 years of post-baccalaureate experience as a DevOps Engineer, Big Data Engineer, or in a related position. Experience must include: Large-scale distributed systems and client-server architectures; Experience with at least one of the following Cloud Computing platforms: Microsoft Azure, Google Cloud, or AWS; Implementing and managing high capacity, redundant, and mission critical environments; Experience running and maintaining a 24x7 production environment; TCP/IP networking, including architecture and core technologies DNS, routing, iptables and tc.
Benefits & conditions
Pulled from the full job description
- Tuition reimbursement
- Parental leave
- Health insurance
- 401(k) matching
- Paid time off
- Vision insurance
- Health savings account, We offer a robust package of employee perks and benefits, including healthcare benefits (medical, dental and vision, EAP), competitive PTO, 401k match, parental leave, and HSA contribution match. We also provide our employees with a paid subscription to the Calm app and offer generous external learning and tuition reimbursement benefits. At AFS, we offer a hybrid work schedule for most roles that allows employees to have the flexibility of working from home and one of our primary offices.