Senior Site Reliability Engineer
Role details
Job location
Tech stack
Job description
As a Sr. Site Reliability Engineer, you will play a critical leadership role in scaling our platform while ensuring best-in-class security, reliability, and performance. You will guide the evolution of our DevOps and SRE architecture, lead by example in designing cloud-native infrastructure, and mentor engineers across teams. This role requires deep technical expertise, strong architectural judgment, and the ability to collaborate effectively across global teams., Lead strategic initiatives to scale and optimize microservice infrastructure using AWS best practices and Terraform-based infrastructure as code.
- Architect and implement the next generation of platform leveraging Kubernetes and modern cloud-native tooling.
- Partner closely with parent company's SRE team in Kuala Lumpur to ensure global alignment, consistency, and operational excellence.
- Lead design reviews and RFC processes, providing thoughtful, actionable feedback and setting a high standard for technical decision-making.
- Own the reliability of production systems, including participating in on-call rotations and driving incident remediation and postmortems.
- Promote and enforce security best practices, particularly in regulated and compliant environments.
- Drive continuous improvement of CI/CD pipelines with a focus on automation, scalability, and developer productivity.
- Mentor and coach engineers, fostering a culture of technical excellence, collaboration, and continuous learning.
Salary Range for this position is $160k - $180K
Requirements
Bachelor's degree in Computer Science, Engineering, or a related field.
- 5+ years of experience in DevOps or Site Reliability Engineering, with a strong track record of leading large-scale initiatives in AWS-based cloud environments.
- Hands-on experience architecting and operating containerized platforms using Kubernetes.
- Deep expertise in infrastructure as code, particularly Terraform.
- Strong experience with AWS services supporting microservices and containers, including EKS and ECS.
- Comprehensive understanding of CI/CD systems such as Jenkins, CircleCI, or GitHub Actions.
- Proficiency in at least one programming language, preferably Python or Go.
- Familiarity with compliance and security frameworks such as PCI, SOC 2, and NIST, with a strategic approach to exceeding regulatory requirements.
Benefits & conditions
Benefit packages for this role will start on the 1st day of employment and include medical, dental, and vision insurance, as well as HSA, FSA, and DCFSA account options, and 401k retirement account access with employer matching. Employees in this role are also entitled to paid sick leave and/or other paid time off as provided by applicable law.