Software Engineer, Platform Infrastructure
Role details
Job location
Tech stack
Job description
- You will design, implement and maintain an active-active multi-cloud infrastructure on AWS and GCP supporting business-critical systems ensuring high availability and performance with automation, delivering systems that stays reliable and performs under stress
- Collaborate with your peers through code-reviews, ensuring best practices and aligning on technical standards to deliver consistent, high-quality solutions
- Collaborate with security teams to ensure the integrity and security of infrastructure and applications including implementing security best practices and compliance standards
- Manage individual project priorities, deadlines, and deliverables by leveraging agile methodologies and maintaining clear communication channels
- Lead incident response efforts by triaging issues effectively, collaborating closely with cross-functional teams to resolve them promptly and minimize downtime
- Implement effective incident management processes and post-incident reviews
- Identify performance bottlenecks through detailed monitoring and profiling and optimize system resources by fine tuning configurations, scaling infrastructure and addressing latency issues
- Drive continuous improvement initiatives by automating repetitive tasks, refining workflows and proactively addressing technical debt within the team, while driving enhancements across the organization
- Maintain comprehensive documentation of systems, processes, and procedures while fostering a culture of knowledge sharing and contribute to the collective learning of the team
- Participate in 24x7 on-call rotation, and be available to work with global teams in the event of critical outages
Requirements
We are seeking a talented and experienced DevOps/SRE (Site Reliability Engineering) Senior Software Engineer to join our dynamic team. The ideal candidate will have a strong background in DevOps practices, cloud infrastructure management, automation, and team leadership skills. If you have a consistent track record of architecting and building large-scale systems; enjoy solving intriguing system challenges at internet-scale; if you are innovative at heart; and have a great balance of skills in learning, organizing, building, and enjoy making an impact, this role might be a great fit for you!, * 12+ years of experience in DevOps/SRE roles
- BS Degree in Computer Science or Equivalent
- Experience with a number of the following: Kubernetes, Docker, Service Mesh such as Istio, Envoy, Linkerd, Solo & ECS
- Experience in cloud-focused software development, preferably in Go, Python, or other object-oriented programming languages
- Experience with Infrastructure as Code (IaC) tools such as Terraform, Ansible, or CloudFormation
- Experience with CI/CD automation including Gitlab pipelines and other related tools
- Solid understanding of networking, security, and compliance principles including the intricacies of multi-tenant architecture and secure network configuration in cloud environments
- Strong understanding of distributed systems, microservices architecture, and cloud-native technologies
- Strong hands-on experience with cloud platforms such as AWS, GCP or Azure
- Proven track record of implementing scalable, high-performance infrastructure solutions in a fast-paced dynamic environments
- Demonstrated ability to communicate clearly with both technical and non-technical project stakeholders, with the ability to work effectively in a cross-functional team environment
- The drive and self-motivation to understand the intricate details of a complex infrastructure environment
- Certifications in relevant technologies such as Certified Kubernetes Administrator (CKA), AWS Certified DevOps Engineer, or Certified Information Systems Security Professional (CISSP) are preferred
Benefits & conditions
For California Only - The estimated annual salary for this position is between $280,000 - $380,000 annually. Compensation packages are based on factors unique to each candidate, including but not limited to skill set, certifications, and specific geographical location. This role is eligible for health insurance, equity awards, life insurance, disability benefits, parental leave, wellness benefits, and paid time off., Roku is committed to offering a diverse range of benefits as part of our compensation package to support our employees and their families. Our comprehensive benefits include global access to mental health and financial wellness support and resources. Local benefits include statutory and voluntary benefits which may include healthcare (medical, dental, and vision), life, accident, disability, commuter, and retirement options (401(k)/pension). Our employees can take time off work for vacation and other personal reasons to balance their evolving work and life needs. It's important to note that not every benefit is available in all locations or for every role. For details specific to your location, please consult with your recruiter.