Site Reliability Engineer, Associate
Role details
Job location
Tech stack
Job description
- Take ownership of project priorities, deadlines and deliverables using Agile methodologies
- Understand and refine business and functional requirements.
- Hands on approach to getting work done, this will be a role that requires a "roll your sleeves" up mentality
- Be a leader with vision and a partner in brainstorming solutions for team productivity, efficiency, guiding and motivating others.
- Drive priority setting of the engineering teams
- Improve Engineering culture by encouraging continuous focus on reliability across the entire application lifecycle
- Proactive participant in architectural and design decisions
- Design and implement end-to-end monitoring solutions for Application and Infrastructure components, based on cutting edge SLO-based telemetry tools
- Drive the engineering of capacity management and demand forecasting solutions
- Act as a culture carrier and leader, passing on SRE knowledge and best practices to the engineering team
- Drive detailed root cause investigations for production incidents with rigorous focus on issue avoidance
- Create/coordinate retros for significant incidents
- Additional core engineering functions, such as adding custom Telemetry metrics to the code base of in-scope Applications
- Anticipate new opportunities to continuously evolve the resiliency profile of scoped Applications and Infrastructure
Requirements
We're looking for an SRE with deep understanding of SRE best practices. The role will align experience with Telemetry solutions and use cases - Log monitoring, performance monitoring and visualization tools like Grafana., * Must Have B.S. / M.S. degree in Computer Science, Engineering or a related team with 4 - 6 years of experience
- Experience with troubleshooting performance issues
- A passion for providing engineering support for highly available, performant full stack applications with a "Student of Technology" attitude
- Experience with CI/CD practices
- Passion to work in a team-environment, multi-tasking and effective communication skills
- Excellent analytical, problem-solving and debugging skills
- Demonstrated ability to build positive relationships
- Self-motivated
- Knowledge of software development methodologies (analysis, design, development, testing) and basic understanding of Agile / Scrum methodology and practices
- Good hands-on object-oriented programming experience in Java, including proficiency in object-oriented principles and design patterns
- Exposure to Infrastructure platform problem solving
Good to Have
- Understanding of Agile work environments, including knowledge of GIT, CI/CD, and one scripting language like Perl, Python etc.
- Experience with automated configuration management tools
- Experience with AI or ML for problem solving or improve developer experience
Benefits & conditions
To help you stay energized, engaged and inspired, we offer a wide range of employee benefits including: retirement investment and tools designed to help you in building a sound financial future; access to education reimbursement; comprehensive resources to support your physical health and emotional well-being; family support programs; and Flexible Time Off (FTO) so you can relax, recharge and be there for the people you care about.