Site Reliability Engineer III
Role details
Job location
Tech stack
Job description
As a Site Reliability Engineer III at JPMorganChase within the Data Solutions team of Corporate Sector, you will play a key role in automating, troubleshooting, and monitoring AWS-based applications and infrastructure. You will work hands-on to enhance reliability, performance, and scalability, ensuring seamless operations and continuous improvement. Your expertise will help drive the adoption of SRE best practices and deliver impactful solutions for the business., * Guides and assists others in building effective designs and achieving consensus within the team
- Collaborates with software engineers and teams to implement automated CI/CD pipelines for deployment
- Designs, develops, tests, and implements solutions to improve availability, reliability, and scalability
- Implements infrastructure, configuration, and network as code for assigned applications and platforms
- Works with technical experts, stakeholders, and team members to resolve complex issues
- Understands and applies service level indicators and objectives to proactively address potential problems
- Supports the adoption and implementation of site reliability engineering best practices
- Drives automation initiatives to reduce manual intervention and improve operational efficiency
- Troubleshoots AWS infrastructure and application issues to maintain high reliability
- Enhances observability through monitoring, alerting, and telemetry collection
Requirements
- Formal training or certification on software engineering concepts and 3+ years applied experience
- Proficient in site reliability engineering principles and their application within cloud environments
- Skilled in at least one programming language such as Python, Java/Spring Boot, or .Net
- Strong knowledge of software applications and technical processes within disciplines like Cloud or AI
- Experience with observability tools (Grafana, Dynatrace, Prometheus, Datadog, Splunk, etc.)
- Familiarity with CI/CD tools such as Jenkins, GitLab, or Terraform
- Ability to proactively identify and address technical challenges
- Demonstrates interest in learning new technologies to drive innovation
- Capable of identifying and implementing relevant solutions to meet design constraints
- Initiates and implements ideas to solve business problems
- Effectively communicates and collaborates within large teams with limited supervision
Preferred qualifications, capabilities, and skills
- Experience with AWS platform and container orchestration (EKS)
- Familiarity with troubleshooting common networking technologies and issues
- Exposure to cloud security and compliance practices
- Experience with infrastructure automation tools (Ansible, Chef, Puppet)
- Knowledge of distributed systems and microservices architecture
- Experience working in agile development environments
Benefits & conditions
We offer a competitive total rewards package including base salary determined based on the role, experience, skill set and location. Those in eligible roles may receive commission-based pay and/or discretionary incentive compensation, paid in the form of cash and/or forfeitable equity, awarded in recognition of individual achievements and contributions. We also offer a range of benefits and programs to meet employee needs, based on eligibility. These benefits include comprehensive health care coverage, on-site health and wellness centers, a retirement savings plan, backup childcare, tuition reimbursement, mental health support, financial coaching and more. Additional details about total compensation and benefits will be provided during the hiring process.