Senior Site Reliability Engineer

Intercontinental Exchange
Jacksonville, United States of America
5 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior

Job location

Jacksonville, United States of America

Tech stack

Java
.NET
PHP
Microsoft Windows
Microsoft Active Directory
Active Directory Federation Services
Amazon Web Services (AWS)
Amazon Web Services (AWS)
Amazon Web Services (AWS)
Audit Trail
Cloud Computing
Cloud Database
Configuration Management
Collaborative Software
System Configuration
Continuous Integration
Linux
DevOps
Amazon DynamoDB
Perl
Identity and Access Management
Python
Scrum
Release Management
Reliability Engineering
Cloud Services
Ansible
Ruby
Security Assertion Markup Language (SAML)
Single Sign-On
Amazon Web Services (AWS)
Software Deployment
Workflow Management Systems
Scripting (Bash/Python/Go/Ruby)
Cloud Platform System
Istio
Technical Debt
Amazon Web Services (AWS)
GIT
Cloudformation
Servicebus
Amazon Web Services (AWS)
Containerization
Kubernetes
Performance Monitor
Route53
Api Design
Api Gateway
Amazon Web Services (AWS)
Terraform
Devsecops

Job description

Intercontinental Exchange, Inc. (ICE) presents a unique opportunity to work with cutting-edge technology and business challenges in the financial services sector. ICE team members work across departments and traditional boundaries to innovate and respond to industry demand. A successful candidate will be able to multitask in a dynamic team-based environment demonstrating strong problem-solving and decision-making abilities and the highest degree of professionalism.

We are seeking an experienced AWS solution design engineer/architect to join our infrastructure cloud team. The infrastructure cloud team is responsible for internal services that provide developer collaboration tools, the build and release pipeline, and shared AWS cloud services platform. The infrastructure cloud team enables engineers to build product features efficiently and confidently them into production.

As Senior SRE, you will be responsible for providing leadership, design and education across multiple teams and drive cross functional cloud transition initiatives. The role will work cross functionally across Product Engineering, SRE, Cloud Automation, Infrastructure and Release engineering to set the best practices of architecture, review deployment architecture and ensure that costs are managed and controlled in the public and private clouds., * Drive discussions regarding trade-offs, best practices and risk mitigations for cloud deployments based on hands-on, real-world experience gained deploying cloud workloads at scale

  • Design and Implement cloud frameworks and deployments by leveraging previous experience, utilizing best practices and industry standards
  • Collaborate with Product and Support teams to plan and deploy product releases
  • Work with Cloud Platform and Operations leaders to develop narratives, backlog grooming, epic planning, and overall sprint planning processes
  • Work with Engineering leadership to build shared services that meet the requirements and needs of the platform and application teams
  • Ensure services are designed with 24/7 availability and operational readiness and rigor
  • Implementation of proactive monitoring, alerting, trend analysis and self-healing systems
  • Educate and mentor team members, operations staff and other departments on AWS concepts
  • Provide technical analysis, resolve problems, and propose solutions in a 24/7 production environment
  • Monitor and research cloud technologies and stay current with trends in the industry
  • Experiment and conduct proof of concepts on emerging technology
  • Participate in an on-call rotation and identify opportunities for reducing toil and avoiding technical debt to reduce support and operations load.

Requirements

  • 7+ years of Systems/Applications automation in 24x7 Production support services environments
  • 5+ years of experience in DevOps, preferably DevSecOps, or SRE role in an AWS cloud environment.
  • 5+ years' strong experience with configuring, managing, solutioning, and architecting with AWS (Lambda, EC2, ECS, ELB, EventBridge, Kinesis, Route 53, RDS and DynamoDB, SNS, SQS, CloudTrail, API Gateway, CloudFront, VPC, TransitGW, IAM, Security Hub, Service Mesh)
  • Fluency with one or more current generation scripting language (Python/Shell/Perl/ PHP/Ruby) AND/OR Java Development and/or .NET
  • In-depth Operating System knowledge in both Windows and Linux
  • Experience with IAM, Active Directory, ADFS, SAML 2.0, and Single sign on practices.
  • Experience using configuration management tooling for application deployment and workflow orchestration.
  • Proven background of implementing continuous integration, and delivery for projects.
  • A track record of introducing automation to solve administrative and other business tasks as usual.
  • Proficiency in Terraform

Preferred skills

  • Proficiency in CloudFormation, or Ansible
  • Experience with Claude Code, Git hub copilot or similar for development
  • A history of delivering services developed in an API-first approach.
  • Coming from a system administration, network, or security background.
  • Prior experience working with environments of significant scale (thousands of servers)
  • Good to have experience in Containerization concepts like Kubernetes

Apply for this position