Site Reliability Engineer (SRE)
Robert Half
Philadelphia, United States of America
2 days ago
Role details
Contract type
Temporary contract Employment type
Full-time (> 32 hours) Working hours
Regular working hours Languages
English Experience level
SeniorJob location
Philadelphia, United States of America
Tech stack
Amazon Web Services (AWS)
Amazon Web Services (AWS)
Amazon Web Services (AWS)
Software as a Service
Configuration Management
Program Optimization
Computer Programming
Continuous Delivery
Continuous Integration
Linux
DevOps
Monitoring of Systems
Python
Linux System Administration
Release Management
Reliability Engineering
Cloud Services
Ansible
Ruby
Cloud Platform System
Grafana
Reliability of Systems
Infrastructure Automation Frameworks
Information Technology
Performance Monitor
Cloudwatch
Puppet
Splunk
Dynatrace
Job description
We are seeking an experienced Senior DevOps / Site Reliability Engineer (SRE) to support digital transformation initiatives focused on cloud automation, infrastructure scalability, and system reliability.
This role will play a key part in building and enhancing SRE capabilities, partnering closely with product and operations teams to deliver high-performing, scalable, and resilient systems. The ideal candidate will bring strong expertise in AWS, Linux, automation, and monitoring, along with a proactive, solutions-oriented approach., * Lead infrastructure and automation efforts supporting digital transformation initiatives
- Design, build, and scale cloud-based infrastructure using AWS and modern DevOps practices
- Develop automation solutions using configuration management tools such as Ansible, Chef, or Puppet
- Design and implement monitoring and observability solutions to ensure system performance and availability
- Partner with cross-functional product and engineering teams to support SRE-related initiatives
- Support and enhance CI/CD pipelines and release management processes
- Maintain centralized logging systems using tools such as Splunk
- Monitor and report on system performance and reliability metrics
- Contribute to the design of scalable, highly available application infrastructure
- Support deployment processes and ensure stability of online systems
Requirements
- Bachelor's degree in Information Technology, Computer Science, or a related field (or equivalent experience)
- Experience with AWS cloud services (e.g., EC2, S3, CloudWatch)
- Strong Linux administration skills
- Experience with configuration management and automation tools (Chef, Puppet, Ansible)
- Experience building and maintaining CI/CD pipelines and release management processes
- Experience with monitoring and observability tools (e.g., Splunk, Dynatrace, Grafana)
- Strong understanding of system performance, scalability, and reliability principles
- Experience working with cloud-based applications and infrastructure, * 3+ years of programming experience in Python or Ruby
- Experience scaling cloud-based applications with a focus on automation
- Familiarity with modern DevOps tools and practices for continuous integration and continuous delivery
- Experience supporting high-traffic or customer-facing platforms
- Exposure to performance monitoring and optimization tools
- Experience collaborating with cross-functional teams in fast-paced environments
About the company
Robert Half is the world's first and largest specialized talent solutions firm that connects highly qualified job seekers to opportunities at great companies. We offer contract, temporary and permanent placement solutions for finance and accounting, technology, marketing and creative, legal, and administrative and customer support roles.