Site Reliability Engineer (SRE)

Robert Half

Philadelphia, United States of America

2 days ago

Role details

Contract type

Temporary contract

Employment type

Full-time (> 32 hours)

Working hours

Regular working hours

Languages

English

Experience level

Senior

Job location

Philadelphia, United States of America

Tech stack

Amazon Web Services (AWS)

Software as a Service

Configuration Management

Program Optimization

Computer Programming

Continuous Delivery

Continuous Integration

Linux

DevOps

Monitoring of Systems

Python

Linux System Administration

Release Management

Reliability Engineering

Cloud Services

Ansible

Ruby

Cloud Platform System

Grafana

Reliability of Systems

Infrastructure Automation Frameworks

Information Technology

Performance Monitor

Cloudwatch

Puppet

Splunk

Dynatrace

Job description

We are seeking an experienced Senior DevOps / Site Reliability Engineer (SRE) to support digital transformation initiatives focused on cloud automation, infrastructure scalability, and system reliability.

This role will play a key part in building and enhancing SRE capabilities, partnering closely with product and operations teams to deliver high-performing, scalable, and resilient systems. The ideal candidate will bring strong expertise in AWS, Linux, automation, and monitoring, along with a proactive, solutions-oriented approach., * Lead infrastructure and automation efforts supporting digital transformation initiatives

Design, build, and scale cloud-based infrastructure using AWS and modern DevOps practices
Develop automation solutions using configuration management tools such as Ansible, Chef, or Puppet
Design and implement monitoring and observability solutions to ensure system performance and availability
Partner with cross-functional product and engineering teams to support SRE-related initiatives
Support and enhance CI/CD pipelines and release management processes
Maintain centralized logging systems using tools such as Splunk
Monitor and report on system performance and reliability metrics
Contribute to the design of scalable, highly available application infrastructure
Support deployment processes and ensure stability of online systems

Requirements

Bachelor's degree in Information Technology, Computer Science, or a related field (or equivalent experience)
Experience with AWS cloud services (e.g., EC2, S3, CloudWatch)
Strong Linux administration skills
Experience with configuration management and automation tools (Chef, Puppet, Ansible)
Experience building and maintaining CI/CD pipelines and release management processes
Experience with monitoring and observability tools (e.g., Splunk, Dynatrace, Grafana)
Strong understanding of system performance, scalability, and reliability principles
Experience working with cloud-based applications and infrastructure, * 3+ years of programming experience in Python or Ruby
Experience scaling cloud-based applications with a focus on automation
Familiarity with modern DevOps tools and practices for continuous integration and continuous delivery
Experience supporting high-traffic or customer-facing platforms
Exposure to performance monitoring and optimization tools
Experience collaborating with cross-functional teams in fast-paced environments

About the company

Robert Half is the world's first and largest specialized talent solutions firm that connects highly qualified job seekers to opportunities at great companies. We offer contract, temporary and permanent placement solutions for finance and accounting, technology, marketing and creative, legal, and administrative and customer support roles.

Role details

Job location

Tech stack

Job description

Requirements

About the company

Apply for this position

Good distractions

Moments

Videos View all