DevOps Engineer
GDH Consulting
San Jose, United States of America
yesterday
Role details
Contract type
Permanent contract Employment type
Full-time (> 32 hours) Working hours
Regular working hours Languages
English Experience level
Intermediate Compensation
$ 83KJob location
Remote
San Jose, United States of America
Tech stack
Systems Engineering
Cloud Computing
Software Documentation
Continuous Integration
Linux
Distributed Systems
Python
Operational Data Store
Reliability Engineering
Software Deployment
Scripting (Bash/Python/Go/Ruby)
Grafana
Reliability of Systems
GIT
Infrastructure Automation Frameworks
Information Technology
Software Version Control
Docker
Programming Languages
Job description
This role involves managing the availability, performance, and scalability of backend services within a large-scale SaaS collaboration platform. As a Site Reliability Engineer, the focus is on supporting cloud and hybrid environments through automation, operational best practices, and incident management. The position requires a proactive approach to ensure service reliability and continuous improvement, with regular onsite presence required in the designated location., * Own deployment, operation, and reliability of core collaboration services across cloud and hybrid environments
- Design, improve, and automate CI/CD pipelines and frameworks, including AI-driven deployment, monitoring, and incident response tools
- Lead complex production incident response activities, perform root cause analysis, and implement long-term reliability enhancements
- Utilize observability and operational data to assist with capacity planning, system scaling, and resource optimization
- Establish operational best practices, documentation standards, and promote a culture of reliability and accountability
- Collaborate with development teams to integrate reliability practices into software deployment processes
- Maintain and improve monitoring and alerting systems to identify and resolve issues proactively
- Drive automation initiatives to streamline operations and reduce manual intervention
- Support on-call duties, ensuring rapid response to production issues and minimizing downtime
- Continuously evaluate new technologies and methodologies to enhance system reliability and operational efficiency
Requirements
- Bachelor's degree in Computer Science, Engineering, or related field, or equivalent work experience
- Three to five years of experience in Site Reliability Engineering, Cloud Operations, or Systems Engineering roles
- Practical experience operating production services with Docker and Kubernetes in cloud or hybrid environments
- Proficiency in scripting or programming languages such as Python, Go, or Bash for automation tasks
- Experience with monitoring, observability tools, incident management, and post-incident reviews
- Strong understanding of Linux systems, networking, distributed systems, CI/CD pipelines, infrastructure as code, and version control with Git
- Excellent problem-solving skills with the ability to handle high-pressure situations effectively
- Effective communication skills to collaborate with cross-functional teams and document processes
- Availability to work in a hybrid environment with onsite presence required in San Jose three days per week
About the company
At GDH, we believe in the power of people and the importance of caring. Our culture statement, "We care about people," isn't just a tagline - it's the core of everything we do. GDH is a premier staffing and talent solutions company dedicated to helping businesses find the best talent and assisting job seekers in finding their dream jobs.
Who We Are:
GDH, founded in 2001, has grown into a leader in providing staffing solutions across various industries. We specialize in IT across several sectors, connecting top talent with leading enterprises. As a Best of Staffing firm recognized for excellence in client, employee, talent, and women's services, we pride ourselves on our commitment to quality and service.
GDH Benefits
GDH offers a range of employee benefits that are designed to promote well-being and help maintain a healthy work-life balance. These comprehensive benefits cover various aspects of an employee's life and aim to enhance their overall experience with the company. Our health benefits include three medical insurance options with access to KISx Card, Zero Card, and HealthJoy concierge services. Other plan offerings include dental, vision, life, disability, supplemental insurance, and pet insurance plans. Enjoy additional perks like holiday pay, 401(k) plan, direct deposit, an employee referral program, work-life balance benefits, a Wellbeats membership, a discounted gym membership program, and more! For more detailed information on benefits, please go to GDH's website under the tab for candidates.