Site Reliability Engineer

Randstad
Roanoke, United States of America
4 days ago

Role details

Contract type
Contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Junior
Compensation
$ 141K

Job location

Roanoke, United States of America

Tech stack

Java
Amazon Web Services (AWS)
Big Data
Bioinformatics
Cloud Computing
Cloud Engineering
Data Visualization
Query Languages
DevOps
Distributed Systems
Monitoring of Systems
Identity and Access Management
Python
Node.js
Systems Development Life Cycle
Reliability Engineering
Prometheus
Shell Script
Software Engineering
Datadog
Data Logging
Scripting (Bash/Python/Go/Ruby)
Grafana
Kubernetes
Infrastructure Automation Frameworks
Information Technology
Terraform
Splunk

Job description

job summary: Demonstrated ability to apply modern monitoring tools (DataDog, Prometheus, Splunk, ...)

Ability to triage, complete root cause analysis, and be decisive under pressure

Experience managing and interpreting large datasets using query languages and visualization tools

Proficient communication skills with an ability to reach both technical and non-technical audience

Ability to learn new software, method and practices and bringing them to our developers

Ability to work with a variety of individuals and groups, both in person and virtually, in a constructive and collaborative manner and build and maintain effective relationships

location: Westlake, Texas job type: Contract salary: $67 - 68 per hour work hours: 8am to 5pm education: Bachelors

responsibilities:

  • Bachelor's degree or equivalent experience or higher in a technology related field (e.g. Engineering, Computer Science, etc.) required, Master's degree a plus
  • 5+ years of hands-on experience deploying and/or supporting highly distributed multi-tiered systems at scale.
  • 1-2 years of experience in Cloud development (AWS) and migration skills; Experience with building and operating highly resilient platforms in AWS cloud environments
  • 2-4 years of experience in software development with Python, NodeJS, or Java with a focus on SDLC and automation
  • Hands-on experience with container orchestration, preferably with Kubernetes
  • Experience operating and implementing distributed & highly concurrent service-based

qualifications: Ability to automate with various scripting languages (Python, Shell scripting, etc...)

Experience managing systems using infrastructure as code tools (IAM, ARM, Terraform, Chef, ...)

Solid understanding of Cloud Computing and DevOps concepts including CI/CD pipelines

Hands-on Kubernetes skills and knowledge.

Hands on experience with one or more observability tools (Prometheus, Grafana, ELK/OpenSearch, OpenTelemetry, Datadog, etc...)

Experienced in Instrumentation with systems skills on building and operating, monitoring, logging, alerting services of distributed systems at scale

Proven experience in maintaining scalability and resiliency of complex environment.

Proven experience in implementing advanced observability practices and techniques at scale.

Equal Opportunity Employer: Race, Color, Religion, Sex, Sexual Orientation, Gender Identity, National Origin, Age, Genetic Information, Disability, Protected Veteran Status, or any other legally protected group status.

At Randstad Digital, we welcome people of all abilities and want to ensure that our hiring and interview process meets the needs of all applicants. If you require a reasonable accommodation to make your application or interview experience a great one, please contact HRsupport@randstadusa.com.

Pay offered to a successful candidate will be based on several factors including the candidate's education, work experience, work location, specific job duties, certifications, etc. In addition, Randstad Digital offers a comprehensive benefits package, including: medical, prescription, dental, vision, AD&D, and life insurance offerings, short-term disability, and a 401K plan (all benefits are based on eligibility).

This posting is open for thirty (30) days.

,

  • Bachelor's degree or equivalent experience or higher in a technology related field (e.g. Engineering, Computer Science, etc.) required, Master's degree a plus
  • 5+ years of hands-on experience deploying and/or supporting highly distributed multi-tiered systems at scale.
  • 1-2 years of experience in Cloud development (AWS) and migration skills; Experience with building and operating highly resilient platforms in AWS cloud environments
  • 2-4 years of experience in software development with Python, NodeJS, or Java with a focus on SDLC and automation
  • Hands-on experience with container orchestration, preferably with Kubernetes
  • Experience operating and implementing distributed & highly concurrent service-based

Requirements

Ability to automate with various scripting languages (Python, Shell scripting, etc...) Experience managing systems using infrastructure as code tools (IAM, ARM, Terraform, Chef, ...) Solid understanding of Cloud Computing and DevOps concepts including CI/CD pipelines Hands-on Kubernetes skills and knowledge. Hands on experience with one or more observability tools (Prometheus, Grafana, ELK/OpenSearch, OpenTelemetry, Datadog, etc...) Experienced in Instrumentation with systems skills on building and operating, monitoring, logging, alerting services of distributed systems at scale Proven experience in maintaining scalability and resiliency of complex environment. Proven experience in implementing advanced observability practices and techniques at scale.

Apply for this position