Site Reliability Engineer

Next Step Systems
Jessup, United States of America
3 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English

Job location

Jessup, United States of America

Tech stack

Java
Apache Accumulo
Amazon Web Services (AWS)
JIRA
Bash
Information Systems
Linux
Hadoop
Hadoop Distributed File System
Python
OpenStack
Reliability Engineering
Ansible
Prometheus
Virtualization Technology
Scripting (Bash/Python/Go/Ruby)
Grafana
Kubernetes
Information Technology
Free and Open-Source Software
Docker

Job description

We are seeking a Site Reliability Engineer to work in a cloud environment platform, built with Java on Free and Open Source Software products including Kubernetes, Hadoop and Accumulo, to enable the execution of data-intensive analytics on a managed infrastructure. This position is on the Operations Team that ensures day-to-day operations stability, provides customer support, as well as knowledge in technical and troubleshooting repair expertise. The ideal candidate will have the ability to thrive in a fast-paced team environment who is self-motivated and proactively completes tasks with strong attention to detail. The candidate will be exposed to a variety of technologies depending on customer requirements. This is an on-call position, and you are expected to provide Tier 1 through Tier 3 support. The candidate should have a strong background in troubleshooting operational issues in a Linux environment. Additional knowledge of Docker, Kubernetes, Hadoop and scripting experience, One of the following certifications required: AWS Certified Developer-Associate, AWS Certified Solutions Architect-Associate, AWS Certified Solutions Architect-Professional, AWS Certified SysOps Administrator-Associate, Certified Kubernetes Administrator (CKAD), Elastic Certified Engineer, and Elastic Certified Observability Engineer.

  • Other experiences that could benefit the candidate include Prometheus, JIRA, Hadoop Distributed File System (HDFS), Virtualization, Salt/Ansible, Grafana, OpenStack, AWS, Python, and bash.

Benefits include medical insurance, retirement plan, PTO, etc.

Keywords: Annapolis Junction MD Jobs, Site Reliability Engineer, Prometheus, Jira, Hadoop Distributed File Systems, HDFS, Virtualization, Salt, Ansible, Grafana, OpenStack, AWS, Python, bash, DoD 8570 IAT Level 1, TS/SCI Security Clearance, Full Scope Polygraph Security Clearance, Maryland Recruiters, IT Jobs, Maryland Recruiting

Looking to hire a Site Reliability Engineer in Annapolis Junction, MD or in other cities? Our IT recruiting agencies and staffing companies can help.

Requirements

such as Python and bash is beneficial. A DoD 8570 IAT Level 1 or higher is required. Candidates must have an active TS/SCI with a Full Scope Polygraph security clearance. This is a 100% Onsite position and not open for Hybrid and/or Remote.

Site Reliability Engineer Qualifications:

  • Must have an active TS/SCI Full Scope Polygraph security clearance.

  • DoD 8570 IAT Level 1 or higher required.

  • 14 years of experience is required. Bachelor's Degree in Computer Science or in a related technical field is highly desired and will be considered equivalent to 2 years of experience. A Master's degree in a Technical Field will be considered equivalent to 4 years of experience. A degree in Mathematics, Information Systems, Engineering, or similar degree will be considered.

Apply for this position