Site Reliability Engineer

OneStream Software LLC
Birmingham, United States of America
2 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Intermediate
Compensation
$ 148K

Job location

Remote
Birmingham, United States of America

Tech stack

.NET
Agile Methodologies
Amazon Web Services (AWS)
JIRA
Azure
Bash
C Sharp (Programming Language)
Ubuntu (Operating System)
Command-Line Interface
Software as a Service
Cloud Computing
Cloud Computing Security
Configuration Management
Debian Linux
Linux
Github
Identity and Access Management
Integrated Development Environments
Python
Log Analysis
Openshift
Powershell
Reliability Engineering
Cloud Services
Ansible
OneStream
Prometheus
Software Engineering
SQL Databases
Datadog
Scripting (Bash/Python/Go/Ruby)
Google Cloud Platform
Grafana
GIT
Cloudformation
Containerization
Kubernetes
Infrastructure Automation Frameworks
Information Technology
Production Code
Bicep
Azure
Puppet
REST
Terraform
New Relic (SaaS)
Software Version Control
Dynatrace
Azure

Job description

As a Site Reliability Engineer, you will focus on ensuring the platform and services customers rely on are reliable, performant, and highly available. If you enjoy staying at the forefront of technology and automating infrastructure deployments, then this is the job for you. This vital role within Cloud Services requires knowledge and experience designing, implementing, and monitoring scalable and secure cloud services. The employee is expected to work well in a small team and willing to share responsibilities with other team members as needed. You will interact with internal staff, managers, and customers to implement and maintain operations. A passion for technology and learning, and the ability to grow others are vital for success in this role. Primary Duties and Responsibilities Implement application/infrastructure observability solutions to ensure desired application availability, reliability, and performance. Participate in regular On-Call rotations and share details related to incidents and their resolution through post-mortem reports and regular review meetings. Proactively partner with Product and Engineering teams to identify, develop, deploy, and maintain reliable systems and services. Influence and create new designs, architectures, standards, and methods for large-scale systems. Sustain a high level of reliability for key services and automated systems. Automate processes to improve reliability, performance, and availability. Update technical documentation, workflows, and knowledge base articles. Provide feedback in pull requests and peer coding reviews. Implement codified automated solutions that build integrations between Dynatrace, Azure DevOps and Jira. Solid knowledge in focused areas of OneStream Software. Ability to mentor others in several technical areas. Understanding practical use of SOC/FedRAMP controls to assist Compliance and Security teams. Required Education and Experience, Transparency around corporate structure, salary, and benefits. Core value of customer success. Variety of project work (not industry-specific). Strong culture andcamaraderie. Multiple training opportunities.

Requirements

BS/BA in computer science, engineering, or technology-related field (or equivalent work experience). Proven work experience as a Site Reliability Engineer or in a similar role. 6+ years of cloud infrastructure and software development experience. 2+ years hands on experience of Azure Kubernetes Services (AKS) with container-based deployment skills or other platforms such as OpenShift, GKS, EKS. Advanced understanding of APM and observability tools such as Dynatrace, AppInsights, DataDog, Log Analytics, New Relic, Prometheus and Grafana. Advanced understanding of Infrastructure-as-Code (IaC) concepts and tooling (Terraform, CloudFormation templates, Bicep or ARM templates) on Microsoft Azure, Amazon Web Services (AWS), or Google Cloud Platform (GCP). Deep knowledge of Configuration Management/Orchestration utilities such as Ansible, PowerShell DSC, Chef, and Puppet. Advanced understanding of cloud concepts including elasticity, security, and identity management. Well versed familiarity with Agile Development methodologies utilizing Jira or Azure DevOps Boards. 6+ years of hands-on experience with the following technologies, tools, and concepts: Automating processes using PowerShell, Bash, CLI, REST APIs, python, ARM Templates or other scripting languages. Comfortable leveraging source control tools such as Git, Azure DevOps, or GitHub. Knowledge of container orchestration platforms such as Kubernetes, OpenShift, AKS, GKS or helm. Microsoft Azure, Amazon Web Services (AWS) or Google Cloud (GCP).

Preferred Education and Experience Experience working for a cloud service provider (CSP), managed service provider (MSP), or SaaS provider. 6+ years of relevant Azure experience deploying and managing leveraging Infrastructure-as-Code (IAC) concepts. Experience with Microsoft and .NET (.NET, C#, SQL). Experience writing efficient and reliable code in a development environment. Debian, Ubuntu, Alpine or other distributions of the Linux operating systems. Deep knowledge and understanding of containerized applications, with special attention to reliability and monitoring of those containerized applications.

Knowledge, Skills, and Abilities Deal well with ambiguous/undefined problems. Ability to self-motivate and work independently. Strong organizational and prioritization skills. Ability to find and apply effective solutions to emerging problems and challenges. Strong attention to detail. Comfortable communicating with all levels of management and engineering. Ability to get up to speed quickly with modern technologies and services. Ability to multitask on a variety of projects.

About the company

OneStream is how today's Finance teams can go beyond just reporting on the past and Take Finance Further by steering the business to the future. It's the only enterprise finance platform that unifies financial and operational data, embeds AI for better decisions and productivity, and empowers the CFO to become a critical driver of business strategy and execution. Our vision is to be the operating system for modern finance, digitizing core financial functions and empowering the CFO to become a critical driver of business strategy. To learn more visit www.onestream.com.

Apply for this position