Site Reliability Engineer (contract)

Wells Fargo

Charlotte, United States of America

6 days ago

Role details

Contract type

Temporary contract

Employment type

Full-time (> 32 hours)

Working hours

Regular working hours

Languages

English

Experience level

Intermediate

Job location

Remote

Charlotte, United States of America

Tech stack

JavaScript

Vbscript

Application Performance Management

Systems Engineering

Confluence

JIRA

Bash

Databases

JMeter

Python

Lookup Table

Powershell

Reliability Engineering

Ansible

Data Logging

Scripting (Bash/Python/Go/Ruby)

Grafana

Containerization

Blazemeter

Kubernetes

Atlassian Tools

Splunk

Appdynamics

Docker

Job description

Help drive Site Reliability Engineering capabilities at Wells Fargo Collection Services igniting the practice, principles, and culture leading by example. Assist in training skilled engineers by growing the practice within Collection Services and partnering with peer platform embedded SRE teams
Leverage enterprise capabilities, tools, and innovation improving availability in a complex ecosystem by evolving observability, monitoring, logging, synthetic monitoring and chaos engineering
Evolve our environment introducing self-healing and autonomic capabilities solving for complex operational and systemic issues with precision including building and training models, automating cognitive processes to improve availability of products we provide to customers
Automate key SRE metrics and IT Service Operations processes including customer impact, % availability of critical business flows, SLO/SLI adherence, error budget, automate incident process for IT Service Operations through data integrating with unified communications, and alerting/notification systems

Requirements

In this contingent resource assignment, you may: Consult on complex initiatives with broad impact and large-scale planning for Infrastructure Engineering. Review and analyze complex multi-faceted, larger scale or longer-term Infrastructure Engineering challenges that require in-depth evaluation of multiple factors including intangibles or unprecedented factors. Contribute to the resolution of complex and multi-faceted situations requiring solid understanding of the function, policies, procedures, and compliance requirements that meet deliverables. Strategically collaborate and consult with client personnel. Required Qualifications: 5+ years of Technology Infrastructure Engineering and Solutions experience, or equivalent demonstrated through one or a combination of the following: work or consulting experience, training, military experience, education., * 4+ years of Systems Engineering, Technology Architecture experience, or equivalent demonstrated through one or a combination of the following: work experience, training, military experience, education

3+ years of experience using Observability Tools (i.e., Splunk, Grafana, etc.) for designing and managing reports, lookup tables, summary indexes. Splunk Dashboards, reports, lookup tables, and summary indexes.
4+ years of application production support experience
2+ years with one or more Agile tools used for tracking user stories or backlogs, such as Confluence or Jira

Desired Qualifications:

Site Reliability Engineering (SRE) experience
2+ years of database logging and monitoring concepts experience
2+ years of experience with Application performance, monitoring and optimization using Blazemeter, JMeter, Splunk and AppDynamics
2+ years of experience with scripting languages such as Bash, PowerShell, Python, Shell, VBScript, or JavaScript
Experience and understanding of AIOPS and related tools such as MoogSoft or Big Panda
Experience with one or more automation tools such as Ansible.
Experience with Container technologies: Kubernetes, Docker, PKS