Site Reliability Engineer (contract)

Wells Fargo
Charlotte, United States of America
6 days ago

Role details

Contract type
Temporary contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Intermediate

Job location

Remote
Charlotte, United States of America

Tech stack

JavaScript
Vbscript
Application Performance Management
Systems Engineering
Confluence
JIRA
Bash
Databases
JMeter
Python
Lookup Table
Powershell
Reliability Engineering
Ansible
Data Logging
Scripting (Bash/Python/Go/Ruby)
Grafana
Containerization
Blazemeter
Kubernetes
Atlassian Tools
Splunk
Appdynamics
Docker

Job description

  • Help drive Site Reliability Engineering capabilities at Wells Fargo Collection Services igniting the practice, principles, and culture leading by example. Assist in training skilled engineers by growing the practice within Collection Services and partnering with peer platform embedded SRE teams
  • Leverage enterprise capabilities, tools, and innovation improving availability in a complex ecosystem by evolving observability, monitoring, logging, synthetic monitoring and chaos engineering
  • Evolve our environment introducing self-healing and autonomic capabilities solving for complex operational and systemic issues with precision including building and training models, automating cognitive processes to improve availability of products we provide to customers
  • Automate key SRE metrics and IT Service Operations processes including customer impact, % availability of critical business flows, SLO/SLI adherence, error budget, automate incident process for IT Service Operations through data integrating with unified communications, and alerting/notification systems

Requirements

In this contingent resource assignment, you may: Consult on complex initiatives with broad impact and large-scale planning for Infrastructure Engineering. Review and analyze complex multi-faceted, larger scale or longer-term Infrastructure Engineering challenges that require in-depth evaluation of multiple factors including intangibles or unprecedented factors. Contribute to the resolution of complex and multi-faceted situations requiring solid understanding of the function, policies, procedures, and compliance requirements that meet deliverables. Strategically collaborate and consult with client personnel. Required Qualifications: 5+ years of Technology Infrastructure Engineering and Solutions experience, or equivalent demonstrated through one or a combination of the following: work or consulting experience, training, military experience, education., * 4+ years of Systems Engineering, Technology Architecture experience, or equivalent demonstrated through one or a combination of the following: work experience, training, military experience, education

  • 3+ years of experience using Observability Tools (i.e., Splunk, Grafana, etc.) for designing and managing reports, lookup tables, summary indexes. Splunk Dashboards, reports, lookup tables, and summary indexes.
  • 4+ years of application production support experience
  • 2+ years with one or more Agile tools used for tracking user stories or backlogs, such as Confluence or Jira

Desired Qualifications:

  • Site Reliability Engineering (SRE) experience
  • 2+ years of database logging and monitoring concepts experience
  • 2+ years of experience with Application performance, monitoring and optimization using Blazemeter, JMeter, Splunk and AppDynamics
  • 2+ years of experience with scripting languages such as Bash, PowerShell, Python, Shell, VBScript, or JavaScript
  • Experience and understanding of AIOPS and related tools such as MoogSoft or Big Panda
  • Experience with one or more automation tools such as Ansible.
  • Experience with Container technologies: Kubernetes, Docker, PKS

Apply for this position