Site Reliability Engineer (SRE) VMware Infrastructure

InfoVision, Inc.
Atlanta, United States of America
3 days ago

Role details

Contract type
Temporary contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior

Job location

Atlanta, United States of America

Tech stack

Microsoft Windows
Amazon Web Services (AWS)
Systems Engineering
Azure
Bash
Continuous Integration
DevOps
VMware ESX Servers
Monitoring of Systems
Python
Linux System Administration
Nagios
Performance Tuning
Powershell
Reliability Engineering
Ansible
Prometheus
Storage Virtualization
Virtualization Technology
vSphere
Google Cloud Platform
Grafana
Containerization
Kubernetes
Vcenter
Splunk
Dynatrace
Docker
VMware

Requirements

  • 12+ years of experience in Systems Engineering, Infrastructure Operations, or Site Reliability Engineering.
  • Strong hands-on experience with VMware technologies like VMware vSphere, ESXi, vCenter, vMotion, vSAN, NSX, VMware HA/DRS
  • Experience with Windows and Linux server administration.
  • Strong understanding of storage, networking, virtualization, and compute infrastructure.
  • Experience with monitoring and observability tools such as Splunk, Prometheus, Grafana, Dynatrace, or Nagios.
  • Expertise in scripting and automation using PowerShell, Python, Bash, or Ansible.
  • Knowledge of cloud platforms such as AWS, Azure, or Google Cloud Platform is preferred.
  • Experience in incident management, problem management, and production support environments.
  • Familiarity with ITIL processes and SRE principles including SLIs, SLOs, and error budgets.
  • Strong troubleshooting and performance tuning skills.

Preferred Qualifications:

  • VMware certifications such as VCP, VCAP, or VCDX preferred.
  • Experience with Kubernetes, Docker, or container platforms is an added advantage.
  • Exposure to CI/CD tools and DevOps practices.
  • Excellent communication and stakeholder management skills.

Apply for this position