Site Reliability Engineer (SRE) VMware Infrastructure
InfoVision, Inc.
Atlanta, United States of America
3 days ago
Role details
Contract type
Temporary contract Employment type
Full-time (> 32 hours) Working hours
Regular working hours Languages
English Experience level
SeniorJob location
Atlanta, United States of America
Tech stack
Microsoft Windows
Amazon Web Services (AWS)
Systems Engineering
Azure
Bash
Continuous Integration
DevOps
VMware ESX Servers
Monitoring of Systems
Python
Linux System Administration
Nagios
Performance Tuning
Powershell
Reliability Engineering
Ansible
Prometheus
Storage Virtualization
Virtualization Technology
vSphere
Google Cloud Platform
Grafana
Containerization
Kubernetes
Vcenter
Splunk
Dynatrace
Docker
VMware
Requirements
- 12+ years of experience in Systems Engineering, Infrastructure Operations, or Site Reliability Engineering.
- Strong hands-on experience with VMware technologies like VMware vSphere, ESXi, vCenter, vMotion, vSAN, NSX, VMware HA/DRS
- Experience with Windows and Linux server administration.
- Strong understanding of storage, networking, virtualization, and compute infrastructure.
- Experience with monitoring and observability tools such as Splunk, Prometheus, Grafana, Dynatrace, or Nagios.
- Expertise in scripting and automation using PowerShell, Python, Bash, or Ansible.
- Knowledge of cloud platforms such as AWS, Azure, or Google Cloud Platform is preferred.
- Experience in incident management, problem management, and production support environments.
- Familiarity with ITIL processes and SRE principles including SLIs, SLOs, and error budgets.
- Strong troubleshooting and performance tuning skills.
Preferred Qualifications:
- VMware certifications such as VCP, VCAP, or VCDX preferred.
- Experience with Kubernetes, Docker, or container platforms is an added advantage.
- Exposure to CI/CD tools and DevOps practices.
- Excellent communication and stakeholder management skills.