Technical Lead - DR &Infrastructure (Architect Level)
Role details
Job location
Tech stack
Job description
We are seeking a senior Technical Lead (Architect Level) to lead the design, build, and execution of a scalable DR framework while also supporting ongoing operations & maintenance (O&M). This individual will play a critical role in stabilizing infrastructure, optimizing VMware environments, and ensuring the organization is fully prepared for controlled recovery scenarios.
This is a highly visible role working directly with client stakeholders to drive both strategic architecture decisions and hands-on execution., DR Strategy & Architecture:
Lead end-to-end DR design across Phoenix (primary) and Austin (DR) data centers
Define recovery objectives, sequencing, and orchestration across app, DB, and infrastructure layers
Establish workload isolation across compute, storage, and network
Drive dependency mapping across applications, databases, identity, and infrastructure
Infrastructure & Optimization:
Optimize VMware environments (HA, DRS, clusters) for resiliency and failover
Oversee SAN replication and storage alignment with DR strategy
Lead capacity planning for recovery readiness
Operations & Maintenance (O&M):
Oversee infrastructure operations across both data centers
Ensure DR readiness through testing, validation, and continuous improvement
Support incident response and recovery execution
Governance & Reporting:
Establish KPIs, risk management, and DR readiness reporting
Partner with leadership on long-term infrastructure and resiliency roadmap
Stakeholder Engagement
Align with stakeholders on DR strategy and priorities
Act as a technical lead driving both strategy and execution
Requirements
- 10+ years of experience in infrastructure engineering or architecture roles
- Expert-level experience with VMware (HA, DRS, clustering, virtualization)
- Proven experience designing and implementing enterprise Disaster Recovery strategies
- Strong background in SAN/storage architecture and replication technologies
- Experience in multi-data center environments with failover and resiliency planning
- Ability to operate at both architectural and hands-on technical levels - Experience within highly regulated or enterprise-scale environments
- Familiarity with DR automation and orchestration tools
- Cloud or hybrid infrastructure exposure (AWS, Azure, or GCP)