Senior Engineer, Platform Infrastructure
Role details
Job location
Tech stack
Job description
We are seeking an experienced Senior Engineer to serve as the technical architect and lead for our Cloud foundation. In this role, you won't be just maintaining systems; you will be defining the future of how our infrastructure evolves. You will own the scalability and reliability of our global platforms while leading the charge into Agentic Infrastructure - leveraging AI to build self-healing, autonomous systems that reduce operational toil. You will act as the bridge between high-level business strategy and deep technical execution, ensuring our Kubernetes-driven environment is not just stable, but a competitive advantage. ** Please note that this opportunity is located in New York, NY, and requires this hire to work from our office four days a week. ** What You'll Do
- Architectural Leadership: Own the long-term technical roadmap for our AWS/Kubernetes ecosystem. Design and implement multi-region, high-availability architectures that support rapid product scaling.
- AI & Agentic Systems: Proactively identify and implement opportunities to use AI/LLMs to build agentic workflows for infrastructure management (e.g., autonomous scaling, automated incident remediation, and intelligent cost optimization).
- Advanced Orchestration: Lead the evolution of our Kubernetes and Istio service mesh strategy, focusing on performance tuning, advanced traffic management, and developer self-service.
- Infrastructure as Code (IaC): Set the standard for Terraform/OpenTofu usage, building reusable modules and frameworks that empower the entire engineering organization.
- Strategic Optimization: Partner with leadership to manage cloud spend and performance metrics, turning technical efficiency into business revenue growth.
- Mentorship & Influence: Act as a force multiplier by mentoring engineers and influencing cross-functional product teams on best practices for Cloud-native development.
- Incident Response & SRE: Serve as a high-level escalation point for complex systemic issues, conducting deep-dive post-mortems that drive structural improvements rather than quick fixes., At VTS, we pride ourselves on articulating a clear and transparent philosophy around equitable, impartial compensation that will allow us to recruit and retain an exceptional team. The base salary is market-driven at the time of offer and is based on tier 1 market data. The salary for this role will range between $160,000 and $200,000 and is determined by several factors, including your skills, prior relevant experience, quality of interviews, leveling, and geography. EEO Guidelines VTS embraces diversity and equal opportunity in a serious way. We are committed to building a team that represents a variety of backgrounds, perspectives, and skills. The more inclusive we are, the better our work will be. All your information will be kept confidential according to EEO guidelines. For more information about what we collect and how we use it, please refer to the If you have a disability or special need that requires accommodation at any time during the recruitment process, please let us know at Create a Job Alert Interested in building your career at VTS? Get future opportunities sent straight to your email. Create alert
Requirements
- Deep Technical Mastery: You have extensive experience managing large-scale production environments in AWS, with "black belt" level knowledge of Kubernetes internals, networking (VPC/Direct Connect), and Istio.
- The "AI Native" Mindset: You are passionate about the intersection of Infrastructure and AI. You don't just want to monitor a system; you want to build an agent that monitors and fixes it for you.
- Proven Track Record: You have 8+ years of experience in Infrastructure, SRE, or DevOps, with at least 2 years in a Senior-level capacity driving cross-team initiatives.
- Polyglot Engineering: You are highly proficient in Go or Python, and you're comfortable diving into the application layer (Ruby, Node.js, or Java) to debug performance bottlenecks.
- Security-First Philosophy: While this is an infra-focused role, you bake security into the CI/CD pipeline and underlying architecture by default.