Lead Product Software Engineer - Cloud Ops
Role details
Job location
Tech stack
Job description
Infrastructure Delivery & Architecture
- Own Terraform delivery for InnovateHub's Azure environments, authoring and maintaining infrastructure-as-code modules that are secure, repeatable, and compliant with SOC 2 and IRS 7216 requirements.
- Collaborate with Infrastructure Operations and Architecture and Product teams to design, review, and evolve cloud infrastructure, ensuring alignment with enterprise standards and cost while meeting the speed and flexibility InnovateHub requires.
- Design and implement Azure resource architectures (App Services, Functions, Cosmos DB, AI Services, Key Vault, networking) that support our AI product portfolio at scale
Product & Operational Health Reporting
- Build and own product and operational health dashboards using data sources such as Azure Monitor, Application Insights, and Log Analytics-giving the team and leadership clear visibility into system reliability, performance, and usage.
- Define and track SLIs/SLOs for InnovateHub products, establishing alerting strategies and incident response patterns that keep our services healthy.
- Translate telemetry and observability data into actionable insights that inform product decisions, capacity planning, and operational improvements.
Developer Enablement & Consultation
- Serve as the go-to cloud operations resource for InnovateHub developers-providing guidance, reusable templates, and hands-on consultation for infrastructure and deployment needs.
- Create and maintain infrastructure templates, CI/CD pipeline patterns, and deployment runbooks that accelerate developer self-service and reduce friction.
- Partner with developers during pair programming rotations to embed infrastructure best practices into the development workflow, including observability instrumentation, secure configuration, and cost optimization.
Cross-Team Collaboration
- Act as the liaison between InnovateHub and enterprise Infrastructure Operations and Architecture teams, ensuring cloud decisions are coordinated and knowledge flows in both directions.
- Work within InnovateHub's collaborative model where engineers understand business problems as deeply as technical solutions, partnering closely with product managers, UX, and research.
- Support cross-portfolio infrastructure needs, enabling other TAA teams to leverage patterns and platforms built by InnovateHub.
Requirements
- 5+ years building and operating cloud infrastructure (Azure strongly preferred)
- 2+ years hands-on experience with Terraform (or equivalent IaC tools) in production environments
- Strong proficiency with Azure PaaS services: App Services, Functions, Cosmos DB, Key Vault, Virtual Networks, Application Gateway, and Azure AI Services
- Deep experience with Azure Monitor, Application Insights, Log Analytics, and KQL for observability and health reporting
- Experience with CI/CD pipelines (Azure DevOps, GitHub Actions) including infrastructure deployment automation
- Experience with Agile/XP practices including TDD/BDD and pair programming
- Proficiency with AI coding tools (GitHub Copilot, Cursor, or similar) and regular use of GenAI utilities for development workflow
Technical Project Leadership:
- 3+ years leading infrastructure or platform engineering initiatives from design through production operation
- Experience working in startup-like environments or innovation teams
- Comfort working in collaborative, fast-paced Agile environments with weekly sprints and blurred (but aligned) role boundaries
Infrastructure & Operations Experience:
- Working knowledge of infrastructure security practices: network segmentation, managed identities, RBAC, secrets management, and compliance controls (SOC 2, IRS 7216 a plus)
- Experience defining SLIs/SLOs and building operational dashboards and alerting strategies
- Understanding of cost management and FinOps principles for Azure environments
Communication Skills:
Ability to explain infrastructure concepts to product-focused developers and non-technical stakeholders. Experience creating documentation, templates, and runbooks that enable self-service across teams., * Microsoft Azure AZ-104 (Azure Administrator) or AZ-305 (Azure Solutions Architect) certification is highly desired - our team values demonstrated Azure expertise and AI-102 certification is also a plus
- Experience with Azure Landing Zones, hub-spoke network topologies, or enterprise-scale Azure architectures
- Background in regulated or compliance-driven environments, particularly financial services or tax/accounting software
- Experience with containerization (Docker, AKS) and service mesh patterns
- Familiarity with AI/ML infrastructure needs: model serving, vector database operations, and RAG system infrastructure
- Experience with platform engineering, developer experience (DevEx) tooling, or internal developer platforms
We encourage applications from candidates with diverse backgrounds who have strong infrastructure skills, project leadership experience, and can adapt quickly to new challenges in a customer-focused, innovation-driven environment.
Benefits & conditions
$116,400.00 - $204,100.00 USD
This role is eligible for Bonus.
Compensation range listed is based on primary location of the position. Actual base salary offer is influenced by a wide array of factors including but not limited to skills, experience and actual hiring location. Your recruiter can share more information about the specific offer for the job location during the hiring process.