Member of Technical Staff - Data Infrastructure Manager - Microsoft AI - Copilot

Microsoft
Redmond, United States of America
15 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Intermediate
Compensation
$ 304K

Job location

New York, United States of America

Tech stack

PHP
Artificial Intelligence
Amazon Web Services (AWS)
Application Release Automation
Azure
Bash
Big Data
Cloud Computing
Computer Engineering
Information Engineering
Data Infrastructure
Data Systems
DevOps
Distributed File Systems
Distributed Systems
Hadoop Distributed File System
Monitoring of Systems
Identity and Access Management
Python
Kerberos (Protocol)
Enterprise Messaging Systems
Microsoft Office
Node.js
NoSQL
OAuth
Powershell
RabbitMQ
Reliability Engineering
Azure
Software Engineering
TypeScript
AI Infrastructure
Cloud Platform System
React
Delivery Pipeline
Multi-Agent Systems
Spark
Kubernetes
Infrastructure Automation Frameworks
Information Technology
Deployment Automation
Bare Metal
Bicep
Kafka
Build Tools
Machine Learning Operations
Virtual Agents
Terraform
Data Pipelines
Databricks

Job description

As Microsoft continues to push the boundaries of AI, we are on the lookout for passionate leaders to help us tackle the most interesting and challenging AI questions of our time. Our vision is bold and broad, to build systems that have true artificial intelligence across agents, applications, services, and infrastructure. It's also inclusive: we aim to make AI accessible to all, consumers, businesses, developers, so that everyone can realize its benefits. We're looking for a Data Infrastructure Manager to lead a team of talented engineers building and scaling the data infrastructure that powers Microsoft's consumer AI. This role sits at the intersection of technical leadership and people management. You'll set the technical direction for large-scale data and ML pipelines, AI agentic workflows, and intelligent systems while growing a high-performing team of ICs. If you've architected big data platforms from the ground up and are now ready to multiply your impact through others, including on some of the most exciting AI infrastructure challenges in the industry, we want to hear from you.

You'll bring:

  • Deep technical expertise in big data and distributed systems
  • A track record of leading and developing engineering talent
  • A passion for automation, observability, and operational excellence
  • The ability to translate complex technical strategy into clear, executable plans
  • Empathy, collaboration, and a growth mindset

Microsoft's mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of Respect, Integrity, and Accountability to create a culture of inclusion where everyone can thrive at work and beyond. Starting January 26, 2026, Microsoft AI (MAI) employees who live within a 50-mile commute of a designated Microsoft office in the U.S. or 25-mile commute of a non-U.S., country-specific location are expected to work from the office at least four days per week. This expectation is subject to local law and may vary by jurisdiction. Responsibilities Team Leadership & People Development

  • Hire, mentor, and develop a team of Data Infrastructure Engineers, fostering a culture of technical excellence, ownership, and continuous growth.
  • Conduct regular 1:1s, set clear goals, and provide actionable feedback to support each engineer's career development.
  • Build and sustain an inclusive, collaborative team environment aligned with Microsoft's values of Respect, Integrity, Accountability, and Inclusion.

Technical Strategy & Architecture

  • Define and drive the technical vision for a scalable, reliable, and observable Big Data Infrastructure serving mission-critical AI applications, including agentic and intelligent systems.
  • Lead technical design reviews, establish engineering standards, and ensure a clean, secure, and well-documented codebase.
  • Partner with engineers to architect data solutions across storage, compute, and analytics layers, including the pipelines and orchestration frameworks that underpin AI agent workflows, balancing long-term scalability with near-term delivery.

Platform & Operations

  • Champion DevOps and SRE best practices across the team, including automated deployments, service monitoring, and incident response.
  • Guide the team in building a self-service big data platform that empowers data engineers, researchers, and partner teams.
  • Oversee robust CI/CD pipelines and infrastructure-as-code practices using tools like Bicep, Terraform, and ARM.
  • Lead capacity planning and drive proactive resolution of bottlenecks in data pipelines and infrastructure.

Cross-Functional Collaboration

  • Act as a key technical partner to Data Engineers, Data Scientists, AI Researchers, ML Engineers, and Developers to deliver secure, seamless big data workflows.
  • Collaborate with Security teams to uphold strong infrastructure security practices (IAM, OAuth, Kerberos).
  • Represent the team in planning and prioritization discussions, translating organizational goals into actionable engineering roadmaps.

Requirements

  • Bachelor's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND 6+ years experience in business analytics, data science, software development, data modeling or data engineering work
  • OR Master's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND 4+ years experience in business analytics, data science, software development, or data engineering work o OR equivalent experience., * Master's Degree in Computer Science or related technical field AND 10+ years of technical engineering experience OR Bachelor's Degree AND 14+ years, OR equivalent experience.
  • 5+ years in Big Data Infrastructure, DevOps, SRE, or Platform Engineering.
  • 5+ years of hands-on experience with distributed systems from bare-metal to cloud-native environments.
  • 5+ years overseeing or contributing to containerized application deployments using Kubernetes and Helm/Kustomize.
  • Solid scripting and automation fluency in Python, Bash, or PowerShell.
  • Proven track record managing CI/CD pipelines, release automation, and production incident response.
  • Hands-on expertise with modern data platforms like Databricks, including deep familiarity with relational and NoSQL databases, key-value stores, Spark compute engines, distributed file systems (e.g., HDFS, ADLS Gen2), and messaging systems (e.g., Event Hub, Kafka, RabbitMQ).
  • Proven experience with cloud-native infrastructure across Azure, AWS, or GCP.
  • Strong collaboration history with Data Engineers, Data Scientists, ML Engineers, Networking, and Security teams.
  • Experience with agentic workflow infrastructure, including orchestration frameworks (e.g., Semantic Kernel, AutoGen), retrieval pipelines, and the data infrastructure patterns that support multi-agent systems at scale.
  • Familiarity with modern web stacks: TypeScript, Node.js, React, and optionally PHP.

#MicrosoftAI #MAIDPS

Software Engineering M5 - The typical base pay range for this role across the U.S. is USD $139,900 - $274,800 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $188,000 - $304,200 per year.

About the company

Microsoft is a global technology company headquartered in Redmond, Washington. Our mission is to empower every person and every organization on the planet to achieve more. We develop, license, and support a wide range of software products, services, and devices that help individuals and businesses realize their full potential.

Our flagship products include the Microsoft 365 productivity cloud, Windows operating system, Azure cloud platform, and Dynamics 365 business applications. We are also a leader in areas such as artificial intelligence, cybersecurity, developer tools, and gaming through Xbox and Game Pass.

With operations in more than 190 countries and over 220,000 employees worldwide, Microsoft is committed to responsible innovation, inclusive economic growth, and sustainability. We work closely with governments, industries, and communities to ensure that technology serves the public good and helps address some of the world’s most pressing challenges.

As we celebrate our 50th anniversary in 2025, we continue to look forward—investing in AI, cloud, and quantum computing to shape the future of work, education, and society at large scale.

Apply for this position