Director of Operating Systems (Windows & Linux)
Role details
Job location
Tech stack
Job description
Being a member of IT Enterprise Product & Platform Engineering team, the Director of Operating Systems (Windows & Linux) leads engineering for core compute operating system platforms across on-premises and hybrid cloud environments. This leader owns platform reliability, security posture, automation and self-service, lifecyclen and modernization roadmaps, including a high-performing organization of managers and senior engineers. The role partners closely with Architecture, Platform Engineering, Cloud, Security, End-User Computing, Networking, and Application teams to deliver scalable, resilient, cost-effective services that enable business outcomes.
Your Primary Responsibilities:
- Platform Strategy & Roadmap: Define and execute multi-year strategy for Windows and Linux OS platforms, including standard builds, lifecycle management, and retirement of legacy technologies.
- Modernization: Lead modernization initiatives such as gold image and configuration baseline standardization, OS upgrade programs, legacy domain and identity integrations (as applicable), and reduction of technical debt across fleets.
- Automation & Self-Service: Build an automation-first culture; expand infrastructure-as-code, configuration management, and automated remediation to reduce toil and improve consistency (e.g., CI/CD for images, patching, and policy enforcement).
- Hybrid Cloud Enablement: Deliver consistent OS capabilities across on-prem and cloud including provisioning workflows, secure configuration, monitoring/telemetry, and operational runbooks.
- Reliability Engineering: Establish SLOs/SLIs, error budgets, and reliability practices; improve MTTR through observability, incident response improvements, and proactive problem management.
- Security & Compliance: Partner with Security and Risk teams to meet vulnerability remediation targets, hardening standards (e.g., CIS/STIG equivalents), endpoint protection integration, and audit requirements.
- Operational Excellence: Own patch and change management, capacity planning, performance management, and operational readiness; ensure clear ownership, playbooks, and escalation paths.
- Financial & Vendor Management: Manage budgets and contracts for OS-related tooling and services (e.g., monitoring, patching, automation, identity, and support agreements); drive cost optimization through standardization and cloud economics.
- Org Leadership: Lead, coach, and develop a large organization of managers and senior technical leaders; set clear goals, establish operating rhythms, and build succession plans. Stakeholder Partnership: Build strong relationships with application owners and infrastructure partners; translate business needs into platform capabilities and measurable outcomes.
Requirements
- Minimum of 12+ years of related experience
- Bachelor's degree in Computer Science, Engineering, and/or equivalent practical experience; advanced degree a plus.
Talents Needed for Success:
- 12+ years of progressive experience in infrastructure/compute platform engineering and operations, including significant ownership of Windows and Linux server environments at enterprise scale.
- 7+ years of people leadership experience, including leading managers and building high-performing, multi-team organizations.
- Deep expertise in Windows Server and major Linux distributions including troubleshooting, performance, storage/filesystems, and service lifecycle management.
- Demonstrated success driving modernization programs (e.g., large-scale OS upgrades, standard image programs, legacy platform rationalization) with measurable outcomes.
- Proven track record implementing automation and infrastructure-as-code at scale (e.g., Ansible, Puppet/Chef/Salt, PowerShell/DSC, Terraform, GitOps patterns).
- Hands-on understanding of hybrid cloud operating models and cloud-native patterns (e.g., immutable images, autoscaling, policy-as-code, secure landing zones, identity and access integration).
- Strong operational leadership: incident management, problem management, change control, capacity planning, and establishing SLO-driven reliability practices.
- Experience partnering with Security on hardening, vulnerability management, and audit/compliance requirements.
- Excellent communication skills with the ability to influence executives and drive alignment across engineering, security, and business stakeholders.
Benefits & conditions
- Competitive compensation, including base pay and annual incentive
- Comprehensive health and life insurance and well-being benefits, based on location
- Pension / Retirement benefits
- Paid Time Off and Personal/Family Care, and other leaves of absence when needed to support your physical, financial, and emotional well-being.
- DTCC offers a flexible/hybrid model of 3 days onsite and 2 days remote (onsite Tuesdays, Wednesdays and a third day unique to each team or employee).