Sr. Software Development Manager
Role details
Job location
Tech stack
Job description
We are looking for an experienced Software Development Manager to lead backend engineering and agentic AI initiatives for Rack Navigator - MSI's next-generation platform for autonomous datacenter operations.
This role combines hands-on technical leadership with team management, focused on building scalable backend systems and intelligent agents that automate infrastructure provisioning, diagnostics, and lifecycle management. You will lead a team of engineers developing AI-driven workflows that interact with real-world datacenter systems at rack scale., Technical Leadership & Architecture
- Lead the design and development of backend services, APIs, and microservices powering Rack Navigator.
- Define architecture for agentic AI systems that orchestrate infrastructure workflows (provisioning, validation, diagnostics, remediation).
- Drive design decisions for scalability, reliability, and performance across distributed infrastructure systems.
- Establish best practices for system design, observability, and fault tolerance.
Agentic AI & Infrastructure Automation
- Oversee development of AI-driven agents that interact with infrastructure control planes, telemetry systems, and hardware APIs.
- Guide the implementation of automation for rack-scale diagnostics, health monitoring, and failure analysis.
- Ensure seamless integration between AI workflows and datacenter infrastructure (compute, storage, networking).
Team Leadership & Execution
- Build, mentor, and manage a high-performing backend engineering team.
- Drive sprint planning, execution, and delivery of key platform capabilities.
- Set clear goals, provide technical direction, and foster a culture of ownership and innovation.
- Balance hands-on coding/reviews with strategic leadership responsibilities.
Platform & Engineering Excellence
- Establish coding standards, testing frameworks, and CI/CD practices.
- Drive development of reusable libraries and internal platforms for infrastructure lifecycle automation.
- Ensure high-quality, maintainable, and well-documented code.
Cross-functional Collaboration
- Partner with infrastructure, hardware, production, and AI teams to deliver end-to-end solutions.
- Align engineering efforts with product and business goals.
- Support production deployments and continuous improvement of datacenter operations.
Requirements
- Bachelor's or Master's degree in Computer Science, Engineering, or related field.
- 8+ years of software development experience, with 2+ years in a technical leadership or management role.
- Strong proficiency in Python for backend systems and automation.
- Experience building scalable backend services, APIs, and distributed systems.
- Hands-on experience with infrastructure automation (Ansible, Python, Shell).
- Solid understanding of datacenter infrastructure (compute, networking, storage).
- Strong problem-solving and debugging skills in complex environments., * Experience building or leading teams developing agentic AI systems or LLM-driven automation.
- Familiarity with modern AI orchestration frameworks and "vibe coding" environments (rapid prototyping with AI-assisted development tools).
- Experience with GPU clusters, HPC systems, or rack-scale architectures.
- Background in diagnostics, system validation, or hardware-software integration.
- Experience with DevOps practices, CI/CD pipelines, and infrastructure-as-code.
- Knowledge of networking architectures (spine-leaf, VLANs, routing).