(Senior) Infrastructure Engineer (OpenStack Neutron Specialist)
Role details
Job location
Tech stack
Job description
We're hiring an Infrastructure Engineer (OpenStack Neutron Specialist) to own and evolve the OpenStack networking layer that underpins Nscale's internal and customer-facing cloud services.
This role sits within the Infrastructure Engineering team, which is responsible for the design, implementation, operation, and continuous improvement of the infrastructure stack. You'll work closely with compute, storage, platform engineering, support, architecture, and pre-sales teams, serving as a deep subject matter expert across Neutron and associated networking technologies including OVN, Open vSwitch, routing, DHCP, metadata services, tenant isolation, and network automation.
This is a high-impact role at the heart of Nscale's cloud platform. You'll help ensure the availability, scalability, performance, and security of the networking layer, act as a 3rd/4th line escalation point for complex issues, and represent real-world operational needs in the upstream OpenStack community to help shape the future of Neutron and related projects., * Design scalable, resilient, and secure OpenStack networking platforms with a strong focus on Neutron and OVN/OVS.
- Own the architecture and day-to-day operation of virtual networking services, including L2/L3 networking, DHCP, metadata, floating IP, NAT, security groups, and tenant segmentation.
- Ensure OpenStack networking platforms adhere to security, compliance, and operational standards.
- Support upgrades, lifecycle management, and change execution across OpenStack networking services with a strong focus on service continuity.
Troubleshooting and Operational Excellence
- Troubleshoot complex control plane and data plane issues across OpenStack networking components and the underlying Linux networking stack.
- Act as a 3rd/4th line escalation point for advanced networking incidents.
- Conduct root cause analysis for critical issues and drive permanent fixes.
- Participate in on-call rotations and incident response activities for critical infrastructure services.
Automation, Performance, and Resilience
- Drive continuous improvement in network automation, provisioning, validation, monitoring, and recovery using infrastructure-as-code and configuration management tools.
- Lead performance tuning, scalability planning, and resilience improvements for network-heavy and latency-sensitive cloud workloads.
- Build and improve automation approaches that strengthen operational consistency and recovery across the platform.
- Contribute specialist input to infrastructure roadmap planning, platform standards, and solution design for customer and internal environments.
Cross-Functional and Upstream Contribution
- Work closely with compute, storage, and platform engineering teams to ensure effective integration across the broader cloud platform.
- Support pre-sales and solution design activities by providing expert guidance on cloud networking capabilities, constraints, and best practices.
- Contribute to upstream OpenStack networking communities, particularly Neutron and related projects such as OVN, through bug reports, code contributions, design discussions, testing, and reviews where appropriate.
- Track upstream roadmaps, release changes, and community direction to help shape Nscale's networking strategy, upgrade planning, and platform standards.
- Represent Nscale's operational requirements and real-world use cases in upstream discussions to help drive improvements for both the business and the broader community.
KPIs
- Availability of OpenStack networking services
- Resolution of advanced networking incidents and root cause fixes
- Network automation, provisioning, and recovery improvements
- Scalability and performance of network-heavy, latency-sensitive workloads
Requirements
Do you have experience in System administration?, * Strong experience working in large-scale OpenStack environments.
- Strong specialist knowledge of Neutron, including ML2, OVN, Open vSwitch, routing, DHCP, metadata, provider networks, tenant networks, VLAN/VXLAN/Geneve, and security groups.
- Strong Linux systems administration and troubleshooting experience.
- Strong understanding of Linux networking concepts including routing, bridging, namespaces, iptables/nftables, bonding, MTU, and packet flow analysis.
- Strong experience investigating complex network behaviour using tools such as tcpdump, iproute2, ovs/ovn tooling, logs, and metrics.
- Strong experience designing and building automation for cloud infrastructure using tools such as Ansible.
- Strong Python and Bash skills.
- Experience working with highly available production platforms and change management in mission-critical environments.
- Ability to collaborate across infrastructure, support, and architecture teams to solve complex technical problems.
- Experience contributing to or working closely with upstream open-source communities is highly desirable, particularly within OpenStack, Neutron, OVN, Open vSwitch, or related networking projects.
Benefits & conditions
At Nscale, you'll find a collaborative, supportive, and innovative environment where your contributions spark real impact. We're building something extraordinary, and we want you at the core.
Highly competitive compensation package (base + bonus + equity), with performance reviews every 12 months.
Join one of the fastest-growing AI infrastructure companies - your chance to directly shape how global AI capacity is planned and deployed.
Expect a dynamic progression plan tailored to your ambitions. Grow by leading critical cross-functional initiatives and shaping capital strategy - always with our full support.
Human-First Flexibility: We treat you as humans first. Our flexible workplace trusts Nscalers to deliver, giving you the autonomy to shape your day around life's moments.