Junior Network Engineer
Role details
Job location
Tech stack
Job description
We are seeking a Network Engineer to design, implement, and manage high-performance networks for HPC and AI infrastructure.
Candidates will work on cutting-edge technologies, including InfiniBand, optical networking, and advanced Linux-based systems, contributing to scalable, secure, and high-availability network solutions. They should also have expertise in IP routing protocols (BGP, OSPF) and network automation (Ansible, Nornir & Netmiko)., * Monitor the performance and health of InfiniBand fabrics, including switches, host adapters, and nodes, using existing tools and contribute to developing new monitoring solutions where necessary.
- Investigate & Help diagnose network connectivity issues, performance bottlenecks, and component failures.
- Collaborate with cross-functional teams to support HPC clusters and ensure smooth network operation.
- Assist with the deployment and configuration of network infrastructures, including large-scale fabric installations from initial setup to operational readiness.
- Maintain and update network documentation and workflows to align with organizational standards.
- Contribute to the requirements for deployments and guide cross-functional teams during implementation.
- Develop and implement advanced monitoring tools and strategies for network performance.
- Work with senior colleagues to research technologies to improve scalability and security.
- Work with senior colleagues to help optimization initiatives, ensuring maximum efficiency, security, and performance.
- Contribute to new network technologies into existing infrastructure.
- Troubleshoot for complex, high-impact network issues across multiple sites.
- Support with Technical Network incidents, working with other teams to resolve issues quickly.
- Work with your team to support network automation solutions using Ansible, Nornir, and Netmiko.
- Contribute ideas and initiatives to ensure network changes are repeatable, efficient, and error-free., * Contribute to network-related Linux administration, ensuring high availability, security, and performance.
- Work with an understanding of security measures, including firewalls, VPNs, and access control policies.
- HPC & InfiniBand understanding:
- Contribute to designing and implementation of HPC network architectures, focusing on InfiniBand configurations for performance-critical environments.
- Work alongside senior colleagues for integration and management of InfiniBand for high-throughput, low-latency computing systems.
- Provide technical input on HPC interconnect issues, optimizing performance across large clusters.
Requirements
-
Knowledge of InfiniBand configuration and management.
-
Familiarity with optical networking hardware and Linux system administration.
-
Have knowledge of one scripting language (e.g., Python, Bash).
-
Analytical and troubleshooting skills.
-
Ability to collaborate effectively in team environments.
-
Willingness to travel to data centers for deployments and support. 2 years of experience in network engineering, with a focus on high-performance environments is Ideal but not mandatory
-
Understanding in InfiniBand, RDMA, and advanced network architectures is a Bonus
-
Certifications (e.g., CCIE, NVIDIA DPU Certification) is a Bonus