Network Automation Engineer
Role details
Job location
Tech stack
Job description
We are seeking an experienced Network Automation Engineer that wants to play a leading role in designing, developing and building scalable AI Ops platforms that deliver RAG solutions, model training and agentic AI workflows in support of a carrier class, global IP network.
The ideal candidate will have a strong background in development fundamentals and coding, with a solid knowledge of network technologies and platforms. The Network Automation Engineer will have a key role in shaping the future of our network management infrastructure and should be comfortable interacting with highly technical software and networking teams.
Responsibilities:
Network Automation Strategy:
- Develop and implement a comprehensive AI Ops network automation strategy aligned with organizational goals and objectives.
- Identify opportunities for automation in network provisioning, configuration management, monitoring and analysis
AI Ops
- Provision and manage on-prem AI infrastructure and containerized environments (Docker, Kubernetes) optimized for GPU workloads and distributed compute
- Deployment and orchestration of agentic AI systems (LangChain, LangGraph etc)
- Implement security best practices including secrets management, model access control, data encryption, and audit logging for AI pipelines
- Monitor and optimize performance of AI workloads
Scripting and Coding:
- Develop and maintain scripts, workflows, and code to automate network tasks using programming languages such as Python, Ansible and other automation tools.
- Collaborate with software development teams to integrate network automation into the CI/CD pipeline.
Network Orchestration:
- Implement network orchestration solutions to streamline the provisioning and management of network services.
Network Inventory Management:
- Develop network inventory reporting and control applications
Monitoring and Analytics:
- Develop telemetry and Netflow based analytics solutions for a carrier grade network
- Integrate network automation with monitoring tools for real-time visibility and reporting., Customer Ownership: We own our customers' success, whether an internal stakeholder or an external client. We take full accountability, anticipate their needs and create smooth experiences to build trust with every touchpoint. In an automated world, personal ownership is GTT's competitive edge.
Requirements
- 4+ years of full software development life cycle experience in a Network technology environment. Preferably with experience in AI/ML environments.
- Strong proficiency in scripting and coding languages such as Python and YAML.
- Experience with network automation tools and frameworks (e.g., Ansible, Puppet, Chef).
- Good understanding of GIT workflows and integration into CI/CD pipelines (Github, Gitlab, Jenkins
- Server and GPU infrastructure management experience
- Solid understanding of container lifecycle management
- Foundational knowledge of network protocols, routing, switching, platforms and network inventory management.
- Network analytics experience including Netflow and processing of high-volume telemetry data
- Experience with Netbox customization a plus.
- Must be result-oriented and constantly looking for new ways to improve efficiency and productivity.
- Must be able to exercise independent judgment under minimum supervision.
- Ability to assess and analyze technical situations and provide viable alternatives
Our Commitments:
Adaptive Mindset: We meet change head-on to build the capabilities we need now. We take personal ownership of our professional development to keep pace with change and actively drive it.