Orchestration of AI Services on Telco Cloud Platforms

Inria
Canton of Rennes-4, France
6 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Compensation
€ 32K

Job location

Remote
Canton of Rennes-4, France

Tech stack

Artificial Intelligence
Software Documentation
Computer Programming
Distributed Systems
Python
Network Functions Virtualization
Open Source Technology
TensorFlow
Sensor Fusion
Graphics Processing Unit (GPU)
Delivery Pipeline
Containerization
Information Technology
Machine Learning Operations

Job description

  • Setup and configuration of a Kubernetes-based Telco Cloud environment (CaaS)
  • Deployment and containerisation of AI services (inference models, intelligent network functions) on a heterogeneous testbed platform
  • Development of monitoring and profiling tools (latency, energy, accuracy) for dynamic AI workloads

Track B - Algorithmic Development and Experimentation

  • Implementation and evaluation of orchestration algorithms (heuristics, optimisation, adaptive policies) for AI model placement across heterogeneous nodes (GPUs, NPUs, edge servers, cloud)
  • Design of dynamic model configuration selection policies (early-exit, compression, mixed precision) based on network conditions and available resources
  • Comparative evaluation against baselines (static deployment, greedy heuristics) under varying load and network conditions

Track C - Data Collection and Experimental Validation

  • Construction of representative evaluation scenarios: video analytics, sensor fusion, intelligent network functions (traffic prediction, anomaly detection)
  • Experimental measurement campaigns on the testbed, analysis of energy-latency-quality trade-offs
  • Contribution to the production of open-source artefacts (code, datasets, configurations) and to the writing of scientific or technical deliverables

Requirements

Education: Master's degree or PhD in computer science, networking, distributed systems, or a related field., * Strong programming skills in Python

  • Knowledge of machine learning frameworks and familiarity with model training and inference pipelines
  • Understanding of distributed systems concepts (scheduling, resource management, containerisation), * A taste for experimentation and hands-on work. You enjoy building things, running experiments, and letting measurements guide your thinking. You are not deterred by a system that does not behave as expected - you are curious about why.
  • Comfort with open-ended problems. The scope of this project will evolve. The right candidate embraces this flexibility rather than seeking rigid task definitions, and is able to self-direct their work within a broader research agenda.
  • A collaborative and communicative nature. The project involves a multi-partner national programme (PEPR NF-MUST). You will interact with researchers from different institutions and backgrounds, and you are able to share your progress, your doubts and your findings clearly and constructively.
  • Cross-disciplinary curiosity. Whether your background is closer to systems, algorithms, or networking, what matters is a genuine interest in the neighbouring fields and a willingness to build bridges between them.
  • A research-oriented mindset. You are comfortable reading technical literature, situating your work in a broader scientific context, and contributing to written outputs that go beyond code documentation.

A thesis or significant project in the areas of network function virtualisation, edge computing, machine learning systems, or distributed optimisation would be a genuine asset. What matters most is the drive to produce rigorous, reproducible, and impactful work within a stimulating and supportive research environment.

Benefits & conditions

  • Subsidized meals
  • Partial reimbursement of public transport costs
  • Leave: 7 weeks of annual leave + 10 extra days off due to RTT (statutory reduction in working hours) + possibility of exceptional leave (sick children, moving home, etc.)
  • Possibility of teleworking (after 6 months of employment) and flexible organization of working hours
  • Professional equipment available (videoconferencing, loan of computer equipment, etc.)
  • Social, cultural and sports events and activities
  • Access to vocational training
  • Social security coverage

About the company

The Inria Centre at Rennes University is one of Inria's eight centres and has more than thirty research teams. The Inria Centre is a major and recognized player in the field of digital sciences. It is at the heart of a rich R&D and innovation ecosystem: highly innovative PMEs, large industrial groups, competitiveness clusters, research and higher education players, laboratories of excellence, technological research institute, etc., This position is funded within the PEPR 5G and Networks of the Future programme, a national priority research programme (France 2030) co-directed by CEA, CNRS and IMT with a total budget of €65M. The programme aims to position France at the forefront of 5G, 6G and future network technologies across the full value chain. The present work is part of the NF-MUST project (End-to-End Multi-domain Service Management Architecture of the Networks of the Future), which focuses on automating the provisioning and lifecycle management of multi-domain, multi-stakeholder services over highly heterogeneous and dynamically evolving future network infrastructures. NF-MUST covers end-to-end orchestration of coordination, cooperation and interaction functions to satisfy diverse service requests across multiple sectors, with strong emphasis on resource availability, security, performance and frugality. The project runs from May 2023 to December 2027 and involves partners including CNRS, Inria, CEA-List, Télécom Paris, Télécom SudParis, EURECOM and others. This research engineer position contributes to the NF-MUST objectives by developing orchestration mechanisms for native AI services deployed over Telco Cloud infrastructure, at the intersection of AI workload management, cloud-native networking and future network architectures. Scientific Context 5G and pre-6G networks must host heterogeneous intelligent applications - autonomous driving, augmented reality, real-time video analytics, embedded machine learning - whose requirements in terms of latency, energy, and model quality evolve dynamically. Open Telco Cloud platforms, based on Kubernetes, provide a shared infrastructure across operators for hosting cloud-native network functions and edge workloads. A major challenge remains, however: these platforms do not natively handle the specificities of AI workloads - adaptive architectures, distributed training, energy-aware inference scheduling. This position is part of a research project aimed at designing and validating intelligent orchestration mechanisms for dynamic AI models on Telco Cloud infrastructure, covering the device-edge-cloud continuum. The solutions developed will be designed to be compatible with open standards in the field (Kubernetes/CaaS) and potentially integrable into different Telco Cloud platforms (Sylva, Nephio, or other cloud-native stacks)., More than a checklist of technical skills, what will make this assignment a success is a particular mindset and a certain way of engaging with research and engineering work. The ideal candidate is someone who genuinely enjoys operating at the boundary between systems and ideas - someone who finds satisfaction not only in making things work, but in understanding why they work and what they reveal about the underlying problem. This role sits at the crossroads of distributed systems, AI, and networking: an intellectual appetite for all three, even without deep expertise in each, will go a long way.

Apply for this position