Job offer
Role details
Job location
Tech stack
Job description
Replication techniques such as CRDTs ([Balegas18], [Preguiça19], [Yu20], [Rault22], [Ignat24]) demonstrate how distributed systems can converge under concurrent updates while preserving constraints. However, they focus on raw state synchronization, whereas in many real-world scenarios, data sharing is guided by views, i.e., query-defined abstractions that determine which parts of the underlying data are exposed. Views can already be incrementally maintained in centralized relational systems, with mature approaches such as DBSP ([Budiu22]) and OpenIVM ([Battiston24]) efficiently propagating changes from base data to derived query results. Recent work has begun to extend incremental view maintenance to decentralized and eventually consistent settings ([Thomassen23]), where replicas evolve independently and must reconcile divergent states. In the context of graph databases, prototype systems have shown how property graph views can be defined, materialized, and incrementally updated over underlying storage layers ([Han24]). However, existing techniques stop short of addressing the more complex challenge of constraint-aware, view-based replication in hybrid graph-relational environments, where data is not only distributed but also governed by specific constraints and access control policies ([Angles21], [Angles23], [Clark22]).
This PhD project addresses this gap by shifting the focus from replicating raw datasets to replicating constraint-aware, incrementally evolving hybrid views representations that combine relational structure with graph connectivity, encode domain semantics, and remain composable and correct across distributed environments. This would enable semantics-preserving, collaborative data platforms for decentralized graph-relational ecosystems.
Objectives of the PhD
The overarching goal of this PhD is to establish the theoretical and system foundations for a replication framework centered on constraint-aware, incrementally evolving hybrid views, elevating views from static query results to first-class, shareable, and composable entities in distributed data systems. The project will pursue three main objectives:
- Formalizing replicated hybrid views. Define a declarative and semantic foundation for views that unify relational and graph data, support selective exposure and structural transformation, and incorporate explicit constraints. This model will enable reasoning about the correctness of views under replication.
- Designing incremental and constraint-preserving replication mechanisms. Extend replication semantics beyond raw data by introducing algorithms and protocols that propagate and merge views while preserving convergence, integrity, and domain-specific invariants. This includes defining merge operators, conflict-resolution strategies, and conditions under which replicated views remain consistent and incrementally updatable across sites.
- Building a decentralized framework for view-based data sharing. Develop a framework in which hybrid views are first-class, shareable abstractions for data exchange among independent nodes. The framework will support their reliable management across distributed infrastructures, enabling collaborative data sharing while upholding correctness guarantees.
Research Questions
The project will address three key research questions:
- How can hybrid graph-relational views be defined as replicable, constraint-aware abstractions, integrating both topology, and domain-specific requirements?
- How can such views be incrementally updated and reconciled across distributed replicas while preserving correctness and integrity?
- How can replication models make such views the unit of synchronization, ensuring convergence under concurrency?
Requirements
Candidates should hold a Master's degree in Computer Science or a related field, with a strong background in data management or distributed systems. Familiarity with graph database technologies or system prototyping is highly desirable.
Languages FRENCH
Benefits & conditions
- Subsidized meals
- Partial reimbursement of public transport costs
- Leave: 7 weeks of annual leave + 10 extra days off due to RTT (statutory reduction in working hours) + possibility of exceptional leave (sick children, moving home, etc.)
- Possibility of teleworking (after 6 months of employment) and flexible organization of working hours
- Professional equipment available (videoconferencing, loan of computer equipment, etc.)
- Social, cultural and sports events and activities
- Access to vocational training
- Social security coverage
€2300 gross/month Selection process