Vous êtes ici

Robust and light-weight overlay management for decentralized learning

Equipe et encadrants
Département / Equipe: 
Site Web Equipe: 
Directeur de thèse
François Taïani
Co-directeur(s), co-encadrant(s)
NomAdresse e-mailTéléphone
François Taïani
+33 (0) 2 99 84 75 04
Sujet de thèse

A growing number of companies are extracting value from the digital data produced by our modern society using extit{Machine learning} (ML) techniques. Most of these companies rely today on centralized or tightly coupled ML systems hosted in data centers or in the cloud. This is problematic as this concentration poses strong risks to the privacy of users, and limits the scope of ML applications to tightly integrated datasets under unified learning models.

To address these limitations, this PhD proposes to explore an alternative approach inspired by peer-to-peer networks in which users control their own system, and only exchange a limited amount of information to construct local machine learning models. This strategy is more amenable to preserving user privacy, and respecting the constraints possibly imposed on sensitive data-sets (such as health records, or personal financial data), and holds the potential for highly scalable and robust learning systems. This project aims to study the challenges raised by this strategy in terms of distribution and overlay management.

More details available here. Apply on line (CV, grade transcripts, references).


[1] X. Bai, M. Bertier, R. Guerraoui, A.-M. Kermarrec, and V. Leroy. Gossiping personalized queries. In EDBT'2010, 2010.

[2] Marin Bertier, Davide Frey, Rachid Guerraoui, Anne-Marie Kermarrec, and Vincent Leroy. The gossple anonymous social network. In Middleware'2010, 2010.

[3] Márk Jelasity, Alberto Montresor, and Ozalp Babaoglu. T-man: Gossip-based fast overlay topology construction. Comp. Netw., 53(13), 2009.

[4] Márk Jelasity, Spyros Voulgaris, Rachid Guerraoui, Anne-Marie Kermarrec, and Maarten van Steen. Gossip-based peer sampling. ACM TOCS, 25, 2007.

[5] Mu Li, David G. Andersen, Jun Woo Park, Alexander J. Smola, Amr Ahmed, Vanja Josifovski, James Long, Eugene J. Shekita, and Bor-Yiing Su. Scaling distributed machine learning with the parameter server. In 11th USENIX Symp. on Op. Sys. Design and Impl. (OSDI 14), pages 583-598, Broomfield, CO, October 2014. USENIX Association.

[6] Kazumi Saito, Masahiro Kimura, Kouzou Ohara, and Hiroshi Motoda. Learning continuous-time information diffusion model for social behavioral data analysis. In ACML, ACML'09, pages 322-337, Berlin, Heidelberg, 2009. Springer-Verlag.

[7] Paul Vanhaesebrouck, Aurélien Bellet, and Marc Tommasi. Decentralized collaborative learning of personalized models over networks. CoRR, abs/1610.05202, 2016.

[8] Spyros Voulgaris and Maarten van Steen. Epidemic-style Management of Semantic Overlays for Content-Based Searching. In Eur. Conf. on Par. and Dist. Computing (EuroPar), pages 1143-1152, 2005.

Début des travaux: 
September 2017
Mots clés: 
machine learning, distributed computing, overlays, self-organisation, distributed systems, epidemic protocols
IRISA - Campus universitaire de Beaulieu, Rennes