Skip to main content

Showing 1–11 of 11 results for author: Rouault, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2209.15259  [pdf, ps, other

    cs.LG cs.AI cs.CR

    On the Impossible Safety of Large AI Models

    Authors: El-Mahdi El-Mhamdi, Sadegh Farhadkhani, Rachid Guerraoui, Nirupam Gupta, Lê-Nguyên Hoang, Rafael Pinot, Sébastien Rouault, John Stephan

    Abstract: Large AI Models (LAIMs), of which large language models are the most prominent recent example, showcase some impressive performance. However they have been empirically found to pose serious security issues. This paper systematizes our knowledge about the fundamental impossibility of building arbitrarily accurate and secure machine learning models. More precisely, we identify key challenging featur… ▽ More

    Submitted 9 May, 2023; v1 submitted 30 September, 2022; originally announced September 2022.

    Comments: 40 pages

  2. arXiv:2110.03991  [pdf, other

    cs.LG cs.CR

    Combining Differential Privacy and Byzantine Resilience in Distributed SGD

    Authors: Rachid Guerraoui, Nirupam Gupta, Rafael Pinot, Sebastien Rouault, John Stephan

    Abstract: Privacy and Byzantine resilience (BR) are two crucial requirements of modern-day distributed machine learning. The two concepts have been extensively studied individually but the question of how to combine them effectively remains unanswered. This paper contributes to addressing this question by studying the extent to which the distributed SGD algorithm, in the standard parameter-server architectu… ▽ More

    Submitted 5 October, 2023; v1 submitted 8 October, 2021; originally announced October 2021.

  3. arXiv:2107.07334  [pdf, other

    cs.HC cs.CR cs.CY cs.LG

    Tournesol: A quest for a large, secure and trustworthy database of reliable human judgments

    Authors: Lê-Nguyên Hoang, Louis Faucon, Aidan Jungo, Sergei Volodin, Dalia Papuc, Orfeas Liossatos, Ben Crulis, Mariame Tighanimine, Isabela Constantin, Anastasiia Kucherenko, Alexandre Maurer, Felix Grimberg, Vlad Nitu, Chris Vossen, Sébastien Rouault, El-Mahdi El-Mhamdi

    Abstract: Today's large-scale algorithms have become immensely influential, as they recommend and moderate the content that billions of humans are exposed to on a daily basis. They are the de-facto regulators of our societies' information diet, from shaping opinions on public health to organizing groups for social movements. This creates serious concerns, but also great opportunities to promote quality info… ▽ More

    Submitted 29 May, 2021; originally announced July 2021.

    Comments: 27 pages, 13 figures

  4. arXiv:2102.08166  [pdf, other

    cs.LG cs.CR cs.DC

    Differential Privacy and Byzantine Resilience in SGD: Do They Add Up?

    Authors: Rachid Guerraoui, Nirupam Gupta, Rafaël Pinot, Sébastien Rouault, John Stephan

    Abstract: This paper addresses the problem of combining Byzantine resilience with privacy in machine learning (ML). Specifically, we study if a distributed implementation of the renowned Stochastic Gradient Descent (SGD) learning algorithm is feasible with both differential privacy (DP) and $(α,f)$-Byzantine resilience. To the best of our knowledge, this is the first work to tackle this problem from a theor… ▽ More

    Submitted 24 June, 2021; v1 submitted 16 February, 2021; originally announced February 2021.

  5. arXiv:2010.05888  [pdf, other

    cs.LG

    Garfield: System Support for Byzantine Machine Learning

    Authors: Rachid Guerraoui, Arsany Guirguis, Jérémy Max Plassmann, Anton Alexandre Ragot, Sébastien Rouault

    Abstract: We present Garfield, a library to transparently make machine learning (ML) applications, initially built with popular (but fragile) frameworks, e.g., TensorFlow and PyTorch, Byzantine-resilient. Garfield relies on a novel object-oriented design, reducing the coding effort, and addressing the vulnerability of the shared-graph architecture followed by classical ML frameworks. Garfield encompasses va… ▽ More

    Submitted 31 December, 2020; v1 submitted 12 October, 2020; originally announced October 2020.

    Comments: 31 pages; 16 figures; 2 tables

  6. arXiv:2008.00742  [pdf, other

    cs.LG cs.DC stat.ML

    Collaborative Learning in the Jungle (Decentralized, Byzantine, Heterogeneous, Asynchronous and Nonconvex Learning)

    Authors: El-Mahdi El-Mhamdi, Sadegh Farhadkhani, Rachid Guerraoui, Arsany Guirguis, Lê Nguyên Hoang, Sébastien Rouault

    Abstract: We study Byzantine collaborative learning, where $n$ nodes seek to collectively learn from each others' local data. The data distribution may vary from one node to another. No node is trusted, and $f < n$ nodes can behave arbitrarily. We prove that collaborative learning is equivalent to a new form of agreement, which we call averaging agreement. In this problem, nodes start each with an initial v… ▽ More

    Submitted 1 December, 2021; v1 submitted 3 August, 2020; originally announced August 2020.

    Comments: 34 pages, 1 figure

    Journal ref: NeurIPS 2021

  7. arXiv:2003.00010  [pdf, other

    cs.LG cs.CR cs.DC

    Distributed Momentum for Byzantine-resilient Learning

    Authors: El-Mahdi El-Mhamdi, Rachid Guerraoui, Sébastien Rouault

    Abstract: Momentum is a variant of gradient descent that has been proposed for its benefits on convergence. In a distributed setting, momentum can be implemented either at the server or the worker side. When the aggregation rule used by the server is linear, commutativity with addition makes both deployments equivalent. Robustness and privacy are however among motivations to abandon linear aggregation rules… ▽ More

    Submitted 9 March, 2020; v1 submitted 28 February, 2020; originally announced March 2020.

    Comments: Source code (for academic use only): https://github.com/LPD-EPFL/ByzantineMomentum

  8. arXiv:1905.04374  [pdf, other

    cs.DC cs.CR cs.LG stat.ML

    Fast and Robust Distributed Learning in High Dimension

    Authors: El-Mahdi El-Mhamdi, Rachid Guerraoui, Sébastien Rouault

    Abstract: Could a gradient aggregation rule (GAR) for distributed machine learning be both robust and fast? This paper answers by the affirmative through multi-Bulyan. Given $n$ workers, $f$ of which are arbitrary malicious (Byzantine) and $m=n-f$ are not, we prove that multi-Bulyan can ensure a strong form of Byzantine resilience, as well as an ${\frac{m}{n}}$ slowdown, compared to averaging, the fastest (… ▽ More

    Submitted 5 February, 2021; v1 submitted 5 May, 2019; originally announced May 2019.

    Comments: preliminary theoretical draft, complements the SysML 2019 practical paper of which the code is provided at https://github.com/LPD-EPFL/AggregaThor. arXiv admin note: text overlap with arXiv:1703.02757

  9. arXiv:1905.03853  [pdf, other

    cs.DC cs.LG stat.ML

    Genuinely Distributed Byzantine Machine Learning

    Authors: El-Mahdi El-Mhamdi, Rachid Guerraoui, Arsany Guirguis, Lê Nguyên Hoang, Sébastien Rouault

    Abstract: Machine Learning (ML) solutions are nowadays distributed, according to the so-called server/worker architecture. One server holds the model parameters while several workers train the model. Clearly, such architecture is prone to various types of component failures, which can be all encompassed within the spectrum of a Byzantine behavior. Several approaches have been proposed recently to tolerate B… ▽ More

    Submitted 2 June, 2020; v1 submitted 5 May, 2019; originally announced May 2019.

    Comments: This is a merge of arXiv:1905.03853 and arXiv:1911.07537; arXiv:1911.07537 will be retracted

  10. arXiv:1802.07927  [pdf, other

    stat.ML cs.CR cs.DC cs.LG

    The Hidden Vulnerability of Distributed Learning in Byzantium

    Authors: El Mahdi El Mhamdi, Rachid Guerraoui, Sébastien Rouault

    Abstract: While machine learning is going through an era of celebrated success, concerns have been raised about the vulnerability of its backbone: stochastic gradient descent (SGD). Recent approaches have been proposed to ensure the robustness of distributed SGD against adversarial (Byzantine) workers sending poisoned gradients during the training phase. Some of these approaches have been proven Byzantine-r… ▽ More

    Submitted 17 July, 2018; v1 submitted 22 February, 2018; originally announced February 2018.

    Comments: Accepted to ICML 2018 as a long talk

  11. arXiv:1707.08167  [pdf, other

    stat.ML cs.AI cs.DC cs.LG cs.NE

    On The Robustness of a Neural Network

    Authors: El Mahdi El Mhamdi, Rachid Guerraoui, Sebastien Rouault

    Abstract: With the development of neural networks based machine learning and their usage in mission critical applications, voices are rising against the \textit{black box} aspect of neural networks as it becomes crucial to understand their limits and capabilities. With the rise of neuromorphic hardware, it is even more critical to understand how a neural network, as a distributed system, tolerates the failu… ▽ More

    Submitted 7 August, 2017; v1 submitted 25 July, 2017; originally announced July 2017.

    Comments: 36th IEEE International Symposium on Reliable Distributed Systems 26 - 29 September 2017. Hong Kong, China