Skip to main content

Showing 1–16 of 16 results for author: Mathieu, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2308.03526  [pdf, other

    cs.LG cs.AI

    AlphaStar Unplugged: Large-Scale Offline Reinforcement Learning

    Authors: Michaël Mathieu, Sherjil Ozair, Srivatsan Srinivasan, Caglar Gulcehre, Shangtong Zhang, Ray Jiang, Tom Le Paine, Richard Powell, Konrad Żołna, Julian Schrittwieser, David Choi, Petko Georgiev, Daniel Toyama, Aja Huang, Roman Ring, Igor Babuschkin, Timo Ewalds, Mahyar Bordbar, Sarah Henderson, Sergio Gómez Colmenarejo, Aäron van den Oord, Wojciech Marian Czarnecki, Nando de Freitas, Oriol Vinyals

    Abstract: StarCraft II is one of the most challenging simulated reinforcement learning environments; it is partially observable, stochastic, multi-agent, and mastering StarCraft II requires strategic planning over long time horizons with real-time low-level execution. It also has an active professional competitive scene. StarCraft II is uniquely suited for advancing offline RL algorithms, both because of it… ▽ More

    Submitted 7 August, 2023; originally announced August 2023.

    Comments: 32 pages, 13 figures, previous version published as a NeurIPS 2021 workshop: https://openreview.net/forum?id=Np8Pumfoty

  2. arXiv:2208.12847  [pdf, other

    eess.IV cs.CV

    Region-guided CycleGANs for Stain Transfer in Whole Slide Images

    Authors: Joseph Boyd, Irène Villa, Marie-Christine Mathieu, Eric Deutsch, Nikos Paragios, Maria Vakalopoulou, Stergios Christodoulidis

    Abstract: In whole slide imaging, commonly used staining techniques based on hematoxylin and eosin (H&E) and immunohistochemistry (IHC) stains accentuate different aspects of the tissue landscape. In the case of detecting metastases, IHC provides a distinct readout that is readily interpretable by pathologists. IHC, however, is a more expensive approach and not available at all medical centers. Virtually ge… ▽ More

    Submitted 26 August, 2022; originally announced August 2022.

  3. arXiv:2107.12808  [pdf, other

    cs.LG cs.AI cs.MA

    Open-Ended Learning Leads to Generally Capable Agents

    Authors: Open Ended Learning Team, Adam Stooke, Anuj Mahajan, Catarina Barros, Charlie Deck, Jakob Bauer, Jakub Sygnowski, Maja Trebacz, Max Jaderberg, Michael Mathieu, Nat McAleese, Nathalie Bradley-Schmieg, Nathaniel Wong, Nicolas Porcel, Roberta Raileanu, Steph Hughes-Fitt, Valentin Dalibard, Wojciech Marian Czarnecki

    Abstract: In this work we create agents that can perform well beyond a single, individual task, that exhibit much wider generalisation of behaviour to a massive, rich space of challenges. We define a universe of tasks within an environment domain and demonstrate the ability to train agents that are generally capable across this vast space and beyond. The environment is natively multi-agent, spanning the con… ▽ More

    Submitted 31 July, 2021; v1 submitted 27 July, 2021; originally announced July 2021.

  4. arXiv:1611.03383  [pdf, other

    cs.LG stat.ML

    Disentangling factors of variation in deep representations using adversarial training

    Authors: Michael Mathieu, Junbo Zhao, Pablo Sprechmann, Aditya Ramesh, Yann LeCun

    Abstract: We introduce a conditional generative model for learning to disentangle the hidden factors of variation within a set of labeled observations, and separate them into complementary codes. One code summarizes the specified factors of variation associated with the labels. The other summarizes the remaining unspecified variability. During training, the only available source of supervision comes from ou… ▽ More

    Submitted 10 November, 2016; originally announced November 2016.

    Comments: Conference paper in NIPS 2016

  5. arXiv:1609.03126  [pdf, other

    cs.LG stat.ML

    Energy-based Generative Adversarial Network

    Authors: Junbo Zhao, Michael Mathieu, Yann LeCun

    Abstract: We introduce the "Energy-based Generative Adversarial Network" model (EBGAN) which views the discriminator as an energy function that attributes low energies to the regions near the data manifold and higher energies to other regions. Similar to the probabilistic GANs, a generator is seen as being trained to produce contrastive samples with minimal energies, while the discriminator is trained to as… ▽ More

    Submitted 6 March, 2017; v1 submitted 11 September, 2016; originally announced September 2016.

    Comments: Submitted to ICLR 2017

  6. arXiv:1511.05440  [pdf, other

    cs.LG cs.CV stat.ML

    Deep multi-scale video prediction beyond mean square error

    Authors: Michael Mathieu, Camille Couprie, Yann LeCun

    Abstract: Learning to predict future images from a video sequence involves the construction of an internal representation that models the image evolution accurately, and therefore, to some degree, its content and dynamics. This is why pixel-space video prediction may be viewed as a promising avenue for unsupervised feature learning. In addition, while optical flow has been a very studied problem in computer… ▽ More

    Submitted 26 February, 2016; v1 submitted 17 November, 2015; originally announced November 2015.

  7. arXiv:1506.03011  [pdf, other

    cs.CV

    Learning to Linearize Under Uncertainty

    Authors: Ross Goroshin, Michael Mathieu, Yann LeCun

    Abstract: Training deep feature hierarchies to solve supervised learning tasks has achieved state of the art performance on many problems in computer vision. However, a principled way in which to train such hierarchies in the unsupervised setting has remained elusive. In this work we suggest a new architecture and loss for training deep feature hierarchies that linearize the transformations observed in unla… ▽ More

    Submitted 10 September, 2015; v1 submitted 9 June, 2015; originally announced June 2015.

    Comments: To appear at NIPS 2015

  8. arXiv:1506.02351  [pdf, other

    stat.ML cs.LG cs.NE

    Stacked What-Where Auto-encoders

    Authors: Junbo Zhao, Michael Mathieu, Ross Goroshin, Yann LeCun

    Abstract: We present a novel architecture, the "stacked what-where auto-encoders" (SWWAE), which integrates discriminative and generative pathways and provides a unified approach to supervised, semi-supervised and unsupervised learning without relying on sampling during training. An instantiation of SWWAE uses a convolutional net (Convnet) (LeCun et al. (1998)) to encode the input, and employs a deconvoluti… ▽ More

    Submitted 14 February, 2016; v1 submitted 8 June, 2015; originally announced June 2015.

    Comments: Workshop track - ICLR 2016

  9. arXiv:1412.7753  [pdf, other

    cs.NE cs.LG

    Learning Longer Memory in Recurrent Neural Networks

    Authors: Tomas Mikolov, Armand Joulin, Sumit Chopra, Michael Mathieu, Marc'Aurelio Ranzato

    Abstract: Recurrent neural network is a powerful model that learns temporal patterns in sequential data. For a long time, it was believed that recurrent networks are difficult to train using simple optimizers, such as stochastic gradient descent, due to the so-called vanishing gradient problem. In this paper, we show that learning longer term patterns in real data, such as in natural language, is perfectly… ▽ More

    Submitted 16 April, 2015; v1 submitted 24 December, 2014; originally announced December 2014.

  10. arXiv:1412.7580  [pdf, ps, other

    cs.LG cs.DC cs.NE

    Fast Convolutional Nets With fbfft: A GPU Performance Evaluation

    Authors: Nicolas Vasilache, Jeff Johnson, Michael Mathieu, Soumith Chintala, Serkan Piantino, Yann LeCun

    Abstract: We examine the performance profile of Convolutional Neural Network training on the current generation of NVIDIA Graphics Processing Units. We introduce two new Fast Fourier Transform convolution implementations: one based on NVIDIA's cuFFT library, and another based on a Facebook authored FFT implementation, fbfft, that provides significant speedups over cuFFT (over 1.5x) for whole CNNs. Both of t… ▽ More

    Submitted 10 April, 2015; v1 submitted 23 December, 2014; originally announced December 2014.

    Comments: Camera ready for ICLR2015

  11. arXiv:1412.6604  [pdf, ps, other

    cs.LG cs.CV

    Video (language) modeling: a baseline for generative models of natural videos

    Authors: MarcAurelio Ranzato, Arthur Szlam, Joan Bruna, Michael Mathieu, Ronan Collobert, Sumit Chopra

    Abstract: We propose a strong baseline model for unsupervised feature learning using video data. By learning to predict missing frames or extrapolate future frames from an input video sequence, the model discovers both spatial and temporal correlations which are useful to represent complex deformations and motion patterns. The models we propose are largely borrowed from the language modeling literature, and… ▽ More

    Submitted 4 May, 2016; v1 submitted 20 December, 2014; originally announced December 2014.

  12. arXiv:1412.0233  [pdf, other

    cs.LG

    The Loss Surfaces of Multilayer Networks

    Authors: Anna Choromanska, Mikael Henaff, Michael Mathieu, Gérard Ben Arous, Yann LeCun

    Abstract: We study the connection between the highly non-convex loss function of a simple model of the fully-connected feed-forward neural network and the Hamiltonian of the spherical spin-glass model under the assumptions of: i) variable independence, ii) redundancy in network parametrization, and iii) uniformity. These assumptions enable us to explain the complexity of the fully decoupled neural network t… ▽ More

    Submitted 21 January, 2015; v1 submitted 30 November, 2014; originally announced December 2014.

  13. arXiv:1404.7195  [pdf, other

    cs.LG

    Fast Approximation of Rotations and Hessians matrices

    Authors: Michael Mathieu, Yann LeCun

    Abstract: A new method to represent and approximate rotation matrices is introduced. The method represents approximations of a rotation matrix $Q$ with linearithmic complexity, i.e. with $\frac{1}{2}n\lg(n)$ rotations over pairs of coordinates, arranged in an FFT-like fashion. The approximation is "learned" using gradient descent. It allows to represent symmetric matrices $H$ as $QDQ^T$ where $D$ is a diago… ▽ More

    Submitted 28 April, 2014; originally announced April 2014.

  14. arXiv:1312.6229  [pdf, ps, other

    cs.CV

    OverFeat: Integrated Recognition, Localization and Detection using Convolutional Networks

    Authors: Pierre Sermanet, David Eigen, Xiang Zhang, Michael Mathieu, Rob Fergus, Yann LeCun

    Abstract: We present an integrated framework for using Convolutional Networks for classification, localization and detection. We show how a multiscale and sliding window approach can be efficiently implemented within a ConvNet. We also introduce a novel deep learning approach to localization by learning to predict object boundaries. Bounding boxes are then accumulated rather than suppressed in order to incr… ▽ More

    Submitted 23 February, 2014; v1 submitted 21 December, 2013; originally announced December 2013.

  15. arXiv:1312.5851  [pdf, other

    cs.CV cs.LG cs.NE

    Fast Training of Convolutional Networks through FFTs

    Authors: Michael Mathieu, Mikael Henaff, Yann LeCun

    Abstract: Convolutional networks are one of the most widely employed architectures in computer vision and machine learning. In order to leverage their ability to learn complex functions, large amounts of data are required for training. Training a large convolutional network to produce state-of-the-art results can take weeks, even when using modern GPUs. Producing labels using a trained network can also be c… ▽ More

    Submitted 6 March, 2014; v1 submitted 20 December, 2013; originally announced December 2013.

  16. arXiv:1104.1546  [pdf, other

    cs.RO

    Physical Simulation of Inarticulate Robots

    Authors: Guillaume Claret, Michaël Mathieu, David Naccache, Guillaume Seguin

    Abstract: In this note we study the structure and the behavior of inarticulate robots. We introduce a robot that moves by successive revolvings. The robot's structure is analyzed, simulated and discussed in detail.

    Submitted 8 April, 2011; originally announced April 2011.