Skip to main content

Showing 1–12 of 12 results for author: Senior, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2310.05900  [pdf, other

    quant-ph cs.LG

    Learning to Decode the Surface Code with a Recurrent, Transformer-Based Neural Network

    Authors: Johannes Bausch, Andrew W Senior, Francisco J H Heras, Thomas Edlich, Alex Davies, Michael Newman, Cody Jones, Kevin Satzinger, Murphy Yuezhen Niu, Sam Blackwell, George Holland, Dvir Kafri, Juan Atalaya, Craig Gidney, Demis Hassabis, Sergio Boixo, Hartmut Neven, Pushmeet Kohli

    Abstract: Quantum error-correction is a prerequisite for reliable quantum computation. Towards this goal, we present a recurrent, transformer-based neural network which learns to decode the surface code, the leading quantum error-correction code. Our decoder outperforms state-of-the-art algorithmic decoders on real-world data from Google's Sycamore quantum processor for distance 3 and 5 surface codes. On di… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

    MSC Class: 81P73; 68T07 ACM Class: I.2.0; J.2

  2. arXiv:2106.14108  [pdf, other

    cs.CE eess.IV

    Inferring a Continuous Distribution of Atom Coordinates from Cryo-EM Images using VAEs

    Authors: Dan Rosenbaum, Marta Garnelo, Michal Zielinski, Charlie Beattie, Ellen Clancy, Andrea Huber, Pushmeet Kohli, Andrew W. Senior, John Jumper, Carl Doersch, S. M. Ali Eslami, Olaf Ronneberger, Jonas Adler

    Abstract: Cryo-electron microscopy (cryo-EM) has revolutionized experimental protein structure determination. Despite advances in high resolution reconstruction, a majority of cryo-EM experiments provide either a single state of the studied macromolecule, or a relatively small number of its conformations. This reduces the effectiveness of the technique for proteins with flexible regions, which are known to… ▽ More

    Submitted 26 June, 2021; originally announced June 2021.

  3. arXiv:1906.00398  [pdf, other

    cs.LG cs.SI stat.ML

    Cost-sensitive Boosting Pruning Trees for depression detection on Twitter

    Authors: Lei Tong, Zhihua Liu, Zheheng Jiang, Feixiang Zhou, Long Chen, Jialin Lyu, Xiangrong Zhang, Qianni Zhang, Abdul Sadka Senior, Yinhai Wang, Ling Li, Huiyu Zhou

    Abstract: Depression is one of the most common mental health disorders, and a large number of depressed people commit suicide each year. Potential depression sufferers usually do not consult psychological doctors because they feel ashamed or are unaware of any depression, which may result in severe delay of diagnosis and treatment. In the meantime, evidence shows that social media data provides valuable clu… ▽ More

    Submitted 21 January, 2022; v1 submitted 2 June, 2019; originally announced June 2019.

    Comments: 15 pages, 7 figures, Accepted by IEEE transactions on Affective Computing

  4. Deep Audio-Visual Speech Recognition

    Authors: Triantafyllos Afouras, Joon Son Chung, Andrew Senior, Oriol Vinyals, Andrew Zisserman

    Abstract: The goal of this work is to recognise phrases and sentences being spoken by a talking face, with or without the audio. Unlike previous works that have focussed on recognising a limited number of words or phrases, we tackle lip reading as an open-world problem - unconstrained natural language sentences, and in the wild videos. Our key contributions are: (1) we compare two models for lip reading, on… ▽ More

    Submitted 22 December, 2018; v1 submitted 6 September, 2018; originally announced September 2018.

    Comments: Accepted for publication by IEEE Transactions on Pattern Analysis and Machine Intelligence

  5. arXiv:1807.05162  [pdf, other

    cs.CV cs.LG

    Large-Scale Visual Speech Recognition

    Authors: Brendan Shillingford, Yannis Assael, Matthew W. Hoffman, Thomas Paine, Cían Hughes, Utsav Prabhu, Hank Liao, Hasim Sak, Kanishka Rao, Lorrayne Bennett, Marie Mulville, Ben Coppin, Ben Laurie, Andrew Senior, Nando de Freitas

    Abstract: This work presents a scalable solution to open-vocabulary visual speech recognition. To achieve this, we constructed the largest existing visual speech recognition dataset, consisting of pairs of text and video clips of faces speaking (3,886 hours of video). In tandem, we designed and trained an integrated lipreading system, consisting of a video processing pipeline that maps raw video to stable v… ▽ More

    Submitted 1 October, 2018; v1 submitted 13 July, 2018; originally announced July 2018.

  6. Lip Reading Sentences in the Wild

    Authors: Joon Son Chung, Andrew Senior, Oriol Vinyals, Andrew Zisserman

    Abstract: The goal of this work is to recognise phrases and sentences being spoken by a talking face, with or without the audio. Unlike previous works that have focussed on recognising a limited number of words or phrases, we tackle lip reading as an open-world problem - unconstrained natural language sentences, and in the wild videos. Our key contributions are: (1) a 'Watch, Listen, Attend and Spell' (WL… ▽ More

    Submitted 30 January, 2017; v1 submitted 16 November, 2016; originally announced November 2016.

  7. arXiv:1610.09027  [pdf, other

    cs.LG

    Scaling Memory-Augmented Neural Networks with Sparse Reads and Writes

    Authors: Jack W Rae, Jonathan J Hunt, Tim Harley, Ivo Danihelka, Andrew Senior, Greg Wayne, Alex Graves, Timothy P Lillicrap

    Abstract: Neural networks augmented with external memory have the ability to learn algorithmic solutions to complex tasks. These models appear promising for applications such as language modeling and machine translation. However, they scale poorly in both space and time as the amount of memory grows --- limiting their applicability to real-world domains. Here, we present an end-to-end differentiable memory… ▽ More

    Submitted 27 October, 2016; originally announced October 2016.

    Comments: in 30th Conference on Neural Information Processing Systems (NIPS 2016), Barcelona, Spain

  8. arXiv:1609.03499  [pdf, other

    cs.SD cs.LG

    WaveNet: A Generative Model for Raw Audio

    Authors: Aaron van den Oord, Sander Dieleman, Heiga Zen, Karen Simonyan, Oriol Vinyals, Alex Graves, Nal Kalchbrenner, Andrew Senior, Koray Kavukcuoglu

    Abstract: This paper introduces WaveNet, a deep neural network for generating raw audio waveforms. The model is fully probabilistic and autoregressive, with the predictive distribution for each audio sample conditioned on all previous ones; nonetheless we show that it can be efficiently trained on data with tens of thousands of samples per second of audio. When applied to text-to-speech, it yields state-of-… ▽ More

    Submitted 19 September, 2016; v1 submitted 12 September, 2016; originally announced September 2016.

  9. arXiv:1603.06744  [pdf, other

    cs.CL cs.NE

    Latent Predictor Networks for Code Generation

    Authors: Wang Ling, Edward Grefenstette, Karl Moritz Hermann, Tomáš Kočiský, Andrew Senior, Fumin Wang, Phil Blunsom

    Abstract: Many language generation tasks require the production of text conditioned on both structured and unstructured inputs. We present a novel neural network architecture which generates an output sequence conditioned on an arbitrary number of input functions. Crucially, our approach allows both the choice of conditioning context and the granularity of generation, for example characters or tokens, to be… ▽ More

    Submitted 8 June, 2016; v1 submitted 22 March, 2016; originally announced March 2016.

  10. arXiv:1507.06947  [pdf, other

    cs.CL cs.LG cs.NE stat.ML

    Fast and Accurate Recurrent Neural Network Acoustic Models for Speech Recognition

    Authors: Haşim Sak, Andrew Senior, Kanishka Rao, Françoise Beaufays

    Abstract: We have recently shown that deep Long Short-Term Memory (LSTM) recurrent neural networks (RNNs) outperform feed forward deep neural networks (DNNs) as acoustic models for speech recognition. More recently, we have shown that the performance of sequence trained context dependent (CD) hidden Markov model (HMM) acoustic models using such LSTM RNNs can be equaled by sequence trained phone models initi… ▽ More

    Submitted 24 July, 2015; originally announced July 2015.

    Comments: To be published in the INTERSPEECH 2015 proceedings

  11. Holographic Graph Neuron: a Bio-Inspired Architecture for Pattern Processing

    Authors: Denis Kleyko, Evgeny Osipov, Alexander Senior, Asad I. Khan, Y. Ahmet Şekercioğlu

    Abstract: This article proposes the use of Vector Symbolic Architectures for implementing Hierarchical Graph Neuron, an architecture for memorizing patterns of generic sensor stimuli. The adoption of a Vector Symbolic representation ensures a one-layered design for the approach, while maintaining the previously reported properties and performance characteristics of Hierarchical Graph Neuron, and also improv… ▽ More

    Submitted 15 January, 2015; originally announced January 2015.

    Comments: 9 pages, 13 figures

    Journal ref: IEEE Transactions on Neural Networks and Learning Systems 28 (2017) 1250 - 1262

  12. arXiv:1402.1128  [pdf, other

    cs.NE cs.CL cs.LG stat.ML

    Long Short-Term Memory Based Recurrent Neural Network Architectures for Large Vocabulary Speech Recognition

    Authors: Haşim Sak, Andrew Senior, Françoise Beaufays

    Abstract: Long Short-Term Memory (LSTM) is a recurrent neural network (RNN) architecture that has been designed to address the vanishing and exploding gradient problems of conventional RNNs. Unlike feedforward neural networks, RNNs have cyclic connections making them powerful for modeling sequences. They have been successfully used for sequence labeling and sequence prediction tasks, such as handwriting rec… ▽ More

    Submitted 5 February, 2014; originally announced February 2014.