Skip to main content

Showing 1–9 of 9 results for author: Fortunato, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1110 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 8 August, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  2. arXiv:2212.12794  [pdf, other

    cs.LG physics.ao-ph

    GraphCast: Learning skillful medium-range global weather forecasting

    Authors: Remi Lam, Alvaro Sanchez-Gonzalez, Matthew Willson, Peter Wirnsberger, Meire Fortunato, Ferran Alet, Suman Ravuri, Timo Ewalds, Zach Eaton-Rosen, Weihua Hu, Alexander Merose, Stephan Hoyer, George Holland, Oriol Vinyals, Jacklynn Stott, Alexander Pritzel, Shakir Mohamed, Peter Battaglia

    Abstract: Global medium-range weather forecasting is critical to decision-making across many social and economic domains. Traditional numerical weather prediction uses increased compute resources to improve forecast accuracy, but cannot directly use historical weather data to improve the underlying model. We introduce a machine learning-based method called "GraphCast", which can be trained directly from rea… ▽ More

    Submitted 4 August, 2023; v1 submitted 24 December, 2022; originally announced December 2022.

    Comments: GraphCast code and trained weights are available at: https://github.com/deepmind/graphcast

  3. arXiv:2210.00612  [pdf, other

    cs.LG cs.CE

    MultiScale MeshGraphNets

    Authors: Meire Fortunato, Tobias Pfaff, Peter Wirnsberger, Alexander Pritzel, Peter Battaglia

    Abstract: In recent years, there has been a growing interest in using machine learning to overcome the high cost of numerical simulation, with some learned models achieving impressive speed-ups over classical solvers whilst maintaining accuracy. However, these methods are usually tested at low-resolution settings, and it remains to be seen whether they can scale to the costly high-resolution simulations tha… ▽ More

    Submitted 2 October, 2022; originally announced October 2022.

    Journal ref: 2nd AI4Science Workshop at the 39th International Conference on Machine Learning (ICML), 2022

  4. arXiv:2010.03409  [pdf, other

    cs.LG cs.CE

    Learning Mesh-Based Simulation with Graph Networks

    Authors: Tobias Pfaff, Meire Fortunato, Alvaro Sanchez-Gonzalez, Peter W. Battaglia

    Abstract: Mesh-based simulations are central to modeling complex physical systems in many disciplines across science and engineering. Mesh representations support powerful numerical integration methods and their resolution can be adapted to strike favorable trade-offs between accuracy and efficiency. However, high-dimensional scientific simulations are very expensive to run, and solvers and parameters must… ▽ More

    Submitted 18 June, 2021; v1 submitted 7 October, 2020; originally announced October 2020.

    Journal ref: International Conference on Learning Representations (ICLR), 2021

  5. arXiv:1910.13406  [pdf, other

    cs.LG cs.AI stat.ML

    Generalization of Reinforcement Learners with Working and Episodic Memory

    Authors: Meire Fortunato, Melissa Tan, Ryan Faulkner, Steven Hansen, Adrià Puigdomènech Badia, Gavin Buttimore, Charlie Deck, Joel Z Leibo, Charles Blundell

    Abstract: Memory is an important aspect of intelligence and plays a role in many deep reinforcement learning models. However, little progress has been made in understanding when specific memory systems help more than others and how well they generalize. The field also has yet to see a prevalent consistent and rigorous approach for evaluating agent performance on holdout data. In this paper, we aim to develo… ▽ More

    Submitted 18 February, 2020; v1 submitted 29 October, 2019; originally announced October 2019.

    Comments: NeurIPS 2019. Equal contribution of first 4 authors

    Journal ref: 33rd Conference on Neural Information Processing Systems (Neurips 2019)

  6. arXiv:1905.13469  [pdf, other

    cs.LG cs.AI cs.NE

    Interval timing in deep reinforcement learning agents

    Authors: Ben Deverett, Ryan Faulkner, Meire Fortunato, Greg Wayne, Joel Z. Leibo

    Abstract: The measurement of time is central to intelligent behavior. We know that both animals and artificial agents can successfully use temporal dependencies to select actions. In artificial agents, little work has directly addressed (1) which architectural components are necessary for successful development of this ability, (2) how this timing ability comes to be represented in the units and actions of… ▽ More

    Submitted 7 December, 2019; v1 submitted 31 May, 2019; originally announced May 2019.

    Comments: 11 pages, 7 figures

  7. arXiv:1706.10295  [pdf, other

    cs.LG stat.ML

    Noisy Networks for Exploration

    Authors: Meire Fortunato, Mohammad Gheshlaghi Azar, Bilal Piot, Jacob Menick, Ian Osband, Alex Graves, Vlad Mnih, Remi Munos, Demis Hassabis, Olivier Pietquin, Charles Blundell, Shane Legg

    Abstract: We introduce NoisyNet, a deep reinforcement learning agent with parametric noise added to its weights, and show that the induced stochasticity of the agent's policy can be used to aid efficient exploration. The parameters of the noise are learned with gradient descent along with the remaining network weights. NoisyNet is straightforward to implement and adds little computational overhead. We find… ▽ More

    Submitted 9 July, 2019; v1 submitted 30 June, 2017; originally announced June 2017.

    Comments: ICLR 2018

  8. arXiv:1704.02798  [pdf, other

    cs.LG stat.ML

    Bayesian Recurrent Neural Networks

    Authors: Meire Fortunato, Charles Blundell, Oriol Vinyals

    Abstract: In this work we explore a straightforward variational Bayes scheme for Recurrent Neural Networks. Firstly, we show that a simple adaptation of truncated backpropagation through time can yield good quality uncertainty estimates and superior regularisation at only a small extra computational cost during training, also reducing the amount of parameters by 80\%. Secondly, we demonstrate how a novel ki… ▽ More

    Submitted 9 May, 2019; v1 submitted 10 April, 2017; originally announced April 2017.

    Comments: 12th Women in Machine Learning Workshop (WiML 2017), co-located with the 31st Conference on Neural Information Processing Systems (NeurIPS 2017), Long Beach, CA, USA

  9. arXiv:1506.03134  [pdf, other

    stat.ML cs.CG cs.LG cs.NE

    Pointer Networks

    Authors: Oriol Vinyals, Meire Fortunato, Navdeep Jaitly

    Abstract: We introduce a new neural architecture to learn the conditional probability of an output sequence with elements that are discrete tokens corresponding to positions in an input sequence. Such problems cannot be trivially addressed by existent approaches such as sequence-to-sequence and Neural Turing Machines, because the number of target classes in each step of the output depends on the length of t… ▽ More

    Submitted 2 January, 2017; v1 submitted 9 June, 2015; originally announced June 2015.