Skip to main content

Showing 1–17 of 17 results for author: Fernando, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  2. arXiv:2309.16797  [pdf, other

    cs.CL cs.AI cs.LG cs.NE

    Promptbreeder: Self-Referential Self-Improvement Via Prompt Evolution

    Authors: Chrisantha Fernando, Dylan Banarse, Henryk Michalewski, Simon Osindero, Tim Rocktäschel

    Abstract: Popular prompt strategies like Chain-of-Thought Prompting can dramatically improve the reasoning abilities of Large Language Models (LLMs) in various domains. However, such hand-crafted prompt-strategies are often sub-optimal. In this paper, we present Promptbreeder, a general-purpose self-referential self-improvement mechanism that evolves and adapts prompts for a given domain. Driven by an LLM,… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

  3. arXiv:2304.02801  [pdf, other

    cs.RO cs.AI

    End-to-end Manipulator Calligraphy Planning via Variational Imitation Learning

    Authors: Fangping Xie, Pierre Le Meur, Charith Fernando

    Abstract: Planning from demonstrations has shown promising results with the advances of deep neural networks. One of the most popular real-world applications is automated handwriting using a robotic manipulator. Classically it is simplified as a two-dimension problem. This representation is suitable for elementary drawings, but it is not sufficient for Japanese calligraphy or complex work of art where the o… ▽ More

    Submitted 5 April, 2023; originally announced April 2023.

    Comments: 5 pages, 4 figures

  4. arXiv:2205.03146  [pdf, other

    cs.CV cs.AI

    CLIP-CLOP: CLIP-Guided Collage and Photomontage

    Authors: Piotr Mirowski, Dylan Banarse, Mateusz Malinowski, Simon Osindero, Chrisantha Fernando

    Abstract: The unabated mystique of large-scale neural networks, such as the CLIP dual image-and-text encoder, popularized automatically generated art. Increasingly more sophisticated generators enhanced the artworks' realism and visual appearance, and creative prompt engineering enabled stylistic expression. Guided by an artist-in-the-loop ideal, we design a gradient-based generator to produce collages. It… ▽ More

    Submitted 24 July, 2022; v1 submitted 6 May, 2022; originally announced May 2022.

    Comments: 5 pages, 7 figures, published at the International Conference on Computational Creativity (ICCC) 2022 as Short Paper: Demo

  5. arXiv:2105.00162  [pdf, other

    cs.AI cs.NE

    Generative Art Using Neural Visual Grammars and Dual Encoders

    Authors: Chrisantha Fernando, S. M. Ali Eslami, Jean-Baptiste Alayrac, Piotr Mirowski, Dylan Banarse, Simon Osindero

    Abstract: Whilst there are perhaps only a few scientific methods, there seem to be almost as many artistic methods as there are artists. Artistic processes appear to inhabit the highest order of open-endedness. To begin to understand some of the processes of art making it is helpful to try to automate them even partially. In this paper, a novel algorithm for producing generative art is described which allow… ▽ More

    Submitted 3 May, 2021; v1 submitted 1 May, 2021; originally announced May 2021.

  6. arXiv:2010.02820  [pdf, other

    cs.AI cs.LG cs.NE

    From Language Games to Drawing Games

    Authors: Chrisantha Fernando, Daria Zenkova, Stanislav Nikolov, Simon Osindero

    Abstract: We attempt to automate various artistic processes by inventing a set of drawing games, analogous to the approach taken by emergent language research in inventing communication games. A critical difference is that drawing games demand much less effort from the receiver than do language games. Artists must work with pre-trained viewers who spend little time learning artist specific representational… ▽ More

    Submitted 10 December, 2020; v1 submitted 6 October, 2020; originally announced October 2020.

  7. arXiv:1910.07395  [pdf, other

    cs.CV

    Offline handwritten mathematical symbol recognition utilising deep learning

    Authors: Azadeh Nazemi, Niloofar Tavakolian, Donal Fitzpatrick, Chandrik a Fernando, Ching Y. Suen

    Abstract: This paper describes an approach for offline recognition of handwritten mathematical symbols. The process of symbol recognition in this paper includes symbol segmentation and accurate classification for over 300 classes. Many multidimensional mathematical symbols need both horizontal and vertical projection to be segmented. However, some symbols do not permit to be projected and stop segmentation,… ▽ More

    Submitted 16 October, 2019; originally announced October 2019.

    ACM Class: I.4.6; I.5.4

  8. arXiv:1811.05931  [pdf, other

    cs.MA

    Evolving intrinsic motivations for altruistic behavior

    Authors: Jane X. Wang, Edward Hughes, Chrisantha Fernando, Wojciech M. Czarnecki, Edgar A. Duenez-Guzman, Joel Z. Leibo

    Abstract: Multi-agent cooperation is an important feature of the natural world. Many tasks involve individual incentives that are misaligned with the common good, yet a wide range of organisms from bacteria to insects and humans are able to overcome their differences and collaborate. Therefore, the emergence of cooperative behavior amongst self-interested individuals is an important question for the fields… ▽ More

    Submitted 11 March, 2019; v1 submitted 14 November, 2018; originally announced November 2018.

    Comments: 10 pages, 6 figures. In Proc. of the 18th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2019)

  9. arXiv:1806.07917  [pdf, other

    cs.NE cs.AI cs.LG

    Meta-Learning by the Baldwin Effect

    Authors: Chrisantha Thomas Fernando, Jakub Sygnowski, Simon Osindero, Jane Wang, Tom Schaul, Denis Teplyashin, Pablo Sprechmann, Alexander Pritzel, Andrei A. Rusu

    Abstract: The scope of the Baldwin effect was recently called into question by two papers that closely examined the seminal work of Hinton and Nowlan. To this date there has been no demonstration of its necessity in empirically challenging tasks. Here we show that the Baldwin effect is capable of evolving few-shot supervised and reinforcement learning mechanisms, by shaping the hyperparameters and the initi… ▽ More

    Submitted 22 June, 2018; v1 submitted 6 June, 2018; originally announced June 2018.

  10. arXiv:1711.09846  [pdf, other

    cs.LG cs.NE

    Population Based Training of Neural Networks

    Authors: Max Jaderberg, Valentin Dalibard, Simon Osindero, Wojciech M. Czarnecki, Jeff Donahue, Ali Razavi, Oriol Vinyals, Tim Green, Iain Dunning, Karen Simonyan, Chrisantha Fernando, Koray Kavukcuoglu

    Abstract: Neural networks dominate the modern machine learning landscape, but their training and success still suffer from sensitivity to empirical choices of hyperparameters such as model architecture, loss function, and optimisation algorithm. In this work we present \emph{Population Based Training (PBT)}, a simple asynchronous optimisation algorithm which effectively utilises a fixed computational budget… ▽ More

    Submitted 28 November, 2017; v1 submitted 27 November, 2017; originally announced November 2017.

  11. arXiv:1711.00436  [pdf, other

    cs.LG cs.CV cs.NE stat.ML

    Hierarchical Representations for Efficient Architecture Search

    Authors: Hanxiao Liu, Karen Simonyan, Oriol Vinyals, Chrisantha Fernando, Koray Kavukcuoglu

    Abstract: We explore efficient neural architecture search methods and show that a simple yet powerful evolutionary algorithm can discover new architectures with excellent performance. Our approach combines a novel hierarchical genetic representation scheme that imitates the modularized design pattern commonly adopted by human experts, and an expressive search space that supports complex topologies. Our algo… ▽ More

    Submitted 22 February, 2018; v1 submitted 1 November, 2017; originally announced November 2017.

    Comments: Accepted as a conference paper at ICLR 2018

  12. arXiv:1701.08734  [pdf, other

    cs.NE cs.LG

    PathNet: Evolution Channels Gradient Descent in Super Neural Networks

    Authors: Chrisantha Fernando, Dylan Banarse, Charles Blundell, Yori Zwols, David Ha, Andrei A. Rusu, Alexander Pritzel, Daan Wierstra

    Abstract: For artificial general intelligence (AGI) it would be efficient if multiple users trained the same giant neural network, permitting parameter reuse, without catastrophic forgetting. PathNet is a first step in this direction. It is a neural network algorithm that uses agents embedded in the neural network whose task is to discover which parts of the network to re-use for new tasks. Agents are pathw… ▽ More

    Submitted 30 January, 2017; originally announced January 2017.

  13. arXiv:1606.02580  [pdf, other

    cs.NE cs.CV cs.LG

    Convolution by Evolution: Differentiable Pattern Producing Networks

    Authors: Chrisantha Fernando, Dylan Banarse, Malcolm Reynolds, Frederic Besse, David Pfau, Max Jaderberg, Marc Lanctot, Daan Wierstra

    Abstract: In this work we introduce a differentiable version of the Compositional Pattern Producing Network, called the DPPN. Unlike a standard CPPN, the topology of a DPPN is evolved but the weights are learned. A Lamarckian algorithm, that combines evolution and learning, produces DPPNs to reconstruct an image. Our main result is that DPPNs can be evolved/trained to compress the weights of a denoising aut… ▽ More

    Submitted 8 June, 2016; originally announced June 2016.

  14. arXiv:1604.04153  [pdf, other

    cs.NE

    Learning to Generate Genotypes with Neural Networks

    Authors: Alexander W. Churchill, Siddharth Sigtia, Chrisantha Fernando

    Abstract: Neural networks and evolutionary computation have a rich intertwined history. They most commonly appear together when an evolutionary algorithm optimises the parameters and topology of a neural network for reinforcement learning problems, or when a neural network is applied as a surrogate fitness function to aid the evolutionary optimisation of expensive fitness functions. In this paper we take a… ▽ More

    Submitted 14 April, 2016; originally announced April 2016.

  15. arXiv:1404.1614  [pdf, other

    cs.NE cs.LG

    A Denoising Autoencoder that Guides Stochastic Search

    Authors: Alexander W. Churchill, Siddharth Sigtia, Chrisantha Fernando

    Abstract: An algorithm is described that adaptively learns a non-linear mutation distribution. It works by training a denoising autoencoder (DA) online at each generation of a genetic algorithm to reconstruct a slowly decaying memory of the best genotypes so far. A compressed hidden layer forces the autoencoder to learn hidden features in the training set that can be used to accelerate search on novel probl… ▽ More

    Submitted 6 April, 2014; originally announced April 2014.

    Comments: Submitted to Parallel Problem Solving from Nature 2014

  16. arXiv:1303.7201  [pdf

    cs.AI

    Design for a Darwinian Brain: Part 2. Cognitive Architecture

    Authors: Chrisantha Fernando, Vera Vasas

    Abstract: The accumulation of adaptations in an open-ended manner during lifetime learning is a holy grail in reinforcement learning, intrinsic motivation, artificial curiosity, and developmental robotics. We present a specification for a cognitive architecture that is capable of specifying an unlimited range of behaviors. We then give examples of how it can stochastically explore an interesting space of ad… ▽ More

    Submitted 28 March, 2013; originally announced March 2013.

    Comments: Submitted as Part 2 to Living Machines 2013, Natural History Museum, London. Code available on github as it is being developed to implement the cognitive architecture above, here... https://github.com/ctf20/DarwinianNeurodynamics

  17. arXiv:1303.7200  [pdf

    cs.AI q-bio.NC

    Design for a Darwinian Brain: Part 1. Philosophy and Neuroscience

    Authors: Chrisantha Fernando

    Abstract: Physical symbol systems are needed for open-ended cognition. A good way to understand physical symbol systems is by comparison of thought to chemistry. Both have systematicity, productivity and compositionality. The state of the art in cognitive architectures for open-ended cognition is critically assessed. I conclude that a cognitive architecture that evolves symbol structures in the brain is a p… ▽ More

    Submitted 28 March, 2013; originally announced March 2013.

    Comments: Darwinian Neurodynamics. Submitted as a two part paper to Living Machines 2013 Natural History Museum, London