Skip to main content

Showing 1–50 of 82 results for author: Hoffmann, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.06876  [pdf, other

    cs.AI cs.RO

    Decision-Focused Learning to Predict Action Costs for Planning

    Authors: Jayanta Mandi, Marco Foschini, Daniel Holler, Sylvie Thiebaux, Jorg Hoffmann, Tias Guns

    Abstract: In many automated planning applications, action costs can be hard to specify. An example is the time needed to travel through a certain road segment, which depends on many factors, such as the current weather conditions. A natural way to address this issue is to learn to predict these parameters based on input features (e.g., weather forecasts) and use the predicted action costs in automated plann… ▽ More

    Submitted 13 August, 2024; originally announced August 2024.

  2. arXiv:2406.03995  [pdf, other

    eess.SY cs.AI

    AC4MPC: Actor-Critic Reinforcement Learning for Nonlinear Model Predictive Control

    Authors: Rudolf Reiter, Andrea Ghezzi, Katrin Baumgärtner, Jasper Hoffmann, Robert D. McAllister, Moritz Diehl

    Abstract: \Ac{MPC} and \ac{RL} are two powerful control strategies with, arguably, complementary advantages. In this work, we show how actor-critic \ac{RL} techniques can be leveraged to improve the performance of \ac{MPC}. The \ac{RL} critic is used as an approximation of the optimal value function, and an actor roll-out provides an initial guess for primal variables of the \ac{MPC}. A parallel control arc… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  3. arXiv:2405.02598  [pdf, other

    cs.LG

    UDUC: An Uncertainty-driven Approach for Learning-based Robust Control

    Authors: Yuan Zhang, Jasper Hoffmann, Joschka Boedecker

    Abstract: Learning-based techniques have become popular in both model predictive control (MPC) and reinforcement learning (RL). Probabilistic ensemble (PE) models offer a promising approach for modelling system dynamics, showcasing the ability to capture uncertainty and scalability in high-dimensional control scenarios. However, PE models are susceptible to mode collapse, resulting in non-robust control whe… ▽ More

    Submitted 4 May, 2024; originally announced May 2024.

  4. arXiv:2404.18863  [pdf, other

    cs.RO math.OC

    PlanNetX: Learning an Efficient Neural Network Planner from MPC for Longitudinal Control

    Authors: Jasper Hoffmann, Diego Fernandez, Julien Brosseit, Julian Bernhard, Klemens Esterle, Moritz Werling, Michael Karg, Joschka Boedecker

    Abstract: Model predictive control (MPC) is a powerful, optimization-based approach for controlling dynamical systems. However, the computational complexity of online optimization can be problematic on embedded devices. Especially, when we need to guarantee fixed control frequencies. Thus, previous work proposed to reduce the computational burden using imitation learning (IL) approximating the MPC policy by… ▽ More

    Submitted 22 May, 2024; v1 submitted 29 April, 2024; originally announced April 2024.

    Comments: 6th Annual Learning for Dynamics & Control Conference (L4DC 2024)

  5. arXiv:2403.10704  [pdf, other

    cs.LG cs.AI cs.CL

    PERL: Parameter Efficient Reinforcement Learning from Human Feedback

    Authors: Hakim Sidahmed, Samrat Phatale, Alex Hutcheson, Zhuonan Lin, Zhang Chen, Zac Yu, Jarvis Jin, Roman Komarytsia, Christiane Ahlheim, Yonghao Zhu, Simral Chaudhary, Bowen Li, Saravanan Ganesh, Bill Byrne, Jessica Hoffmann, Hassan Mansoor, Wei Li, Abhinav Rastogi, Lucas Dixon

    Abstract: Reinforcement Learning from Human Feedback (RLHF) has proven to be a strong method to align Pretrained Large Language Models (LLMs) with human preferences. But training models with RLHF is computationally expensive, and an overall complex process. In this work, we study RLHF where the underlying models are trained using the parameter efficient method of Low-Rank Adaptation (LoRA) introduced by Hu… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

  6. arXiv:2403.08904  [pdf, other

    cs.CL

    Detecting Hallucination and Coverage Errors in Retrieval Augmented Generation for Controversial Topics

    Authors: Tyler A. Chang, Katrin Tomanek, Jessica Hoffmann, Nithum Thain, Erin van Liemt, Kathleen Meier-Hellstern, Lucas Dixon

    Abstract: We explore a strategy to handle controversial topics in LLM-based chatbots based on Wikipedia's Neutral Point of View (NPOV) principle: acknowledge the absence of a single true answer and surface multiple perspectives. We frame this as retrieval augmented generation, where perspectives are retrieved from a knowledge base and the LLM is tasked with generating a fluent and faithful response from the… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

    Comments: Accepted at LREC-COLING 2024

  7. arXiv:2402.02992  [pdf, other

    cs.LG cs.AI cs.CL

    Decoding-time Realignment of Language Models

    Authors: Tianlin Liu, Shangmin Guo, Leonardo Bianco, Daniele Calandriello, Quentin Berthet, Felipe Llinares, Jessica Hoffmann, Lucas Dixon, Michal Valko, Mathieu Blondel

    Abstract: Aligning language models with human preferences is crucial for reducing errors and biases in these models. Alignment techniques, such as reinforcement learning from human feedback (RLHF), are typically cast as optimizing a tradeoff between human preference rewards and a proximity regularization term that encourages staying close to the unaligned model. Selecting an appropriate level of regularizat… ▽ More

    Submitted 24 May, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

    Comments: In Proceedings of the 41st International Conference on Machine Learning (ICML 2024)

  8. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  9. arXiv:2311.09830  [pdf, other

    cs.AI cs.CL

    AutoPlanBench: Automatically generating benchmarks for LLM planners from PDDL

    Authors: Katharina Stein, Daniel Fišer, Jörg Hoffmann, Alexander Koller

    Abstract: LLMs are being increasingly used for planning-style tasks, but their capabilities for planning and reasoning are poorly understood. We present AutoPlanBench, a novel method for automatically converting planning benchmarks written in PDDL into textual descriptions and offer a benchmark dataset created with our method. We show that while the best LLM planners do well on some planning tasks, others r… ▽ More

    Submitted 9 February, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

  10. arXiv:2309.08042  [pdf, other

    cs.CV cs.AI

    Towards Large-scale Building Attribute Mapping using Crowdsourced Images: Scene Text Recognition on Flickr and Problems to be Solved

    Authors: Yao Sun, Anna Kruspe, Liqiu Meng, Yifan Tian, Eike J Hoffmann, Stefan Auer, Xiao Xiang Zhu

    Abstract: Crowdsourced platforms provide huge amounts of street-view images that contain valuable building information. This work addresses the challenges in applying Scene Text Recognition (STR) in crowdsourced street-view images for building attribute mapping. We use Flickr images, particularly examining texts on building facades. A Berlin Flickr dataset is created, and pre-trained STR models are used for… ▽ More

    Submitted 14 September, 2023; originally announced September 2023.

  11. arXiv:2309.01261  [pdf, other

    cs.PL cs.DC cs.LO

    Worst-Case Input Generation for Concurrent Programs under Non-Monotone Resource Metrics

    Authors: Long Pham, Jan Hoffmann

    Abstract: Worst-case input generation aims to automatically generate inputs that exhibit the worst-case performance of programs. It has several applications, and can, for example, detect vulnerabilities to denial-of-service attacks. However, it is non-trivial to generate worst-case inputs for concurrent programs, particularly for resources like memory where the peak cost depends on how processes are schedul… ▽ More

    Submitted 23 July, 2024; v1 submitted 3 September, 2023; originally announced September 2023.

  12. arXiv:2308.15470  [pdf, other

    cs.LG

    Policy composition in reinforcement learning via multi-objective policy optimization

    Authors: Shruti Mishra, Ankit Anand, Jordan Hoffmann, Nicolas Heess, Martin Riedmiller, Abbas Abdolmaleki, Doina Precup

    Abstract: We enable reinforcement learning agents to learn successful behavior policies by utilizing relevant pre-existing teacher policies. The teacher policies are introduced as objectives, in addition to the task objective, in a multi-objective policy optimization setting. Using the Multi-Objective Maximum a Posteriori Policy Optimization algorithm (Abdolmaleki et al. 2020), we show that teacher policies… ▽ More

    Submitted 30 August, 2023; v1 submitted 29 August, 2023; originally announced August 2023.

  13. arXiv:2308.02041  [pdf

    cs.CY cs.AI

    Regulating AI: Applying insights from behavioural economics and psychology to the application of article 5 of the EU AI Act

    Authors: Huixin Zhong, Eamonn O'Neill, Janina A. Hoffmann

    Abstract: Article 5 of the European Union's Artificial Intelligence Act is intended to regulate AI use to prevent potentially harmful consequences. Nevertheless, applying this legislation practically is likely to be challenging because of ambiguously used terminologies and because it fails to specify which manipulation techniques may be invoked by AI, potentially leading to significant harm. This paper aims… ▽ More

    Submitted 25 February, 2024; v1 submitted 24 July, 2023; originally announced August 2023.

    Comments: This paper was accepted for publication by AAAI 2024 paper on December of 2023

  14. arXiv:2305.17300  [pdf, other

    cs.NE cs.AI cs.LG

    Exploiting Large Neuroimaging Datasets to Create Connectome-Constrained Approaches for more Robust, Efficient, and Adaptable Artificial Intelligence

    Authors: Erik C. Johnson, Brian S. Robinson, Gautam K. Vallabha, Justin Joyce, Jordan K. Matelsky, Raphael Norman-Tenazas, Isaac Western, Marisel Villafañe-Delgado, Martha Cervantes, Michael S. Robinette, Arun V. Reddy, Lindsey Kitchell, Patricia K. Rivlin, Elizabeth P. Reilly, Nathan Drenkow, Matthew J. Roos, I-Jeng Wang, Brock A. Wester, William R. Gray-Roncal, Joan A. Hoffmann

    Abstract: Despite the progress in deep learning networks, efficient learning at the edge (enabling adaptable, low-complexity machine learning solutions) remains a critical need for defense and commercial applications. We envision a pipeline to utilize large neuroimaging datasets, including maps of the brain which capture neuron and synapse connectivity, to improve machine learning approaches. We have pursue… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

    Comments: 11 pages, 4 figures

  15. arXiv:2304.13627  [pdf, ps, other

    cs.PL cs.LO

    Automatic Amortized Resource Analysis with Regular Recursive Types

    Authors: Jessie Grosen, David M. Kahn, Jan Hoffmann

    Abstract: The goal of automatic resource bound analysis is to statically infer symbolic bounds on the resource consumption of the evaluation of a program. A longstanding challenge for automatic resource analysis is the inference of bounds that are functions of complex custom data structures. This article builds on type-based automatic amortized resource analysis (AARA) to address this challenge. AARA is bas… ▽ More

    Submitted 26 April, 2023; originally announced April 2023.

    Comments: 15 pages, 5 figures; to be published in LICS'23

  16. arXiv:2302.06541  [pdf, other

    cs.CL

    Towards Agile Text Classifiers for Everyone

    Authors: Maximilian Mozes, Jessica Hoffmann, Katrin Tomanek, Muhamed Kouate, Nithum Thain, Ann Yuan, Tolga Bolukbasi, Lucas Dixon

    Abstract: Text-based safety classifiers are widely used for content moderation and increasingly to tune generative language model behavior - a topic of growing concern for the safety of digital assistants and chatbots. However, different policies require different classifiers, and safety policies themselves improve from iteration and adaptation. This paper introduces and evaluates methods for agile text cla… ▽ More

    Submitted 21 October, 2023; v1 submitted 13 February, 2023; originally announced February 2023.

    Comments: Findings of EMNLP 2023

  17. arXiv:2212.01607  [pdf, other

    cs.RO eess.SY

    A Hierarchical Approach for Strategic Motion Planning in Autonomous Racing

    Authors: Rudolf Reiter, Jasper Hoffmann, Joschka Boedecker, Moritz Diehl

    Abstract: We present an approach for safe trajectory planning, where a strategic task related to autonomous racing is learned sample-efficient within a simulation environment. A high-level policy, represented as a neural network, outputs a reward specification that is used within the cost function of a parametric nonlinear model predictive controller (NMPC). By including constraints and vehicle kinematics… ▽ More

    Submitted 3 December, 2022; originally announced December 2022.

  18. arXiv:2211.00543  [pdf

    cs.CV

    Geo-Information Harvesting from Social Media Data

    Authors: Xiao Xiang Zhu, Yuanyuan Wang, Mrinalini Kochupillai, Martin Werner, Matthias Häberle, Eike Jens Hoffmann, Hannes Taubenböck, Devis Tuia, Alex Levering, Nathan Jacobs, Anna Kruspe, Karam Abdulahhad

    Abstract: As unconventional sources of geo-information, massive imagery and text messages from open platforms and social media form a temporally quasi-seamless, spatially multi-perspective stream, but with unknown and diverse quality. Due to its complementarity to remote sensing data, geo-information from these sources offers promising perspectives, but harvesting is not trivial due to its data characterist… ▽ More

    Submitted 1 November, 2022; originally announced November 2022.

    Comments: Accepted for publication IEEE Geoscience and Remote Sensing Magazine

  19. arXiv:2206.06054  [pdf, other

    cs.LG cs.SE

    Specifying and Testing $k$-Safety Properties for Machine-Learning Models

    Authors: Maria Christakis, Hasan Ferit Eniser, Jörg Hoffmann, Adish Singla, Valentin Wüstholz

    Abstract: Machine-learning models are becoming increasingly prevalent in our lives, for instance assisting in image-classification or decision-making tasks. Consequently, the reliability of these models is of critical importance and has resulted in the development of numerous approaches for validating and verifying their robustness and fairness. However, beyond such specific properties, it is challenging to… ▽ More

    Submitted 13 June, 2022; originally announced June 2022.

  20. arXiv:2204.08524  [pdf, other

    cs.LG cs.AI cs.CY stat.ML

    So2Sat POP -- A Curated Benchmark Data Set for Population Estimation from Space on a Continental Scale

    Authors: Sugandha Doda, Yuanyuan Wang, Matthias Kahl, Eike Jens Hoffmann, Kim Ouan, Hannes Taubenböck, Xiao Xiang Zhu

    Abstract: Obtaining a dynamic population distribution is key to many decision-making processes such as urban planning, disaster management and most importantly helping the government to better allocate socio-technical supply. For the aspiration of these objectives, good population data is essential. The traditional method of collecting population data through the census is expensive and tedious. In recent y… ▽ More

    Submitted 10 November, 2022; v1 submitted 7 April, 2022; originally announced April 2022.

  21. arXiv:2203.15556  [pdf, other

    cs.CL cs.LG

    Training Compute-Optimal Large Language Models

    Authors: Jordan Hoffmann, Sebastian Borgeaud, Arthur Mensch, Elena Buchatskaya, Trevor Cai, Eliza Rutherford, Diego de Las Casas, Lisa Anne Hendricks, Johannes Welbl, Aidan Clark, Tom Hennigan, Eric Noland, Katie Millican, George van den Driessche, Bogdan Damoc, Aurelia Guy, Simon Osindero, Karen Simonyan, Erich Elsen, Jack W. Rae, Oriol Vinyals, Laurent Sifre

    Abstract: We investigate the optimal model size and number of tokens for training a transformer language model under a given compute budget. We find that current large language models are significantly undertrained, a consequence of the recent focus on scaling language models whilst keeping the amount of training data constant. By training over 400 language models ranging from 70 million to over 16 billion… ▽ More

    Submitted 29 March, 2022; originally announced March 2022.

  22. arXiv:2203.09361  [pdf, other

    cs.AI cs.CC cs.LO

    Expressivity of Planning with Horn Description Logic Ontologies (Technical Report)

    Authors: Stefan Borgwardt, Jörg Hoffmann, Alisa Kovtunova, Markus Krötzsch, Bernhard Nebel, Marcel Steinmetz

    Abstract: State constraints in AI Planning globally restrict the legal environment states. Standard planning languages make closed-domain and closed-world assumptions. Here we address open-world state constraints formalized by planning over a description logic (DL) ontology. Previously, this combination of DL and planning has been investigated for the light-weight DL DL-Lite. Here we propose a novel compila… ▽ More

    Submitted 17 March, 2022; originally announced March 2022.

    Comments: 16 pages with appendix

    MSC Class: 68 ACM Class: I.2.4; I.2.8

  23. arXiv:2202.07315  [pdf, other

    cs.CV

    Using Social Media Images for Building Function Classification

    Authors: Eike Jens Hoffmann, Karam Abdulahhad, Xiao Xiang Zhu

    Abstract: Urban land use on a building instance level is crucial geo-information for many applications, yet difficult to obtain. An intuitive approach to close this gap is predicting building functions from ground level imagery. Social media image platforms contain billions of images, with a large variety of motifs including but not limited to street perspectives. To cope with this issue this study proposes… ▽ More

    Submitted 15 February, 2022; originally announced February 2022.

  24. arXiv:2202.01169  [pdf, other

    cs.CL cs.LG

    Unified Scaling Laws for Routed Language Models

    Authors: Aidan Clark, Diego de las Casas, Aurelia Guy, Arthur Mensch, Michela Paganini, Jordan Hoffmann, Bogdan Damoc, Blake Hechtman, Trevor Cai, Sebastian Borgeaud, George van den Driessche, Eliza Rutherford, Tom Hennigan, Matthew Johnson, Katie Millican, Albin Cassirer, Chris Jones, Elena Buchatskaya, David Budden, Laurent Sifre, Simon Osindero, Oriol Vinyals, Jack Rae, Erich Elsen, Koray Kavukcuoglu , et al. (1 additional authors not shown)

    Abstract: The performance of a language model has been shown to be effectively modeled as a power-law in its parameter count. Here we study the scaling behaviors of Routing Networks: architectures that conditionally use only a subset of their parameters while processing an input. For these models, parameter count and computational requirement form two independent axes along which an increase leads to better… ▽ More

    Submitted 9 February, 2022; v1 submitted 2 February, 2022; originally announced February 2022.

    Comments: Fixing typos and affiliation clarity

  25. arXiv:2201.03288  [pdf, other

    eess.IV cs.CV

    A statistical shape model for radiation-free assessment and classification of craniosynostosis

    Authors: Matthias Schaufelberger, Reinald Peter Kühle, Andreas Wachter, Frederic Weichel, Niclas Hagen, Friedemann Ringwald, Urs Eisenmann, Jürgen Hoffmann, Michael Engel, Christian Freudlsperger, Werner Nahm

    Abstract: The assessment of craniofacial deformities requires patient data which is sparsely available. Statistical shape models provide realistic and synthetic data enabling comparisons of existing methods on a common dataset. We build the first publicly available statistical 3D head model of craniosynostosis patients and the first model focusing on infants younger than 1.5 years. We further present a sh… ▽ More

    Submitted 28 March, 2022; v1 submitted 10 January, 2022; originally announced January 2022.

  26. arXiv:2112.11446  [pdf, other

    cs.CL cs.AI

    Scaling Language Models: Methods, Analysis & Insights from Training Gopher

    Authors: Jack W. Rae, Sebastian Borgeaud, Trevor Cai, Katie Millican, Jordan Hoffmann, Francis Song, John Aslanides, Sarah Henderson, Roman Ring, Susannah Young, Eliza Rutherford, Tom Hennigan, Jacob Menick, Albin Cassirer, Richard Powell, George van den Driessche, Lisa Anne Hendricks, Maribeth Rauh, Po-Sen Huang, Amelia Glaese, Johannes Welbl, Sumanth Dathathri, Saffron Huang, Jonathan Uesato, John Mellor , et al. (55 additional authors not shown)

    Abstract: Language modelling provides a step towards intelligent communication systems by harnessing large repositories of written human knowledge to better predict and understand the world. In this paper, we present an analysis of Transformer-based language model performance across a wide range of model scales -- from models with tens of millions of parameters up to a 280 billion parameter model called Gop… ▽ More

    Submitted 21 January, 2022; v1 submitted 8 December, 2021; originally announced December 2021.

    Comments: 120 pages

  27. arXiv:2112.04426  [pdf, other

    cs.CL cs.LG

    Improving language models by retrieving from trillions of tokens

    Authors: Sebastian Borgeaud, Arthur Mensch, Jordan Hoffmann, Trevor Cai, Eliza Rutherford, Katie Millican, George van den Driessche, Jean-Baptiste Lespiau, Bogdan Damoc, Aidan Clark, Diego de Las Casas, Aurelia Guy, Jacob Menick, Roman Ring, Tom Hennigan, Saffron Huang, Loren Maggiore, Chris Jones, Albin Cassirer, Andy Brock, Michela Paganini, Geoffrey Irving, Oriol Vinyals, Simon Osindero, Karen Simonyan , et al. (3 additional authors not shown)

    Abstract: We enhance auto-regressive language models by conditioning on document chunks retrieved from a large corpus, based on local similarity with preceding tokens. With a $2$ trillion token database, our Retrieval-Enhanced Transformer (RETRO) obtains comparable performance to GPT-3 and Jurassic-1 on the Pile, despite using 25$\times$ fewer parameters. After fine-tuning, RETRO performance translates to d… ▽ More

    Submitted 7 February, 2022; v1 submitted 8 December, 2021; originally announced December 2021.

    Comments: Fix incorrect reported numbers in Table 14

  28. arXiv:2111.00607  [pdf, other

    cs.CL

    A Systematic Investigation of Commonsense Knowledge in Large Language Models

    Authors: Xiang Lorraine Li, Adhiguna Kuncoro, Jordan Hoffmann, Cyprien de Masson d'Autume, Phil Blunsom, Aida Nematzadeh

    Abstract: Language models (LMs) trained on large amounts of data have shown impressive performance on many NLP tasks under the zero-shot and few-shot setup. Here we aim to better understand the extent to which such models learn commonsense knowledge -- a critical component of many NLP applications. We conduct a systematic and rigorous zero-shot and few-shot commonsense evaluation of large pre-trained LMs, w… ▽ More

    Submitted 31 October, 2022; v1 submitted 31 October, 2021; originally announced November 2021.

    Comments: Accepted to EMNLP 2022

  29. arXiv:2108.12251  [pdf, other

    cs.SI

    Changes in Twitter geolocations: Insights and suggestions for future usage

    Authors: Anna Kruspe, Matthias Häberle, Eike J. Hoffmann, Samyo Rode-Hasinger, Karam Abdulahhad, Xiao Xiang Zhu

    Abstract: Twitter data has become established as a valuable source of data for various application scenarios in the past years. For many such applications, it is necessary to know where Twitter posts (tweets) were sent from or what location they refer to. Researchers have frequently used exact coordinates provided in a small percentage of tweets, but Twitter removed the option to share these coordinates in… ▽ More

    Submitted 22 September, 2021; v1 submitted 27 August, 2021; originally announced August 2021.

  30. arXiv:2107.01820  [pdf

    cs.LG cs.AI q-bio.QM stat.ML

    An Explainable AI System for the Diagnosis of High Dimensional Biomedical Data

    Authors: Alfred Ultsch, Jörg Hoffmann, Maximilian Röhnert, Malte Von Bonin, Uta Oelschlägel, Cornelia Brendel, Michael C. Thrun

    Abstract: Typical state of the art flow cytometry data samples consists of measures of more than 100.000 cells in 10 or more features. AI systems are able to diagnose such data with almost the same accuracy as human experts. However, there is one central challenge in such systems: their decisions have far-reaching consequences for the health and life of people, and therefore, the decisions of AI systems nee… ▽ More

    Submitted 1 March, 2022; v1 submitted 5 July, 2021; originally announced July 2021.

    Comments: 29 pages, 5 figure, 5 tables, data available at https://data.mendeley.com/datasets/jk4dt6wprv/1

    MSC Class: 68T05 ACM Class: I.2; I.5

  31. arXiv:2106.13936  [pdf, ps, other

    cs.PL

    Automatic Amortized Resource Analysis with the Quantum Physicist's Method

    Authors: David M Kahn, Jan Hoffmann

    Abstract: We present a novel method for working with the physicist's method of amortized resource analysis, which we call the quantum physicist's method. These principles allow for more precise analyses of resources that are not monotonically consumed, like stack. This method takes its name from its two major features, worldviews and resource tunneling, which behave analogously to quantum superposition and… ▽ More

    Submitted 25 June, 2021; originally announced June 2021.

  32. arXiv:2106.12182  [pdf, other

    cs.LG cs.CV stat.ML

    Fairness for Image Generation with Uncertain Sensitive Attributes

    Authors: Ajil Jalal, Sushrut Karmalkar, Jessica Hoffmann, Alexandros G. Dimakis, Eric Price

    Abstract: This work tackles the issue of fairness in the context of generative procedures, such as image super-resolution, which entail different definitions from the standard classification setting. Moreover, while traditional group fairness definitions are typically defined with respect to specified protected groups -- camouflaging the fact that these groupings are artificial and carry historical and poli… ▽ More

    Submitted 2 July, 2021; v1 submitted 23 June, 2021; originally announced June 2021.

  33. arXiv:2104.03598  [pdf, other

    cs.PL

    Sound Probabilistic Inference via Guide Types

    Authors: Di Wang, Jan Hoffmann, Thomas Reps

    Abstract: Probabilistic programming languages aim to describe and automate Bayesian modeling and inference. Modern languages support programmable inference, which allows users to customize inference algorithms by incorporating guide programs to improve inference performance. For Bayesian inference to be sound, guide programs must be compatible with model programs. One pervasive but challenging condition for… ▽ More

    Submitted 8 April, 2021; originally announced April 2021.

  34. arXiv:2103.16105  [pdf, ps, other

    cs.PL

    Expected-Cost Analysis for Probabilistic Programs and Semantics-Level Adaption of Optional Stopping Theorems

    Authors: Di Wang, Jan Hoffmann, Thomas Reps

    Abstract: In this article, we present a semantics-level adaption of the Optional Stopping Theorem, sketch an expected-cost analysis as its application, and survey different variants of the Optional Stopping Theorem that have been used in static analysis of probabilistic programs.

    Submitted 30 March, 2021; originally announced March 2021.

    Comments: arXiv admin note: substantial text overlap with arXiv:2001.10150

  35. arXiv:2011.09705  [pdf, other

    cs.AI

    Iterative Planning with Plan-Space Explanations: A Tool and User Study

    Authors: Rebecca Eifler, Jörg Hoffmann

    Abstract: In a variety of application settings, the user preference for a planning task - the precise optimization objective - is difficult to elicit. One possible remedy is planning as an iterative process, allowing the user to iteratively refine and modify example plans. A key step to support such a process are explanations, answering user questions about the current plan. In particular, a relevant kind o… ▽ More

    Submitted 19 November, 2020; originally announced November 2020.

    Comments: Proceedings of the International Workshop of Explainable AI Planning (XAIP'20), at ICAPS'20

  36. arXiv:2011.09037  [pdf, other

    cs.PL

    Probabilistic Resource-Aware Session Types

    Authors: Ankush Das, Di Wang, Jan Hoffmann

    Abstract: Session types guarantee that message-passing processes adhere to predefined communication protocols. Prior work on session types has focused on deterministic languages but many message-passing systems, such as Markov chains and randomized distributed algorithms, are probabilistic. To model and analyze such systems, this article introduces probabilistic session types and explores their application… ▽ More

    Submitted 17 November, 2020; originally announced November 2020.

    Comments: technical report

  37. arXiv:2010.16353  [pdf, other

    cs.PL

    Typable Fragments of Polynomial Automatic Amortized Resource Analysis

    Authors: Long Pham, Jan Hoffmann

    Abstract: Being a fully automated technique for resource analysis, automatic amortized resource analysis (AARA) can fail in returning worst-case cost bounds of programs, fundamentally due to the undecidability of resource analysis. For programmers who are unfamiliar with the technical details of AARA, it is difficult to predict whether a program can be successfully analyzed in AARA. Motivated by this proble… ▽ More

    Submitted 30 October, 2020; originally announced October 2020.

    Comments: This is the full version of our CSL 2021 paper

    MSC Class: 68N15 ACM Class: D.3.1; F.3.1

  38. arXiv:2010.03982  [pdf, other

    cs.CL

    Generating Instructions at Different Levels of Abstraction

    Authors: Arne Köhn, Julia Wichlacz, Álvaro Torralba, Daniel Höller, Jörg Hoffmann, Alexander Koller

    Abstract: When generating technical instructions, it is often convenient to describe complex objects in the world at different levels of abstraction. A novice user might need an object explained piece by piece, while for an expert, talking about the complex object (e.g. a wall or railing) directly may be more succinct and efficient. We show how to generate building instructions at different levels of abstra… ▽ More

    Submitted 8 October, 2020; originally announced October 2020.

    Comments: Accepted COLING 2020 long paper

  39. arXiv:2008.08262  [pdf, other

    cs.SI cs.CY physics.soc-ph stat.AP

    Quarantines as a Targeted Immunization Strategy

    Authors: Jessica Hoffmann, Matt Jordan, Constantine Caramanis

    Abstract: In the context of the recent COVID-19 outbreak, quarantine has been used to "flatten the curve" and slow the spread of the disease. In this paper, we show that this is not the only benefit of quarantine for the mitigation of an SIR epidemic spreading on a graph. Indeed, human contact networks exhibit a powerlaw structure, which means immunizing nodes at random is extremely ineffective at slowing t… ▽ More

    Submitted 20 February, 2021; v1 submitted 19 August, 2020; originally announced August 2020.

  40. arXiv:2008.00766  [pdf, other

    cs.LG cs.AI

    Tracking the Race Between Deep Reinforcement Learning and Imitation Learning -- Extended Version

    Authors: Timo P. Gros, Daniel Höller, Jörg Hoffmann, Verena Wolf

    Abstract: Learning-based approaches for solving large sequential decision making problems have become popular in recent years. The resulting agents perform differently and their characteristics depend on those of the underlying learning approach. Here, we consider a benchmark planning problem from the reinforcement learning domain, the Racetrack, to investigate the properties of agents derived from differen… ▽ More

    Submitted 3 August, 2020; originally announced August 2020.

    Comments: Extended Version of the Conference Paper published in the Proceedings of the 17th International Conference on Quantitative Evaluation of SysTems (QEST)

  41. arXiv:2006.16233  [pdf, ps, other

    cs.PL

    Liquid Resource Types

    Authors: Tristan Knoth, Di Wang, Adam Reynolds, Jan Hoffmann, Nadia Polikarpova

    Abstract: This article presents liquid resource types, a technique for automatically verifying the resource consumption of functional programs. Existing resource analysis techniques trade automation for flexibility -- automated techniques are restricted to relatively constrained families of resource bounds, while more expressive proof techniques admitting value-dependent bounds rely on handwritten proofs. L… ▽ More

    Submitted 1 July, 2020; v1 submitted 29 June, 2020; originally announced June 2020.

  42. arXiv:2006.14010  [pdf, ps, other

    cs.PL

    Raising Expectations: Automating Expected Cost Analysis with Types

    Authors: Di Wang, David M Kahn, Jan Hoffmann

    Abstract: This article presents a type-based analysis for deriving upper bounds on the expected execution cost of probabilistic programs. The analysis is naturally compositional, parametric in the cost model, and supports higher order functions and inductive data types. The derived bounds are multivariate polynomials that are functions of data structures. Bound inference is enabled by local type rules that… ▽ More

    Submitted 21 September, 2020; v1 submitted 24 June, 2020; originally announced June 2020.

  43. arXiv:2006.07360  [pdf, other

    cs.LG stat.ML

    AlgebraNets

    Authors: Jordan Hoffmann, Simon Schmitt, Simon Osindero, Karen Simonyan, Erich Elsen

    Abstract: Neural networks have historically been built layerwise from the set of functions in ${f: \mathbb{R}^n \to \mathbb{R}^m }$, i.e. with activations and weights/parameters represented by real numbers, $\mathbb{R}$. Our work considers a richer set of objects for activations and weights, and undertakes a comprehensive study of alternative algebras as number representations by studying their performance… ▽ More

    Submitted 16 June, 2020; v1 submitted 12 June, 2020; originally announced June 2020.

  44. arXiv:2005.04074  [pdf, other

    cs.LG cs.SI stat.ML

    Adversarial Graph Embeddings for Fair Influence Maximization over Social Networks

    Authors: Moein Khajehnejad, Ahmad Asgharian Rezaei, Mahmoudreza Babaei, Jessica Hoffmann, Mahdi Jalili, Adrian Weller

    Abstract: Influence maximization is a widely studied topic in network science, where the aim is to reach the maximum possible number of nodes, while only targeting a small initial set of individuals. It has critical applications in many fields, including viral marketing, information propagation, news dissemination, and vaccinations. However, the objective does not usually take into account whether the final… ▽ More

    Submitted 10 May, 2020; v1 submitted 8 May, 2020; originally announced May 2020.

    Comments: In Proc. of the 29th International Joint Conference on Artificial Intelligence (IJCAI'20), 2020

  45. arXiv:2002.09519  [pdf, ps, other

    cs.PL

    Exponential Automatic Amortized Resource Analysis

    Authors: David M Kahn, Jan Hoffmann

    Abstract: Automatic amortized resource analysis (AARA) is a type-based technique for inferring concrete (non-asymptotic) bounds on a program's resource usage. Existing work on AARA has focused on bounds that are polynomial in the sizes of the inputs. This paper presents and extension of AARA to exponential bounds that preserves the benefits of the technique, such as compositionality and efficient type infer… ▽ More

    Submitted 5 March, 2020; v1 submitted 21 February, 2020; originally announced February 2020.

  46. arXiv:2001.10150  [pdf, other

    cs.PL

    Central Moment Analysis for Cost Accumulators in Probabilistic Programs

    Authors: Di Wang, Jan Hoffmann, Thomas Reps

    Abstract: For probabilistic programs, it is usually not possible to automatically derive exact information about their properties, such as the distribution of states at a given program point. Instead, one can attempt to derive approximations, such as upper bounds on tail probabilities. Such bounds can be obtained via concentration inequalities, which rely on the moments of a distribution, such as the expect… ▽ More

    Submitted 8 April, 2021; v1 submitted 27 January, 2020; originally announced January 2020.

  47. arXiv:1909.10893  [pdf, other

    cs.LG cs.AI stat.ML

    Recurrent Independent Mechanisms

    Authors: Anirudh Goyal, Alex Lamb, Jordan Hoffmann, Shagun Sodhani, Sergey Levine, Yoshua Bengio, Bernhard Schölkopf

    Abstract: Learning modular structures which reflect the dynamics of the environment can lead to better generalization and robustness to changes which only affect a few of the underlying causes. We propose Recurrent Independent Mechanisms (RIMs), a new recurrent architecture in which multiple groups of recurrent cells operate with nearly independent transition dynamics, communicate only sparingly through the… ▽ More

    Submitted 17 November, 2020; v1 submitted 24 September, 2019; originally announced September 2019.

  48. arXiv:1909.00949  [pdf, other

    cs.LG cond-mat.mtrl-sci physics.comp-ph stat.ML

    Data-Driven Approach to Encoding and Decoding 3-D Crystal Structures

    Authors: Jordan Hoffmann, Louis Maestrati, Yoshihide Sawada, Jian Tang, Jean Michel Sellier, Yoshua Bengio

    Abstract: Generative models have achieved impressive results in many domains including image and text generation. In the natural sciences, generative models have led to rapid progress in automated drug discovery. Many of the current methods focus on either 1-D or 2-D representations of typically small, drug-like molecules. However, many molecules require 3-D descriptors and exceed the chemical complexity of… ▽ More

    Submitted 3 September, 2019; originally announced September 2019.

  49. arXiv:1908.01000  [pdf, other

    cs.LG cs.AI stat.ML

    InfoGraph: Unsupervised and Semi-supervised Graph-Level Representation Learning via Mutual Information Maximization

    Authors: Fan-Yun Sun, Jordan Hoffmann, Vikas Verma, Jian Tang

    Abstract: This paper studies learning the representations of whole graphs in both unsupervised and semi-supervised scenarios. Graph-level representations are critical in a variety of real-world applications such as predicting the properties of molecules and community analysis in social networks. Traditional graph kernel based methods are simple, yet effective for obtaining fixed-length representations for g… ▽ More

    Submitted 17 January, 2020; v1 submitted 31 July, 2019; originally announced August 2019.

    Comments: ICLR 2020 (spotlight)

  50. arXiv:1906.07159  [pdf, other

    cs.SI cs.LG stat.ML

    vGraph: A Generative Model for Joint Community Detection and Node Representation Learning

    Authors: Fan-Yun Sun, Meng Qu, Jordan Hoffmann, Chin-Wei Huang, Jian Tang

    Abstract: This paper focuses on two fundamental tasks of graph analysis: community detection and node representation learning, which capture the global and local structures of graphs, respectively. In the current literature, these two tasks are usually independently studied while they are actually highly correlated. We propose a probabilistic generative model called vGraph to learn community membership and… ▽ More

    Submitted 17 September, 2019; v1 submitted 18 June, 2019; originally announced June 2019.

    Comments: Accepted Paper at NeurIPS 2019