Skip to main content

Showing 1–9 of 9 results for author: Mathiasen, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.04030  [pdf, other

    cs.LG

    Reducing the Cost of Quantum Chemical Data By Backpropagating Through Density Functional Theory

    Authors: Alexander Mathiasen, Hatem Helal, Paul Balanca, Adam Krzywaniak, Ali Parviz, Frederik Hvilshøj, Blazej Banaszewski, Carlo Luschi, Andrew William Fitzgibbon

    Abstract: Density Functional Theory (DFT) accurately predicts the quantum chemical properties of molecules, but scales as $O(N_{\text{electrons}}^3)$. Schütt et al. (2019) successfully approximate DFT 1000x faster with Neural Networks (NN). Arguably, the biggest problem one faces when scaling to larger molecules is the cost of DFT labels. For example, it took years to create the PCQ dataset (Nakata & Shimaz… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

  2. arXiv:2311.01135  [pdf, other

    cs.LG physics.chem-ph

    Generating QM1B with PySCF$_{\text{IPU}}$

    Authors: Alexander Mathiasen, Hatem Helal, Kerstin Klaser, Paul Balanca, Josef Dean, Carlo Luschi, Dominique Beaini, Andrew Fitzgibbon, Dominic Masters

    Abstract: The emergence of foundation models in Computer Vision and Natural Language Processing have resulted in immense progress on downstream tasks. This progress was enabled by datasets with billions of training examples. Similar benefits are yet to be unlocked for quantum chemistry, where the potential of deep learning is constrained by comparatively small datasets with 100k to 20M training examples. Th… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

    Comments: 15 pages, 7 figures. NeurIPS 2023 Track Datasets and Benchmarks

    ACM Class: I.2.6; J.2

  3. arXiv:2009.14554  [pdf, other

    cs.LG stat.ML

    One Reflection Suffice

    Authors: Alexander Mathiasen, Frederik Hvilshøj

    Abstract: Orthogonal weight matrices are used in many areas of deep learning. Much previous work attempt to alleviate the additional computational resources it requires to constrain weight matrices to be orthogonal. One popular approach utilizes *many* Householder reflections. The only practical drawback is that many reflections cause low GPU utilization. We mitigate this final drawback by proving that *one… ▽ More

    Submitted 30 September, 2020; originally announced September 2020.

  4. arXiv:2009.14075  [pdf, other

    cs.LG stat.ML

    Backpropagating through Fréchet Inception Distance

    Authors: Alexander Mathiasen, Frederik Hvilshøj

    Abstract: The Fréchet Inception Distance (FID) has been used to evaluate hundreds of generative models. We introduce FastFID, which can efficiently train generative models with FID as a loss function. Using FID as an additional loss for Generative Adversarial Networks improves their FID.

    Submitted 14 April, 2021; v1 submitted 29 September, 2020; originally announced September 2020.

  5. arXiv:2009.13977  [pdf, other

    cs.LG stat.ML

    What if Neural Networks had SVDs?

    Authors: Alexander Mathiasen, Frederik Hvilshøj, Jakob Rødsgaard Jørgensen, Anshul Nasery, Davide Mottin

    Abstract: Various Neural Networks employ time-consuming matrix operations like matrix inversion. Many such matrix operations are faster to compute given the Singular Value Decomposition (SVD). Previous work allows using the SVD in Neural Networks without computing it. In theory, the techniques can speed up matrix operations, however, in practice, they are not fast enough. We present an algorithm that is fas… ▽ More

    Submitted 29 September, 2020; originally announced September 2020.

  6. arXiv:2006.01491  [pdf, other

    cs.PL cs.CC

    The Fine-Grained and Parallel Complexity of Andersen's Pointer Analysis

    Authors: Anders Alnor Mathiasen, Andreas Pavlogiannis

    Abstract: Pointer analysis is one of the fundamental problems in static program analysis. Given a set of pointers, the task is to produce a useful over-approximation of the memory locations that each pointer may point-to at runtime. The most common formulation is Andersen's Pointer Analysis (APA), defined as an inclusion-based set of $m$ pointer constraints over a set of $n$ pointers. Existing algorithms so… ▽ More

    Submitted 14 October, 2020; v1 submitted 2 June, 2020; originally announced June 2020.

  7. arXiv:1909.12518  [pdf, ps, other

    cs.LG cs.DS stat.ML

    Margin-Based Generalization Lower Bounds for Boosted Classifiers

    Authors: Allan Grønlund, Lior Kamma, Kasper Green Larsen, Alexander Mathiasen, Jelani Nelson

    Abstract: Boosting is one of the most successful ideas in machine learning. The most well-accepted explanations for the low generalization error of boosting algorithms such as AdaBoost stem from margin theory. The study of margins in the context of boosting algorithms was initiated by Schapire, Freund, Bartlett and Lee (1998) and has inspired numerous boosting algorithms and generalization bounds. To date,… ▽ More

    Submitted 7 May, 2020; v1 submitted 27 September, 2019; originally announced September 2019.

  8. arXiv:1901.10789  [pdf, other

    cs.LG cs.DS stat.ML

    Optimal Minimal Margin Maximization with Boosting

    Authors: Allan Grønlund, Kasper Green Larsen, Alexander Mathiasen

    Abstract: Boosting algorithms produce a classifier by iteratively combining base hypotheses. It has been observed experimentally that the generalization error keeps improving even after achieving zero training error. One popular explanation attributes this to improvements in margins. A common goal in a long line of research, is to maximize the smallest margin using as few base hypotheses as possible, culmin… ▽ More

    Submitted 30 January, 2019; originally announced January 2019.

  9. arXiv:1701.07204  [pdf, other

    cs.DS cs.AI cs.LG

    Fast Exact k-Means, k-Medians and Bregman Divergence Clustering in 1D

    Authors: Allan Grønlund, Kasper Green Larsen, Alexander Mathiasen, Jesper Sindahl Nielsen, Stefan Schneider, Mingzhou Song

    Abstract: The $k$-Means clustering problem on $n$ points is NP-Hard for any dimension $d\ge 2$, however, for the 1D case there exists exact polynomial time algorithms. Previous literature reported an $O(kn^2)$ time dynamic programming algorithm that uses $O(kn)$ space. It turns out that the problem has been considered under a different name more than twenty years ago. We present all the existing work that h… ▽ More

    Submitted 25 April, 2018; v1 submitted 25 January, 2017; originally announced January 2017.