Skip to main content

Showing 1–50 of 84 results for author: Barrett, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.07088  [pdf, other

    cs.AI cs.LO eess.SY

    Safe and Reliable Training of Learning-Based Aerospace Controllers

    Authors: Udayan Mandal, Guy Amir, Haoze Wu, Ieva Daukantas, Fletcher Lee Newell, Umberto Ravaioli, Baoluo Meng, Michael Durling, Kerianne Hobbs, Milan Ganai, Tobey Shim, Guy Katz, Clark Barrett

    Abstract: In recent years, deep reinforcement learning (DRL) approaches have generated highly successful controllers for a myriad of complex domains. However, the opaque nature of these models limits their applicability in aerospace systems and safety-critical domains, in which a single mistake can have dire consequences. In this paper, we present novel advancements in both the training and verification of… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: 10 pages, 3 figures

  2. arXiv:2406.18912  [pdf, ps, other

    math.LO cs.LO

    The nonexistence of unicorns and many-sorted Löwenheim-Skolem theorems

    Authors: Benjamin Przybocki, Guilherme Toledo, Yoni Zohar, Clark Barrett

    Abstract: Stable infiniteness, strong finite witnessability, and smoothness are model-theoretic properties relevant to theory combination in satisfiability modulo theories. Theories that are strongly finitely witnessable and smooth are called strongly polite and can be effectively combined with other theories. Toledo, Zohar, and Barrett conjectured that stably infinite and strongly finitely witnessable theo… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: To appear in FM24

  3. arXiv:2406.15882  [pdf, ps, other

    cs.LO math.CT

    Equivalence Hypergraphs: E-Graphs for Monoidal Theories

    Authors: Dan R. Ghica, Chris Barrett, Aleksei Tiurin

    Abstract: The technique of equipping graphs with an equivalence relation, called equality saturation, has recently proved both powerful and practical in program optimisation, particularly for satisfiability modulo theory solvers. We give a categorical semantics to these structures, called e-graphs, in terms of Cartesian categories enriched over a semilattice. We show how this semantics can be generalised to… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

  4. arXiv:2405.14058  [pdf, other

    cs.AI cs.LG eess.SY

    Formally Verifying Deep Reinforcement Learning Controllers with Lyapunov Barrier Certificates

    Authors: Udayan Mandal, Guy Amir, Haoze Wu, Ieva Daukantas, Fletcher Lee Newell, Umberto J. Ravaioli, Baoluo Meng, Michael Durling, Milan Ganai, Tobey Shim, Guy Katz, Clark Barrett

    Abstract: Deep reinforcement learning (DRL) is a powerful machine learning paradigm for generating agents that control autonomous systems. However, the ``black box'' nature of DRL agents limits their deployment in real-world safety-critical applications. A promising approach for providing strong guarantees on an agent's behavior is to use Neural Lyapunov Barrier (NLB) certificates, which are learned functio… ▽ More

    Submitted 14 August, 2024; v1 submitted 22 May, 2024; originally announced May 2024.

    Comments: To appear in FMCAD 2024

  5. The Relational Machine Calculus

    Authors: Chris Barrett, Daniel Castle, Willem Heijltjes

    Abstract: This paper presents the Relational Machine Calculus (RMC): a simple, foundational model of first-order relational programming. The RMC originates from the Functional Machine Calculus (FMC), which generalizes the lambda-calculus and its standard call-by-name stack machine in two directions. One, "locations", introduces multiple stacks, which enable effect operators to be encoded into the abstractio… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

    Comments: LICS paper, 15 pages excluding references

    MSC Class: 03B70 ACM Class: D.3.1; D.3.2; F.4.1; F.1.1; I.2.3

  6. arXiv:2405.06624  [pdf, other

    cs.AI

    Towards Guaranteed Safe AI: A Framework for Ensuring Robust and Reliable AI Systems

    Authors: David "davidad" Dalrymple, Joar Skalse, Yoshua Bengio, Stuart Russell, Max Tegmark, Sanjit Seshia, Steve Omohundro, Christian Szegedy, Ben Goldhaber, Nora Ammann, Alessandro Abate, Joe Halpern, Clark Barrett, Ding Zhao, Tan Zhi-Xuan, Jeannette Wing, Joshua Tenenbaum

    Abstract: Ensuring that AI systems reliably and robustly avoid harmful or dangerous behaviours is a crucial challenge, especially for AI systems with a high degree of autonomy and general intelligence, or systems used in safety-critical contexts. In this paper, we will introduce and define a family of approaches to AI safety, which we will refer to as guaranteed safe (GS) AI. The core feature of these appro… ▽ More

    Submitted 8 July, 2024; v1 submitted 10 May, 2024; originally announced May 2024.

  7. arXiv:2405.06127  [pdf, other

    cs.LO cs.AR

    Efficiently Synthesizing Lowest Cost Rewrite Rules for Instruction Selection

    Authors: Ross Daly, Caleb Donovick, Caleb Terrill, Jackson Melchert, Priyanka Raina, Clark Barrett, Pat Hanrahan

    Abstract: Compiling programs to an instruction set architecture (ISA) requires a set of rewrite rules that map patterns consisting of compiler instructions to patterns consisting of ISA instructions. We synthesize such rules by constructing SMT queries, whose solutions represent two functionally equivalent programs. These two programs are interpreted as an instruction selection rewrite rule. Existing work i… ▽ More

    Submitted 17 May, 2024; v1 submitted 9 May, 2024; originally announced May 2024.

    Comments: 9 pages, 2 figures, in submission to FMCAD24, fixed small bugs in results generation, improved baseline comparison, improved background and writing clarity

  8. arXiv:2405.03057  [pdf, ps, other

    cs.LO

    Verifying SQL Queries using Theories of Tables and Relations

    Authors: Mudathir Mohamed, Andrew Reynolds, Cesare Tinelli, Clark Barrett

    Abstract: We present a number of first- and second-order extensions to SMT theories specifically aimed at representing and analyzing SQL queries with join, projection, and selection operations. We support reasoning about SQL queries with either bag or set semantics for database tables. We provide the former via an extension of a theory of finite bags and the latter via an extension of the theory of finite r… ▽ More

    Submitted 27 May, 2024; v1 submitted 5 May, 2024; originally announced May 2024.

  9. arXiv:2404.18988  [pdf, other

    cs.CL

    Markovian Agents for Informative Language Modeling

    Authors: Scott Viteri, Max Lamparth, Peter Chatain, Clark Barrett

    Abstract: Chain-of-Thought (CoT) reasoning could in principle enable a deeper understanding of a language model's (LM) internal reasoning. However, prior work suggests that LMs can answer questions similarly despite changes in their CoT, suggesting that those models are not truly using the CoT. We propose an reinforcement learning technique to produce CoTs that are sufficient alone for predicting future tex… ▽ More

    Submitted 22 May, 2024; v1 submitted 29 April, 2024; originally announced April 2024.

    Comments: 21 pages, 8 figures

  10. arXiv:2404.16122  [pdf, ps, other

    cs.LO

    Generalized Optimization Modulo Theories

    Authors: Nestan Tsiskaridze, Clark Barrett, Cesare Tinelli

    Abstract: Optimization Modulo Theories (OMT) has emerged as an important extension of the highly successful Satisfiability Modulo Theories (SMT) paradigm. The OMT problem requires solving an SMT problem with the restriction that the solution must be optimal with respect to a given objective function. We introduce a generalization of the OMT problem where, in particular, objective functions can range over pa… ▽ More

    Submitted 28 April, 2024; v1 submitted 24 April, 2024; originally announced April 2024.

  11. arXiv:2401.14461  [pdf, other

    cs.AI cs.LG cs.LO

    Marabou 2.0: A Versatile Formal Analyzer of Neural Networks

    Authors: Haoze Wu, Omri Isac, Aleksandar Zeljić, Teruhiro Tagomori, Matthew Daggitt, Wen Kokke, Idan Refaeli, Guy Amir, Kyle Julian, Shahaf Bassan, Pei Huang, Ori Lahav, Min Wu, Min Zhang, Ekaterina Komendantskaya, Guy Katz, Clark Barrett

    Abstract: This paper serves as a comprehensive system description of version 2.0 of the Marabou framework for formal analysis of neural networks. We discuss the tool's architectural design and highlight the major features and components introduced since its initial release.

    Submitted 20 May, 2024; v1 submitted 25 January, 2024; originally announced January 2024.

    Comments: Condensed version accepted at CAV'24

  12. arXiv:2401.04827  [pdf

    cs.IT

    The site linkage spectrum of data arrays

    Authors: Christopher Barrett, Andrei Bura, Fenix Huang, Christian Reidys

    Abstract: A new perspective is introduced regarding the analysis of Multiple Sequence Alignments (MSA), representing aligned data defined over a finite alphabet of symbols. The framework is designed to produce a block decomposition of an MSA, where each block is comprised of sequences exhibiting a certain site-coherence. The key component of this framework is an information theoretical potential defined on… ▽ More

    Submitted 9 January, 2024; originally announced January 2024.

    Comments: 15 pages, 2 figures

    MSC Class: 94D99 (Primary)

  13. arXiv:2312.12679  [pdf, other

    cs.LG cs.AI cs.LO

    Towards Efficient Verification of Quantized Neural Networks

    Authors: Pei Huang, Haoze Wu, Yuting Yang, Ieva Daukantas, Min Wu, Yedi Zhang, Clark Barrett

    Abstract: Quantization replaces floating point arithmetic with integer arithmetic in deep neural network models, providing more efficient on-device inference with less power and memory. In this work, we propose a framework for formally verifying properties of quantized neural networks. Our baseline technique is based on integer linear programming which guarantees both soundness and completeness. We then sho… ▽ More

    Submitted 27 December, 2023; v1 submitted 19 December, 2023; originally announced December 2023.

    Comments: This paper has been accepted by AAAI2024

  14. arXiv:2312.07104  [pdf, other

    cs.AI cs.PL

    SGLang: Efficient Execution of Structured Language Model Programs

    Authors: Lianmin Zheng, Liangsheng Yin, Zhiqiang Xie, Chuyue Sun, Jeff Huang, Cody Hao Yu, Shiyi Cao, Christos Kozyrakis, Ion Stoica, Joseph E. Gonzalez, Clark Barrett, Ying Sheng

    Abstract: Large language models (LLMs) are increasingly used for complex tasks that require multiple generation calls, advanced prompting techniques, control flow, and structured inputs/outputs. However, efficient systems are lacking for programming and executing these applications. We introduce SGLang, a system for efficient execution of complex language model programs. SGLang consists of a frontend langua… ▽ More

    Submitted 5 June, 2024; v1 submitted 12 December, 2023; originally announced December 2023.

  15. arXiv:2310.17807  [pdf, other

    cs.AI cs.LG cs.SE

    Clover: Closed-Loop Verifiable Code Generation

    Authors: Chuyue Sun, Ying Sheng, Oded Padon, Clark Barrett

    Abstract: The use of large language models for code generation is a rapidly growing trend in software development. However, without effective methods for ensuring the correctness of generated code, this trend could lead to any number of undesirable outcomes. In this paper, we lay out a vision for addressing this challenge: the Clover paradigm, short for Closed-Loop Verifiable Code Generation, which reduces… ▽ More

    Submitted 3 June, 2024; v1 submitted 26 October, 2023; originally announced October 2023.

  16. arXiv:2310.04870  [pdf, other

    cs.FL cs.AI cs.LG cs.LO

    Lemur: Integrating Large Language Models in Automated Program Verification

    Authors: Haoze Wu, Clark Barrett, Nina Narodytska

    Abstract: The demonstrated code-understanding capability of LLMs raises the question of whether they can be used for automated program verification, a task that demands high-level abstract reasoning about program properties that is challenging for verification tools. We propose a general methodology to combine the power of LLMs and automated reasoners for automated program verification. We formally describe… ▽ More

    Submitted 24 April, 2024; v1 submitted 7 October, 2023; originally announced October 2023.

    Comments: Accepted at ICLR'24

  17. Identifying and Mitigating the Security Risks of Generative AI

    Authors: Clark Barrett, Brad Boyd, Elie Burzstein, Nicholas Carlini, Brad Chen, Jihye Choi, Amrita Roy Chowdhury, Mihai Christodorescu, Anupam Datta, Soheil Feizi, Kathleen Fisher, Tatsunori Hashimoto, Dan Hendrycks, Somesh Jha, Daniel Kang, Florian Kerschbaum, Eric Mitchell, John Mitchell, Zulfikar Ramzan, Khawaja Shams, Dawn Song, Ankur Taly, Diyi Yang

    Abstract: Every major technical invention resurfaces the dual-use dilemma -- the new technology has the potential to be used for good as well as for harm. Generative AI (GenAI) techniques, such as large language models (LLMs) and diffusion models, have shown remarkable capabilities (e.g., in-context learning, code-completion, and text-to-image generation and editing). However, GenAI can be used just as well… ▽ More

    Submitted 28 December, 2023; v1 submitted 28 August, 2023; originally announced August 2023.

    Journal ref: Foundations and Trends in Privacy and Security 6 (2023) 1-52

  18. arXiv:2308.13106  [pdf, other

    cs.PL cs.AR cs.LO

    PEak: A Single Source of Truth for Hardware Design and Verification

    Authors: Caleb Donovick, Ross Daly, Jackson Melchert, Lenny Truong, Priyanka Raina, Pat Hanrahan, Clark Barrett

    Abstract: Domain-specific languages for hardware can significantly enhance designer productivity, but sometimes at the cost of ease of verification. On the other hand, ISA specification languages are too static to be used during early stage design space exploration. We present PEak, an open-source hardware design and specification language, which aims to improve both design productivity and verification cap… ▽ More

    Submitted 24 August, 2023; originally announced August 2023.

  19. arXiv:2307.07885  [pdf, ps, other

    cs.LO

    Combining Finite Combination Properties: Finite Models and Busy Beavers

    Authors: Guilherme Toledo, Yoni Zohar, Clark Barrett

    Abstract: This work is a part of an ongoing effort to understand the relationships between properties used in theory combination. We here focus on including two properties that are related to shiny theories: the finite model property and stable finiteness. For any combination of properties, we consider the question of whether there exists a theory that exhibits it. When there is, we provide an example with… ▽ More

    Submitted 15 July, 2023; originally announced July 2023.

  20. arXiv:2306.14048  [pdf, other

    cs.LG

    H$_2$O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models

    Authors: Zhenyu Zhang, Ying Sheng, Tianyi Zhou, Tianlong Chen, Lianmin Zheng, Ruisi Cai, Zhao Song, Yuandong Tian, Christopher Ré, Clark Barrett, Zhangyang Wang, Beidi Chen

    Abstract: Large Language Models (LLMs), despite their recent impressive accomplishments, are notably cost-prohibitive to deploy, particularly for applications involving long-content generation, such as dialogue systems and story writing. Often, a large amount of transient state information, referred to as the KV cache, is stored in GPU memory in addition to model parameters, scaling linearly with the sequen… ▽ More

    Submitted 18 December, 2023; v1 submitted 24 June, 2023; originally announced June 2023.

  21. arXiv:2306.05854  [pdf, other

    cs.DC cs.LO

    Partitioning Strategies for Distributed SMT Solving

    Authors: Amalee Wilson, Andres Noetzli, Andrew Reynolds, Byron Cook, Cesare Tinelli, Clark Barrett

    Abstract: For many users of Satisfiability Modulo Theories (SMT) solvers, the solver's performance is the main bottleneck in their application. One promising approach for improving performance is to leverage the increasing availability of parallel and cloud computing. However, despite many efforts, the best parallel approach to date consists of running a portfolio of solvers, meaning that performance is sti… ▽ More

    Submitted 8 June, 2023; originally announced June 2023.

    Comments: Submitted to FMCAD 2023

  22. arXiv:2306.02003  [pdf, other

    cs.LG cs.AI cs.PF eess.SY stat.ML

    On Optimal Caching and Model Multiplexing for Large Model Inference

    Authors: Banghua Zhu, Ying Sheng, Lianmin Zheng, Clark Barrett, Michael I. Jordan, Jiantao Jiao

    Abstract: Large Language Models (LLMs) and other large foundation models have achieved noteworthy success, but their size exacerbates existing resource consumption and latency challenges. In particular, the large-scale deployment of these models is hindered by the significant resource requirements during inference. In this paper, we study two approaches for mitigating these challenges: employing a cache to… ▽ More

    Submitted 28 August, 2023; v1 submitted 3 June, 2023; originally announced June 2023.

  23. arXiv:2305.16073  [pdf, ps, other

    cs.LO

    On the Simply-Typed Functional Machine Calculus: Categorical Semantics and Strong Normalisation

    Authors: Chris Barrett

    Abstract: The Functional Machine Calculus (FMC) was recently introduced as a generalization of the lambda-calculus to include higher-order global state, probabilistic and non-deterministic choice, and input and output, while retaining confluence. The calculus can encode both the call-by-name and call-by-value semantics of these effects. This is enabled by two independent generalisations, both natural from t… ▽ More

    Submitted 25 May, 2023; originally announced May 2023.

    Comments: Ph.D. thesis; 158 pages; 11 figures

    ACM Class: F.1.1; F.3.2

  24. arXiv:2305.11087  [pdf, other

    cs.AI cs.LO

    Lightweight Online Learning for Sets of Related Problems in Automated Reasoning

    Authors: Haoze Wu, Christopher Hahn, Florian Lonsing, Makai Mann, Raghuram Ramanujan, Clark Barrett

    Abstract: We present Self-Driven Strategy Learning ($\textit{sdsl}$), a lightweight online learning methodology for automated reasoning tasks that involve solving a set of related problems. $\textit{sdsl}$ does not require offline training, but instead automatically constructs a dataset while solving earlier problems. It fits a machine learning model to this data which is then used to adjust the solving str… ▽ More

    Submitted 15 August, 2023; v1 submitted 18 May, 2023; originally announced May 2023.

    Comments: Condensed version published at FMCAD'23

  25. arXiv:2305.06064  [pdf, other

    cs.LO cs.CC

    DNN Verification, Reachability, and the Exponential Function Problem

    Authors: Omri Isac, Yoni Zohar, Clark Barrett, Guy Katz

    Abstract: Deep neural networks (DNNs) are increasingly being deployed to perform safety-critical tasks. The opacity of DNNs, which prevents humans from reasoning about them, presents new safety and security challenges. To address these challenges, the verification community has begun developing techniques for rigorously analyzing DNNs, with numerous verification algorithms proposed in recent years. While a… ▽ More

    Submitted 10 July, 2023; v1 submitted 10 May, 2023; originally announced May 2023.

    Comments: This is a preprint version of the paper that appears at CONCUR 2023

  26. arXiv:2305.02384  [pdf, ps, other

    cs.LO math.LO

    Combining Combination Properties: An Analysis of Stable Infiniteness, Convexity, and Politeness

    Authors: Guilherme Vicentin de Toledo, Yoni Zohar, Clark Barrett

    Abstract: We make two contributions to the study of theory combination in satisfiability modulo theories. The first is a table of examples for the combinations of the most common model-theoretic properties in theory combination, namely stable infiniteness, smoothness, convexity, finite witnessability, and strong finite witnessability (and therefore politeness and strong politeness as well). All of our examp… ▽ More

    Submitted 3 May, 2023; originally announced May 2023.

  27. arXiv:2303.13697  [pdf, other

    cs.RO cs.LO eess.SY math.OC

    Soy: An Efficient MILP Solver for Piecewise-Affine Systems

    Authors: Haoze Wu, Min Wu, Dorsa Sadigh, Clark Barrett

    Abstract: Piecewise-affine (PWA) systems are widely used for modeling and control of robotics problems including modeling contact dynamics. A common approach is to encode the control problem of the PWA system as a Mixed-Integer Convex Program (MICP), which can be solved by general-purpose off-the-shelf MICP solvers. To mitigate the scalability challenge of solving these MICP problems, existing work focuses… ▽ More

    Submitted 15 August, 2023; v1 submitted 23 March, 2023; originally announced March 2023.

    Comments: Same version published at IROS'2023

  28. arXiv:2303.06865  [pdf, other

    cs.LG cs.AI cs.PF

    FlexGen: High-Throughput Generative Inference of Large Language Models with a Single GPU

    Authors: Ying Sheng, Lianmin Zheng, Binhang Yuan, Zhuohan Li, Max Ryabinin, Daniel Y. Fu, Zhiqiang Xie, Beidi Chen, Clark Barrett, Joseph E. Gonzalez, Percy Liang, Christopher Ré, Ion Stoica, Ce Zhang

    Abstract: The high computational and memory requirements of large language model (LLM) inference make it feasible only with multiple high-end accelerators. Motivated by the emerging demand for latency-insensitive tasks with batched processing, this paper initiates the study of high-throughput LLM inference using limited resources, such as a single commodity GPU. We present FlexGen, a high-throughput generat… ▽ More

    Submitted 12 June, 2023; v1 submitted 13 March, 2023; originally announced March 2023.

  29. arXiv:2303.01713  [pdf, other

    cs.LG

    Convex Bounds on the Softmax Function with Applications to Robustness Verification

    Authors: Dennis Wei, Haoze Wu, Min Wu, Pin-Yu Chen, Clark Barrett, Eitan Farchi

    Abstract: The softmax function is a ubiquitous component at the output of neural networks and increasingly in intermediate layers as well. This paper provides convex lower bounds and concave upper bounds on the softmax function, which are compatible with convex optimization formulations for characterizing neural networks and other ML models. We derive bounds using both a natural exponential-reciprocal decom… ▽ More

    Submitted 3 March, 2023; originally announced March 2023.

    Comments: AISTATS 2023

  30. arXiv:2212.01051  [pdf, other

    cs.LG

    VeriX: Towards Verified Explainability of Deep Neural Networks

    Authors: Min Wu, Haoze Wu, Clark Barrett

    Abstract: We present VeriX (Verified eXplainability), a system for producing optimal robust explanations and generating counterfactuals along decision boundaries of machine learning models. We build such explanations and counterfactuals iteratively using constraint solving techniques and a heuristic based on feature-level sensitivity ranking. We evaluate our method on image recognition benchmarks and a real… ▽ More

    Submitted 25 September, 2023; v1 submitted 2 December, 2022; originally announced December 2022.

  31. arXiv:2211.13140  [pdf, other

    cs.LO cs.PL

    The Functional Machine Calculus II: Semantics

    Authors: Chris Barrett, Willem Heijltjes, Guy McCusker

    Abstract: The Functional Machine Calculus (FMC), recently introduced by the authors, is a generalization of the lambda-calculus which may faithfully encode the effects of higher-order mutable store, I/O and probabilistic/non-deterministic input. Significantly, it remains confluent and can be simply typed in the presence of these effects. In this paper, we explore the denotational semantics of the FMC. We ha… ▽ More

    Submitted 5 February, 2023; v1 submitted 23 November, 2022; originally announced November 2022.

    Comments: 40 pages, published in Computer Science Logic 2023 Updated to conform to published version

    ACM Class: F.1.1; F.3.2

  32. arXiv:2210.12871  [pdf, other

    cs.LG cs.LO cs.NE

    Tighter Abstract Queries in Neural Network Verification

    Authors: Elazar Cohen, Yizhak Yisrael Elboher, Clark Barrett, Guy Katz

    Abstract: Neural networks have become critical components of reactive systems in various domains within computer science. Despite their excellent performance, using neural networks entails numerous risks that stem from our lack of ability to understand and reason about their behavior. Due to these risks, various formal methods have been proposed for verifying neural networks; but unfortunately, these typica… ▽ More

    Submitted 14 May, 2023; v1 submitted 23 October, 2022; originally announced October 2022.

  33. arXiv:2209.05201  [pdf, other

    cs.LO

    Proof-Stitch: Proof Combination for Divide and Conquer SAT Solvers

    Authors: Abhishek Nair, Saranyu Chattopadhyay, Haoze Wu, Alex Ozdemir, Clark Barrett

    Abstract: With the increasing availability of parallel computing power, there is a growing focus on parallelizing algorithms for important automated reasoning problems such as Boolean satisfiability (SAT). Divide-and-Conquer (D&C) is a popular parallel SAT solving paradigm that partitions SAT instances into independent sub-problems which are then solved in parallel. For unsatisfiable instances, state-of-the… ▽ More

    Submitted 4 September, 2022; originally announced September 2022.

    Comments: 6 pages

  34. arXiv:2208.07669  [pdf, ps, other

    cs.LG cs.LO

    On Optimizing Back-Substitution Methods for Neural Network Verification

    Authors: Tom Zelazny, Haoze Wu, Clark Barrett, Guy Katz

    Abstract: With the increasing application of deep learning in mission-critical systems, there is a growing need to obtain formal guarantees about the behaviors of neural networks. Indeed, many approaches for verifying neural networks have been recently proposed, but these generally struggle with limited scalability or insufficient accuracy. A key component in many state-of-the-art verification schemes is co… ▽ More

    Submitted 16 August, 2022; originally announced August 2022.

    Comments: This is the extended version of a paper with the same title that appeared at FMCAD 2022

    MSC Class: 68Q60

  35. arXiv:2206.03669  [pdf, other

    cs.LG cs.AI cs.LO

    Toward Certified Robustness Against Real-World Distribution Shifts

    Authors: Haoze Wu, Teruhiro Tagomori, Alexander Robey, Fengjun Yang, Nikolai Matni, George Pappas, Hamed Hassani, Corina Pasareanu, Clark Barrett

    Abstract: We consider the problem of certifying the robustness of deep neural networks against real-world distribution shifts. To do so, we bridge the gap between hand-crafted specifications and realistic deployment settings by proposing a novel neural-symbolic verification framework, in which we train a generative model to learn perturbations from data and define specifications with respect to the output o… ▽ More

    Submitted 6 March, 2023; v1 submitted 8 June, 2022; originally announced June 2022.

    Comments: SatML'23. Keywords: certified robustness, distribution shift, generative models, S-shaped activations, CEGAR

  36. arXiv:2206.00512  [pdf, other

    cs.LO cs.LG

    Neural Network Verification with Proof Production

    Authors: Omri Isac, Clark Barrett, Min Zhang, Guy Katz

    Abstract: Deep neural networks (DNNs) are increasingly being employed in safety-critical systems, and there is an urgent need to guarantee their correctness. Consequently, the verification community has devised multiple techniques and tools for verifying DNNs. When DNN verifiers discover an input that triggers an error, that is easy to confirm; but when they report that no error exists, there is no way to e… ▽ More

    Submitted 27 August, 2022; v1 submitted 1 June, 2022; originally announced June 2022.

    Comments: This is a preprint version of the paper that appeared at FMCAD 2022

  37. arXiv:2205.14867  [pdf, other

    cs.CY cs.LG

    Measuring and mitigating voting access disparities: a study of race and polling locations in Florida and North Carolina

    Authors: Mohsen Abbasi, Suresh Venkatasubramanian, Sorelle A. Friedler, Kristian Lum, Calvin Barrett

    Abstract: Voter suppression and associated racial disparities in access to voting are long-standing civil rights concerns in the United States. Barriers to voting have taken many forms over the decades. A history of violent explicit discouragement has shifted to more subtle access limitations that can include long lines and wait times, long travel times to reach a polling station, and other logistical barri… ▽ More

    Submitted 30 May, 2022; originally announced May 2022.

  38. arXiv:2205.08095  [pdf, other

    cs.LO

    Reasoning About Vectors using an SMT Theory of Sequences

    Authors: Ying Sheng, Andres Nötzli, Andrew Reynolds, Yoni Zohar, David Dill, Wolfgang Grieskamp, Junkil Park, Shaz Qadeer, Clark Barrett, Cesare Tinelli

    Abstract: Dynamic arrays, also referred to as vectors, are fundamental data structures used in many programs. Modeling their semantics efficiently is crucial when reasoning about such programs. The theory of arrays is widely supported but is not ideal, because the number of elements is fixed (determined by its index sort) and cannot be adjusted, which is a problem, given that the length of vectors often pla… ▽ More

    Submitted 21 May, 2022; v1 submitted 17 May, 2022; originally announced May 2022.

    Comments: IJCAR 2022

  39. arXiv:2203.11201  [pdf, other

    cs.LG cs.AI cs.LO math.OC

    Efficient Neural Network Analysis with Sum-of-Infeasibilities

    Authors: Haoze Wu, Aleksandar Zeljić, Guy Katz, Clark Barrett

    Abstract: Inspired by sum-of-infeasibilities methods in convex optimization, we propose a novel procedure for analyzing verification queries on neural networks with piecewise-linear activation functions. Given a convex relaxation which over-approximates the non-convex activation functions, we encode the violations of activation functions as a cost function and optimize it with respect to the convex relaxati… ▽ More

    Submitted 19 March, 2022; originally announced March 2022.

    Comments: TACAS'22

  40. Scalable Verification of GNN-based Job Schedulers

    Authors: Haoze Wu, Clark Barrett, Mahmood Sharif, Nina Narodytska, Gagandeep Singh

    Abstract: Recently, Graph Neural Networks (GNNs) have been applied for scheduling jobs over clusters, achieving better performance than hand-crafted heuristics. Despite their impressive performance, concerns remain over whether these GNN-based job schedulers meet users' expectations about other important properties, such as strategy-proofness, sharing incentive, and stability. In this work, we consider form… ▽ More

    Submitted 15 September, 2022; v1 submitted 7 March, 2022; originally announced March 2022.

    Comments: Condensed version published at OOPSLA'22

  41. arXiv:2201.01978  [pdf, other

    cs.LG cs.CV cs.LO

    An Abstraction-Refinement Approach to Verifying Convolutional Neural Networks

    Authors: Matan Ostrovsky, Clark Barrett, Guy Katz

    Abstract: Convolutional neural networks have gained vast popularity due to their excellent performance in the fields of computer vision, image processing, and others. Unfortunately, it is now well known that convolutional networks often produce erroneous results - for example, minor perturbations of the inputs of these networks can result in severe classification errors. Numerous verification approaches hav… ▽ More

    Submitted 6 January, 2022; originally announced January 2022.

    MSC Class: 68Q60

  42. Scaling Up Hardware Accelerator Verification using A-QED with Functional Decomposition

    Authors: Saranyu Chattopadhyay, Florian Lonsing, Luca Piccolboni, Deepraj Soni, Peng Wei, Xiaofan Zhang, Yuan Zhou, Luca Carloni, Deming Chen, Jason Cong, Ramesh Karri, Zhiru Zhang, Caroline Trippel, Clark Barrett, Subhasish Mitra

    Abstract: Hardware accelerators (HAs) are essential building blocks for fast and energy-efficient computing systems. Accelerator Quick Error Detection (A-QED) is a recent formal technique which uses Bounded Model Checking for pre-silicon verification of HAs. A-QED checks an HA for self-consistency, i.e., whether identical inputs within a sequence of operations always produce the same output. Under modest as… ▽ More

    Submitted 17 August, 2021; v1 submitted 13 August, 2021; originally announced August 2021.

    Comments: preprint of a paper to appear at FMCAD 2021, including appendix

  43. arXiv:2108.05987  [pdf, other

    cs.FL cs.AR

    Automating System Configuration

    Authors: Nestan Tsiskaridze, Maxwell Strange, Makai Mann, Kavya Sreedhar, Qiaoyi Liu, Mark Horowitz, Clark Barrett

    Abstract: The increasing complexity of modern configurable systems makes it critical to improve the level of automation in the process of system configuration. Such automation can also improve the agility of the development cycle, allowing for rapid and automated integration of decoupled workflows. In this paper, we present a new framework for automated configuration of systems representable as state machin… ▽ More

    Submitted 18 August, 2021; v1 submitted 12 August, 2021; originally announced August 2021.

  44. arXiv:2106.10392  [pdf

    cs.AR cs.LO

    Effective Pre-Silicon Verification of Processor Cores by Breaking the Bounds of Symbolic Quick Error Detection

    Authors: Karthik Ganesan, Florian Lonsing, Srinivasa Shashank Nuthakki, Eshan Singh, Mohammad Rahmani Fadiheh, Wolfgang Kunz, Dominik Stoffel, Clark Barrett, Subhasish Mitra

    Abstract: We present a novel approach to pre-silicon verification of processor designs. The purpose of pre-silicon verification is to find logic bugs in a design at an early stage and thus avoid time- and cost-intensive post-silicon debugging. Our approach relies on symbolic quick error detection (Symbolic QED, or SQED). SQED is targeted at finding logic bugs in a symbolic representation of a design by comb… ▽ More

    Submitted 18 June, 2021; originally announced June 2021.

    Comments: This article has the full author list which was missing in arXiv:1908.06757. arXiv admin note: substantial text overlap with arXiv:1908.06757

  45. arXiv:2105.09743  [pdf, ps, other

    cs.LO

    lazybvtoint at the SMT Competition 2020

    Authors: Yoni Zohar, Ahmed Irfan, Makai Mann, Andres Notzli, Andrew Reynolds, Clark Barrett

    Abstract: lazybvtoint is a new prototype SMT-solver, that will participate in the incremental and non-incremental tracks of the \qfbv logic.

    Submitted 7 May, 2021; originally announced May 2021.

  46. arXiv:2105.01382  [pdf, other

    cs.LO math.LO

    A Subatomic Proof System for Decision Trees

    Authors: Chris Barrett, Alessio Guglielmi

    Abstract: We design a proof system for propositional classical logic that integrates two languages for Boolean functions: standard conjunction-disjunction-negation and binary decision trees. We give two reasons to do so. The first is proof-theoretical naturalness: the system consists of all and only the inference rules generated by the single, simple, linear scheme of the recently introduced subatomic logic… ▽ More

    Submitted 30 June, 2022; v1 submitted 4 May, 2021; originally announced May 2021.

    Comments: To appear on ACM Transactions on Computational Logic

  47. arXiv:2104.14155  [pdf, other

    cs.AR

    Automated Design Space Exploration of CGRA Processing Element Architectures using Frequent Subgraph Analysis

    Authors: Jackson Melchert, Kathleen Feng, Caleb Donovick, Ross Daly, Clark Barrett, Mark Horowitz, Pat Hanrahan, Priyanka Raina

    Abstract: The architecture of a coarse-grained reconfigurable array (CGRA) processing element (PE) has a significant effect on the performance and energy efficiency of an application running on the CGRA. This paper presents an automated approach for generating specialized PE architectures for an application or an application domain. Frequent subgraphs mined from a set of applications are merged to form a PE… ▽ More

    Submitted 29 April, 2021; originally announced April 2021.

  48. arXiv:2104.11738  [pdf, ps, other

    cs.LO

    Politeness and Stable Infiniteness: Stronger Together

    Authors: Ying Sheng, Yoni Zohar, Christophe Ringeissen, Andrew Reynolds, Clark Barrett, Cesare Tinelli

    Abstract: We make two contributions to the study of polite combination in satisfiability modulo theories. The first contribution is a separation between politeness and strong politeness, by presenting a polite theory that is not strongly polite. This result shows that proving strong politeness (which is often harder than proving politeness) is sometimes needed in order to use polite combination. The s… ▽ More

    Submitted 27 April, 2021; v1 submitted 23 April, 2021; originally announced April 2021.

  49. arXiv:2103.01629  [pdf, other

    cs.LG

    DeepCert: Verification of Contextually Relevant Robustness for Neural Network Image Classifiers

    Authors: Colin Paterson, Haoze Wu, John Grese, Radu Calinescu, Corina S. Pasareanu, Clark Barrett

    Abstract: We introduce DeepCert, a tool-supported method for verifying the robustness of deep neural network (DNN) image classifiers to contextually relevant perturbations such as blur, haze, and changes in image contrast. While the robustness of DNN classifiers has been the subject of intense research in recent years, the solutions delivered by this research focus on verifying DNN robustness to small pertu… ▽ More

    Submitted 2 March, 2021; originally announced March 2021.

  50. Counterexample-Guided Prophecy for Model Checking Modulo the Theory of Arrays

    Authors: Makai Mann, Ahmed Irfan, Alberto Griggio, Oded Padon, Clark Barrett

    Abstract: We develop a framework for model checking infinite-state systems by automatically augmenting them with auxiliary variables, enabling quantifier-free induction proofs for systems that would otherwise require quantified invariants. We combine this mechanism with a counterexample-guided abstraction refinement scheme for the theory of arrays. Our framework can thus, in many cases, reduce inductive rea… ▽ More

    Submitted 30 August, 2022; v1 submitted 17 January, 2021; originally announced January 2021.

    Journal ref: Logical Methods in Computer Science, Volume 18, Issue 3 (August 31, 2022) lmcs:8436