Skip to main content

Showing 1–50 of 99 results for author: Hashimoto, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.08351  [pdf, other

    cs.CL cs.LG

    AutoBencher: Creating Salient, Novel, Difficult Datasets for Language Models

    Authors: Xiang Lisa Li, Evan Zheran Liu, Percy Liang, Tatsunori Hashimoto

    Abstract: Evaluation is critical for assessing capabilities, tracking scientific progress, and informing model selection. In this paper, we present three desiderata for a good benchmark for language models: (i) salience (e.g., knowledge about World War II is more salient than a random day in history), (ii) novelty (i.e., the benchmark reveals new trends in model rankings not shown by previous benchmarks), a… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: preprint

  2. arXiv:2407.04620  [pdf, other

    cs.LG cs.AI cs.CL

    Learning to (Learn at Test Time): RNNs with Expressive Hidden States

    Authors: Yu Sun, Xinhao Li, Karan Dalal, Jiarui Xu, Arjun Vikram, Genghan Zhang, Yann Dubois, Xinlei Chen, Xiaolong Wang, Sanmi Koyejo, Tatsunori Hashimoto, Carlos Guestrin

    Abstract: Self-attention performs well in long context but has quadratic complexity. Existing RNN layers have linear complexity, but their performance in long context is limited by the expressive power of their hidden state. We propose a new class of sequence modeling layers with linear complexity and an expressive hidden state. The key idea is to make the hidden state a machine learning model itself, and t… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  3. arXiv:2407.01023  [pdf, other

    cs.LG

    DistML.js: Installation-free Distributed Deep Learning Framework for Web Browsers

    Authors: Masatoshi Hidaka, Tomohiro Hashimoto, Yuto Nishizawa, Tatsuya Harada

    Abstract: We present "DistML.js", a library designed for training and inference of machine learning models within web browsers. Not only does DistML.js facilitate model training on local devices, but it also supports distributed learning through communication with servers. Its design and define-by-run API for deep learning model construction resemble PyTorch, thereby reducing the learning curve for prototyp… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  4. arXiv:2406.14785  [pdf, other

    cs.CL cs.LG

    Understanding Finetuning for Factual Knowledge Extraction

    Authors: Gaurav Ghosal, Tatsunori Hashimoto, Aditi Raghunathan

    Abstract: In this work, we study the impact of QA fine-tuning data on downstream factuality. We show that fine-tuning on lesser-known facts that are poorly stored during pretraining yields significantly worse factuality than fine-tuning on well-known facts, even when all facts are seen during pretraining. We prove this phenomenon theoretically, showing that training on lesser-known facts can lead the model… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: To appear in ICML 2024

  5. arXiv:2405.20456  [pdf, other

    cs.LG

    Scaling Laws for the Value of Individual Data Points in Machine Learning

    Authors: Ian Covert, Wenlong Ji, Tatsunori Hashimoto, James Zou

    Abstract: Recent works have shown that machine learning models improve at a predictable rate with the total amount of training data, leading to scaling laws that describe the relationship between error and dataset size. These scaling laws can help design a model's training dataset, but they typically take an aggregate view of the data by only considering the dataset's size. We introduce a new perspective by… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: ICML 2024 camera-ready

  6. arXiv:2405.10938  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    Observational Scaling Laws and the Predictability of Language Model Performance

    Authors: Yangjun Ruan, Chris J. Maddison, Tatsunori Hashimoto

    Abstract: Understanding how language model performance varies with scale is critical to benchmark and algorithm development. Scaling laws are one approach to building this understanding, but the requirement of training models across many different scales has limited their use. We propose an alternative, observational approach that bypasses model training and instead builds scaling laws from ~80 publically a… ▽ More

    Submitted 2 July, 2024; v1 submitted 17 May, 2024; originally announced May 2024.

  7. arXiv:2404.04500  [pdf, other

    cs.CR cs.AI cs.CY cs.LG

    Trustless Audits without Revealing Data or Models

    Authors: Suppakit Waiwitlikhit, Ion Stoica, Yi Sun, Tatsunori Hashimoto, Daniel Kang

    Abstract: There is an increasing conflict between business incentives to hide models and data as trade secrets, and the societal need for algorithmic transparency. For example, a rightsholder wishing to know whether their copyrighted works have been used during training must convince the model provider to allow a third party to audit the model and data. Finding a mutually agreeable third party is difficult,… ▽ More

    Submitted 6 April, 2024; originally announced April 2024.

  8. arXiv:2404.04475  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    Length-Controlled AlpacaEval: A Simple Way to Debias Automatic Evaluators

    Authors: Yann Dubois, Balázs Galambosi, Percy Liang, Tatsunori B. Hashimoto

    Abstract: LLM-based auto-annotators have become a key component of the LLM development process due to their cost-effectiveness and scalability compared to human-based evaluation. However, these auto-annotators can introduce complex biases that are hard to remove. Even simple, known confounders such as preference for longer outputs remain in existing automated evaluation metrics. We propose a simple regressi… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

  9. arXiv:2404.00474  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    Linguistic Calibration of Long-Form Generations

    Authors: Neil Band, Xuechen Li, Tengyu Ma, Tatsunori Hashimoto

    Abstract: Language models (LMs) may lead their users to make suboptimal downstream decisions when they confidently hallucinate. This issue can be mitigated by having the LM verbally convey the probability that its claims are correct, but existing models cannot produce long-form text with calibrated confidence statements. Through the lens of decision-making, we define linguistic calibration for long-form gen… ▽ More

    Submitted 4 June, 2024; v1 submitted 30 March, 2024; originally announced April 2024.

    Comments: ICML 2024. Code available at https://github.com/tatsu-lab/linguistic_calibration

  10. arXiv:2402.16827  [pdf, other

    cs.CL cs.LG

    A Survey on Data Selection for Language Models

    Authors: Alon Albalak, Yanai Elazar, Sang Michael Xie, Shayne Longpre, Nathan Lambert, Xinyi Wang, Niklas Muennighoff, Bairu Hou, Liangming Pan, Haewon Jeong, Colin Raffel, Shiyu Chang, Tatsunori Hashimoto, William Yang Wang

    Abstract: A major factor in the recent success of large language models is the use of enormous and ever-growing text datasets for unsupervised pre-training. However, naively training a model on all available data may not be optimal (or feasible), as the quality of available text data can vary. Filtering out data can also decrease the carbon footprint and financial costs of training models by reducing the am… ▽ More

    Submitted 8 March, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

    Comments: Paper list available at https://github.com/alon-albalak/data-selection-survey

  11. arXiv:2402.10978  [pdf, other

    cs.LG cs.AI cs.CL

    Language Models with Conformal Factuality Guarantees

    Authors: Christopher Mohri, Tatsunori Hashimoto

    Abstract: Guaranteeing the correctness and factuality of language model (LM) outputs is a major open problem. In this work, we propose conformal factuality, a framework that can ensure high probability correctness guarantees for LMs by connecting language modeling and conformal prediction. We observe that the correctness of an LM output is equivalent to an uncertainty quantification problem, where the uncer… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

  12. arXiv:2401.15866  [pdf, other

    cs.LG

    Stochastic Amortization: A Unified Approach to Accelerate Feature and Data Attribution

    Authors: Ian Covert, Chanwoo Kim, Su-In Lee, James Zou, Tatsunori Hashimoto

    Abstract: Many tasks in explainable machine learning, such as data valuation and feature attribution, perform expensive computation for each data point and can be intractable for large datasets. These methods require efficient approximations, and learning a network that directly predicts the desired output, which is commonly known as amortization, is a promising solution. However, training such models with… ▽ More

    Submitted 28 January, 2024; originally announced January 2024.

  13. arXiv:2401.10005  [pdf, other

    cs.CV cs.CL

    Advancing Large Multi-modal Models with Explicit Chain-of-Reasoning and Visual Question Generation

    Authors: Kohei Uehara, Nabarun Goswami, Hanqin Wang, Toshiaki Baba, Kohtaro Tanaka, Tomohiro Hashimoto, Kai Wang, Rei Ito, Takagi Naoya, Ryo Umagami, Yingyi Wen, Tanachai Anakewat, Tatsuya Harada

    Abstract: The increasing demand for intelligent systems capable of interpreting and reasoning about visual content requires the development of Large Multi-Modal Models (LMMs) that are not only accurate but also have explicit reasoning capabilities. This paper presents a novel approach to imbue an LMM with the ability to conduct explicit reasoning based on visual content and textual instructions. We introduc… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

  14. arXiv:2312.04469  [pdf, other

    cs.LG cs.CL cs.CR

    On the Learnability of Watermarks for Language Models

    Authors: Chenchen Gu, Xiang Lisa Li, Percy Liang, Tatsunori Hashimoto

    Abstract: Watermarking of language model outputs enables statistical detection of model-generated text, which can mitigate harms and misuses of language models. Existing watermarking strategies operate by altering the decoder of an existing language model. In this paper, we ask whether language models can directly learn to generate watermarked text, which would have significant implications for the real-wor… ▽ More

    Submitted 2 May, 2024; v1 submitted 7 December, 2023; originally announced December 2023.

    Comments: Accepted at ICLR 2024

  15. arXiv:2312.01707  [pdf, other

    cs.HC

    Perceptual Dimensions of Physical Properties of Handheld Objects Induced by Impedance Changes

    Authors: Takeru Hashimoto, Shigeo Yoshida, Takuji Narumi

    Abstract: Haptics in virtual reality is the emerging dimension after audiovisual experiences. Researchers designed several handheld VR controllers to simulate haptic experiences in virtual reality environments. Some of these devices, equipped to deliver active force, can dynamically alter the timing and intensity of force feedback, potentially offering a wide array of haptic sensations. Past research primar… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

  16. arXiv:2312.00364  [pdf, other

    cs.LG cs.CV

    Benchmarking Multi-Domain Active Learning on Image Classification

    Authors: Jiayi Li, Rohan Taori, Tatsunori B. Hashimoto

    Abstract: Active learning aims to enhance model performance by strategically labeling informative data points. While extensively studied, its effectiveness on large-scale, real-world datasets remains underexplored. Existing research primarily focuses on single-source data, ignoring the multi-domain nature of real-world data. We introduce a multi-domain active learning benchmark to bridge this gap. Our bench… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

  17. arXiv:2311.05553  [pdf, other

    cs.CL cs.AI

    Removing RLHF Protections in GPT-4 via Fine-Tuning

    Authors: Qiusi Zhan, Richard Fang, Rohan Bindu, Akul Gupta, Tatsunori Hashimoto, Daniel Kang

    Abstract: As large language models (LLMs) have increased in their capabilities, so does their potential for dual use. To reduce harmful outputs, produces and vendors of LLMs have used reinforcement learning with human feedback (RLHF). In tandem, LLM vendors have been increasingly enabling fine-tuning of their most powerful models. However, concurrent work has shown that fine-tuning can remove RLHF protectio… ▽ More

    Submitted 5 April, 2024; v1 submitted 9 November, 2023; originally announced November 2023.

    Comments: Accepted to NAACL 2024. (7 pages)

  18. arXiv:2310.19677  [pdf, other

    cs.CL

    MoCa: Measuring Human-Language Model Alignment on Causal and Moral Judgment Tasks

    Authors: Allen Nie, Yuhui Zhang, Atharva Amdekar, Chris Piech, Tatsunori Hashimoto, Tobias Gerstenberg

    Abstract: Human commonsense understanding of the physical and social world is organized around intuitive theories. These theories support making causal and moral judgments. When something bad happens, we naturally ask: who did what, and why? A rich literature in cognitive science has studied people's causal and moral intuitions. This work has revealed a number of factors that systematically influence people… ▽ More

    Submitted 31 October, 2023; v1 submitted 30 October, 2023; originally announced October 2023.

    Comments: 34 pages, 7 figures. NeurIPS 2023

  19. arXiv:2310.18413  [pdf, other

    cs.LG cs.AI stat.ML

    On the Fairness ROAD: Robust Optimization for Adversarial Debiasing

    Authors: Vincent Grari, Thibault Laugel, Tatsunori Hashimoto, Sylvain Lamprier, Marcin Detyniecki

    Abstract: In the field of algorithmic fairness, significant attention has been put on group fairness criteria, such as Demographic Parity and Equalized Odds. Nevertheless, these objectives, measured as global averages, have raised concerns about persistent local disparities between sensitive groups. In this work, we address the problem of local fairness, which ensures that the predictor is unbiased not only… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

    Comments: 23 pages, 10 figures

  20. arXiv:2310.17623  [pdf, other

    cs.CL cs.LG

    Proving Test Set Contamination in Black Box Language Models

    Authors: Yonatan Oren, Nicole Meister, Niladri Chatterji, Faisal Ladhak, Tatsunori B. Hashimoto

    Abstract: Large language models are trained on vast amounts of internet data, prompting concerns and speculation that they have memorized public benchmarks. Going from speculation to proof of contamination is challenging, as the pretraining data used by proprietary models are often not publicly accessible. We show that it is possible to provide provable guarantees of test set contamination in language model… ▽ More

    Submitted 23 November, 2023; v1 submitted 26 October, 2023; originally announced October 2023.

  21. arXiv:2310.13807  [pdf, other

    cs.LG

    Learning to (Learn at Test Time)

    Authors: Yu Sun, Xinhao Li, Karan Dalal, Chloe Hsu, Sanmi Koyejo, Carlos Guestrin, Xiaolong Wang, Tatsunori Hashimoto, Xinlei Chen

    Abstract: We reformulate the problem of supervised learning as learning to learn with two nested loops (i.e. learning problems). The inner loop learns on each individual instance with self-supervision before final prediction. The outer loop learns the self-supervised task used by the inner loop, such that its final prediction improves. Our inner loop turns out to be equivalent to linear attention when the i… ▽ More

    Submitted 7 January, 2024; v1 submitted 20 October, 2023; originally announced October 2023.

    Comments: Fixed a few small typos

  22. arXiv:2310.01846  [pdf, other

    cs.CL cs.LG

    Benchmarking and Improving Generator-Validator Consistency of Language Models

    Authors: Xiang Lisa Li, Vaishnavi Shrivastava, Siyan Li, Tatsunori Hashimoto, Percy Liang

    Abstract: As of September 2023, ChatGPT correctly answers "what is 7+8" with 15, but when asked "7+8=15, True or False" it responds with "False". This inconsistency between generating and validating an answer is prevalent in language models (LMs) and erodes trust. In this paper, we propose a framework for measuring the consistency between generation and validation (which we call generator-validator consiste… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

    Comments: preprint

  23. arXiv:2309.15817  [pdf, other

    cs.AI cs.CL cs.LG

    Identifying the Risks of LM Agents with an LM-Emulated Sandbox

    Authors: Yangjun Ruan, Honghua Dong, Andrew Wang, Silviu Pitis, Yongchao Zhou, Jimmy Ba, Yann Dubois, Chris J. Maddison, Tatsunori Hashimoto

    Abstract: Recent advances in Language Model (LM) agents and tool use, exemplified by applications like ChatGPT Plugins, enable a rich set of capabilities but also amplify potential risks - such as leaking private data or causing financial losses. Identifying these risks is labor-intensive, necessitating implementing the tools, setting up the environment for each test scenario manually, and finding risky cas… ▽ More

    Submitted 17 May, 2024; v1 submitted 25 September, 2023; originally announced September 2023.

  24. arXiv:2309.07875  [pdf, other

    cs.CL

    Safety-Tuned LLaMAs: Lessons From Improving the Safety of Large Language Models that Follow Instructions

    Authors: Federico Bianchi, Mirac Suzgun, Giuseppe Attanasio, Paul Röttger, Dan Jurafsky, Tatsunori Hashimoto, James Zou

    Abstract: Training large language models to follow instructions makes them perform better on a wide range of tasks and generally become more helpful. However, a perfectly helpful model will follow even the most malicious instructions and readily generate harmful content. In this paper, we raise concerns over the safety of models that only emphasize helpfulness, not harmlessness, in their instruction-tuning.… ▽ More

    Submitted 19 March, 2024; v1 submitted 14 September, 2023; originally announced September 2023.

  25. Identifying and Mitigating the Security Risks of Generative AI

    Authors: Clark Barrett, Brad Boyd, Elie Burzstein, Nicholas Carlini, Brad Chen, Jihye Choi, Amrita Roy Chowdhury, Mihai Christodorescu, Anupam Datta, Soheil Feizi, Kathleen Fisher, Tatsunori Hashimoto, Dan Hendrycks, Somesh Jha, Daniel Kang, Florian Kerschbaum, Eric Mitchell, John Mitchell, Zulfikar Ramzan, Khawaja Shams, Dawn Song, Ankur Taly, Diyi Yang

    Abstract: Every major technical invention resurfaces the dual-use dilemma -- the new technology has the potential to be used for good as well as for harm. Generative AI (GenAI) techniques, such as large language models (LLMs) and diffusion models, have shown remarkable capabilities (e.g., in-context learning, code-completion, and text-to-image generation and editing). However, GenAI can be used just as well… ▽ More

    Submitted 28 December, 2023; v1 submitted 28 August, 2023; originally announced August 2023.

    Journal ref: Foundations and Trends in Privacy and Security 6 (2023) 1-52

  26. Accelerating Aggregation Queries on Unstructured Streams of Data

    Authors: Matthew Russo, Tatsunori Hashimoto, Daniel Kang, Yi Sun, Matei Zaharia

    Abstract: Analysts and scientists are interested in querying streams of video, audio, and text to extract quantitative insights. For example, an urban planner may wish to measure congestion by querying the live feed from a traffic camera. Prior work has used deep neural networks (DNNs) to answer such queries in the batch setting. However, much of this work is not suited for the streaming setting because it… ▽ More

    Submitted 17 August, 2023; originally announced August 2023.

    Comments: 14 pages, 11 figures, to be published in Proceedings of the VLDB Endowment, Vol. 16, No. 11

    Journal ref: PVLDB, 16(11): 2897 - 2910, 2023

  27. arXiv:2308.04635  [pdf

    cs.CY cs.AI

    Where's the Liability in Harmful AI Speech?

    Authors: Peter Henderson, Tatsunori Hashimoto, Mark Lemley

    Abstract: Generative AI, in particular text-based "foundation models" (large models trained on a huge variety of information including the internet), can generate speech that could be problematic under a wide range of liability regimes. Machine learning practitioners regularly "red team" models to identify and mitigate such problematic speech: from "hallucinations" falsely accusing people of serious miscond… ▽ More

    Submitted 16 August, 2023; v1 submitted 8 August, 2023; originally announced August 2023.

    Comments: Published in the Journal of Free Speech Law (2023)

  28. arXiv:2307.15593  [pdf, other

    cs.LG cs.CL cs.CR

    Robust Distortion-free Watermarks for Language Models

    Authors: Rohith Kuditipudi, John Thickstun, Tatsunori Hashimoto, Percy Liang

    Abstract: We propose a methodology for planting watermarks in text from an autoregressive language model that are robust to perturbations without changing the distribution over text up to a certain maximum generation budget. We generate watermarked text by mapping a sequence of random numbers -- which we compute using a randomized watermark key -- to a sample from the language model. To detect watermarked t… ▽ More

    Submitted 6 June, 2024; v1 submitted 28 July, 2023; originally announced July 2023.

    Comments: reformatting of camera-ready version accepted to TMLR, with minor edits to introduction

  29. arXiv:2307.03576  [pdf, ps, other

    cs.LG

    One Step of Gradient Descent is Provably the Optimal In-Context Learner with One Layer of Linear Self-Attention

    Authors: Arvind Mahankali, Tatsunori B. Hashimoto, Tengyu Ma

    Abstract: Recent works have empirically analyzed in-context learning and shown that transformers trained on synthetic linear regression tasks can learn to implement ridge regression, which is the Bayes-optimal predictor, given sufficient capacity [Akyürek et al., 2023], while one-layer transformers with linear self-attention and no MLP layer will learn to implement one step of gradient descent (GD) on a lea… ▽ More

    Submitted 7 July, 2023; originally announced July 2023.

  30. arXiv:2305.18619  [pdf, other

    cs.CL cs.LG

    Likelihood-Based Diffusion Language Models

    Authors: Ishaan Gulrajani, Tatsunori B. Hashimoto

    Abstract: Despite a growing interest in diffusion-based language models, existing work has not shown that these models can attain nontrivial likelihoods on standard language modeling benchmarks. In this work, we take the first steps towards closing the likelihood gap between autoregressive and diffusion-based language models, with the goal of building and releasing a diffusion model which outperforms a smal… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

  31. arXiv:2305.14387  [pdf, other

    cs.LG cs.AI cs.CL

    AlpacaFarm: A Simulation Framework for Methods that Learn from Human Feedback

    Authors: Yann Dubois, Xuechen Li, Rohan Taori, Tianyi Zhang, Ishaan Gulrajani, Jimmy Ba, Carlos Guestrin, Percy Liang, Tatsunori B. Hashimoto

    Abstract: Large language models (LLMs) such as ChatGPT have seen widespread adoption due to their strong instruction-following abilities. Developing these LLMs involves a complex yet poorly understood workflow requiring training with human feedback. Replicating and understanding this instruction-following requires tackling three major challenges: the high cost of data collection, the lack of trustworthy eva… ▽ More

    Submitted 7 January, 2024; v1 submitted 22 May, 2023; originally announced May 2023.

    Comments: Spotlight at NeurIPS 2023

  32. arXiv:2303.17548  [pdf, other

    cs.CL cs.AI cs.CY cs.LG

    Whose Opinions Do Language Models Reflect?

    Authors: Shibani Santurkar, Esin Durmus, Faisal Ladhak, Cinoo Lee, Percy Liang, Tatsunori Hashimoto

    Abstract: Language models (LMs) are increasingly being used in open-ended contexts, where the opinions reflected by LMs in response to subjective queries can have a profound impact, both on user satisfaction, as well as shaping the views of society at large. In this work, we put forth a quantitative framework to investigate the opinions reflected by LMs -- by leveraging high-quality public opinion polls and… ▽ More

    Submitted 30 March, 2023; originally announced March 2023.

  33. arXiv:2303.15715  [pdf, other

    cs.CY cs.AI cs.LG

    Foundation Models and Fair Use

    Authors: Peter Henderson, Xuechen Li, Dan Jurafsky, Tatsunori Hashimoto, Mark A. Lemley, Percy Liang

    Abstract: Existing foundation models are trained on copyrighted material. Deploying these models can pose both legal and ethical risks when data creators fail to receive appropriate attribution or compensation. In the United States and several other countries, copyrighted content may be used to build foundation models without incurring liability due to the fair use doctrine. However, there is a caveat: If t… ▽ More

    Submitted 27 March, 2023; originally announced March 2023.

  34. arXiv:2302.13439  [pdf, other

    cs.CL cs.AI

    Navigating the Grey Area: How Expressions of Uncertainty and Overconfidence Affect Language Models

    Authors: Kaitlyn Zhou, Dan Jurafsky, Tatsunori Hashimoto

    Abstract: The increased deployment of LMs for real-world tasks involving knowledge and facts makes it important to understand model epistemology: what LMs think they know, and how their attitudes toward that knowledge are affected by language use in their inputs. Here, we study an aspect of model epistemology: how epistemic markers of certainty, uncertainty, or evidentiality like "I'm sure it's", "I think i… ▽ More

    Submitted 13 November, 2023; v1 submitted 26 February, 2023; originally announced February 2023.

    Comments: EMNLP 2023 (Oral)

  35. arXiv:2302.11861  [pdf, other

    cs.LG cs.CV

    Out-of-Domain Robustness via Targeted Augmentations

    Authors: Irena Gao, Shiori Sagawa, Pang Wei Koh, Tatsunori Hashimoto, Percy Liang

    Abstract: Models trained on one set of domains often suffer performance drops on unseen domains, e.g., when wildlife monitoring models are deployed in new camera locations. In this work, we study principles for designing data augmentations for out-of-domain (OOD) generalization. In particular, we focus on real-world scenarios in which some domain-dependent features are robust, i.e., some features that vary… ▽ More

    Submitted 6 February, 2024; v1 submitted 23 February, 2023; originally announced February 2023.

  36. arXiv:2302.05733  [pdf, other

    cs.CR cs.LG

    Exploiting Programmatic Behavior of LLMs: Dual-Use Through Standard Security Attacks

    Authors: Daniel Kang, Xuechen Li, Ion Stoica, Carlos Guestrin, Matei Zaharia, Tatsunori Hashimoto

    Abstract: Recent advances in instruction-following large language models (LLMs) have led to dramatic improvements in a range of NLP tasks. Unfortunately, we find that the same improved capabilities amplify the dual-use risks for malicious purposes of these models. Dual-use is difficult to prevent as instruction-following capabilities now enable standard attacks from computer security. The capabilities of th… ▽ More

    Submitted 11 February, 2023; originally announced February 2023.

  37. arXiv:2302.03068  [pdf, other

    cs.LG cs.AI stat.ML

    Evaluating Self-Supervised Learning via Risk Decomposition

    Authors: Yann Dubois, Tatsunori Hashimoto, Percy Liang

    Abstract: Self-supervised learning (SSL) pipelines differ in many design choices such as the architecture, augmentations, or pretraining data. Yet SSL is typically evaluated using a single metric: linear probing on ImageNet. This does not provide much insight into why or when a model is better, now how to improve it. To address this, we propose an SSL risk decomposition, which generalizes the classical supe… ▽ More

    Submitted 8 January, 2024; v1 submitted 6 February, 2023; originally announced February 2023.

    Comments: Oral at ICML 2023

  38. arXiv:2301.13848  [pdf, other

    cs.CL cs.AI cs.LG

    Benchmarking Large Language Models for News Summarization

    Authors: Tianyi Zhang, Faisal Ladhak, Esin Durmus, Percy Liang, Kathleen McKeown, Tatsunori B. Hashimoto

    Abstract: Large language models (LLMs) have shown promise for automatic summarization but the reasons behind their successes are poorly understood. By conducting a human evaluation on ten LLMs across different pretraining methods, prompts, and model scales, we make two important observations. First, we find instruction tuning, and not model size, is the key to the LLM's zero-shot summarization capability. S… ▽ More

    Submitted 31 January, 2023; originally announced January 2023.

  39. arXiv:2212.10722  [pdf, other

    cs.CL

    Contrastive Error Attribution for Finetuned Language Models

    Authors: Faisal Ladhak, Esin Durmus, Tatsunori Hashimoto

    Abstract: Recent work has identified noisy and misannotated data as a core cause of hallucinations and unfaithful outputs in Natural Language Generation (NLG) tasks. Consequently, identifying and removing these examples is a key open challenge in creating reliable NLG systems. In this work, we introduce a framework to identify and remove low-quality training instances that lead to undesirable outputs, such… ▽ More

    Submitted 11 July, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

    Comments: ACL 2023

  40. arXiv:2212.10520  [pdf, other

    cs.CL

    Privacy-Preserving Domain Adaptation of Semantic Parsers

    Authors: Fatemehsadat Mireshghallah, Yu Su, Tatsunori Hashimoto, Jason Eisner, Richard Shin

    Abstract: Task-oriented dialogue systems often assist users with personal or confidential matters. For this reason, the developers of such a system are generally prohibited from observing actual usage. So how can they know where the system is failing and needs more training data or new functionality? In this work, we study ways in which realistic user utterances can be generated synthetically, to help incre… ▽ More

    Submitted 8 June, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

    Comments: ACL 2023

  41. arXiv:2211.16490  [pdf, other

    cs.LG cs.CL cs.PL cs.SE

    Coder Reviewer Reranking for Code Generation

    Authors: Tianyi Zhang, Tao Yu, Tatsunori B. Hashimoto, Mike Lewis, Wen-tau Yih, Daniel Fried, Sida I. Wang

    Abstract: Sampling diverse programs from a code language model and reranking with model likelihood is a popular method for code generation but it is prone to preferring degenerate solutions. Inspired by collaborative programming, we propose Coder-Reviewer reranking. We augment Coder language models from past work, which generate programs given language instructions, with Reviewer models, which evaluate the… ▽ More

    Submitted 29 November, 2022; originally announced November 2022.

  42. arXiv:2211.11971  [pdf, other

    cs.CV

    Multi-View Neural Surface Reconstruction with Structured Light

    Authors: Chunyu Li, Taisuke Hashimoto, Eiichi Matsumoto, Hiroharu Kato

    Abstract: Three-dimensional (3D) object reconstruction based on differentiable rendering (DR) is an active research topic in computer vision. DR-based methods minimize the difference between the rendered and target images by optimizing both the shape and appearance and realizing a high visual reproductivity. However, most approaches perform poorly for textureless objects because of the geometrical ambiguity… ▽ More

    Submitted 21 November, 2022; originally announced November 2022.

    Comments: Accepted by BMVC 2022

  43. arXiv:2211.09110  [pdf, other

    cs.CL cs.AI cs.LG

    Holistic Evaluation of Language Models

    Authors: Percy Liang, Rishi Bommasani, Tony Lee, Dimitris Tsipras, Dilara Soylu, Michihiro Yasunaga, Yian Zhang, Deepak Narayanan, Yuhuai Wu, Ananya Kumar, Benjamin Newman, Binhang Yuan, Bobby Yan, Ce Zhang, Christian Cosgrove, Christopher D. Manning, Christopher Ré, Diana Acosta-Navas, Drew A. Hudson, Eric Zelikman, Esin Durmus, Faisal Ladhak, Frieda Rong, Hongyu Ren, Huaxiu Yao , et al. (25 additional authors not shown)

    Abstract: Language models (LMs) are becoming the foundation for almost all major language technologies, but their capabilities, limitations, and risks are not well understood. We present Holistic Evaluation of Language Models (HELM) to improve the transparency of language models. First, we taxonomize the vast space of potential scenarios (i.e. use cases) and metrics (i.e. desiderata) that are of interest fo… ▽ More

    Submitted 1 October, 2023; v1 submitted 16 November, 2022; originally announced November 2022.

    Comments: Authored by the Center for Research on Foundation Models (CRFM) at the Stanford Institute for Human-Centered Artificial Intelligence (HAI). Project page: https://crfm.stanford.edu/helm/v1.0

    Journal ref: Published in Transactions on Machine Learning Research (TMLR), 2023

  44. arXiv:2211.04775  [pdf, other

    cs.CR

    ZK-IMG: Attested Images via Zero-Knowledge Proofs to Fight Disinformation

    Authors: Daniel Kang, Tatsunori Hashimoto, Ion Stoica, Yi Sun

    Abstract: Over the past few years, AI methods of generating images have been increasing in capabilities, with recent breakthroughs enabling high-resolution, photorealistic "deepfakes" (artificially generated images with the purpose of misinformation or harm). The rise of deepfakes has potential for social disruption. Recent work has proposed using ZK-SNARKs (zero-knowledge succinct non-interactive argument… ▽ More

    Submitted 10 November, 2022; v1 submitted 9 November, 2022; originally announced November 2022.

  45. Easily Accessible Text-to-Image Generation Amplifies Demographic Stereotypes at Large Scale

    Authors: Federico Bianchi, Pratyusha Kalluri, Esin Durmus, Faisal Ladhak, Myra Cheng, Debora Nozza, Tatsunori Hashimoto, Dan Jurafsky, James Zou, Aylin Caliskan

    Abstract: Machine learning models that convert user-written text descriptions into images are now widely available online and used by millions of users to generate millions of images a day. We investigate the potential for these models to amplify dangerous and complex stereotypes. We find a broad range of ordinary prompts produce stereotypes, including prompts simply mentioning traits, descriptors, occupati… ▽ More

    Submitted 7 June, 2023; v1 submitted 7 November, 2022; originally announced November 2022.

    Comments: FAccT 2023 paper. The published version is available at 10.1145/3593013.3594095

  46. arXiv:2210.15097  [pdf, other

    cs.CL cs.AI cs.LG

    Contrastive Decoding: Open-ended Text Generation as Optimization

    Authors: Xiang Lisa Li, Ari Holtzman, Daniel Fried, Percy Liang, Jason Eisner, Tatsunori Hashimoto, Luke Zettlemoyer, Mike Lewis

    Abstract: Given a language model (LM), maximum probability is a poor decoding objective for open-ended generation, because it produces short and repetitive text. On the other hand, sampling can often produce incoherent text that drifts from the original topics. We propose contrastive decoding (CD), a reliable decoding approach that optimizes a contrastive objective subject to a plausibility constraint. The… ▽ More

    Submitted 10 July, 2023; v1 submitted 26 October, 2022; originally announced October 2022.

    Comments: Main conference long paper at ACL 2023

  47. arXiv:2210.08674  [pdf, ps, other

    cs.CR cs.LG

    Scaling up Trustless DNN Inference with Zero-Knowledge Proofs

    Authors: Daniel Kang, Tatsunori Hashimoto, Ion Stoica, Yi Sun

    Abstract: As ML models have increased in capabilities and accuracy, so has the complexity of their deployments. Increasingly, ML model consumers are turning to service providers to serve the ML models in the ML-as-a-service (MLaaS) paradigm. As MLaaS proliferates, a critical requirement emerges: how can model consumers verify that the correct predictions were served, in the face of malicious, lazy, or buggy… ▽ More

    Submitted 16 October, 2022; originally announced October 2022.

  48. arXiv:2210.08248  [pdf, other

    cs.LG

    A Closer Look at the Calibration of Differentially Private Learners

    Authors: Hanlin Zhang, Xuechen Li, Prithviraj Sen, Salim Roukos, Tatsunori Hashimoto

    Abstract: We systematically study the calibration of classifiers trained with differentially private stochastic gradient descent (DP-SGD) and observe miscalibration across a wide range of vision and language tasks. Our analysis identifies per-example gradient clipping in DP-SGD as a major cause of miscalibration, and we show that existing approaches for improving calibration with differential privacy only p… ▽ More

    Submitted 14 November, 2022; v1 submitted 15 October, 2022; originally announced October 2022.

    Comments: NeurIPS 2022 Workshop on Algorithmic Fairness through the Lens of Causality and Privacy

  49. arXiv:2209.06235  [pdf, other

    cs.LG stat.ML

    Improving Self-Supervised Learning by Characterizing Idealized Representations

    Authors: Yann Dubois, Tatsunori Hashimoto, Stefano Ermon, Percy Liang

    Abstract: Despite the empirical successes of self-supervised learning (SSL) methods, it is unclear what characteristics of their representations lead to high downstream accuracies. In this work, we characterize properties that SSL representations should ideally satisfy. Specifically, we prove necessary and sufficient conditions such that for any task invariant to given data augmentations, desired probes (e.… ▽ More

    Submitted 12 December, 2022; v1 submitted 13 September, 2022; originally announced September 2022.

    Comments: Accepted at NeurIPS 2022

  50. arXiv:2209.03942  [pdf, other

    cs.LG cs.AI cs.CL cs.CV stat.ML

    Data Feedback Loops: Model-driven Amplification of Dataset Biases

    Authors: Rohan Taori, Tatsunori B. Hashimoto

    Abstract: Datasets scraped from the internet have been critical to the successes of large-scale machine learning. Yet, this very success puts the utility of future internet-derived datasets at potential risk, as model outputs begin to replace human annotations as a source of supervision. In this work, we first formalize a system where interactions with one model are recorded as history and scraped as trai… ▽ More

    Submitted 8 September, 2022; originally announced September 2022.