Skip to main content

Showing 1–50 of 196 results for author: McAuley, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.10946  [pdf, other

    cs.AI

    Large Language Model Driven Recommendation

    Authors: Anton Korikov, Scott Sanner, Yashar Deldjoo, Zhankui He, Julian McAuley, Arnau Ramisa, Rene Vidal, Mahesh Sathiamoorthy, Atoosa Kasrizadeh, Silvia Milano, Francesco Ricci

    Abstract: While previous chapters focused on recommendation systems (RSs) based on standardized, non-verbal user feedback such as purchases, views, and clicks -- the advent of LLMs has unlocked the use of natural language (NL) interactions for recommendation. This chapter discusses how LLMs' abilities for general NL reasoning present novel opportunities to build highly personalized RSs -- which can effectiv… ▽ More

    Submitted 20 August, 2024; originally announced August 2024.

  2. arXiv:2408.05094  [pdf, other

    cs.CL

    Unlocking Decoding-time Controllability: Gradient-Free Multi-Objective Alignment with Contrastive Prompts

    Authors: Tingchen Fu, Yupeng Hou, Julian McAuley, Rui Yan

    Abstract: The task of multi-objective alignment aims at balancing and controlling the different alignment objectives (e.g., helpfulness, harmlessness and honesty) of large language models to meet the personalized requirements of different users. However, previous methods tend to train multiple models to deal with various user preferences, with the number of trained models growing linearly with the number of… ▽ More

    Submitted 9 August, 2024; originally announced August 2024.

  3. arXiv:2408.04668  [pdf, other

    cs.CL cs.AI cs.IR

    Forecasting Live Chat Intent from Browsing History

    Authors: Se-eun Yoon, Ahmad Bin Rabiah, Zaid Alibadi, Surya Kallumadi, Julian McAuley

    Abstract: Customers reach out to online live chat agents with various intents, such as asking about product details or requesting a return. In this paper, we propose the problem of predicting user intent from browsing history and address it through a two-stage approach. The first stage classifies a user's browsing history into high-level intent categories. Here, we represent each browsing history as a text… ▽ More

    Submitted 6 August, 2024; originally announced August 2024.

    Comments: CIKM 2024

  4. Calibration-Disentangled Learning and Relevance-Prioritized Reranking for Calibrated Sequential Recommendation

    Authors: Hyunsik Jeon, Se-eun Yoon, Julian McAuley

    Abstract: Calibrated recommendation, which aims to maintain personalized proportions of categories within recommendations, is crucial in practical scenarios since it enhances user satisfaction by reflecting diverse interests. However, achieving calibration in a sequential setting (i.e., calibrated sequential recommendation) is challenging due to the need to adapt to users' evolving preferences. Previous met… ▽ More

    Submitted 4 August, 2024; originally announced August 2024.

    Comments: Published at CIKM '24 as a full research paper

  5. arXiv:2407.20454  [pdf, other

    cs.LG cs.CL

    CoMMIT: Coordinated Instruction Tuning for Multimodal Large Language Models

    Authors: Junda Wu, Xintong Li, Tong Yu, Yu Wang, Xiang Chen, Jiuxiang Gu, Lina Yao, Jingbo Shang, Julian McAuley

    Abstract: Instruction tuning in multimodal large language models (MLLMs) aims to smoothly integrate a backbone LLM with a pre-trained feature encoder for downstream tasks. The major challenge is how to efficiently find the synergy through cooperative learning where LLMs adapt their reasoning abilities in downstream tasks while feature encoders adjust their encoding to provide more relevant modal information… ▽ More

    Submitted 29 July, 2024; originally announced July 2024.

    Comments: 9 pages

  6. arXiv:2407.20445  [pdf, other

    cs.SD cs.AI cs.LG eess.AS

    Futga: Towards Fine-grained Music Understanding through Temporally-enhanced Generative Augmentation

    Authors: Junda Wu, Zachary Novack, Amit Namburi, Jiaheng Dai, Hao-Wen Dong, Zhouhang Xie, Carol Chen, Julian McAuley

    Abstract: Existing music captioning methods are limited to generating concise global descriptions of short music clips, which fail to capture fine-grained musical characteristics and time-aware musical changes. To address these limitations, we propose FUTGA, a model equipped with fined-grained music understanding capabilities through learning from generative augmentation with temporal compositions. We lever… ▽ More

    Submitted 29 July, 2024; originally announced July 2024.

    Comments: 6 pages

  7. arXiv:2406.17260  [pdf, other

    cs.CL

    Mitigating Hallucination in Fictional Character Role-Play

    Authors: Nafis Sadeq, Zhouhang Xie, Byungkyu Kang, Prarit Lamba, Xiang Gao, Julian McAuley

    Abstract: Role-playing has wide-ranging applications in customer support, embodied agents, computational social science, etc. The influence of parametric world knowledge of large language models (LLMs) often causes role-playing characters to act out of character and hallucinate about things outside the scope of their knowledge. In this work, we focus on the evaluation and mitigation of hallucination in fict… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  8. arXiv:2406.02048  [pdf, other

    cs.IR

    Auto-Encoding or Auto-Regression? A Reality Check on Causality of Self-Attention-Based Sequential Recommenders

    Authors: Yueqi Wang, Zhankui He, Zhenrui Yue, Julian McAuley, Dong Wang

    Abstract: The comparison between Auto-Encoding (AE) and Auto-Regression (AR) has become an increasingly important topic with recent advances in sequential recommendation. At the heart of this discussion lies the comparison of BERT4Rec and SASRec, which serve as representative AE and AR models for self-attentive sequential recommenders. Yet the conclusion of this debate remains uncertain due to: (1) the lack… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  9. arXiv:2405.20289  [pdf, other

    cs.SD cs.AI cs.LG

    DITTO-2: Distilled Diffusion Inference-Time T-Optimization for Music Generation

    Authors: Zachary Novack, Julian McAuley, Taylor Berg-Kirkpatrick, Nicholas Bryan

    Abstract: Controllable music generation methods are critical for human-centered AI-based music creation, but are currently limited by speed, quality, and control design trade-offs. Diffusion Inference-Time T-optimization (DITTO), in particular, offers state-of-the-art results, but is over 10x slower than real-time, limiting practical use. We propose Distilled Diffusion Inference-Time T -Optimization (or DIT… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  10. arXiv:2405.16871  [pdf, other

    cs.IR

    Multi-Behavior Generative Recommendation

    Authors: Zihan Liu, Yupeng Hou, Julian McAuley

    Abstract: Multi-behavior sequential recommendation (MBSR) aims to incorporate behavior types of interactions for better recommendations. Existing approaches focus on the next-item prediction objective, neglecting the value of integrating the target behavior type into the learning objective. In this paper, we propose MBGen, a novel Multi-Behavior sequential Generative recommendation framework. We formulate t… ▽ More

    Submitted 29 July, 2024; v1 submitted 27 May, 2024; originally announced May 2024.

    Comments: Camera ready; accepted by CIKM 2024

  11. arXiv:2405.16720  [pdf, other

    cs.CL

    Large Scale Knowledge Washing

    Authors: Yu Wang, Ruihan Wu, Zexue He, Xiusi Chen, Julian McAuley

    Abstract: Large language models show impressive abilities in memorizing world knowledge, which leads to concerns regarding memorization of private information, toxic or sensitive knowledge, and copyrighted content. We introduce the problem of Large Scale Knowledge Washing, focusing on unlearning an extensive amount of factual knowledge. Previous unlearning methods usually define the reverse loss and update… ▽ More

    Submitted 28 May, 2024; v1 submitted 26 May, 2024; originally announced May 2024.

  12. arXiv:2405.14142  [pdf, other

    cs.CV cs.AI

    Imagery as Inquiry: Exploring A Multimodal Dataset for Conversational Recommendation

    Authors: Se-eun Yoon, Hyunsik Jeon, Julian McAuley

    Abstract: We introduce a multimodal dataset where users express preferences through images. These images encompass a broad spectrum of visual expressions ranging from landscapes to artistic depictions. Users request recommendations for books or music that evoke similar feelings to those captured in the images, and recommendations are endorsed by the community through upvotes. This dataset supports two recom… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  13. arXiv:2405.12119  [pdf, other

    cs.IR cs.AI cs.CL

    Reindex-Then-Adapt: Improving Large Language Models for Conversational Recommendation

    Authors: Zhankui He, Zhouhang Xie, Harald Steck, Dawen Liang, Rahul Jha, Nathan Kallus, Julian McAuley

    Abstract: Large language models (LLMs) are revolutionizing conversational recommender systems by adeptly indexing item content, understanding complex conversational contexts, and generating relevant item titles. However, controlling the distribution of recommended items remains a challenge. This leads to suboptimal performance due to the failure to capture rapidly changing data distributions, such as item p… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

  14. arXiv:2405.01769  [pdf, other

    cs.CL

    A Survey on Large Language Models for Critical Societal Domains: Finance, Healthcare, and Law

    Authors: Zhiyu Zoey Chen, Jing Ma, Xinlu Zhang, Nan Hao, An Yan, Armineh Nourbakhsh, Xianjun Yang, Julian McAuley, Linda Petzold, William Yang Wang

    Abstract: In the fast-evolving domain of artificial intelligence, large language models (LLMs) such as GPT-3 and GPT-4 are revolutionizing the landscapes of finance, healthcare, and law: domains characterized by their reliance on professional expertise, challenging data acquisition, high-stakes, and stringent regulatory compliance. This survey offers a detailed exploration of the methodologies, applications… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: 35 pages, 6 figures

  15. arXiv:2404.16375  [pdf, other

    cs.CV cs.AI cs.CL

    List Items One by One: A New Data Source and Learning Paradigm for Multimodal LLMs

    Authors: An Yan, Zhengyuan Yang, Junda Wu, Wanrong Zhu, Jianwei Yang, Linjie Li, Kevin Lin, Jianfeng Wang, Julian McAuley, Jianfeng Gao, Lijuan Wang

    Abstract: Set-of-Mark (SoM) Prompting unleashes the visual grounding capability of GPT-4V, by enabling the model to associate visual objects with tags inserted on the image. These tags, marked with alphanumerics, can be indexed via text tokens for easy reference. Despite the extraordinary performance from GPT-4V, we observe that other Multimodal Large Language Models (MLLMs) struggle to understand these vis… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: Preprint

  16. arXiv:2404.15676  [pdf, other

    cs.CL cs.AI

    Beyond Chain-of-Thought: A Survey of Chain-of-X Paradigms for LLMs

    Authors: Yu Xia, Rui Wang, Xu Liu, Mingyan Li, Tong Yu, Xiang Chen, Julian McAuley, Shuai Li

    Abstract: Chain-of-Thought (CoT) has been a widely adopted prompting method, eliciting impressive reasoning abilities of Large Language Models (LLMs). Inspired by the sequential thought structure of CoT, a number of Chain-of-X (CoX) methods have been developed to address various challenges across diverse domains and tasks involving LLMs. In this paper, we provide a comprehensive survey of Chain-of-X methods… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

  17. arXiv:2404.00579  [pdf, other

    cs.IR cs.AI

    A Review of Modern Recommender Systems Using Generative Models (Gen-RecSys)

    Authors: Yashar Deldjoo, Zhankui He, Julian McAuley, Anton Korikov, Scott Sanner, Arnau Ramisa, René Vidal, Maheswaran Sathiamoorthy, Atoosa Kasirzadeh, Silvia Milano

    Abstract: Traditional recommender systems (RS) typically use user-item rating histories as their main data source. However, deep generative models now have the capability to model and sample from complex data distributions, including user-item interactions, text, images, and videos, enabling novel recommendation tasks. This comprehensive, multidisciplinary survey connects key advancements in RS using Genera… ▽ More

    Submitted 4 July, 2024; v1 submitted 31 March, 2024; originally announced April 2024.

    Comments: This survey accompanies a tutorial presented at ACM KDD'24

  18. arXiv:2403.15737  [pdf, other

    cs.CL

    Few-shot Dialogue Strategy Learning for Motivational Interviewing via Inductive Reasoning

    Authors: Zhouhang Xie, Bodhisattwa Prasad Majumder, Mengjie Zhao, Yoshinori Maeda, Keiichi Yamada, Hiromi Wakaki, Julian McAuley

    Abstract: We consider the task of building a dialogue system that can motivate users to adopt positive lifestyle changes: Motivational Interviewing. Addressing such a task requires a system that can infer \textit{how} to motivate a user effectively. We propose DIIT, a framework that is capable of learning and applying conversation strategies in the form of natural language inductive rules from expert demons… ▽ More

    Submitted 23 March, 2024; originally announced March 2024.

  19. arXiv:2403.09738  [pdf, other

    cs.CL cs.AI cs.IR

    Evaluating Large Language Models as Generative User Simulators for Conversational Recommendation

    Authors: Se-eun Yoon, Zhankui He, Jessica Maria Echterhoff, Julian McAuley

    Abstract: Synthetic users are cost-effective proxies for real users in the evaluation of conversational recommender systems. Large language models show promise in simulating human-like behavior, raising the question of their ability to represent a diverse population of users. We introduce a new protocol to measure the degree to which language models can accurately emulate human behavior in conversational re… ▽ More

    Submitted 25 March, 2024; v1 submitted 13 March, 2024; originally announced March 2024.

    Comments: NAACL 2024

  20. arXiv:2403.09606  [pdf, ps, other

    cs.CL cs.AI

    Large Language Models and Causal Inference in Collaboration: A Comprehensive Survey

    Authors: Xiaoyu Liu, Paiheng Xu, Junda Wu, Jiaxin Yuan, Yifan Yang, Yuhang Zhou, Fuxiao Liu, Tianrui Guan, Haoliang Wang, Tong Yu, Julian McAuley, Wei Ai, Furong Huang

    Abstract: Causal inference has shown potential in enhancing the predictive accuracy, fairness, robustness, and explainability of Natural Language Processing (NLP) models by capturing causal relationships among variables. The emergence of generative Large Language Models (LLMs) has significantly impacted various NLP domains, particularly through their advanced reasoning capabilities. This survey focuses on e… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

  21. arXiv:2403.06447  [pdf, other

    cs.IR cs.AI

    CoRAL: Collaborative Retrieval-Augmented Large Language Models Improve Long-tail Recommendation

    Authors: Junda Wu, Cheng-Chun Chang, Tong Yu, Zhankui He, Jianing Wang, Yupeng Hou, Julian McAuley

    Abstract: The long-tail recommendation is a challenging task for traditional recommender systems, due to data sparsity and data imbalance issues. The recent development of large language models (LLMs) has shown their abilities in complex reasoning, which can help to deduce users' preferences based on very few previous interactions. However, since most LLM-based systems rely on items' semantic meaning as the… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

    Comments: 11 pages

  22. arXiv:2403.03952  [pdf, other

    cs.IR

    Bridging Language and Items for Retrieval and Recommendation

    Authors: Yupeng Hou, Jiacheng Li, Zhankui He, An Yan, Xiusi Chen, Julian McAuley

    Abstract: This paper introduces BLaIR, a series of pretrained sentence embedding models specialized for recommendation scenarios. BLaIR is trained to learn correlations between item metadata and potential natural language context, which is useful for retrieving and recommending items. To pretrain BLaIR, we collect Amazon Reviews 2023, a new dataset comprising over 570 million reviews and 48 million items fr… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

  23. arXiv:2403.00811  [pdf, other

    cs.AI cs.CL

    Cognitive Bias in High-Stakes Decision-Making with LLMs

    Authors: Jessica Echterhoff, Yao Liu, Abeer Alessa, Julian McAuley, Zexue He

    Abstract: Large language models (LLMs) offer significant potential as tools to support an expanding range of decision-making tasks. Given their training on human (created) data, LLMs have been shown to inherit societal biases against protected groups, as well as be subject to bias functionally resembling cognitive bias. Human-like bias can impede fair and explainable decisions made with LLM assistance. Our… ▽ More

    Submitted 19 July, 2024; v1 submitted 24 February, 2024; originally announced March 2024.

  24. arXiv:2402.19173  [pdf, other

    cs.SE cs.AI

    StarCoder 2 and The Stack v2: The Next Generation

    Authors: Anton Lozhkov, Raymond Li, Loubna Ben Allal, Federico Cassano, Joel Lamy-Poirier, Nouamane Tazi, Ao Tang, Dmytro Pykhtar, Jiawei Liu, Yuxiang Wei, Tianyang Liu, Max Tian, Denis Kocetkov, Arthur Zucker, Younes Belkada, Zijian Wang, Qian Liu, Dmitry Abulkhanov, Indraneil Paul, Zhuang Li, Wen-Ding Li, Megan Risdal, Jia Li, Jian Zhu, Terry Yue Zhuo , et al. (41 additional authors not shown)

    Abstract: The BigCode project, an open-scientific collaboration focused on the responsible development of Large Language Models for Code (Code LLMs), introduces StarCoder2. In partnership with Software Heritage (SWH), we build The Stack v2 on top of the digital commons of their source code archive. Alongside the SWH repositories spanning 619 programming languages, we carefully select other high-quality data… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

  25. arXiv:2402.19009  [pdf, other

    cs.LG cs.AI

    Unified Generation, Reconstruction, and Representation: Generalized Diffusion with Adaptive Latent Encoding-Decoding

    Authors: Guangyi Liu, Yu Wang, Zeyu Feng, Qiyu Wu, Liping Tang, Yuan Gao, Zhen Li, Shuguang Cui, Julian McAuley, Zichao Yang, Eric P. Xing, Zhiting Hu

    Abstract: The vast applications of deep generative models are anchored in three core capabilities -- generating new instances, reconstructing inputs, and learning compact representations -- across various data types, such as discrete text/protein sequences and continuous images. Existing model families, like variational autoencoders (VAEs), generative adversarial networks (GANs), autoregressive models, and… ▽ More

    Submitted 5 June, 2024; v1 submitted 29 February, 2024; originally announced February 2024.

    Comments: ICML 2024 camera-ready. Code is available at https://github.com/guangyliu/EDDPM

  26. arXiv:2402.15591  [pdf, other

    cs.IR cs.AI

    RecWizard: A Toolkit for Conversational Recommendation with Modular, Portable Models and Interactive User Interface

    Authors: Zeyuan Zhang, Tanmay Laud, Zihang He, Xiaojie Chen, Xinshuang Liu, Zhouhang Xie, Julian McAuley, Zhankui He

    Abstract: We present a new Python toolkit called RecWizard for Conversational Recommender Systems (CRS). RecWizard offers support for development of models and interactive user interface, drawing from the best practices of the Huggingface ecosystems. CRS with RecWizard are modular, portable, interactive and Large Language Models (LLMs)-friendly, to streamline the learning process and reduce the additional e… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

    Comments: AAAI'24 Demo Track

  27. arXiv:2402.13449  [pdf, other

    cs.CL

    CAMELoT: Towards Large Language Models with Training-Free Consolidated Associative Memory

    Authors: Zexue He, Leonid Karlinsky, Donghyun Kim, Julian McAuley, Dmitry Krotov, Rogerio Feris

    Abstract: Large Language Models (LLMs) struggle to handle long input sequences due to high memory and runtime costs. Memory-augmented models have emerged as a promising solution to this problem, but current methods are hindered by limited memory capacity and require costly re-training to integrate with a new LLM. In this work, we introduce an associative memory module which can be coupled to any pre-trained… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

  28. arXiv:2402.12079  [pdf, other

    cs.CV cs.CL

    LVCHAT: Facilitating Long Video Comprehension

    Authors: Yu Wang, Zeyuan Zhang, Julian McAuley, Zexue He

    Abstract: Enabling large language models (LLMs) to read videos is vital for multimodal LLMs. Existing works show promise on short videos whereas long video (longer than e.g.~1 minute) comprehension remains challenging. The major problem lies in the over-compression of videos, i.e., the encoded video representations are not enough to represent the whole video. To address this issue, we propose Long Video Cha… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

    Comments: 17 pages; 8 figures

  29. arXiv:2402.11558  [pdf, other

    cs.LG

    A Temporally Disentangled Contrastive Diffusion Model for Spatiotemporal Imputation

    Authors: Yakun Chen, Kaize Shi, Zhangkai Wu, Juan Chen, Xianzhi Wang, Julian McAuley, Guandong Xu, Shui Yu

    Abstract: Spatiotemporal data analysis is pivotal across various domains, such as transportation, meteorology, and healthcare. The data collected in real-world scenarios are often incomplete due to device malfunctions and network errors. Spatiotemporal imputation aims to predict missing values by exploiting the spatial and temporal dependencies in the observed data. Traditional imputation approaches based o… ▽ More

    Submitted 22 March, 2024; v1 submitted 18 February, 2024; originally announced February 2024.

  30. arXiv:2402.11143  [pdf, other

    cs.IR

    Foundation Models for Recommender Systems: A Survey and New Perspectives

    Authors: Chengkai Huang, Tong Yu, Kaige Xie, Shuai Zhang, Lina Yao, Julian McAuley

    Abstract: Recently, Foundation Models (FMs), with their extensive knowledge bases and complex architectures, have offered unique opportunities within the realm of recommender systems (RSs). In this paper, we attempt to thoroughly examine FM-based recommendation systems (FM4RecSys). We start by reviewing the research background of FM4RecSys. Then, we provide a systematic taxonomy of existing FM4RecSys resear… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

  31. arXiv:2402.09668  [pdf, other

    cs.LG cs.AI cs.CL

    How to Train Data-Efficient LLMs

    Authors: Noveen Sachdeva, Benjamin Coleman, Wang-Cheng Kang, Jianmo Ni, Lichan Hong, Ed H. Chi, James Caverlee, Julian McAuley, Derek Zhiyuan Cheng

    Abstract: The training of large language models (LLMs) is expensive. In this paper, we study data-efficient approaches for pre-training LLMs, i.e., techniques that aim to optimize the Pareto frontier of model quality and training resource/data consumption. We seek to understand the tradeoffs associated with data selection routines based on (i) expensive-to-compute data-quality estimates, and (ii) maximizati… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

    Comments: Under review. 44 pages, 30 figures

  32. arXiv:2402.08785  [pdf, other

    cs.CL

    InstructGraph: Boosting Large Language Models via Graph-centric Instruction Tuning and Preference Alignment

    Authors: Jianing Wang, Junda Wu, Yupeng Hou, Yao Liu, Ming Gao, Julian McAuley

    Abstract: Do current large language models (LLMs) better solve graph reasoning and generation tasks with parameter updates? In this paper, we propose InstructGraph, a framework that empowers LLMs with the abilities of graph reasoning and generation by instruction tuning and preference alignment. Specifically, we first propose a structured format verbalizer to unify all graph data into a universal code-like… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

    Comments: 19 pages

  33. arXiv:2402.04624  [pdf, other

    cs.CL

    MEMORYLLM: Towards Self-Updatable Large Language Models

    Authors: Yu Wang, Yifan Gao, Xiusi Chen, Haoming Jiang, Shiyang Li, Jingfeng Yang, Qingyu Yin, Zheng Li, Xian Li, Bing Yin, Jingbo Shang, Julian McAuley

    Abstract: Existing Large Language Models (LLMs) usually remain static after deployment, which might make it hard to inject new knowledge into the model. We aim to build models containing a considerable portion of self-updatable parameters, enabling the model to integrate new knowledge effectively and efficiently. To this end, we introduce MEMORYLLM, a model that comprises a transformer and a fixed-size memo… ▽ More

    Submitted 26 May, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

    Comments: 13 pages, 9 figures

  34. arXiv:2402.03481  [pdf, other

    cs.IR cs.LG cs.SI

    FINEST: Stabilizing Recommendations by Rank-Preserving Fine-Tuning

    Authors: Sejoon Oh, Berk Ustun, Julian McAuley, Srijan Kumar

    Abstract: Modern recommender systems may output considerably different recommendations due to small perturbations in the training data. Changes in the data from a single user will alter the recommendations as well as the recommendations of other users. In applications like healthcare, housing, and finance, this sensitivity can have adverse effects on user experience. We propose a method to stabilize a given… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: Accepted at the 6th FAccTRec Workshop on Responsible Recommendation @ ACM RecSys 2023

  35. arXiv:2401.12553  [pdf, other

    cs.IR

    InfoRank: Unbiased Learning-to-Rank via Conditional Mutual Information Minimization

    Authors: Jiarui Jin, Zexue He, Mengyue Yang, Weinan Zhang, Yong Yu, Jun Wang, Julian McAuley

    Abstract: Ranking items regarding individual user interests is a core technique of multiple downstream tasks such as recommender systems. Learning such a personalized ranker typically relies on the implicit feedback from users' past click-through behaviors. However, collected feedback is biased toward previously highly-ranked items and directly learning from it would result in a "rich-get-richer" phenomenon… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

    Comments: WWW 2024

  36. arXiv:2401.12179  [pdf, other

    cs.SD cs.AI cs.LG eess.AS

    DITTO: Diffusion Inference-Time T-Optimization for Music Generation

    Authors: Zachary Novack, Julian McAuley, Taylor Berg-Kirkpatrick, Nicholas J. Bryan

    Abstract: We propose Diffusion Inference-Time T-Optimization (DITTO), a general-purpose frame-work for controlling pre-trained text-to-music diffusion models at inference-time via optimizing initial noise latents. Our method can be used to optimize through any differentiable feature matching loss to achieve a target (stylized) output and leverages gradient checkpointing for memory efficiency. We demonstrate… ▽ More

    Submitted 3 June, 2024; v1 submitted 22 January, 2024; originally announced January 2024.

    Comments: Oral at ICML 2024

  37. arXiv:2312.11554  [pdf, other

    cs.CL cs.AI

    Deciphering Compatibility Relationships with Textual Descriptions via Extraction and Explanation

    Authors: Yu Wang, Zexue He, Zhankui He, Hao Xu, Julian McAuley

    Abstract: Understanding and accurately explaining compatibility relationships between fashion items is a challenging problem in the burgeoning domain of AI-driven outfit recommendations. Present models, while making strides in this area, still occasionally fall short, offering explanations that can be elementary and repetitive. This work aims to address these shortcomings by introducing the Pair Fashion Exp… ▽ More

    Submitted 17 December, 2023; originally announced December 2023.

    Journal ref: AAAI 2024

  38. arXiv:2311.12257  [pdf, other

    cs.SD cs.IR cs.MM eess.AS

    Equipping Pretrained Unconditional Music Transformers with Instrument and Genre Controls

    Authors: Weihan Xu, Julian McAuley, Shlomo Dubnov, Hao-Wen Dong

    Abstract: The ''pretraining-and-finetuning'' paradigm has become a norm for training domain-specific models in natural language processing and computer vision. In this work, we aim to examine this paradigm for symbolic music generation through leveraging the largest ever symbolic music dataset sourced from the MuseScore forum. We first pretrain a large unconditional transformer model using 1.5 million songs… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

  39. arXiv:2311.07562  [pdf, other

    cs.CV cs.AI

    GPT-4V in Wonderland: Large Multimodal Models for Zero-Shot Smartphone GUI Navigation

    Authors: An Yan, Zhengyuan Yang, Wanrong Zhu, Kevin Lin, Linjie Li, Jianfeng Wang, Jianwei Yang, Yiwu Zhong, Julian McAuley, Jianfeng Gao, Zicheng Liu, Lijuan Wang

    Abstract: We present MM-Navigator, a GPT-4V-based agent for the smartphone graphical user interface (GUI) navigation task. MM-Navigator can interact with a smartphone screen as human users, and determine subsequent actions to fulfill given instructions. Our findings demonstrate that large multimodal models (LMMs), specifically GPT-4V, excel in zero-shot GUI navigation through its advanced screen interpretat… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

    Comments: Work in progress

  40. arXiv:2310.16639  [pdf, other

    cs.CV cs.LG

    Driving through the Concept Gridlock: Unraveling Explainability Bottlenecks in Automated Driving

    Authors: Jessica Echterhoff, An Yan, Kyungtae Han, Amr Abdelraouf, Rohit Gupta, Julian McAuley

    Abstract: Concept bottleneck models have been successfully used for explainable machine learning by encoding information within the model with a set of human-defined concepts. In the context of human-assisted or autonomous driving, explainability models can help user acceptance and understanding of decisions made by the autonomous vehicle, which can be used to rationalize and explain driver or vehicle behav… ▽ More

    Submitted 26 October, 2023; v1 submitted 25 October, 2023; originally announced October 2023.

  41. arXiv:2310.15433  [pdf, other

    cs.LG cs.IR

    Off-Policy Evaluation for Large Action Spaces via Policy Convolution

    Authors: Noveen Sachdeva, Lequn Wang, Dawen Liang, Nathan Kallus, Julian McAuley

    Abstract: Developing accurate off-policy estimators is crucial for both evaluating and optimizing for new policies. The main challenge in off-policy estimation is the distribution shift between the logging policy that generates data and the target policy that we aim to evaluate. Typically, techniques for correcting distribution shift involve some form of importance sampling. This approach results in unbiase… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: Under review. 36 pages, 31 figures

  42. arXiv:2310.14633  [pdf, other

    cs.CL cs.LG

    Extending Input Contexts of Language Models through Training on Segmented Sequences

    Authors: Petros Karypis, Julian McAuley, George Karypis

    Abstract: Effectively training language models on long inputs poses many technical challenges. As a cost consideration, languages models are pretrained on a fixed sequence length before being adapted to longer sequences. We explore various methods for adapting models to longer inputs by training on segmented sequences and an interpolation-based method for extending absolute positional embeddings. We develop… ▽ More

    Submitted 19 June, 2024; v1 submitted 23 October, 2023; originally announced October 2023.

    Comments: 11 pages, 3 figures

  43. arXiv:2310.14088  [pdf, other

    cs.CL

    MedEval: A Multi-Level, Multi-Task, and Multi-Domain Medical Benchmark for Language Model Evaluation

    Authors: Zexue He, Yu Wang, An Yan, Yao Liu, Eric Y. Chang, Amilcare Gentili, Julian McAuley, Chun-Nan Hsu

    Abstract: Curated datasets for healthcare are often limited due to the need of human annotations from experts. In this paper, we present MedEval, a multi-level, multi-task, and multi-domain medical benchmark to facilitate the development of language models for healthcare. MedEval is comprehensive and consists of data from several healthcare systems and spans 35 human body regions from 8 examination modaliti… ▽ More

    Submitted 14 November, 2023; v1 submitted 21 October, 2023; originally announced October 2023.

    Comments: Accepted to EMNLP 2023. Camera-ready version: updated IRB, added more evaluation results on LLMs such as GPT4, LLaMa2, and LLaMa2-chat

  44. arXiv:2310.10772  [pdf, other

    cs.SD cs.LG cs.MM eess.AS

    Unsupervised Lead Sheet Generation via Semantic Compression

    Authors: Zachary Novack, Nikita Srivatsan, Taylor Berg-Kirkpatrick, Julian McAuley

    Abstract: Lead sheets have become commonplace in generative music research, being used as an initial compressed representation for downstream tasks like multitrack music generation and automatic arrangement. Despite this, researchers have often fallen back on deterministic reduction methods (such as the skyline algorithm) to generate lead sheets when seeking paired lead sheets and full scores, with little a… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

  45. arXiv:2310.09983  [pdf, other

    cs.LG cs.AI cs.CL cs.IR

    Farzi Data: Autoregressive Data Distillation

    Authors: Noveen Sachdeva, Zexue He, Wang-Cheng Kang, Jianmo Ni, Derek Zhiyuan Cheng, Julian McAuley

    Abstract: We study data distillation for auto-regressive machine learning tasks, where the input and output have a strict left-to-right causal structure. More specifically, we propose Farzi, which summarizes an event sequence dataset into a small number of synthetic sequences -- Farzi Data -- which are optimized to maintain (if not improve) model performance compared to training on the full dataset. Under t… ▽ More

    Submitted 15 October, 2023; originally announced October 2023.

    Comments: Under review. 23 pages, 9 figures

  46. arXiv:2310.09653  [pdf, other

    cs.SD cs.AI eess.AS

    SelfVC: Voice Conversion With Iterative Refinement using Self Transformations

    Authors: Paarth Neekhara, Shehzeen Hussain, Rafael Valle, Boris Ginsburg, Rishabh Ranjan, Shlomo Dubnov, Farinaz Koushanfar, Julian McAuley

    Abstract: We propose SelfVC, a training strategy to iteratively improve a voice conversion model with self-synthesized examples. Previous efforts on voice conversion focus on factorizing speech into explicitly disentangled representations that separately encode speaker characteristics and linguistic content. However, disentangling speech representations to capture such attributes using task-specific loss te… ▽ More

    Submitted 3 May, 2024; v1 submitted 14 October, 2023; originally announced October 2023.

    Comments: Accepted at ICML 2024

  47. arXiv:2310.09233  [pdf, other

    cs.IR cs.CL

    AgentCF: Collaborative Learning with Autonomous Language Agents for Recommender Systems

    Authors: Junjie Zhang, Yupeng Hou, Ruobing Xie, Wenqi Sun, Julian McAuley, Wayne Xin Zhao, Leyu Lin, Ji-Rong Wen

    Abstract: Recently, there has been an emergence of employing LLM-powered agents as believable human proxies, based on their remarkable decision-making capability. However, existing studies mainly focus on simulating human dialogue. Human non-verbal behaviors, such as item clicking in recommender systems, although implicitly exhibiting user preferences and could enhance the modeling of users, have not been d… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

  48. arXiv:2310.03182  [pdf, other

    cs.CV cs.CL cs.LG

    Robust and Interpretable Medical Image Classifiers via Concept Bottleneck Models

    Authors: An Yan, Yu Wang, Yiwu Zhong, Zexue He, Petros Karypis, Zihan Wang, Chengyu Dong, Amilcare Gentili, Chun-Nan Hsu, Jingbo Shang, Julian McAuley

    Abstract: Medical image classification is a critical problem for healthcare, with the potential to alleviate the workload of doctors and facilitate diagnoses of patients. However, two challenges arise when deploying deep learning models to real-world healthcare applications. First, neural models tend to learn spurious correlations instead of desired features, which could fall short when generalizing to new… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

    Comments: 18 pages, 12 figures

  49. arXiv:2310.02367  [pdf, other

    cs.IR

    Linear Recurrent Units for Sequential Recommendation

    Authors: Zhenrui Yue, Yueqi Wang, Zhankui He, Huimin Zeng, Julian McAuley, Dong Wang

    Abstract: State-of-the-art sequential recommendation relies heavily on self-attention-based recommender models. Yet such models are computationally expensive and often too slow for real-time recommendation. Furthermore, the self-attention operation is performed at a sequence-level, thereby making low-cost incremental inference challenging. Inspired by recent advances in efficient language modeling, we propo… ▽ More

    Submitted 8 November, 2023; v1 submitted 3 October, 2023; originally announced October 2023.

    Comments: Accepted to WSDM 2024

  50. arXiv:2310.02263  [pdf, other

    cs.CL cs.AI cs.LG

    Automatic Pair Construction for Contrastive Post-training

    Authors: Canwen Xu, Corby Rosset, Ethan C. Chau, Luciano Del Corro, Shweti Mahajan, Julian McAuley, Jennifer Neville, Ahmed Hassan Awadallah, Nikhil Rao

    Abstract: Alignment serves as an important step to steer large language models (LLMs) towards human preferences. In this paper, we propose an automatic way to construct contrastive data for LLM, using preference pairs from multiple models of varying strengths (e.g., InstructGPT, ChatGPT and GPT-4). We compare the contrastive techniques of SLiC and DPO to SFT baselines and find that DPO provides a step-funct… ▽ More

    Submitted 2 April, 2024; v1 submitted 3 October, 2023; originally announced October 2023.

    Comments: NAACL 2024 (Findings)