Skip to main content

Showing 1–21 of 21 results for author: Hirschberg, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.21315  [pdf, other

    cs.CL cs.AI

    Beyond Silent Letters: Amplifying LLMs in Emotion Recognition with Vocal Nuances

    Authors: Zehui Wu, Ziwei Gong, Lin Ai, Pengyuan Shi, Kaan Donbekci, Julia Hirschberg

    Abstract: This paper introduces a novel approach to emotion detection in speech using Large Language Models (LLMs). We address the limitation of LLMs in processing audio inputs by translating speech characteristics into natural language descriptions. Our method integrates these descriptions into text prompts, enabling LLMs to perform multimodal emotion analysis without architectural modifications. We evalua… ▽ More

    Submitted 31 July, 2024; v1 submitted 30 July, 2024; originally announced July 2024.

  2. arXiv:2406.17982  [pdf, other

    cs.CL

    EDEN: Empathetic Dialogues for English learning

    Authors: Li Siyan, Teresa Shao, Zhou Yu, Julia Hirschberg

    Abstract: Dialogue systems have been used as conversation partners in English learning, but few have studied whether these systems improve learning outcomes. Student passion and perseverance, or grit, has been associated with language learning success. Recent work establishes that as students perceive their English teachers to be more supportive, their grit improves. Hypothesizing that the same pattern appl… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  3. arXiv:2406.12263  [pdf, other

    cs.CL

    Defending Against Social Engineering Attacks in the Age of LLMs

    Authors: Lin Ai, Tharindu Kumarage, Amrita Bhattacharjee, Zizhou Liu, Zheng Hui, Michael Davinroy, James Cook, Laura Cassani, Kirill Trapeznikov, Matthias Kirchner, Arslan Basharat, Anthony Hoogs, Joshua Garland, Huan Liu, Julia Hirschberg

    Abstract: The proliferation of Large Language Models (LLMs) poses challenges in detecting and mitigating digital deception, as these models can emulate human conversational patterns and facilitate chat-based social engineering (CSE) attacks. This study investigates the dual capabilities of LLMs as both facilitators and defenders against CSE threats. We develop a novel dataset, SEConvo, simulating CSE scenar… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  4. arXiv:2406.02826  [pdf, other

    cs.CL cs.LG

    Exploring Robustness in Doctor-Patient Conversation Summarization: An Analysis of Out-of-Domain SOAP Notes

    Authors: Yu-Wen Chen, Julia Hirschberg

    Abstract: Summarizing medical conversations poses unique challenges due to the specialized domain and the difficulty of collecting in-domain training data. In this study, we investigate the performance of state-of-the-art doctor-patient conversation generative summarization models on the out-of-domain data. We divide the summarization model of doctor-patient conversation into two configurations: (1) a gener… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: Clinical NLP Workshop 2024

  5. arXiv:2404.17991  [pdf, other

    cs.CL

    Enhancing Pre-Trained Generative Language Models with Question Attended Span Extraction on Machine Reading Comprehension

    Authors: Lin Ai, Zheng Hui, Zizhou Liu, Julia Hirschberg

    Abstract: Machine Reading Comprehension (MRC) poses a significant challenge in the field of Natural Language Processing (NLP). While mainstream MRC methods predominantly leverage extractive strategies using encoder-only models such as BERT, generative approaches face the issue of out-of-control generation -- a critical problem where answers generated are often incorrect, irrelevant, or unfaithful to the sou… ▽ More

    Submitted 12 June, 2024; v1 submitted 27 April, 2024; originally announced April 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2403.04771

  6. arXiv:2404.14616  [pdf, other

    cs.SI

    What Makes A Video Radicalizing? Identifying Sources of Influence in QAnon Videos

    Authors: Lin Ai, Yu-Wen Chen, Yuwen Yu, Seoyoung Kweon, Julia Hirschberg, Sarah Ita Levitan

    Abstract: In recent years, radicalization is being increasingly attempted on video-sharing platforms. Previous studies have been proposed to identify online radicalization using generic social context analysis, without taking into account comprehensive viewer traits and how those can affect viewers' perception of radicalizing content. To address the challenge, we examine QAnon, a conspiracy-based radicalizi… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  7. arXiv:2404.13764  [pdf, other

    cs.CL

    Using Adaptive Empathetic Responses for Teaching English

    Authors: Li Siyan, Teresa Shao, Zhou Yu, Julia Hirschberg

    Abstract: Existing English-teaching chatbots rarely incorporate empathy explicitly in their feedback, but empathetic feedback could help keep students engaged and reduce learner anxiety. Toward this end, we propose the task of negative emotion detection via audio, for recognizing empathetic feedback opportunities in language learning. We then build the first spoken English-teaching chatbot with adaptive, em… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

    Comments: Accepted to BEA workshop at NAACL 2024

  8. arXiv:2403.04771  [pdf, other

    cs.CL

    QASE Enhanced PLMs: Improved Control in Text Generation for MRC

    Authors: Lin Ai, Zheng Hui, Zizhou Liu, Julia Hirschberg

    Abstract: To address the challenges of out-of-control generation in generative models for machine reading comprehension (MRC), we introduce the Question-Attended Span Extraction (QASE) module. Integrated during the fine-tuning of pre-trained generative language models (PLMs), QASE enables these PLMs to match SOTA extractive methods and outperform leading LLMs like GPT-4 in MRC tasks, without significant inc… ▽ More

    Submitted 26 February, 2024; originally announced March 2024.

  9. arXiv:2311.07703  [pdf, other

    cs.CL cs.SD eess.AS

    Measuring Entrainment in Spontaneous Code-switched Speech

    Authors: Debasmita Bhattacharya, Siying Ding, Alayna Nguyen, Julia Hirschberg

    Abstract: It is well-known that speakers who entrain to one another have more successful conversations than those who do not. Previous research has shown that interlocutors entrain on linguistic features in both written and spoken monolingual domains. More recent work on code-switched communication has also shown preliminary evidence of entrainment on certain aspects of code-switching (CSW). However, such s… ▽ More

    Submitted 26 March, 2024; v1 submitted 13 November, 2023; originally announced November 2023.

    Comments: Edits: camera-ready manuscript for NAACL 2024

  10. arXiv:2309.01164  [pdf, other

    eess.AS cs.LG cs.SD

    Noise robust speech emotion recognition with signal-to-noise ratio adapting speech enhancement

    Authors: Yu-Wen Chen, Julia Hirschberg, Yu Tsao

    Abstract: Speech emotion recognition (SER) often experiences reduced performance due to background noise. In addition, making a prediction on signals with only background noise could undermine user trust in the system. In this study, we propose a Noise Robust Speech Emotion Recognition system, NRSER. NRSER employs speech enhancement (SE) to effectively reduce the noise in input signals. Then, the signal-to-… ▽ More

    Submitted 3 September, 2023; originally announced September 2023.

  11. arXiv:2308.12490  [pdf, other

    cs.CL cs.SD eess.AS

    MultiPA: A Multi-task Speech Pronunciation Assessment Model for Open Response Scenarios

    Authors: Yu-Wen Chen, Zhou Yu, Julia Hirschberg

    Abstract: Pronunciation assessment models designed for open response scenarios enable users to practice language skills in a manner similar to real-life communication. However, previous open-response pronunciation assessment models have predominantly focused on a single pronunciation task, such as sentence-level accuracy, rather than offering a comprehensive assessment in various aspects. We propose MultiPA… ▽ More

    Submitted 4 June, 2024; v1 submitted 23 August, 2023; originally announced August 2023.

    Comments: INTERSPEECH 2024

  12. arXiv:2308.00264  [pdf, other

    cs.CL cs.AI cs.LG cs.MM

    Multimodal Multi-loss Fusion Network for Sentiment Analysis

    Authors: Zehui Wu, Ziwei Gong, Jaywon Koo, Julia Hirschberg

    Abstract: This paper investigates the optimal selection and fusion of feature encoders across multiple modalities and combines these in one neural network to improve sentiment detection. We compare different fusion methods and examine the impact of multi-loss training within the multi-modality fusion network, identifying surprisingly important findings relating to subnet performance. We have also found that… ▽ More

    Submitted 2 June, 2024; v1 submitted 31 July, 2023; originally announced August 2023.

    Comments: First two authors contributed equally to the paper

  13. arXiv:2212.10557  [pdf, other

    cs.CL

    DialGuide: Aligning Dialogue Model Behavior with Developer Guidelines

    Authors: Prakhar Gupta, Yang Liu, Di Jin, Behnam Hedayatnia, Spandana Gella, Sijia Liu, Patrick Lange, Julia Hirschberg, Dilek Hakkani-Tur

    Abstract: Dialogue models are able to generate coherent and fluent responses, but they can still be challenging to control and may produce non-engaging, unsafe results. This unpredictability diminishes user trust and can hinder the use of the models in the real world. To address this, we introduce DialGuide, a novel framework for controlling dialogue model behavior using natural language rules, or guideline… ▽ More

    Submitted 21 May, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

  14. arXiv:2211.06318  [pdf

    cs.CY cs.AI cs.LG

    Artificial Intelligence and Life in 2030: The One Hundred Year Study on Artificial Intelligence

    Authors: Peter Stone, Rodney Brooks, Erik Brynjolfsson, Ryan Calo, Oren Etzioni, Greg Hager, Julia Hirschberg, Shivaram Kalyanakrishnan, Ece Kamar, Sarit Kraus, Kevin Leyton-Brown, David Parkes, William Press, AnnaLee Saxenian, Julie Shah, Milind Tambe, Astro Teller

    Abstract: In September 2016, Stanford's "One Hundred Year Study on Artificial Intelligence" project (AI100) issued the first report of its planned long-term periodic assessment of artificial intelligence (AI) and its impact on society. It was written by a panel of 17 study authors, each of whom is deeply rooted in AI research, chaired by Peter Stone of the University of Texas at Austin. The report, entitled… ▽ More

    Submitted 31 October, 2022; originally announced November 2022.

    Comments: 52 pages, https://ai100.stanford.edu/2016-report

  15. arXiv:2208.08690  [pdf, other

    cs.CL

    A Survey on Open Information Extraction from Rule-based Model to Large Language Model

    Authors: Pai Liu, Wenyang Gao, Wenjie Dong, Lin Ai, Ziwei Gong, Songfang Huang, Zongsheng Li, Ehsan Hoque, Julia Hirschberg, Yue Zhang

    Abstract: Open Information Extraction (OpenIE) represents a crucial NLP task aimed at deriving structured information from unstructured text, unrestricted by relation type or domain. This survey paper provides an overview of OpenIE technologies spanning from 2007 to 2024, emphasizing a chronological perspective absent in prior surveys. It examines the evolution of task settings in OpenIE to align with the a… ▽ More

    Submitted 10 May, 2024; v1 submitted 18 August, 2022; originally announced August 2022.

    Comments: The first five authors contributed to this work equally. Names are ordered randomly

  16. arXiv:2206.00167  [pdf, other

    cs.CL

    Understanding How People Rate Their Conversations

    Authors: Alexandros Papangelis, Nicole Chartier, Pankaj Rajan, Julia Hirschberg, Dilek Hakkani-Tur

    Abstract: User ratings play a significant role in spoken dialogue systems. Typically, such ratings tend to be averaged across all users and then utilized as feedback to improve the system or personalize its behavior. While this method can be useful to understand broad, general issues with the system and its behavior, it does not take into account differences between users that affect their ratings. In this… ▽ More

    Submitted 31 May, 2022; originally announced June 2022.

    Comments: Published at IWSDS 2021

  17. Part of speech tagging for code switched data

    Authors: Fahad AlGhamdi, Giovanni Molina, Mona Diab, Thamar Solorio, Abdelati Hawwari, Victor Soto, Julia Hirschberg

    Abstract: We address the problem of Part of Speech tagging (POS) in the context of linguistic code switching (CS). CS is the phenomenon where a speaker switches between two languages or variants of the same language within or across utterances, known as intra-sentential or inter-sentential CS, respectively. Processing CS data is especially challenging in intra-sentential data given state of the art monoling… ▽ More

    Submitted 3 November, 2019; v1 submitted 27 September, 2019; originally announced September 2019.

    Comments: Association for Computational Linguistics

  18. arXiv:1908.02308  [pdf

    cs.MM

    Report of 2017 NSF Workshop on Multimedia Challenges, Opportunities and Research Roadmaps

    Authors: Shih-Fu Chang, Alex Hauptmann, Louis-Philippe Morency, Sameer Antani, Dick Bulterman, Carlos Busso, Joyce Chai, Julia Hirschberg, Ramesh Jain, Ketan Mayer-Patel, Reuven Meth, Raymond Mooney, Klara Nahrstedt, Shri Narayanan, Prem Natarajan, Sharon Oviatt, Balakrishnan Prabhakaran, Arnold Smeulders, Hari Sundaram, Zhengyou Zhang, Michelle Zhou

    Abstract: With the transformative technologies and the rapidly changing global R&D landscape, the multimedia and multimodal community is now faced with many new opportunities and uncertainties. With the open source dissemination platform and pervasive computing resources, new research results are being discovered at an unprecedented pace. In addition, the rapid exchange and influence of ideas across traditi… ▽ More

    Submitted 6 August, 2019; originally announced August 2019.

    Comments: Long Report of NSF Workshop on Multimedia Challenges, Opportunities and Research Roadmaps, held in March 2017, Washington DC. Short report available separately

  19. arXiv:1906.04138  [pdf, other

    cs.CL

    Named Entity Recognition on Code-Switched Data: Overview of the CALCS 2018 Shared Task

    Authors: Gustavo Aguilar, Fahad AlGhamdi, Victor Soto, Mona Diab, Julia Hirschberg, Thamar Solorio

    Abstract: In the third shared task of the Computational Approaches to Linguistic Code-Switching (CALCS) workshop, we focus on Named Entity Recognition (NER) on code-switched social-media data. We divide the shared task into two competitions based on the English-Spanish (ENG-SPA) and Modern Standard Arabic-Egyptian (MSA-EGY) language pairs. We use Twitter data and 9 entity types to establish a new dataset fo… ▽ More

    Submitted 10 June, 2019; originally announced June 2019.

    Comments: ACL 2018 (CALCS)

    Journal ref: Proceedings of the Third Workshop on Computational Approaches to Linguistic Code-Switching, 2018, 138-147

  20. arXiv:1703.08537  [pdf, ps, other

    cs.CL

    Crowdsourcing Universal Part-Of-Speech Tags for Code-Switching

    Authors: Victor Soto, Julia Hirschberg

    Abstract: Code-switching is the phenomenon by which bilingual speakers switch between multiple languages during communication. The importance of developing language technologies for codeswitching data is immense, given the large populations that routinely code-switch. High-quality linguistic annotations are extremely valuable for any NLP task, and performance is often limited by the amount of high-quality l… ▽ More

    Submitted 24 March, 2017; originally announced March 2017.

    Comments: Submitted to Interspeech 2017

  21. Some Bibliographical References on Intonation and Intonational Meaning

    Authors: Julia Hirschberg

    Abstract: A by-no-means-complete collection of references for those interested in intonational meaning, with other miscellaneous references on intonation included. Additional references are welcome, and should be sent to [email protected].

    Submitted 2 May, 1994; originally announced May 1994.

    Comments: 14 pp of text and citations, bibtex added as separate file