Skip to main content

Showing 1–24 of 24 results for author: Blanco, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.11546  [pdf, ps, other

    cs.CL cs.AI cs.LG

    Memorization In In-Context Learning

    Authors: Shahriar Golchin, Mihai Surdeanu, Steven Bethard, Eduardo Blanco, Ellen Riloff

    Abstract: In-context learning (ICL) has proven to be an effective strategy for improving the performance of large language models (LLMs) with no additional training. However, the exact mechanism behind these performance improvements remains unclear. This study is the first to show how ICL surfaces memorized training data and to explore the correlation between this memorization and performance across various… ▽ More

    Submitted 21 August, 2024; originally announced August 2024.

    Comments: v1

  2. arXiv:2407.03525  [pdf, other

    cs.CL

    UnSeenTimeQA: Time-Sensitive Question-Answering Beyond LLMs' Memorization

    Authors: Md Nayem Uddin, Amir Saeidi, Divij Handa, Agastya Seth, Tran Cao Son, Eduardo Blanco, Steven R. Corman, Chitta Baral

    Abstract: This paper introduces UnSeenTimeQA, a novel time-sensitive question-answering (TSQA) benchmark that diverges from traditional TSQA benchmarks by avoiding factual and web-searchable queries. We present a series of time-sensitive event scenarios decoupled from real-world factual information. It requires large language models (LLMs) to engage in genuine temporal reasoning, disassociating from the kno… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  3. arXiv:2406.16253  [pdf, other

    cs.CL

    LLMs Assist NLP Researchers: Critique Paper (Meta-)Reviewing

    Authors: Jiangshu Du, Yibo Wang, Wenting Zhao, Zhongfen Deng, Shuaiqi Liu, Renze Lou, Henry Peng Zou, Pranav Narayanan Venkit, Nan Zhang, Mukund Srinath, Haoran Ranran Zhang, Vipul Gupta, Yinghui Li, Tao Li, Fei Wang, Qin Liu, Tianlin Liu, Pengzhi Gao, Congying Xia, Chen Xing, Jiayang Cheng, Zhaowei Wang, Ying Su, Raj Sanjay Shah, Ruohao Guo , et al. (15 additional authors not shown)

    Abstract: This work is motivated by two key trends. On one hand, large language models (LLMs) have shown remarkable versatility in various generative tasks such as writing, drawing, and question answering, significantly reducing the time required for many routine tasks. On the other hand, researchers, whose work is not only time-consuming but also highly expertise-demanding, face increasing challenges as th… ▽ More

    Submitted 25 June, 2024; v1 submitted 23 June, 2024; originally announced June 2024.

  4. arXiv:2406.07492  [pdf, other

    cs.CL

    Paraphrasing in Affirmative Terms Improves Negation Understanding

    Authors: MohammadHossein Rezaei, Eduardo Blanco

    Abstract: Negation is a common linguistic phenomenon. Yet language models face challenges with negation in many natural language understanding tasks such as question answering and natural language inference. In this paper, we experiment with seamless strategies that incorporate affirmative interpretations (i.e., paraphrases without negation) to make models more robust against negation. Crucially, our affirm… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: Accepted to ACL 2024

  5. arXiv:2404.16413  [pdf, other

    cs.CL

    Asking and Answering Questions to Extract Event-Argument Structures

    Authors: Md Nayem Uddin, Enfa Rose George, Eduardo Blanco, Steven Corman

    Abstract: This paper presents a question-answering approach to extract document-level event-argument structures. We automatically ask and answer questions for each argument type an event may have. Questions are generated using manually defined templates and generative transformers. Template-based questions are generated using predefined role-specific wh-words and event triggers from the context document. Tr… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: Accepted at LREC-COLING 2024

  6. arXiv:2404.16262  [pdf, other

    cs.CL

    Interpreting Answers to Yes-No Questions in Dialogues from Multiple Domains

    Authors: Zijie Wang, Farzana Rashid, Eduardo Blanco

    Abstract: People often answer yes-no questions without explicitly saying yes, no, or similar polar keywords. Figuring out the meaning of indirect answers is challenging, even for large language models. In this paper, we investigate this problem working with dialogues from multiple domains. We present new benchmarks in three diverse domains: movie scripts, tennis interviews, and airline customer service. We… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

    Comments: To appear at NAACL 2024 Findings

  7. arXiv:2404.04770  [pdf, other

    cs.CL

    Generating Uncontextualized and Contextualized Questions for Document-Level Event Argument Extraction

    Authors: Md Nayem Uddin, Enfa Rose George, Eduardo Blanco, Steven Corman

    Abstract: This paper presents multiple question generation strategies for document-level event argument extraction. These strategies do not require human involvement and result in uncontextualized questions as well as contextualized questions grounded on the event and document of interest. Experimental results show that combining uncontextualized and contextualized questions is beneficial, especially when e… ▽ More

    Submitted 6 April, 2024; originally announced April 2024.

    Comments: Accepted at NAACL 2024

  8. arXiv:2403.17146  [pdf, other

    cs.CL

    Outcome-Constrained Large Language Models for Countering Hate Speech

    Authors: Lingzi Hong, Pengcheng Luo, Eduardo Blanco, Xiaoying Song

    Abstract: Counterspeech that challenges or responds to hate speech has been seen as an alternative to mitigate the negative impact of hate speech and foster productive online communications. Research endeavors have been directed to using language models for the automatic generation of counterspeech to assist efforts in combating online hate. Existing research focuses on the generation of counterspeech with… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  9. arXiv:2403.11082  [pdf, other

    cs.CL cs.AI cs.LG

    RobustSentEmbed: Robust Sentence Embeddings Using Adversarial Self-Supervised Contrastive Learning

    Authors: Javad Rafiei Asl, Prajwal Panzade, Eduardo Blanco, Daniel Takabi, Zhipeng Cai

    Abstract: Pre-trained language models (PLMs) have consistently demonstrated outstanding performance across a diverse spectrum of natural language processing tasks. Nevertheless, despite their success with unseen data, current PLM-based representations often exhibit poor robustness in adversarial settings. In this paper, we introduce RobustSentEmbed, a self-supervised sentence embedding framework designed to… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

    Comments: Accepted at the Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL Findings) 2024. [https://openreview.net/forum?id=9dEAg4lJEA]

  10. arXiv:2312.04804  [pdf, other

    cs.CY cs.CL

    Hate Cannot Drive out Hate: Forecasting Conversation Incivility following Replies to Hate Speech

    Authors: Xinchen Yu, Eduardo Blanco, Lingzi Hong

    Abstract: User-generated replies to hate speech are promising means to combat hatred, but questions about whether they can stop incivility in follow-up conversations linger. We argue that effective replies stop incivility from emerging in follow-up conversations - replies that elicit more incivility are counterproductive. This study introduces the task of predicting the incivility of conversations following… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

    Comments: The 18th International AAAI Conference on Web and Social Media (ICWSM 2024) Accepted

  11. arXiv:2310.15464  [pdf, other

    cs.CL

    Interpreting Answers to Yes-No Questions in User-Generated Content

    Authors: Shivam Mathur, Keun Hee Park, Dhivya Chinnappa, Saketh Kotamraju, Eduardo Blanco

    Abstract: Interpreting answers to yes-no questions in social media is difficult. Yes and no keywords are uncommon, and the few answers that include them are rarely to be interpreted what the keywords suggest. In this paper, we present a new corpus of 4,442 yes-no question-answer pairs from Twitter. We discuss linguistic characteristics of answers whose interpretation is yes or no, as well as answers whose i… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: Accepted at the Findings of EMNLP 2023

  12. arXiv:2310.13290  [pdf, other

    cs.CL

    Interpreting Indirect Answers to Yes-No Questions in Multiple Languages

    Authors: Zijie Wang, Md Mosharaf Hossain, Shivam Mathur, Terry Cruz Melo, Kadir Bulut Ozler, Keun Hee Park, Jacob Quintero, MohammadHossein Rezaei, Shreya Nupur Shakya, Md Nayem Uddin, Eduardo Blanco

    Abstract: Yes-no questions expect a yes or no for an answer, but people often skip polar keywords. Instead, they answer with long explanations that must be interpreted. In this paper, we focus on this challenging problem and release new benchmarks in eight languages. We present a distant supervision approach to collect training data. We also demonstrate that direct answers (i.e., with polar keywords) are us… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

    Comments: Accepted to EMNLP 2023 Findings

  13. arXiv:2307.05034  [pdf, other

    cs.CL

    Synthetic Dataset for Evaluating Complex Compositional Knowledge for Natural Language Inference

    Authors: Sushma Anand Akoju, Robert Vacareanu, Haris Riaz, Eduardo Blanco, Mihai Surdeanu

    Abstract: We introduce a synthetic dataset called Sentences Involving Complex Compositional Knowledge (SICCK) and a novel analysis that investigates the performance of Natural Language Inference (NLI) models to understand compositionality in logic. We produce 1,304 sentence pairs by modifying 15 examples from the SICK dataset (Marelli et al., 2014). To this end, we modify the original texts using a set of p… ▽ More

    Submitted 7 September, 2024; v1 submitted 11 July, 2023; originally announced July 2023.

    Comments: Accepted to Natural Language Reasoning and Structured Explanations (NLRSE) Workshop, ACL 2023. For dataset, please refer https://github.com/sushmaakoju/clulab-releases/blob/master/acl2023-nlrse-sicck/README.md and https://github.com/sushmaakoju/acl2023-nlrse-clulab-SICCK-dataset

  14. arXiv:2210.14486  [pdf, other

    cs.CL

    Leveraging Affirmative Interpretations from Negation Improves Natural Language Understanding

    Authors: Md Mosharaf Hossain, Eduardo Blanco

    Abstract: Negation poses a challenge in many natural language understanding tasks. Inspired by the fact that understanding a negated statement often requires humans to infer affirmative interpretations, in this paper we show that doing so benefits models for three natural language understanding tasks. We present an automated procedure to collect pairs of sentences with negation and their affirmative interpr… ▽ More

    Submitted 26 October, 2022; originally announced October 2022.

    Comments: To appear at the main conference of EMNLP 2022

  15. arXiv:2206.06423  [pdf, other

    cs.CL

    Hate Speech and Counter Speech Detection: Conversational Context Does Matter

    Authors: Xinchen Yu, Eduardo Blanco, Lingzi Hong

    Abstract: Hate speech is plaguing the cyberspace along with user-generated content. This paper investigates the role of conversational context in the annotation and detection of online hate and counter speech, where context is defined as the preceding comment in a conversation thread. We created a context-aware dataset for a 3-way classification task on Reddit comments: hate speech, counter speech, or neutr… ▽ More

    Submitted 13 June, 2022; originally announced June 2022.

    Comments: Accepted by NAACL 2022

  16. arXiv:2205.11467  [pdf, other

    cs.CL

    A Question-Answer Driven Approach to Reveal Affirmative Interpretations from Verbal Negations

    Authors: Md Mosharaf Hossain, Luke Holman, Anusha Kakileti, Tiffany Iris Kao, Nathan Raul Brito, Aaron Abraham Mathews, Eduardo Blanco

    Abstract: This paper explores a question-answer driven approach to reveal affirmative interpretations from verbal negations (i.e., when a negation cue grammatically modifies a verb). We create a new corpus consisting of 4,472 verbal negations and discover that 67.1% of them convey that an event actually occurred. Annotators generate and answer 7,277 questions for the 3,001 negations that convey an affirmati… ▽ More

    Submitted 23 May, 2022; originally announced May 2022.

    Comments: Accepted at the Findings of NAACL 2022

  17. Applying Model Checking to Highly-Configurable Safety Critical Software: The SPS-PPS PLC Program

    Authors: Borja Fernandez Adiego, Ignacio D. Lopez-Miguel, Jean-Charles Tournier, Enrique Blanco, Tomasz Ladzinski, Frederic Havart

    Abstract: An important aspect of many particle accelerators is the constant evolution and frequent configuration changes that are needed to perform the experiments they are designed for. This often leads to the design of configurable software that can absorb these changes and perform the required control and protection actions. This design strategy minimizes the engineering and maintenance costs, but it mak… ▽ More

    Submitted 30 March, 2022; originally announced March 2022.

    Comments: 18th International Conference on Accelerator and Large Experimental Physics Control Systems (ICALEPCS2021)

  18. arXiv:2203.08929  [pdf, other

    cs.CL

    An Analysis of Negation in Natural Language Understanding Corpora

    Authors: Md Mosharaf Hossain, Dhivya Chinnappa, Eduardo Blanco

    Abstract: This paper analyzes negation in eight popular corpora spanning six natural language understanding tasks. We show that these corpora have few negations compared to general-purpose English, and that the few negations in them are often unimportant. Indeed, one can often ignore negations and still make the right predictions. Additionally, experimental results show that state-of-the-art transformers tr… ▽ More

    Submitted 16 March, 2022; originally announced March 2022.

    Comments: To appear in the proceedings of ACL 2022 (main conference)

  19. arXiv:2109.07017  [pdf, other

    cs.CL

    Written Justifications are Key to Aggregate Crowdsourced Forecasts

    Authors: Saketh Kotamraju, Eduardo Blanco

    Abstract: This paper demonstrates that aggregating crowdsourced forecasts benefits from modeling the written justifications provided by forecasters. Our experiments show that the majority and weighted vote baselines are competitive, and that the written justifications are beneficial to call a question throughout its life except in the last quarter. We also conduct an error analysis shedding light into the c… ▽ More

    Submitted 14 September, 2021; originally announced September 2021.

    Comments: Findings of EMNLP 2021

  20. arXiv:2010.05432  [pdf, other

    cs.CL cs.AI

    It's not a Non-Issue: Negation as a Source of Error in Machine Translation

    Authors: Md Mosharaf Hossain, Antonios Anastasopoulos, Eduardo Blanco, Alexis Palmer

    Abstract: As machine translation (MT) systems progress at a rapid pace, questions of their adequacy linger. In this study we focus on negation, a universal, core property of human language that significantly affects the semantics of an utterance. We investigate whether translating negation is an issue for modern MT systems using 17 translation directions as test bed. Through thorough analysis, we find that… ▽ More

    Submitted 11 October, 2020; originally announced October 2020.

    Comments: Accepted at the Findings of EMNLP2020

  21. Interactive Text Graph Mining with a Prolog-based Dialog Engine

    Authors: Paul Tarau, Eduardo Blanco

    Abstract: On top of a neural network-based dependency parser and a graph-based natural language processing module we design a Prolog-based dialog engine that explores interactively a ranked fact database extracted from a text document. We reorganize dependency graphs to focus on the most relevant content elements of a sentence and integrate sentence identifiers as graph nodes. Additionally, after rankin… ▽ More

    Submitted 30 July, 2020; originally announced August 2020.

    Comments: Under consideration in Theory and Practice of Logic Programming (TPLP). arXiv admin note: substantial text overlap with arXiv:1909.09742

    Journal ref: Theory and Practice of Logic Programming 21 (2021) 244-263

  22. arXiv:1909.09742  [pdf, other

    cs.AI

    Dependency-based Text Graphs for Keyphrase and Summary Extraction with Applications to Interactive Content Retrieval

    Authors: Paul Tarau, Eduardo Blanco

    Abstract: We build a bridge between neural network-based machine learning and graph-based natural language processing and introduce a unified approach to keyphrase, summary and relation extraction by aggregating dependency graphs from links provided by a deep-learning based dependency parser. We reorganize dependency graphs to focus on the most relevant content elements of a sentence, integrate sentence i… ▽ More

    Submitted 20 September, 2019; originally announced September 2019.

  23. Early warning in egg production curves from commercial hens: A SVM approach

    Authors: Iván Ramírez Morales, Daniel Rivero Cebrián, Enrique Fernández Blanco, Alejandro Pazos Sierra

    Abstract: Artificial Intelligence allows the improvement of our daily life, for instance, speech and handwritten text recognition, real time translation and weather forecasting are common used applications. In the livestock sector, machine learning algorithms have the potential for early detection and warning of problems, which represents a significant milestone in the poultry industry. Production problems… ▽ More

    Submitted 8 April, 2019; originally announced April 2019.

    Journal ref: Early warning in egg production curves from commercial hens: A SVM approach, Computers and Electronics in Agriculture, Volume 121, 2016, Pages 169-179, ISSN 0168-1699, https://doi.org/10.1016/j.compag.2015.12.009

  24. arXiv:1702.04415  [pdf, other

    cs.LG stat.ML

    Small Boxes Big Data: A Deep Learning Approach to Optimize Variable Sized Bin Packing

    Authors: Feng Mao, Edgar Blanco, Mingang Fu, Rohit Jain, Anurag Gupta, Sebastien Mancel, Rong Yuan, Stephen Guo, Sai Kumar, Yayang Tian

    Abstract: Bin Packing problems have been widely studied because of their broad applications in different domains. Known as a set of NP-hard problems, they have different vari- ations and many heuristics have been proposed for obtaining approximate solutions. Specifically, for the 1D variable sized bin packing problem, the two key sets of optimization heuristics are the bin assignment and the bin allocation.… ▽ More

    Submitted 14 February, 2017; originally announced February 2017.

    Comments: The Third IEEE International Conference on Big Data Computing Service and Applications, 2017

    ACM Class: I.1.2; I.2.8