Skip to main content

Showing 1–50 of 231 results for author: Rao, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.05895  [pdf, other

    cs.CY cs.CR

    Gender of Recruiter Makes a Difference: A study into Cybersecurity Graduate Recruitment

    Authors: Joanne L. Hall, Asha Rao

    Abstract: An ever-widening workforce gap exists in the global cybersecurity industry but diverse talent is underutilized. The global cybersecurity workforce is only 25% female. Much research exists on the effect of gender bias on the hiring of women into the technical workforce, but little on how the gender of the recruiter (gender difference) affects recruitment decisions. This research reveals differences… ▽ More

    Submitted 11 August, 2024; originally announced August 2024.

    Comments: 22 pages, 4 figures

  2. arXiv:2408.00118  [pdf, other

    cs.CL cs.AI

    Gemma 2: Improving Open Language Models at a Practical Size

    Authors: Gemma Team, Morgane Riviere, Shreya Pathak, Pier Giuseppe Sessa, Cassidy Hardin, Surya Bhupatiraju, Léonard Hussenot, Thomas Mesnard, Bobak Shahriari, Alexandre Ramé, Johan Ferret, Peter Liu, Pouya Tafti, Abe Friesen, Michelle Casbon, Sabela Ramos, Ravin Kumar, Charline Le Lan, Sammy Jerome, Anton Tsitsulin, Nino Vieillard, Piotr Stanczyk, Sertan Girgin, Nikola Momchev, Matt Hoffman , et al. (172 additional authors not shown)

    Abstract: In this work, we introduce Gemma 2, a new addition to the Gemma family of lightweight, state-of-the-art open models, ranging in scale from 2 billion to 27 billion parameters. In this new version, we apply several known technical modifications to the Transformer architecture, such as interleaving local-global attentions (Beltagy et al., 2020a) and group-query attention (Ainslie et al., 2023). We al… ▽ More

    Submitted 2 August, 2024; v1 submitted 31 July, 2024; originally announced August 2024.

  3. arXiv:2407.21783  [pdf, other

    cs.AI cs.CL cs.CV

    The Llama 3 Herd of Models

    Authors: Abhimanyu Dubey, Abhinav Jauhri, Abhinav Pandey, Abhishek Kadian, Ahmad Al-Dahle, Aiesha Letman, Akhil Mathur, Alan Schelten, Amy Yang, Angela Fan, Anirudh Goyal, Anthony Hartshorn, Aobo Yang, Archi Mitra, Archie Sravankumar, Artem Korenev, Arthur Hinsvark, Arun Rao, Aston Zhang, Aurelien Rodriguez, Austen Gregerson, Ava Spataru, Baptiste Roziere, Bethany Biron, Binh Tang , et al. (510 additional authors not shown)

    Abstract: Modern artificial intelligence (AI) systems are powered by foundation models. This paper presents a new set of foundation models, called Llama 3. It is a herd of language models that natively support multilinguality, coding, reasoning, and tool usage. Our largest model is a dense Transformer with 405B parameters and a context window of up to 128K tokens. This paper presents an extensive empirical… ▽ More

    Submitted 15 August, 2024; v1 submitted 31 July, 2024; originally announced July 2024.

  4. arXiv:2407.05483  [pdf, other

    cs.CL cs.LG

    Just read twice: closing the recall gap for recurrent language models

    Authors: Simran Arora, Aman Timalsina, Aaryan Singhal, Benjamin Spector, Sabri Eyuboglu, Xinyi Zhao, Ashish Rao, Atri Rudra, Christopher Ré

    Abstract: Recurrent large language models that compete with Transformers in language modeling perplexity are emerging at a rapid rate (e.g., Mamba, RWKV). Excitingly, these architectures use a constant amount of memory during inference. However, due to the limited memory, recurrent LMs cannot recall and use all the information in long contexts leading to brittle in-context learning (ICL) quality. A key chal… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

  5. arXiv:2407.01802  [pdf, ps, other

    cs.CC

    An XOR Lemma for Deterministic Communication Complexity

    Authors: Siddharth Iyer, Anup Rao

    Abstract: We prove a lower bound on the communication complexity of computing the $n$-fold xor of an arbitrary function $f$, in terms of the communication complexity and rank of $f$. We prove that $D(f^{\oplus n}) \geq n \cdot \Big(\frac{Ω(D(f))}{\log \mathsf{rk}(f)} -\log \mathsf{rk}(f)\Big )$, where here $D(f), D(f^{\oplus n})$ represent the deterministic communication complexity, and $\mathsf{rk}(f)$ is… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  6. arXiv:2406.12702  [pdf, other

    cs.CL

    [WIP] Jailbreak Paradox: The Achilles' Heel of LLMs

    Authors: Abhinav Rao, Monojit Choudhury, Somak Aditya

    Abstract: We introduce two paradoxes concerning jailbreak of foundation models: First, it is impossible to construct a perfect jailbreak classifier, and second, a weaker model cannot consistently detect whether a stronger (in a pareto-dominant sense) model is jailbroken or not. We provide formal proofs for these paradoxes and a short case study on Llama and GPT4-o to demonstrate this. We discuss broader the… ▽ More

    Submitted 20 June, 2024; v1 submitted 18 June, 2024; originally announced June 2024.

  7. arXiv:2406.01866  [pdf, other

    cs.CL cs.CY cs.SI

    #EpiTwitter: Public Health Messaging During the COVID-19 Pandemic

    Authors: Ashwin Rao, Nazanin Sabri, Siyi Guo, Louiqa Raschid, Kristina Lerman

    Abstract: Effective communication during health crises is critical, with social media serving as a key platform for public health experts (PHEs) to engage with the public. However, it also amplifies pseudo-experts promoting contrarian views. Despite its importance, the role of emotional and moral language in PHEs' communication during COVID-19 remains under explored. This study examines how PHEs and pseudo-… ▽ More

    Submitted 10 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

  8. arXiv:2404.12464  [pdf, other

    cs.CL

    NormAd: A Benchmark for Measuring the Cultural Adaptability of Large Language Models

    Authors: Abhinav Rao, Akhila Yerukola, Vishwa Shah, Katharina Reinecke, Maarten Sap

    Abstract: The integration of large language models (LLMs) into various global cultures fundamentally presents a challenge: LLMs must navigate interactions, respect social norms, and avoid transgressing cultural boundaries. However, it is still unclear if LLMs can adapt their outputs to diverse cultural norms. Our study focuses on this aspect. We introduce NormAd, a novel dataset, which includes 2.6k stories… ▽ More

    Submitted 11 July, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

    Comments: Preprint. In Review

  9. arXiv:2404.09147  [pdf

    cs.HC

    Evaluating the efficacy of haptic feedback, 360° treadmill-integrated Virtual Reality framework and longitudinal training on decision-making performance in a complex search-and-shoot simulation

    Authors: Akash K Rao, Arnav Bhavsar, Shubhajit Roy Chowdhury, Sushil Chandra, Ramsingh Negi, Prakash Duraisamy, Varun Dutt

    Abstract: Virtual Reality (VR) has made significant strides, offering users a multitude of ways to interact with virtual environments. Each sensory modality in VR provides distinct inputs and interactions, enhancing the user's immersion and presence. However, the potential of additional sensory modalities, such as haptic feedback and 360° locomotion, to improve decision-making performance has not been thoro… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

    Comments: 13 pages, 6 figures, 1 Table

  10. arXiv:2404.01588  [pdf, other

    cs.CL cs.AI cs.LG

    Hallucination Diversity-Aware Active Learning for Text Summarization

    Authors: Yu Xia, Xu Liu, Tong Yu, Sungchul Kim, Ryan A. Rossi, Anup Rao, Tung Mai, Shuai Li

    Abstract: Large Language Models (LLMs) have shown propensity to generate hallucinated outputs, i.e., texts that are factually incorrect or unsupported. Existing methods for alleviating hallucinations typically require costly human annotations to identify and correct hallucinations in LLM outputs. Moreover, most of these methods focus on a specific type of hallucination, e.g., entity or token errors, which l… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: Accepted to NAACL 2024

  11. arXiv:2403.14872  [pdf, other

    cs.CR

    Structuring the Chaos: Enabling Small Business Cyber-Security Risks & Assets Modelling with a UML Class Model

    Authors: Tracy Tam, Asha Rao, Joanne Hall

    Abstract: Small businesses are increasingly adopting IT, and consequently becoming more vulnerable to cyber-incidents. Whilst small businesses are aware of the cyber-security risks, many struggle with implementing mitigations. Some of these can be traced to fundamental differences in the characteristics of small business versus large enterprises where modern cyber-security solutions are widely deployed. S… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

  12. arXiv:2403.04085  [pdf, other

    cs.CL cs.CY

    Don't Blame the Data, Blame the Model: Understanding Noise and Bias When Learning from Subjective Annotations

    Authors: Abhishek Anand, Negar Mokhberian, Prathyusha Naresh Kumar, Anweasha Saha, Zihao He, Ashwin Rao, Fred Morstatter, Kristina Lerman

    Abstract: Researchers have raised awareness about the harms of aggregating labels especially in subjective tasks that naturally contain disagreements among human annotators. In this work we show that models that are only provided aggregated labels show low confidence on high-disagreement data instances. While previous studies consider such instances as mislabeled, we argue that the reason the high-disagreem… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

  13. arXiv:2403.00975  [pdf, other

    cs.LG cs.AI math.FA stat.AP

    Equipment Health Assessment: Time Series Analysis for Wind Turbine Performance

    Authors: Jana Backhus, Aniruddha Rajendra Rao, Chandrasekar Venkatraman, Abhishek Padmanabhan, A. Vinoth Kumar, Chetan Gupta

    Abstract: In this study, we leverage SCADA data from diverse wind turbines to predict power output, employing advanced time series methods, specifically Functional Neural Networks (FNN) and Long Short-Term Memory (LSTM) networks. A key innovation lies in the ensemble of FNN and LSTM models, capitalizing on their collective learning. This ensemble approach outperforms individual models, ensuring stable and a… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

    Comments: 19 Pages, 17 Figures, 3 Tables, Submitted at Applied Sciences (MDPI)

  14. arXiv:2402.11114  [pdf, other

    cs.CL cs.CY cs.SI

    Whose Emotions and Moral Sentiments Do Language Models Reflect?

    Authors: Zihao He, Siyi Guo, Ashwin Rao, Kristina Lerman

    Abstract: Language models (LMs) are known to represent the perspectives of some social groups better than others, which may impact their performance, especially on subjective tasks such as content moderation and hate speech detection. To explore how LMs represent different perspectives, existing research focused on positional alignment, i.e., how closely the models mimic the opinions and stances of differen… ▽ More

    Submitted 17 June, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

  15. arXiv:2402.01711  [pdf, other

    cs.CY cs.AI

    LLM on FHIR -- Demystifying Health Records

    Authors: Paul Schmiedmayer, Adrit Rao, Philipp Zagar, Vishnu Ravi, Aydin Zahedivash, Arash Fereydooni, Oliver Aalami

    Abstract: Objective: To enhance health literacy and accessibility of health information for a diverse patient population by developing a patient-centered artificial intelligence (AI) solution using large language models (LLMs) and Fast Healthcare Interoperability Resources (FHIR) application programming interfaces (APIs). Materials and Methods: The research involved developing LLM on FHIR, an open-source mo… ▽ More

    Submitted 25 January, 2024; originally announced February 2024.

    Comments: Pre-print of the paper submitted to the Call for Papers for the Special Focus Issue on ChatGPT and Large Language Models (LLMs) in Biomedicine and Health at the Journal of the American Medical Informatics Association: https://academic.oup.com/jamia/pages/call-for-papers-for-special-focus-issue

  16. arXiv:2402.01091  [pdf, other

    cs.CL cs.CY cs.SI

    Reading Between the Tweets: Deciphering Ideological Stances of Interconnected Mixed-Ideology Communities

    Authors: Zihao He, Ashwin Rao, Siyi Guo, Negar Mokhberian, Kristina Lerman

    Abstract: Recent advances in NLP have improved our ability to understand the nuanced worldviews of online communities. Existing research focused on probing ideological stances treats liberals and conservatives as separate groups. However, this fails to account for the nuanced views of the organically formed online communities and the connections between them. In this paper, we study discussions of the 2020… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

  17. arXiv:2402.00090  [pdf

    q-bio.NC cs.HC

    Classification of attention performance post-longitudinal tDCS via functional connectivity and machine learning methods

    Authors: Akash K Rao, Vishnu K Menon, Arnav Bhavsar, Shubhajit Roy Chowdhury, Ramsingh Negi, Varun Dutt

    Abstract: Attention is the brain's mechanism for selectively processing specific stimuli while filtering out irrelevant information. Characterizing changes in attention following long-term interventions (such as transcranial direct current stimulation (tDCS)) has seldom been emphasized in the literature. To classify attention performance post-tDCS, this study uses functional connectivity and machine learnin… ▽ More

    Submitted 31 January, 2024; originally announced February 2024.

    Comments: 6 pages, to be presented in the IEEE 9th International Conference for Convergence in Technology (I2CT),Pune, April 2024. arXiv admin note: substantial text overlap with arXiv:2401.17700

  18. arXiv:2401.17711  [pdf

    cs.HC cs.AI

    Prediction of multitasking performance post-longitudinal tDCS via EEG-based functional connectivity and machine learning methods

    Authors: Akash K Rao, Shashank Uttrani, Vishnu K Menon, Darshil Shah, Arnav Bhavsar, Shubhajit Roy Chowdhury, Varun Dutt

    Abstract: Predicting and understanding the changes in cognitive performance, especially after a longitudinal intervention, is a fundamental goal in neuroscience. Longitudinal brain stimulation-based interventions like transcranial direct current stimulation (tDCS) induce short-term changes in the resting membrane potential and influence cognitive processes. However, very little research has been conducted o… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

    Comments: 16 pages, presented at the 30th International Conference on Neural Information Processing (ICONIP2023), Changsha, China, November 2023

  19. arXiv:2401.17705  [pdf

    cs.LG cs.HC

    Predicting suicidal behavior among Indian adults using childhood trauma, mental health questionnaires and machine learning cascade ensembles

    Authors: Akash K Rao, Gunjan Y Trivedi, Riri G Trivedi, Anshika Bajpai, Gajraj Singh Chauhan, Vishnu K Menon, Kathirvel Soundappan, Hemalatha Ramani, Neha Pandya, Varun Dutt

    Abstract: Among young adults, suicide is India's leading cause of death, accounting for an alarming national suicide rate of around 16%. In recent years, machine learning algorithms have emerged to predict suicidal behavior using various behavioral traits. But to date, the efficacy of machine learning algorithms in predicting suicidal behavior in the Indian context has not been explored in literature. In th… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

    Comments: 11 pages, presnted at the 4th International Conference on Frontiers in Computing and Systems (COMSYS 2023), Himachal Pradesh, October 2023

  20. arXiv:2401.17700  [pdf

    cs.HC cs.AI

    Classification of executive functioning performance post-longitudinal tDCS using functional connectivity and machine learning methods

    Authors: Akash K Rao, Vishnu K Menon, Shashank Uttrani, Ayushman Dixit, Dipanshu Verma, Varun Dutt

    Abstract: Executive functioning is a cognitive process that enables humans to plan, organize, and regulate their behavior in a goal-directed manner. Understanding and classifying the changes in executive functioning after longitudinal interventions (like transcranial direct current stimulation (tDCS)) has not been explored in the literature. This study employs functional connectivity and machine learning al… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

    Comments: 7 pages, presented at the IEEE 20th India Council International Conference (INDICON 2023), Hyderabad, India, December 2023

  21. arXiv:2401.14498  [pdf, other

    cs.LG eess.SY stat.AP stat.ML

    Predictive Analysis for Optimizing Port Operations

    Authors: Aniruddha Rajendra Rao, Haiyan Wang, Chetan Gupta

    Abstract: Maritime transport is a pivotal logistics mode for the long-distance and bulk transportation of goods. However, the intricate planning involved in this mode is often hindered by uncertainties, including weather conditions, cargo diversity, and port dynamics, leading to increased costs. Consequently, accurately estimating vessel total (stay) time at port and potential delays becomes imperative for… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

    Comments: 13 pages, 9 figures, 4 Tables. Submitted at IEEE IJCNN 2024

  22. arXiv:2401.07453  [pdf, other

    cs.CL cs.AI cs.IR

    Model Editing at Scale leads to Gradual and Catastrophic Forgetting

    Authors: Akshat Gupta, Anurag Rao, Gopala Anumanchipalli

    Abstract: Editing knowledge in large language models is an attractive capability to have which allows us to correct incorrectly learnt facts during pre-training, as well as update the model with an ever-growing list of new facts. While existing model editing techniques have shown promise, they are usually evaluated using metrics for reliability, specificity and generalization over one or few edits. We argue… ▽ More

    Submitted 10 June, 2024; v1 submitted 14 January, 2024; originally announced January 2024.

    Comments: ACL 2024 Findings

  23. arXiv:2401.06275  [pdf, other

    cs.SI

    The Pulse of Mood Online: Unveiling Emotional Reactions in a Dynamic Social Media Landscape

    Authors: Siyi Guo, Zihao He, Ashwin Rao, Fred Morstatter, Jeffrey Brantingham, Kristina Lerman

    Abstract: The rich and dynamic information environment of social media provides researchers, policy makers, and entrepreneurs with opportunities to learn about social phenomena in a timely manner. However, using these data to understand social behavior is difficult due to heterogeneity of topics and events discussed in the highly dynamic online information environment. To address these challenges, we presen… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2307.10245

  24. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  25. arXiv:2312.11561  [pdf, other

    cs.LG cs.AI

    COPD-FlowNet: Elevating Non-invasive COPD Diagnosis with CFD Simulations

    Authors: Aryan Tyagi, Aryaman Rao, Shubhanshu Rao, Raj Kumar Singh

    Abstract: Chronic Obstructive Pulmonary Disorder (COPD) is a prevalent respiratory disease that significantly impacts the quality of life of affected individuals. This paper presents COPDFlowNet, a novel deep-learning framework that leverages a custom Generative Adversarial Network (GAN) to generate synthetic Computational Fluid Dynamics (CFD) velocity flow field images specific to the trachea of COPD patie… ▽ More

    Submitted 17 December, 2023; originally announced December 2023.

    Comments: 2 pages 2 tables 3 figures

  26. arXiv:2312.03076  [pdf, ps, other

    cs.CC

    XOR Lemmas for Communication via Marginal Information

    Authors: Siddharth Iyer, Anup Rao

    Abstract: We define the $\textit{marginal information}$ of a communication protocol, and use it to prove XOR lemmas for communication complexity. We show that if every $C$-bit protocol has bounded advantage for computing a Boolean function $f$, then every $\tilde Ω(C \sqrt{n})$-bit protocol has advantage $\exp(-Ω(n))$ for computing the $n$-fold xor $f^{\oplus n}$. We prove exponentially small bounds in the… ▽ More

    Submitted 2 July, 2024; v1 submitted 5 December, 2023; originally announced December 2023.

    Comments: Fixed typos

  27. arXiv:2311.18676  [pdf, other

    cs.SI cs.AI

    DQSSA: A Quantum-Inspired Solution for Maximizing Influence in Online Social Networks (Student Abstract)

    Authors: Aryaman Rao, Parth Singh, Dinesh Kumar Vishwakarma, Mukesh Prasad

    Abstract: Influence Maximization is the task of selecting optimal nodes maximising the influence spread in social networks. This study proposes a Discretized Quantum-based Salp Swarm Algorithm (DQSSA) for optimizing influence diffusion in social networks. By discretizing meta-heuristic algorithms and infusing them with quantum-inspired enhancements, we address issues like premature convergence and low effic… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

    Comments: AAAI Conference on Artificial Intelligence 2024

  28. arXiv:2311.17754  [pdf, other

    cs.CV cs.GR cs.HC cs.MM

    Cinematic Behavior Transfer via NeRF-based Differentiable Filming

    Authors: Xuekun Jiang, Anyi Rao, Jingbo Wang, Dahua Lin, Bo Dai

    Abstract: In the evolving landscape of digital media and video production, the precise manipulation and reproduction of visual elements like camera movements and character actions are highly desired. Existing SLAM methods face limitations in dynamic scenes and human pose estimation often focuses on 2D projections, neglecting 3D statuses. To address these issues, we first introduce a reverse filming behavior… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

    Comments: Project Page: https://virtualfilmstudio.github.io/projects/cinetransfer

  29. arXiv:2311.16933  [pdf, other

    cs.CV

    SparseCtrl: Adding Sparse Controls to Text-to-Video Diffusion Models

    Authors: Yuwei Guo, Ceyuan Yang, Anyi Rao, Maneesh Agrawala, Dahua Lin, Bo Dai

    Abstract: The development of text-to-video (T2V), i.e., generating videos with a given text prompt, has been significantly advanced in recent years. However, relying solely on text prompts often results in ambiguous frame composition due to spatial uncertainty. The research community thus leverages the dense structure signals, e.g., per-frame depth/edge sequences, to enhance controllability, whose collectio… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

    Comments: Project page: https://guoyww.github.io/projects/SparseCtrl

  30. arXiv:2311.16831  [pdf, other

    cs.CY

    Tracking a Year of Polarized Twitter Discourse on Abortion

    Authors: Ashwin Rao, Rong-Ching Chang, Qiankun Zhong, Kristina Lerman, Magdalena Wojcieszak

    Abstract: Abortion is one of the most contentious issues in American politics. The Dobbs v. Jackson Women's Health Organization ruling in 2022, which shifted the authority to regulate abortion from the federal government to the states, triggering intense protests and emotional debates across the nation. Yet, little is known about how online discourse about abortion rights fluctuated on social media platform… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

  31. arXiv:2311.09687  [pdf, other

    cs.CL

    Inducing Political Bias Allows Language Models Anticipate Partisan Reactions to Controversies

    Authors: Zihao He, Siyi Guo, Ashwin Rao, Kristina Lerman

    Abstract: Social media platforms are rife with politically charged discussions. Therefore, accurately deciphering and predicting partisan biases using Large Language Models (LLMs) is increasingly critical. In this study, we address the challenge of understanding political bias in digitized discourse using LLMs. While traditional approaches often rely on finetuning separate models for each political faction,… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

  32. arXiv:2311.04817  [pdf, other

    cs.LG cs.AI

    Decentralized Personalized Online Federated Learning

    Authors: Renzhi Wu, Saayan Mitra, Xiang Chen, Anup Rao

    Abstract: Vanilla federated learning does not support learning in an online environment, learning a personalized model on each client, and learning in a decentralized setting. There are existing methods extending federated learning in each of the three aspects. However, some important applications on enterprise edge servers (e.g. online item recommendation at global scale) involve the three aspects at the s… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

    Journal ref: IEEE BigData 2023

  33. arXiv:2311.02332  [pdf, other

    cs.LG cs.CV

    Multimodal Machine Learning in Image-Based and Clinical Biomedicine: Survey and Prospects

    Authors: Elisa Warner, Joonsang Lee, William Hsu, Tanveer Syeda-Mahmood, Charles Kahn, Olivier Gevaert, Arvind Rao

    Abstract: Machine learning (ML) applications in medical artificial intelligence (AI) systems have shifted from traditional and statistical methods to increasing application of deep learning models. This survey navigates the current landscape of multimodal ML, focusing on its profound impact on medical image analysis and clinical decision support systems. Emphasizing challenges and innovations in addressing… ▽ More

    Submitted 19 January, 2024; v1 submitted 4 November, 2023; originally announced November 2023.

  34. arXiv:2310.18553  [pdf, other

    cs.SI physics.soc-ph

    Affective Polarization and Dynamics of Information Spread in Online Networks

    Authors: Kristina Lerman, Dan Feldman, Zihao He, Ashwin Rao

    Abstract: Members of different political groups not only disagree about issues but also dislike and distrust each other. While social media can amplify this emotional divide -- called affective polarization by political scientists -- there is a lack of agreement on its strength and prevalence. We measure affective polarization on social media by quantifying the emotions and toxicity of reply interactions. W… ▽ More

    Submitted 7 May, 2024; v1 submitted 27 October, 2023; originally announced October 2023.

  35. arXiv:2310.07251  [pdf, other

    cs.CL cs.AI

    Ethical Reasoning over Moral Alignment: A Case and Framework for In-Context Ethical Policies in LLMs

    Authors: Abhinav Rao, Aditi Khandelwal, Kumar Tanmay, Utkarsh Agarwal, Monojit Choudhury

    Abstract: In this position paper, we argue that instead of morally aligning LLMs to specific set of ethical principles, we should infuse generic ethical reasoning capabilities into them so that they can handle value pluralism at a global scale. When provided with an ethical policy, an LLM should be capable of making decisions that are ethically consistent to the policy. We develop a framework that integrate… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

  36. arXiv:2310.00073  [pdf, other

    cs.RO

    Multi-Objective Sparse Sensing with Ergodic Optimization

    Authors: Ananya Rao, Howie Choset

    Abstract: We consider a search problem where a robot has one or more types of sensors, each suited to detecting different types of targets or target information. Often, information in the form of a distribution of possible target locations, or locations of interest, may be available to guide the search. When multiple types of information exist, then a distribution for each type of information must also exis… ▽ More

    Submitted 29 September, 2023; originally announced October 2023.

  37. arXiv:2309.03294  [pdf, other

    cs.CR

    MALITE: Lightweight Malware Detection and Classification for Constrained Devices

    Authors: Sidharth Anand, Barsha Mitra, Soumyadeep Dey, Abhinav Rao, Rupsa Dhar, Jaideep Vaidya

    Abstract: Today, malware is one of the primary cyberthreats to organizations. Malware has pervaded almost every type of computing device including the ones having limited memory, battery and computation power such as mobile phones, tablets and embedded devices like Internet-of-Things (IoT) devices. Consequently, the privacy and security of the malware infected systems and devices have been heavily jeopardiz… ▽ More

    Submitted 6 September, 2023; originally announced September 2023.

  38. arXiv:2308.14922  [pdf, other

    cs.HC cs.CV cs.GR

    Automated Conversion of Music Videos into Lyric Videos

    Authors: Jiaju Ma, Anyi Rao, Li-Yi Wei, Rubaiat Habib Kazi, Hijung Valentina Shin, Maneesh Agrawala

    Abstract: Musicians and fans often produce lyric videos, a form of music videos that showcase the song's lyrics, for their favorite songs. However, making such videos can be challenging and time-consuming as the lyrics need to be added in synchrony and visual harmony with the video. Informed by prior work and close examination of existing lyric videos, we propose a set of design guidelines to help creators… ▽ More

    Submitted 28 August, 2023; originally announced August 2023.

  39. Zero-shot Skeleton-based Action Recognition via Mutual Information Estimation and Maximization

    Authors: Yujie Zhou, Wenwen Qiang, Anyi Rao, Ning Lin, Bing Su, Jiaqi Wang

    Abstract: Zero-shot skeleton-based action recognition aims to recognize actions of unseen categories after training on data of seen categories. The key is to build the connection between visual and semantic space from seen to unseen classes. Previous studies have primarily focused on encoding sequences into a singular feature vector, with subsequent mapping the features to an identical anchor point within t… ▽ More

    Submitted 7 August, 2023; originally announced August 2023.

    Comments: Accepted by ACM MM 2023

  40. arXiv:2307.10245  [pdf, other

    cs.SI physics.soc-ph

    Measuring Online Emotional Reactions to Events

    Authors: Siyi Guo, Zihao He, Ashwin Rao, Eugene Jang, Yuanfeixue Nan, Fred Morstatter, Jeffrey Brantingham, Kristina Lerman

    Abstract: The rich and dynamic information environment of social media provides researchers, policy makers, and entrepreneurs with opportunities to learn about social phenomena in a timely manner. However, using this data to understand social behavior is difficult due heterogeneity of topics and events discussed in the highly dynamic online information environment. To address these challenges, we present a… ▽ More

    Submitted 28 March, 2024; v1 submitted 17 July, 2023; originally announced July 2023.

    Comments: Proceedings of the International Conference on Advances in Social Networks Analysis and Mining. 2023

  41. arXiv:2307.04725  [pdf, other

    cs.CV cs.GR cs.LG

    AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning

    Authors: Yuwei Guo, Ceyuan Yang, Anyi Rao, Zhengyang Liang, Yaohui Wang, Yu Qiao, Maneesh Agrawala, Dahua Lin, Bo Dai

    Abstract: With the advance of text-to-image (T2I) diffusion models (e.g., Stable Diffusion) and corresponding personalization techniques such as DreamBooth and LoRA, everyone can manifest their imagination into high-quality images at an affordable cost. However, adding motion dynamics to existing high-quality personalized T2Is and enabling them to generate animations remains an open challenge. In this paper… ▽ More

    Submitted 8 February, 2024; v1 submitted 10 July, 2023; originally announced July 2023.

    Comments: Codes and Supplementary Material: https://github.com/guoyww/AnimateDiff

  42. arXiv:2307.04345  [pdf, other

    cs.LG cs.AI

    Continual Learning as Computationally Constrained Reinforcement Learning

    Authors: Saurabh Kumar, Henrik Marklund, Ashish Rao, Yifan Zhu, Hong Jun Jeon, Yueyang Liu, Benjamin Van Roy

    Abstract: An agent that efficiently accumulates knowledge to develop increasingly sophisticated skills over a long lifetime could advance the frontier of artificial intelligence capabilities. The design of such agents, which remains a long-standing challenge of artificial intelligence, is addressed by the subject of continual learning. This monograph clarifies and formalizes concepts of continual learning,… ▽ More

    Submitted 20 August, 2023; v1 submitted 10 July, 2023; originally announced July 2023.

  43. arXiv:2307.03200  [pdf, other

    cs.CY cs.AI cs.MM

    Transcribing Educational Videos Using Whisper: A preliminary study on using AI for transcribing educational videos

    Authors: Ashwin Rao

    Abstract: Videos are increasingly being used for e-learning, and transcripts are vital to enhance the learning experience. The costs and delays of generating transcripts can be alleviated by automatic speech recognition (ASR) systems. In this article, we quantify the transcripts generated by whisper for 25 educational videos and identify some open avenues of research when leveraging ASR for transcribing edu… ▽ More

    Submitted 4 July, 2023; originally announced July 2023.

    Comments: Third Conference on Deployable AI: https://openreview.net/group?id=RBCDSAI.iitm.ac.in/DAI/2023/Conference

  44. arXiv:2306.17162  [pdf, other

    cs.RO

    Can Machines Garden? Systematically Comparing the AlphaGarden vs. Professional Horticulturalists

    Authors: Simeon Adebola, Rishi Parikh, Mark Presten, Satvik Sharma, Shrey Aeron, Ananth Rao, Sandeep Mukherjee, Tomson Qu, Christina Wistrom, Eugen Solowjow, Ken Goldberg

    Abstract: The AlphaGarden is an automated testbed for indoor polyculture farming which combines a first-order plant simulator, a gantry robot, a seed planting algorithm, plant phenotyping and tracking algorithms, irrigation sensors and algorithms, and custom pruning tools and algorithms. In this paper, we systematically compare the performance of the AlphaGarden to professional horticulturalists on the staf… ▽ More

    Submitted 29 June, 2023; originally announced June 2023.

    Comments: International Conference on Robotics and Automation(ICRA) 2023 Oral

  45. arXiv:2306.02848  [pdf, other

    cs.LG cs.CV q-fin.PM

    HireVAE: An Online and Adaptive Factor Model Based on Hierarchical and Regime-Switch VAE

    Authors: Zikai Wei, Anyi Rao, Bo Dai, Dahua Lin

    Abstract: Factor model is a fundamental investment tool in quantitative investment, which can be empowered by deep learning to become more flexible and efficient in practical complicated investing situations. However, it is still an open question to build a factor model that can conduct stock prediction in an online and adaptive setting, where the model can adapt itself to match the current market regime id… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

    Comments: Accepted to IJCAI 2023

  46. arXiv:2305.18533  [pdf, other

    cs.SI cs.CY

    Pandemic Culture Wars: Partisan Differences in the Moral Language of COVID-19 Discussions

    Authors: Ashwin Rao, Siyi Guo, Sze-Yuh Nina Wang, Fred Morstatter, Kristina Lerman

    Abstract: Effective response to pandemics requires coordinated adoption of mitigation measures, like masking and quarantines, to curb a virus's spread. However, as the COVID-19 pandemic demonstrated, political divisions can hinder consensus on the appropriate response. To better understand these divisions, our study examines a vast collection of COVID-19-related tweets. We focus on five contentious issues:… ▽ More

    Submitted 17 October, 2023; v1 submitted 29 May, 2023; originally announced May 2023.

  47. arXiv:2305.17455  [pdf, other

    cs.CV cs.CL

    CrossGET: Cross-Guided Ensemble of Tokens for Accelerating Vision-Language Transformers

    Authors: Dachuan Shi, Chaofan Tao, Anyi Rao, Zhendong Yang, Chun Yuan, Jiaqi Wang

    Abstract: Recent vision-language models have achieved tremendous advances. However, their computational costs are also escalating dramatically, making model acceleration exceedingly critical. To pursue more efficient vision-language Transformers, this paper introduces Cross-Guided Ensemble of Tokens (CrossGET), a general acceleration framework for vision-language Transformers. This framework adaptively comb… ▽ More

    Submitted 13 June, 2024; v1 submitted 27 May, 2023; originally announced May 2023.

    Comments: ICML 2024. Code: https://github.com/sdc17/CrossGET

  48. arXiv:2305.14965  [pdf, other

    cs.CL

    Tricking LLMs into Disobedience: Formalizing, Analyzing, and Detecting Jailbreaks

    Authors: Abhinav Rao, Sachin Vashistha, Atharva Naik, Somak Aditya, Monojit Choudhury

    Abstract: Recent explorations with commercial Large Language Models (LLMs) have shown that non-expert users can jailbreak LLMs by simply manipulating their prompts; resulting in degenerate output behavior, privacy and security breaches, offensive outputs, and violations of content regulator policies. Limited studies have been conducted to formalize and analyze these attacks and their mitigations. We bridge… ▽ More

    Submitted 27 March, 2024; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: Accepted at LREC-COLING 2024 - The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation

  49. arXiv:2305.11867  [pdf, other

    cs.SI

    Socio-Linguistic Characteristics of Coordinated Inauthentic Accounts

    Authors: Keith Burghardt, Ashwin Rao, Siyi Guo, Zihao He, Georgios Chochlakis, Baruah Sabyasachee, Andrew Rojecki, Shri Narayanan, Kristina Lerman

    Abstract: Online manipulation is a pressing concern for democracies, but the actions and strategies of coordinated inauthentic accounts, which have been used to interfere in elections, are not well understood. We analyze a five million-tweet multilingual dataset related to the 2017 French presidential election, when a major information campaign led by Russia called "#MacronLeaks" took place. We utilize heur… ▽ More

    Submitted 30 May, 2023; v1 submitted 19 May, 2023; originally announced May 2023.

    Comments: 12 pages, 9 figures; figures updated

  50. arXiv:2305.05532  [pdf, other

    eess.SP cs.AI cs.LG stat.AP stat.ML

    An ensemble of convolution-based methods for fault detection using vibration signals

    Authors: Xian Yeow Lee, Aman Kumar, Lasitha Vidyaratne, Aniruddha Rajendra Rao, Ahmed Farahat, Chetan Gupta

    Abstract: This paper focuses on solving a fault detection problem using multivariate time series of vibration signals collected from planetary gearboxes in a test rig. Various traditional machine learning and deep learning methods have been proposed for multivariate time-series classification, including distance-based, functional data-oriented, feature-driven, and convolution kernel-based methods. Recent st… ▽ More

    Submitted 4 May, 2023; originally announced May 2023.

    Comments: 12 Pages, 9 Figures, 2 Tables. Accepted at ICPHM 2023

    Journal ref: 2023 IEEE International Conference on Prognostics and Health Management (ICPHM)