Skip to main content

Showing 1–50 of 133 results for author: Verma, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2409.06122  [pdf, other

    cs.AI cs.SE

    Case Study: Leveraging GenAI to Build AI-based Surrogates and Regressors for Modeling Radio Frequency Heating in Fusion Energy Science

    Authors: E. Wes Bethel, Vianna Cramer, Alexander del Rio, Lothar Narins, Chris Pestano, Satvik Verma, Erick Arias, Nicola Bertelli, Talita Perciano, Syun'ichi Shiraiwa, Álvaro Sánchez Villar, Greg Wallace, John C. Wright

    Abstract: This work presents a detailed case study on using Generative AI (GenAI) to develop AI surrogates for simulation models in fusion energy research. The scope includes the methodology, implementation, and results of using GenAI to assist in model development and optimization, comparing these results with previous manually developed models.

    Submitted 9 September, 2024; originally announced September 2024.

    Report number: LBNL-2001609

  2. arXiv:2408.12112  [pdf, other

    cs.LG cs.AI cs.MA

    Balancing Act: Prioritization Strategies for LLM-Designed Restless Bandit Rewards

    Authors: Shresth Verma, Niclas Boehmer, Lingkai Kong, Milind Tambe

    Abstract: LLMs are increasingly used to design reward functions based on human preferences in Reinforcement Learning (RL). We focus on LLM-designed rewards for Restless Multi-Armed Bandits, a framework for allocating limited resources among agents. In applications such as public health, this approach empowers grassroots health workers to tailor automated allocation decisions to community needs. In the prese… ▽ More

    Submitted 21 August, 2024; originally announced August 2024.

  3. arXiv:2407.21783  [pdf, other

    cs.AI cs.CL cs.CV

    The Llama 3 Herd of Models

    Authors: Abhimanyu Dubey, Abhinav Jauhri, Abhinav Pandey, Abhishek Kadian, Ahmad Al-Dahle, Aiesha Letman, Akhil Mathur, Alan Schelten, Amy Yang, Angela Fan, Anirudh Goyal, Anthony Hartshorn, Aobo Yang, Archi Mitra, Archie Sravankumar, Artem Korenev, Arthur Hinsvark, Arun Rao, Aston Zhang, Aurelien Rodriguez, Austen Gregerson, Ava Spataru, Baptiste Roziere, Bethany Biron, Binh Tang , et al. (510 additional authors not shown)

    Abstract: Modern artificial intelligence (AI) systems are powered by foundation models. This paper presents a new set of foundation models, called Llama 3. It is a herd of language models that natively support multilinguality, coding, reasoning, and tool usage. Our largest model is a dense Transformer with 405B parameters and a context window of up to 128K tokens. This paper presents an extensive empirical… ▽ More

    Submitted 15 August, 2024; v1 submitted 31 July, 2024; originally announced July 2024.

  4. arXiv:2407.12131  [pdf, other

    cs.CY cs.AI cs.LG cs.MA

    Improving Health Information Access in the World's Largest Maternal Mobile Health Program via Bandit Algorithms

    Authors: Arshika Lalan, Shresth Verma, Paula Rodriguez Diaz, Panayiotis Danassis, Amrita Mahale, Kumar Madhu Sudan, Aparna Hegde, Milind Tambe, Aparna Taneja

    Abstract: Harnessing the wide-spread availability of cell phones, many nonprofits have launched mobile health (mHealth) programs to deliver information via voice or text to beneficiaries in underserved communities, with maternal and infant health being a key area of such mHealth programs. Unfortunately, dwindling listenership is a major challenge, requiring targeted interventions using limited resources. Th… ▽ More

    Submitted 14 May, 2024; originally announced July 2024.

    Comments: Published at Innovative Applications of Artificial Intelligence (IAAI 2024)

  5. arXiv:2407.08003  [pdf, other

    cs.LG cs.AI

    Machine Learning for ALSFRS-R Score Prediction: Making Sense of the Sensor Data

    Authors: Ritesh Mehta, Aleksandar Pramov, Shashank Verma

    Abstract: Amyotrophic Lateral Sclerosis (ALS) is characterized as a rapidly progressive neurodegenerative disease that presents individuals with limited treatment options in the realm of medical interventions and therapies. The disease showcases a diverse range of onset patterns and progression trajectories, emphasizing the critical importance of early detection of functional decline to enable tailored care… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: Paper submitted to CLEF 2024 CEUR-WS

  6. arXiv:2407.05255  [pdf, other

    cs.CV

    Estimation of the Area and Precipitation Associated with a Tropical Cyclone Biparjoy by using Image Processing

    Authors: Shikha Verma, Kuldeep Srivastava, Akhilesh Tiwari, Shekhar Verma

    Abstract: The rainfall associated with Topical Cyclone(TC) contributes a major amount to the annual rainfall in India. Due to the limited research on the quantitative precipitation associated with Tropical Cyclones (TC), the prediction of the amount of precipitation and area that it may cover remains a challenge. This paper proposes an approach to estimate the accumulated precipitation and impact on affecte… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

  7. arXiv:2406.15444  [pdf, other

    cs.CL

    Investigating the Robustness of LLMs on Math Word Problems

    Authors: Ujjwala Anantheswaran, Himanshu Gupta, Kevin Scaria, Shreyas Verma, Chitta Baral, Swaroop Mishra

    Abstract: Large Language Models (LLMs) excel at various tasks, including solving math word problems (MWPs), but struggle with real-world problems containing irrelevant information. To address this, we propose a prompting framework that generates adversarial variants of MWPs by adding irrelevant variables. We introduce a dataset, ProbleMATHIC, containing both adversarial and non-adversarial MWPs. Our experim… ▽ More

    Submitted 3 September, 2024; v1 submitted 30 May, 2024; originally announced June 2024.

  8. arXiv:2406.13439  [pdf, other

    cs.CL

    Finding Blind Spots in Evaluator LLMs with Interpretable Checklists

    Authors: Sumanth Doddapaneni, Mohammed Safi Ur Rahman Khan, Sshubam Verma, Mitesh M. Khapra

    Abstract: Large Language Models (LLMs) are increasingly relied upon to evaluate text outputs of other LLMs, thereby influencing leaderboards and development decisions. However, concerns persist over the accuracy of these assessments and the potential for misleading conclusions. In this work, we investigate the effectiveness of LLMs as evaluators for text generation tasks. We propose FBI, a novel framework d… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  9. arXiv:2406.11930  [pdf, other

    cs.SE cs.AI cs.CL

    A Critical Study of What Code-LLMs (Do Not) Learn

    Authors: Abhinav Anand, Shweta Verma, Krishna Narasimhan, Mira Mezini

    Abstract: Large Language Models trained on code corpora (code-LLMs) have demonstrated impressive performance in various coding assistance tasks. However, despite their increased size and training dataset, code-LLMs still have limitations such as suggesting codes with syntactic errors, variable misuse etc. Some studies argue that code-LLMs perform well on coding tasks because they use self-attention and hidd… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  10. arXiv:2405.16681  [pdf, other

    cs.CL

    Triple Preference Optimization: Achieving Better Alignment with Less Data in a Single Step Optimization

    Authors: Amir Saeidi, Shivanshu Verma, Aswin RRV, Chitta Baral

    Abstract: Large Language Models (LLMs) perform well across diverse tasks, but aligning them with human demonstrations is challenging. Recently, Reinforcement Learning (RL)-free methods like Direct Preference Optimization (DPO) have emerged, offering improved stability and scalability while retaining competitive performance relative to RL-based methods. However, while RL-free methods deliver satisfactory per… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

  11. arXiv:2405.12299  [pdf, other

    cs.LG cs.AI cs.CV

    Perturbing the Gradient for Alleviating Meta Overfitting

    Authors: Manas Gogoi, Sambhavi Tiwari, Shekhar Verma

    Abstract: The reason for Meta Overfitting can be attributed to two factors: Mutual Non-exclusivity and the Lack of diversity, consequent to which a single global function can fit the support set data of all the meta-training tasks and fail to generalize to new unseen tasks. This issue is evidenced by low error rates on the meta-training tasks, but high error rates on new tasks. However, there can be a numbe… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

  12. arXiv:2404.14723  [pdf, other

    cs.CL

    Insights into Alignment: Evaluating DPO and its Variants Across Multiple Tasks

    Authors: Amir Saeidi, Shivanshu Verma, Chitta Baral

    Abstract: Large Language Models (LLMs) have demonstrated remarkable performance across a spectrum of tasks. Recently, Direct Preference Optimization (DPO) has emerged as an RL-free approach to optimize the policy model on human preferences. However, several limitations hinder the widespread adoption of this method. To address these shortcomings, various versions of DPO have been introduced. Yet, a comprehen… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  13. arXiv:2402.13135  [pdf

    cs.DC

    A Systematic Literature Review on Task Allocation and Performance Management Techniques in Cloud Data Center

    Authors: Nidhika Chauhan, Navneet Kaur, Kamaljit Singh Saini, Sahil Verma, Abdulatif Alabdulatif, Ruba Abu Khurma, Maribel Garcia-Arenas, Pedro A. Castillo

    Abstract: As cloud computing usage grows, cloud data centers play an increasingly important role. To maximize resource utilization, ensure service quality, and enhance system performance, it is crucial to allocate tasks and manage performance effectively. The purpose of this study is to provide an extensive analysis of task allocation and performance management techniques employed in cloud data centers. The… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

  14. arXiv:2402.07896  [pdf, other

    cs.CL

    Suppressing Pink Elephants with Direct Principle Feedback

    Authors: Louis Castricato, Nathan Lile, Suraj Anand, Hailey Schoelkopf, Siddharth Verma, Stella Biderman

    Abstract: Existing methods for controlling language models, such as RLHF and Constitutional AI, involve determining which LLM behaviors are desirable and training them into a language model. However, in many cases, it is desirable for LLMs to be controllable at inference time, so that they can be used in multiple contexts with diverse needs. We illustrate this with the Pink Elephant Problem: instructing an… ▽ More

    Submitted 13 February, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

    Comments: 8 pages, 6 figures

  15. arXiv:2401.00388  [pdf, other

    cs.CL

    FusionMind -- Improving question and answering with external context fusion

    Authors: Shreyas Verma, Manoj Parmar, Palash Choudhary, Sanchita Porwal

    Abstract: Answering questions using pre-trained language models (LMs) and knowledge graphs (KGs) presents challenges in identifying relevant knowledge and performing joint reasoning.We compared LMs (fine-tuned for the task) with the previously published QAGNN method for the Question-answering (QA) objective and further measured the impact of additional factual context on the QAGNN performance. The QAGNN met… ▽ More

    Submitted 30 December, 2023; originally announced January 2024.

    Comments: 5 pages, 4 figures, 4 tables

  16. arXiv:2312.15576  [pdf, other

    cs.CL

    Reducing LLM Hallucinations using Epistemic Neural Networks

    Authors: Shreyas Verma, Kien Tran, Yusuf Ali, Guangyu Min

    Abstract: Reducing and detecting hallucinations in large language models is an open research problem. In this project, we attempt to leverage recent advances in the field of uncertainty estimation to reduce hallucinations in frozen large language models. Epistemic neural networks have recently been proposed to improve output joint distributions for large pre-trained models. ENNs are small networks attached… ▽ More

    Submitted 24 December, 2023; originally announced December 2023.

    Comments: 12 pages,9 figures, 4 tables

  17. arXiv:2311.14948  [pdf, other

    cs.LG cs.AI cs.CV

    Effective Backdoor Mitigation Depends on the Pre-training Objective

    Authors: Sahil Verma, Gantavya Bhatt, Avi Schwarzschild, Soumye Singhal, Arnav Mohanty Das, Chirag Shah, John P Dickerson, Jeff Bilmes

    Abstract: Despite the advanced capabilities of contemporary machine learning (ML) models, they remain vulnerable to adversarial and backdoor attacks. This vulnerability is particularly concerning in real-world deployments, where compromised models may exhibit unpredictable behavior in critical scenarios. Such risks are heightened by the prevalent practice of collecting massive, internet-sourced datasets for… ▽ More

    Submitted 5 December, 2023; v1 submitted 25 November, 2023; originally announced November 2023.

    Comments: Accepted for oral presentation at BUGS workshop @ NeurIPS 2023 (https://neurips2023-bugs.github.io/)

  18. arXiv:2311.11694  [pdf, other

    cs.LG stat.ML

    Unveiling the Power of Self-Attention for Shipping Cost Prediction: The Rate Card Transformer

    Authors: P Aditya Sreekar, Sahil Verma, Varun Madhavan, Abhishek Persad

    Abstract: Amazon ships billions of packages to its customers annually within the United States. Shipping cost of these packages are used on the day of shipping (day 0) to estimate profitability of sales. Downstream systems utilize these days 0 profitability estimates to make financial decisions, such as pricing strategies and delisting loss-making products. However, obtaining accurate shipping cost estimate… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

  19. arXiv:2311.10648  [pdf, other

    cs.CV cs.LG

    Self-trained Panoptic Segmentation

    Authors: Shourya Verma

    Abstract: Panoptic segmentation is an important computer vision task which combines semantic and instance segmentation. It plays a crucial role in domains of medical image analysis, self-driving vehicles, and robotics by providing a comprehensive understanding of visual environments. Traditionally, deep learning panoptic segmentation models have relied on dense and accurately annotated training data, which… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

  20. arXiv:2311.07139  [pdf, other

    cs.LG cs.AI cs.MA

    Analyzing and Predicting Low-Listenership Trends in a Large-Scale Mobile Health Program: A Preliminary Investigation

    Authors: Arshika Lalan, Shresth Verma, Kumar Madhu Sudan, Amrita Mahale, Aparna Hegde, Milind Tambe, Aparna Taneja

    Abstract: Mobile health programs are becoming an increasingly popular medium for dissemination of health information among beneficiaries in less privileged communities. Kilkari is one of the world's largest mobile health programs which delivers time sensitive audio-messages to pregnant women and new mothers. We have been collaborating with ARMMAN, a non-profit in India which operates the Kilkari program, to… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

    Comments: Accepted to Data Science for Social Good Workshop, KDD 2023

  21. arXiv:2310.17876  [pdf, other

    cs.CL

    TarGEN: Targeted Data Generation with Large Language Models

    Authors: Himanshu Gupta, Kevin Scaria, Ujjwala Anantheswaran, Shreyas Verma, Mihir Parmar, Saurabh Arjun Sawant, Chitta Baral, Swaroop Mishra

    Abstract: The rapid advancement of large language models (LLMs) has sparked interest in data synthesis techniques, aiming to generate diverse and high-quality synthetic datasets. However, these synthetic datasets often suffer from a lack of diversity and added noise. In this paper, we present TarGEN, a multi-step prompting strategy for generating high-quality synthetic datasets utilizing a LLM. An advantage… ▽ More

    Submitted 8 August, 2024; v1 submitted 26 October, 2023; originally announced October 2023.

    Comments: COLM 2024, 35 pages

  22. arXiv:2310.01794  [pdf, other

    cs.LG

    GNNX-BENCH: Unravelling the Utility of Perturbation-based GNN Explainers through In-depth Benchmarking

    Authors: Mert Kosan, Samidha Verma, Burouj Armgaan, Khushbu Pahwa, Ambuj Singh, Sourav Medya, Sayan Ranu

    Abstract: Numerous explainability methods have been proposed to shed light on the inner workings of GNNs. Despite the inclusion of empirical evaluations in all the proposed algorithms, the interrogative aspects of these evaluations lack diversity. As a result, various facets of explainability pertaining to GNNs, such as a comparative analysis of counterfactual reasoners, their stability to variational facto… ▽ More

    Submitted 14 March, 2024; v1 submitted 3 October, 2023; originally announced October 2023.

    Comments: Accepted at ICLR 2024

  23. arXiv:2309.00864  [pdf, other

    cs.LG cs.AI cs.CV cs.DC

    Equitable-FL: Federated Learning with Sparsity for Resource-Constrained Environment

    Authors: Indrajeet Kumar Sinha, Shekhar Verma, Krishna Pratap Singh

    Abstract: In Federated Learning, model training is performed across multiple computing devices, where only parameters are shared with a common central server without exchanging their data instances. This strategy assumes abundance of resources on individual clients and utilizes these resources to build a richer model as user's models. However, when the assumption of the abundance of resources is violated, l… ▽ More

    Submitted 2 September, 2023; originally announced September 2023.

    Comments: 12 pages, 7 figures

  24. arXiv:2308.14916  [pdf, other

    cs.IR cs.AI cs.LG

    RecRec: Algorithmic Recourse for Recommender Systems

    Authors: Sahil Verma, Ashudeep Singh, Varich Boonsanong, John P. Dickerson, Chirag Shah

    Abstract: Recommender systems play an essential role in the choices people make in domains such as entertainment, shopping, food, news, employment, and education. The machine learning models underlying these recommender systems are often enormously large and black-box in nature for users, content providers, and system developers alike. It is often crucial for all stakeholders to understand the model's ratio… ▽ More

    Submitted 28 August, 2023; originally announced August 2023.

    Comments: Accepted as a short paper at CIKM 2023

  25. arXiv:2308.13970  [pdf, other

    cs.LG cs.AI cs.CV

    FAM: fast adaptive federated meta-learning

    Authors: Indrajeet Kumar Sinha, Shekhar Verma, Krishna Pratap Singh

    Abstract: In this work, we propose a fast adaptive federated meta-learning (FAM) framework for collaboratively learning a single global model, which can then be personalized locally on individual clients. Federated learning enables multiple clients to collaborate to train a model without sharing data. Clients with insufficient data or data diversity participate in federated learning to learn a model with su… ▽ More

    Submitted 1 September, 2023; v1 submitted 26 August, 2023; originally announced August 2023.

    Comments: 13 Pages, 1 figure

  26. arXiv:2308.12890  [pdf, other

    cs.CL cs.AI

    Large Language Models Vote: Prompting for Rare Disease Identification

    Authors: David Oniani, Jordan Hilsman, Hang Dong, Fengyi Gao, Shiven Verma, Yanshan Wang

    Abstract: The emergence of generative Large Language Models (LLMs) emphasizes the need for accurate and efficient prompting approaches. LLMs are often applied in Few-Shot Learning (FSL) contexts, where tasks are executed with minimal training data. FSL has become popular in many Artificial Intelligence (AI) subdomains, including AI for health. Rare diseases affect a small fraction of the population. Rare di… ▽ More

    Submitted 23 January, 2024; v1 submitted 24 August, 2023; originally announced August 2023.

  27. arXiv:2308.10995  [pdf, ps, other

    physics.ao-ph cs.LG

    Deep Learning Techniques in Extreme Weather Events: A Review

    Authors: Shikha Verma, Kuldeep Srivastava, Akhilesh Tiwari, Shekhar Verma

    Abstract: Extreme weather events pose significant challenges, thereby demanding techniques for accurate analysis and precise forecasting to mitigate its impact. In recent years, deep learning techniques have emerged as a promising approach for weather forecasting and understanding the dynamics of extreme weather events. This review aims to provide a comprehensive overview of the state-of-the-art deep learni… ▽ More

    Submitted 18 August, 2023; originally announced August 2023.

  28. arXiv:2307.02357  [pdf

    cs.DB cs.AI cs.DC cs.LG

    Decentralized Data Governance as Part of a Data Mesh Platform: Concepts and Approaches

    Authors: Arif Wider, Sumedha Verma, Atif Akhtar

    Abstract: Data mesh is a socio-technical approach to decentralized analytics data management. To manage this decentralization efficiently, data mesh relies on automation provided by a self-service data infrastructure platform. A key aspect of this platform is to enable decentralized data governance. Because data mesh is a young approach, there is a lack of coherence in how data mesh concepts are interpreted… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.

    Comments: 30th International Conference on Web Services, July 2-8, 2023, Chicago, USA

  29. arXiv:2306.16503  [pdf, other

    cs.LG cs.AI

    SARC: Soft Actor Retrospective Critic

    Authors: Sukriti Verma, Ayush Chopra, Jayakumar Subramanian, Mausoom Sarkar, Nikaash Puri, Piyush Gupta, Balaji Krishnamurthy

    Abstract: The two-time scale nature of SAC, which is an actor-critic algorithm, is characterised by the fact that the critic estimate has not converged for the actor at any given time, but since the critic learns faster than the actor, it ensures eventual consistency between the two. Various strategies have been introduced in literature to learn better gradient estimates to help achieve better convergence.… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

    Comments: Accepted at RLDM 2022

  30. arXiv:2306.04835  [pdf, other

    cs.LG cs.AI cs.SI

    Empowering Counterfactual Reasoning over Graph Neural Networks through Inductivity

    Authors: Samidha Verma, Burouj Armgaan, Sourav Medya, Sayan Ranu

    Abstract: Graph neural networks (GNNs) have various practical applications, such as drug discovery, recommendation engines, and chip design. However, GNNs lack transparency as they cannot provide understandable explanations for their predictions. To address this issue, counterfactual reasoning is used. The main goal is to make minimal changes to the input graph of a GNN in order to alter its prediction. Whi… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

  31. A Survey on Multi-AP Coordination Approaches over Emerging WLANs: Future Directions and Open Challenges

    Authors: Shikhar Verma, Tiago Koketsu Rodrigues, Yuichi Kawamoto, Mostafa M. Fouda, Nei Kato

    Abstract: Recent advancements in wireless local area network (WLAN) technology include IEEE 802.11be and 802.11ay, often known as Wi-Fi 7 and WiGig, respectively. The goal of these developments is to provide Extremely High Throughput (EHT) and low latency to meet the demands of future applications like as 8K videos, augmented and virtual reality, the Internet of Things, telesurgery, and other developing tec… ▽ More

    Submitted 19 December, 2023; v1 submitted 7 June, 2023; originally announced June 2023.

    Comments: The reason for the replacement of the previous version of the paper is due to a change in the author's list. As a result, a new version has been created, which serves as the final draft version before acceptance. This updated version contains all the latest changes and improvements made to the paper

  32. arXiv:2305.12640  [pdf, other

    cs.AI cs.LG stat.ML

    Limited Resource Allocation in a Non-Markovian World: The Case of Maternal and Child Healthcare

    Authors: Panayiotis Danassis, Shresth Verma, Jackson A. Killian, Aparna Taneja, Milind Tambe

    Abstract: The success of many healthcare programs depends on participants' adherence. We consider the problem of scheduling interventions in low resource settings (e.g., placing timely support calls from health workers) to increase adherence and/or engagement. Past works have successfully developed several classes of Restless Multi-armed Bandit (RMAB) based solutions for this problem. Nevertheless, all past… ▽ More

    Submitted 21 May, 2023; originally announced May 2023.

    Comments: Proceedings of the 32nd International Joint Conference on Artificial Intelligence (IJCAI 2023)

  33. OPT-R: Exploring the Role of Explanations in Finetuning and Prompting for Reasoning Skills of Large Language Models

    Authors: Badr AlKhamissi, Siddharth Verma, Ping Yu, Zhijing Jin, Asli Celikyilmaz, Mona Diab

    Abstract: In this paper, we conduct a thorough investigation into the reasoning capabilities of Large Language Models (LLMs), focusing specifically on the Open Pretrained Transformers (OPT) models as a representative of such models. Our study entails finetuning three different sizes of OPT on a carefully curated reasoning corpus, resulting in two sets of finetuned models: OPT-R, finetuned without explanatio… ▽ More

    Submitted 24 October, 2023; v1 submitted 19 May, 2023; originally announced May 2023.

    Comments: Proceedings of the 1st Workshop on Natural Language Reasoning and Structured Explanations (NLRSE) at ACL 2023

  34. arXiv:2304.03233  [pdf, other

    cs.DS cs.DM

    Parameterized algorithms for Eccentricity Shortest Path Problem

    Authors: Sriram Bhyravarapu, Satyabrata Jana, Lawqueen Kanesh, Saket Saurabh, Shaily Verma

    Abstract: Given an undirected graph $G=(V,E)$ and an integer $\ell$, the Eccentricity Shortest Path (ESP) asks to find a shortest path $P$ such that for every vertex $v\in V(G)$, there is a vertex $w\in P$ such that $d_G(v,w)\leq \ell$, where $d_G(v,w)$ represents the distance between $v$ and $w$ in $G$. Dragan and Leitert [Theor. Comput. Sci. 2017] showed that the optimization version of this problem, whic… ▽ More

    Submitted 6 April, 2023; originally announced April 2023.

    Comments: A preliminary version of this paper was accepted in IWOCA 2023

  35. arXiv:2304.02862  [pdf, other

    cs.LG

    Learning to Learn with Indispensable Connections

    Authors: Sambhavi Tiwari, Manas Gogoi, Shekhar Verma, Krishna Pratap Singh

    Abstract: Meta-learning aims to solve unseen tasks with few labelled instances. Nevertheless, despite its effectiveness for quick learning in existing optimization-based methods, it has several flaws. Inconsequential connections are frequently seen during meta-training, which results in an over-parameterized neural network. Because of this, meta-testing observes unnecessary computations and extra memory ove… ▽ More

    Submitted 6 April, 2023; originally announced April 2023.

  36. arXiv:2303.15731  [pdf, other

    cs.NI

    Smart Handover with Predicted User Behavior using Convolutional Neural Networks for WiGig Systems

    Authors: Tiago Koketsu Rodrigues, Shikhar Verma, Yuichi Kawamoto, Nei Kato, Mostafa M. Fouda, Muhammad Ismail

    Abstract: WiGig networks and 60 GHz frequency communications have a lot of potential for commercial and personal use. They can offer extremely high transmission rates but at the cost of low range and penetration. Due to these issues, WiGig systems are unstable and need to rely on frequent handovers to maintain high-quality connections. However, this solution is problematic as it forces users into bad connec… ▽ More

    Submitted 28 March, 2023; originally announced March 2023.

    Comments: Submitted to IEEE Network

  37. Addressing DAO Insider Attacks in IPv6-Based Low-Power and Lossy Networks

    Authors: Sachin Kumar Verma, Abhishek Verma, Avinash Chandra Pandey

    Abstract: Low-Power and Lossy Networks (LLNs) run on resource-constrained devices and play a key role in many Industrial Internet of Things and Cyber-Physical Systems based applications. But, achieving an energy-efficient routing in LLNs is a major challenge nowadays. This challenge is addressed by Routing Protocol for Low-power Lossy Networks (RPL), which is specified in RFC 6550 as a "Proposed Standard" a… ▽ More

    Submitted 1 March, 2023; originally announced March 2023.

    Journal ref: In 2022 IEEE Region 10 Symposium (TENSYMP) (pp. 1-6). IEEE (July, 2022)

  38. arXiv:2302.10287  [pdf, other

    cs.CV

    CertViT: Certified Robustness of Pre-Trained Vision Transformers

    Authors: Kavya Gupta, Sagar Verma

    Abstract: Lipschitz bounded neural networks are certifiably robust and have a good trade-off between clean and certified accuracy. Existing Lipschitz bounding methods train from scratch and are limited to moderately sized networks (< 6M parameters). They require a fair amount of hyper-parameter tuning and are computationally prohibitive for large networks like Vision Transformers (5M to 660M parameters). Ob… ▽ More

    Submitted 1 February, 2023; originally announced February 2023.

    Comments: Preprint work. 13 pages, 3 figures, https://github.com/sagarverma/transformer-lipschitz

  39. arXiv:2302.07317  [pdf, other

    cs.LG cs.AI cs.CV

    Algorithm Selection for Deep Active Learning with Imbalanced Datasets

    Authors: Jifan Zhang, Shuai Shao, Saurabh Verma, Robert Nowak

    Abstract: Label efficiency has become an increasingly important objective in deep learning applications. Active learning aims to reduce the number of labeled examples needed to train deep networks, but the empirical performance of active learning algorithms can vary dramatically across datasets and applications. It is difficult to know in advance which active learning strategy will perform well or best in a… ▽ More

    Submitted 2 November, 2023; v1 submitted 14 February, 2023; originally announced February 2023.

  40. arXiv:2301.07835  [pdf, other

    cs.AI

    Decision-Focused Evaluation: Analyzing Performance of Deployed Restless Multi-Arm Bandits

    Authors: Paritosh Verma, Shresth Verma, Aditya Mate, Aparna Taneja, Milind Tambe

    Abstract: Restless multi-arm bandits (RMABs) is a popular decision-theoretic framework that has been used to model real-world sequential decision making problems in public health, wildlife conservation, communication systems, and beyond. Deployed RMAB systems typically operate in two stages: the first predicts the unknown parameters defining the RMAB instance, and the second employs an optimization algorith… ▽ More

    Submitted 18 January, 2023; originally announced January 2023.

    Comments: 11 pages, 3 figures, AI for Social Good Workshop (AAAI'23)

  41. arXiv:2301.07496  [pdf

    cs.LG

    Machine learning techniques for the Schizophrenia diagnosis: A comprehensive review and future research directions

    Authors: Shradha Verma, Tripti Goel, M Tanveer, Weiping Ding, Rahul Sharma, R Murugan

    Abstract: Schizophrenia (SCZ) is a brain disorder where different people experience different symptoms, such as hallucination, delusion, flat-talk, disorganized thinking, etc. In the long term, this can cause severe effects and diminish life expectancy by more than ten years. Therefore, early and accurate diagnosis of SCZ is prevalent, and modalities like structural magnetic resonance imaging (sMRI), functi… ▽ More

    Submitted 16 January, 2023; originally announced January 2023.

  42. arXiv:2212.08286  [pdf, other

    cs.CL

    ALERT: Adapting Language Models to Reasoning Tasks

    Authors: Ping Yu, Tianlu Wang, Olga Golovneva, Badr AlKhamissi, Siddharth Verma, Zhijing Jin, Gargi Ghosh, Mona Diab, Asli Celikyilmaz

    Abstract: Current large language models can perform reasonably well on complex tasks that require step-by-step reasoning with few-shot learning. Are these models applying reasoning skills they have learnt during pre-training and reason outside of their training context, or are they simply memorizing their training corpus at finer granularity and have learnt to better understand their context? To tease apart… ▽ More

    Submitted 7 July, 2023; v1 submitted 16 December, 2022; originally announced December 2022.

  43. arXiv:2212.05195  [pdf, other

    cs.LG

    Uniform Masking Prevails in Vision-Language Pretraining

    Authors: Siddharth Verma, Yuchen Lu, Rui Hou, Hanchao Yu, Nicolas Ballas, Madian Khabsa, Amjad Almahairi

    Abstract: Masked Language Modeling (MLM) has proven to be an essential component of Vision-Language (VL) pretraining. To implement MLM, the researcher must make two design choices: the masking strategy, which determines which tokens to mask, and the masking rate, which determines how many tokens to mask. Previous work has focused primarily on the masking strategy while setting the masking rate at a default… ▽ More

    Submitted 9 December, 2022; originally announced December 2022.

  44. arXiv:2212.03980  [pdf, other

    cs.HC cs.AI cs.LG

    DDoD: Dual Denial of Decision Attacks on Human-AI Teams

    Authors: Benjamin Tag, Niels van Berkel, Sunny Verma, Benjamin Zi Hao Zhao, Shlomo Berkovsky, Dali Kaafar, Vassilis Kostakos, Olga Ohrimenko

    Abstract: Artificial Intelligence (AI) systems have been increasingly used to make decision-making processes faster, more accurate, and more efficient. However, such systems are also at constant risk of being attacked. While the majority of attacks targeting AI-based applications aim to manipulate classifiers or training data and alter the output of an AI model, recently proposed Sponge Attacks against AI m… ▽ More

    Submitted 7 December, 2022; originally announced December 2022.

    Comments: 10 pages, 1 figure, IEEE Pervasive Computing, IEEE Special Issue on Human-Centered AI

  45. arXiv:2212.02794  [pdf

    cs.CV

    Hybrid Model using Feature Extraction and Non-linear SVM for Brain Tumor Classification

    Authors: Lalita Mishra, Shekhar Verma, Shirshu Varma

    Abstract: It is essential to classify brain tumors from magnetic resonance imaging (MRI) accurately for better and timely treatment of the patients. In this paper, we propose a hybrid model, using VGG along with Nonlinear-SVM (Soft and Hard) to classify the brain tumors: glioma and pituitary and tumorous and non-tumorous. The VGG-SVM model is trained for two different datasets of two classes; thus, we perfo… ▽ More

    Submitted 6 December, 2022; originally announced December 2022.

  46. arXiv:2212.01745  [pdf, other

    cs.RO

    Design of an All-Purpose Terrace Farming Robot

    Authors: Vibhakar Mohta, Adarsh Patnaik, Shivam Kumar Panda, Siva Vignesh Krishnan, Abhinav Gupta, Abhay Shukla, Gauri Wadhwa, Shrey Verma, Aditya Bandopadhyay

    Abstract: Automation in farming processes is a growing field of research in both academia and industries. A considerable amount of work has been put into this field to develop systems robust enough for farming. Terrace farming, in particular, provides a varying set of challenges, including robust stair climbing methods and stable navigation in unstructured terrains. We propose the design of a novel autonomo… ▽ More

    Submitted 4 December, 2022; originally announced December 2022.

  47. arXiv:2211.14935  [pdf, other

    cs.IR cs.AI cs.CY cs.LG

    RecXplainer: Amortized Attribute-based Personalized Explanations for Recommender Systems

    Authors: Sahil Verma, Chirag Shah, John P. Dickerson, Anurag Beniwal, Narayanan Sadagopan, Arjun Seshadri

    Abstract: Recommender systems influence many of our interactions in the digital world -- impacting how we shop for clothes, sorting what we see when browsing YouTube or TikTok, and determining which restaurants and hotels we are shown when using hospitality platforms. Modern recommender systems are large, opaque models trained on a mixture of proprietary and open-source datasets. Naturally, issues of trust… ▽ More

    Submitted 29 August, 2023; v1 submitted 27 November, 2022; originally announced November 2022.

    Comments: Awarded the Best Student Paper at TEA Workshop at NeurIPS 2022

  48. arXiv:2211.13508  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    1st Workshop on Maritime Computer Vision (MaCVi) 2023: Challenge Results

    Authors: Benjamin Kiefer, Matej Kristan, Janez Perš, Lojze Žust, Fabio Poiesi, Fabio Augusto de Alcantara Andrade, Alexandre Bernardino, Matthew Dawkins, Jenni Raitoharju, Yitong Quan, Adem Atmaca, Timon Höfer, Qiming Zhang, Yufei Xu, Jing Zhang, Dacheng Tao, Lars Sommer, Raphael Spraul, Hangyue Zhao, Hongpu Zhang, Yanyun Zhao, Jan Lukas Augustin, Eui-ik Jeon, Impyeong Lee, Luca Zedda , et al. (48 additional authors not shown)

    Abstract: The 1$^{\text{st}}$ Workshop on Maritime Computer Vision (MaCVi) 2023 focused on maritime computer vision for Unmanned Aerial Vehicles (UAV) and Unmanned Surface Vehicle (USV), and organized several subchallenges in this domain: (i) UAV-based Maritime Object Detection, (ii) UAV-based Maritime Object Tracking, (iii) USV-based Maritime Obstacle Segmentation and (iv) USV-based Maritime Obstacle Detec… ▽ More

    Submitted 28 November, 2022; v1 submitted 24 November, 2022; originally announced November 2022.

    Comments: MaCVi 2023 was part of WACV 2023. This report (38 pages) discusses the competition as part of MaCVi

  49. arXiv:2211.12479  [pdf, other

    cs.CV cs.LG

    Adaptive Prototypical Networks

    Authors: Manas Gogoi, Sambhavi Tiwari, Shekhar Verma

    Abstract: Prototypical network for Few shot learning tries to learn an embedding function in the encoder that embeds images with similar features close to one another in the embedding space. However, in this process, the support set samples for a task are embedded independently of one other, and hence, the inter-class closeness is not taken into account. Thus, in the presence of similar-looking classes in a… ▽ More

    Submitted 22 November, 2022; originally announced November 2022.

  50. arXiv:2211.04764  [pdf

    q-bio.NC cs.AI

    Quantitative Susceptibility Mapping in Cognitive Decline: A Review of Technical Aspects and Applications

    Authors: Shradha Verma, Tripti Goel, M Tanveer

    Abstract: In the human brain, essential iron molecules for proper neurological functioning exist in transferrin (tf) and ferritin (Fe3) forms. However, its unusual increment manifests iron overload, which reacts with hydrogen peroxide. This reaction will generate hydroxyl radicals, and irons higher oxidation states. Further, this reaction causes tissue damage or cognitive decline in the brain and also leads… ▽ More

    Submitted 9 November, 2022; originally announced November 2022.