Skip to main content

Showing 1–50 of 738 results for author: Ghosh, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2409.07256  [pdf, other

    cs.CV

    MRAC Track 1: 2nd Workshop on Multimodal, Generative and Responsible Affective Computing

    Authors: Shreya Ghosh, Zhixi Cai, Abhinav Dhall, Dimitrios Kollias, Roland Goecke, Tom Gedeon

    Abstract: With the rapid advancements in multimodal generative technology, Affective Computing research has provoked discussion about the potential consequences of AI systems equipped with emotional intelligence. Affective Computing involves the design, evaluation, and implementation of Emotion AI and related technologies aimed at improving people's lives. Designing a computational model in affective comput… ▽ More

    Submitted 11 September, 2024; originally announced September 2024.

    Comments: ACM MM Workshop 2024. Workshop webpage: https://react-ws.github.io/2024/

  2. arXiv:2409.06991  [pdf, other

    cs.CV

    1M-Deepfakes Detection Challenge

    Authors: Zhixi Cai, Abhinav Dhall, Shreya Ghosh, Munawar Hayat, Dimitrios Kollias, Kalin Stefanov, Usman Tariq

    Abstract: The detection and localization of deepfake content, particularly when small fake segments are seamlessly mixed with real videos, remains a significant challenge in the field of digital media security. Based on the recently released AV-Deepfake1M dataset, which contains more than 1 million manipulated videos across more than 2,000 subjects, we introduce the 1M-Deepfakes Detection Challenge. This ch… ▽ More

    Submitted 10 September, 2024; originally announced September 2024.

    Comments: ACM MM 2024. Challenge webpage: https://deepfakes1m.github.io/

  3. arXiv:2409.06821  [pdf, other

    cs.CV

    Sam2Rad: A Segmentation Model for Medical Images with Learnable Prompts

    Authors: Assefa Seyoum Wahd, Banafshe Felfeliyan, Yuyue Zhou, Shrimanti Ghosh, Adam McArthur, Jiechen Zhang, Jacob L. Jaremko, Abhilash Hareendranathan

    Abstract: Foundation models like the segment anything model require high-quality manual prompts for medical image segmentation, which is time-consuming and requires expertise. SAM and its variants often fail to segment structures in ultrasound (US) images due to domain shift. We propose Sam2Rad, a prompt learning approach to adapt SAM and its variants for US bone segmentation without human prompts. It int… ▽ More

    Submitted 10 September, 2024; originally announced September 2024.

  4. arXiv:2409.06224  [pdf, other

    cs.CV cs.LG cs.MM

    MIP-GAF: A MLLM-annotated Benchmark for Most Important Person Localization and Group Context Understanding

    Authors: Surbhi Madan, Shreya Ghosh, Lownish Rai Sookha, M. A. Ganaie, Ramanathan Subramanian, Abhinav Dhall, Tom Gedeon

    Abstract: Estimating the Most Important Person (MIP) in any social event setup is a challenging problem mainly due to contextual complexity and scarcity of labeled data. Moreover, the causality aspects of MIP estimation are quite subjective and diverse. To this end, we aim to address the problem by annotating a large-scale `in-the-wild' dataset for identifying human perceptions about the `Most Important Per… ▽ More

    Submitted 10 September, 2024; originally announced September 2024.

    Comments: Accepted for publication at WACV 2025

  5. arXiv:2409.03916  [pdf, other

    cs.SI cs.LG

    A Survey on Signed Graph Embedding: Methods and Applications

    Authors: Shrabani Ghosh

    Abstract: A signed graph (SG) is a graph where edges carry sign information attached to it. The sign of a network can be positive, negative, or neutral. A signed network is ubiquitous in a real-world network like social networks, citation networks, and various technical networks. There are many network embedding models have been proposed and developed for signed networks for both homogeneous and heterogeneo… ▽ More

    Submitted 5 September, 2024; originally announced September 2024.

  6. arXiv:2409.01628  [pdf, other

    cs.LG cs.CL

    CTG-KrEW: Generating Synthetic Structured Contextually Correlated Content by Conditional Tabular GAN with K-Means Clustering and Efficient Word Embedding

    Authors: Riya Samanta, Bidyut Saha, Soumya K. Ghosh, Sajal K. Das

    Abstract: Conditional Tabular Generative Adversarial Networks (CTGAN) and their various derivatives are attractive for their ability to efficiently and flexibly create synthetic tabular data, showcasing strong performance and adaptability. However, there are certain critical limitations to such models. The first is their inability to preserve the semantic integrity of contextually correlated words or phrase… ▽ More

    Submitted 3 September, 2024; originally announced September 2024.

  7. arXiv:2409.01484  [pdf, other

    cs.CR

    Watermarking of Quantum Circuits

    Authors: Rupshali Roy, Swaroop Ghosh

    Abstract: Quantum circuits constitute Intellectual Property (IP) of the quantum developers and users, which needs to be protected from theft by adversarial agents, e.g., the quantum cloud provider or a rogue adversary present in the cloud. This necessitates the exploration of low-overhead techniques applicable to near-term quantum devices, to trace the quantum circuits/algorithms\textquotesingle{} IP and th… ▽ More

    Submitted 2 September, 2024; originally announced September 2024.

  8. arXiv:2409.00093  [pdf, other

    eess.SP cs.LG

    Towards Sustainable Personalized On-Device Human Activity Recognition with TinyML and Cloud-Enabled Auto Deployment

    Authors: Bidyut Saha, Riya Samanta, Soumya K Ghosh, Ram Babu Roy

    Abstract: Human activity recognition (HAR) holds immense potential for transforming health and fitness monitoring, yet challenges persist in achieving personalized outcomes and sustainability for on-device continuous inferences. This work introduces a wrist-worn smart band designed to address these challenges through a novel combination of on-device TinyML-driven computing and cloud-enabled auto-deployment.… ▽ More

    Submitted 26 August, 2024; originally announced September 2024.

  9. arXiv:2409.00081  [pdf, other

    cs.DL cs.LG cs.SI

    Examining Different Research Communities: Authorship Network

    Authors: Shrabani Ghosh

    Abstract: Google Scholar is one of the top search engines to access research articles across multiple disciplines for scholarly literature. Google scholar advance search option gives the privilege to extract articles based on phrases, publishers name, authors name, time duration etc. In this work, we collected Google Scholar data (2000-2021) for two different research domains in computer science: Data Minin… ▽ More

    Submitted 24 August, 2024; originally announced September 2024.

  10. arXiv:2409.00051  [pdf, other

    cs.HC cs.CY

    OnDiscuss: An Epistemic Network Analysis Learning Analytics Visualization Tool for Evaluating Asynchronous Online Discussions

    Authors: Yanye Luther, Marcia Moraes, Sudipto Ghosh, James Folkestad

    Abstract: Asynchronous online discussions are common assignments in both hybrid and online courses to promote critical thinking and collaboration among students. However, the evaluation of these assignments can require considerable time and effort from instructors. We created OnDiscuss, a learning analytics visualization tool for instructors that utilizes text mining algorithms and Epistemic Network Analysi… ▽ More

    Submitted 19 August, 2024; originally announced September 2024.

    Comments: Accepted to the International Conference on Quantitative Ethnography 2024 in Philadelphia, Pennsylvania

  11. arXiv:2408.17128  [pdf, ps, other

    cs.NI

    Time varying channel estimation for RIS assisted network with outdated CSI: Looking beyond coherence time

    Authors: Souvik Deb, Sasthi C. Ghosh

    Abstract: The channel estimation (CE) overhead for unstructured multipath-rich channels increases linearly with the number of reflective elements of reconfigurable intelligent surface (RIS). This results in a significant portion of the channel coherence time being spent on CE, reducing data communication time. Furthermore, due to the mobility of the user equipment (UE) and the time consumed during CE, the e… ▽ More

    Submitted 30 August, 2024; originally announced August 2024.

  12. arXiv:2408.17027  [pdf, other

    cs.CV

    ConDense: Consistent 2D/3D Pre-training for Dense and Sparse Features from Multi-View Images

    Authors: Xiaoshuai Zhang, Zhicheng Wang, Howard Zhou, Soham Ghosh, Danushen Gnanapragasam, Varun Jampani, Hao Su, Leonidas Guibas

    Abstract: To advance the state of the art in the creation of 3D foundation models, this paper introduces the ConDense framework for 3D pre-training utilizing existing pre-trained 2D networks and large-scale multi-view datasets. We propose a novel 2D-3D joint training scheme to extract co-embedded 2D and 3D features in an end-to-end pipeline, where 2D-3D feature consistency is enforced through a volume rende… ▽ More

    Submitted 30 August, 2024; originally announced August 2024.

    Comments: ECCV 2024

  13. arXiv:2408.16929  [pdf, other

    quant-ph cs.ET cs.LG

    AI-driven Reverse Engineering of QML Models

    Authors: Archisman Ghosh, Swaroop Ghosh

    Abstract: Quantum machine learning (QML) is a rapidly emerging area of research, driven by the capabilities of Noisy Intermediate-Scale Quantum (NISQ) devices. With the progress in the research of QML models, there is a rise in third-party quantum cloud services to cater to the increasing demand for resources. New security concerns surface, specifically regarding the protection of intellectual property (IP)… ▽ More

    Submitted 29 August, 2024; originally announced August 2024.

    Comments: 7 pages, 4 figures

  14. arXiv:2408.16535  [pdf, other

    cs.LG

    TinyTNAS: GPU-Free, Time-Bound, Hardware-Aware Neural Architecture Search for TinyML Time Series Classification

    Authors: Bidyut Saha, Riya Samanta, Soumya K. Ghosh, Ram Babu Roy

    Abstract: In this work, we present TinyTNAS, a novel hardware-aware multi-objective Neural Architecture Search (NAS) tool specifically designed for TinyML time series classification. Unlike traditional NAS methods that rely on GPU capabilities, TinyTNAS operates efficiently on CPUs, making it accessible for a broader range of applications. Users can define constraints on RAM, FLASH, and MAC operations to di… ▽ More

    Submitted 29 August, 2024; originally announced August 2024.

  15. arXiv:2408.15605  [pdf, other

    cs.RO cs.CV eess.SP

    ES-PTAM: Event-based Stereo Parallel Tracking and Mapping

    Authors: Suman Ghosh, Valentina Cavinato, Guillermo Gallego

    Abstract: Visual Odometry (VO) and SLAM are fundamental components for spatial perception in mobile robots. Despite enormous progress in the field, current VO/SLAM systems are limited by their sensors' capability. Event cameras are novel visual sensors that offer advantages to overcome the limitations of standard cameras, enabling robots to expand their operating range to challenging scenarios, such as high… ▽ More

    Submitted 28 August, 2024; originally announced August 2024.

    Comments: 17 pages, 7 figures, 4 tables, https://github.com/tub-rip/ES-PTAM

    Journal ref: European Conference on Computer Vision (ECCV) Workshops, Milan, Italy 2024

  16. arXiv:2408.15076  [pdf, other

    cs.LG cs.AI

    MiWaves Reinforcement Learning Algorithm

    Authors: Susobhan Ghosh, Yongyi Guo, Pei-Yao Hung, Lara Coughlin, Erin Bonar, Inbal Nahum-Shani, Maureen Walton, Susan Murphy

    Abstract: The escalating prevalence of cannabis use poses a significant public health challenge globally. In the U.S., cannabis use is more prevalent among emerging adults (EAs) (ages 18-25) than any other age group, with legalization in the multiple states contributing to a public perception that cannabis is less risky than in prior decades. To address this growing concern, we developed MiWaves, a reinforc… ▽ More

    Submitted 27 August, 2024; originally announced August 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2402.17739

  17. arXiv:2408.14113  [pdf, other

    cs.CR

    Fast Low Level Disk Encryption Using FPGAs

    Authors: Debrup Chakraborty, Sebati Ghosh, Cuauhtemoc Mancillas-Lopez, Palash Sarkar

    Abstract: A fixed length tweakable enciphering scheme (TES) is the appropriate cryptographic functionality for low level disk encryption. Research on TES over the last two decades have led to a number of proposals many of which have already been implemented using FPGAs. This paper considers the FPGA implementations of two more recent and promising TESs, namely AEZ and FAST. The relevant architectures are de… ▽ More

    Submitted 26 August, 2024; originally announced August 2024.

  18. arXiv:2408.13720  [pdf, other

    cs.LG cs.CV

    A prototype-based model for set classification

    Authors: Mohammad Mohammadi, Sreejita Ghosh

    Abstract: Classification of sets of inputs (e.g., images and texts) is an active area of research within both computer vision (CV) and natural language processing (NLP). A common way to represent a set of vectors is to model them as linear subspaces. In this contribution, we present a prototype-based approach for learning on the manifold formed from such linear subspaces, the Grassmann manifold. Our propose… ▽ More

    Submitted 25 August, 2024; originally announced August 2024.

  19. arXiv:2408.12021  [pdf, other

    cs.CR eess.SP

    R-STELLAR: A Resilient Synthesizable Signature Attenuation SCA Protection on AES-256 with built-in Attack-on-Countermeasure Detection

    Authors: Archisman Ghosh, Dong-Hyun Seo, Debayan Das, Santosh Ghosh, Shreyas Sen

    Abstract: Side channel attacks (SCAs) remain a significant threat to the security of cryptographic systems in modern embedded devices. Even mathematically secure cryptographic algorithms, when implemented in hardware, inadvertently leak information through physical side channel signatures such as power consumption, electromagnetic (EM) radiation, light emissions, and acoustic emanations. Exploiting these si… ▽ More

    Submitted 21 August, 2024; originally announced August 2024.

    Comments: Extended from CICC. Now under revision at Journal of Solid-State Circuits

  20. arXiv:2408.11510  [pdf, other

    cs.ET

    Empowering Volunteer Crowdsourcing Services: A Serverless-assisted, Skill and Willingness Aware Task Assignment Approach for Amicable Volunteer Involvement

    Authors: Riya Samanta, Biswajeet Sethi, Soumya K Ghosh

    Abstract: Volunteer crowdsourcing (VCS) leverages citizen interaction to address challenges by utilizing individuals' knowledge and skills. Complex social tasks often require collaboration among volunteers with diverse skill sets, and their willingness to engage is crucial. Matching tasks with the most suitable volunteers remains a significant challenge. VCS platforms face unpredictable demands in terms of… ▽ More

    Submitted 21 August, 2024; originally announced August 2024.

  21. arXiv:2408.11498  [pdf, other

    cs.ET

    Sustainable Volunteer Engagement: Ensuring Potential Retention and Skill Diversity for Balanced Workforce Composition in Crowdsourcing Paradigm

    Authors: Riya Samanta, Soumya K Ghosh

    Abstract: Crowdsourcing (CS) faces the challenge of managing complex, skill-demanding tasks, which requires effective task assignment and retention strategies to sustain a balanced workforce. This challenge has become more significant in Volunteer Crowdsourcing Services (VCS). This study introduces Workforce Composition Balance (WCB), a novel framework designed to maintain workforce diversity in VCS by dyna… ▽ More

    Submitted 31 August, 2024; v1 submitted 21 August, 2024; originally announced August 2024.

  22. arXiv:2408.09562  [pdf, other

    quant-ph cs.CR cs.LG

    Security Concerns in Quantum Machine Learning as a Service

    Authors: Satwik Kundu, Swaroop Ghosh

    Abstract: Quantum machine learning (QML) is a category of algorithms that employ variational quantum circuits (VQCs) to tackle machine learning tasks. Recent discoveries have shown that QML models can effectively generalize from limited training data samples. This capability has sparked increased interest in deploying these models to address practical, real-world challenges, resulting in the emergence of Qu… ▽ More

    Submitted 18 August, 2024; originally announced August 2024.

    Comments: 9 pages, 3 figures

  23. arXiv:2408.07832  [pdf, other

    cs.CL cs.CV

    LADDER: Language Driven Slice Discovery and Error Rectification

    Authors: Shantanu Ghosh, Rayan Syed, Chenyu Wang, Clare B. Poynton, Kayhan Batmanghelich

    Abstract: Error slice discovery associates structured patterns with model errors. Existing methods discover error slices by clustering the error-prone samples with similar patterns or assigning discrete attributes to each sample for post-hoc analysis. While these methods aim for interpretability and easier mitigation through reweighting or rebalancing, they may not capture the full complexity of error patte… ▽ More

    Submitted 4 September, 2024; v1 submitted 31 July, 2024; originally announced August 2024.

  24. arXiv:2408.06002  [pdf, other

    cs.RO

    Generative Design of Multimodal Soft Pneumatic Actuators

    Authors: Saswath Ghosh, Sitikantha Roy

    Abstract: The recent advancements in machine learning techniques have steered us towards the data-driven design of products. Motivated by this objective, the present study proposes an automated design methodology that employs data-driven methods to generate new designs of soft actuators. One of the bottlenecks in the data-driven automated design process is having publicly available data to train the model.… ▽ More

    Submitted 12 August, 2024; originally announced August 2024.

  25. arXiv:2408.01996  [pdf, other

    cs.ET eess.SY

    Configuring Safe Spiking Neural Controllers for Cyber-Physical Systems through Formal Verification

    Authors: Arkaprava Gupta, Sumana Ghosh, Ansuman Banerjee, Swarup Kumar Mohalik

    Abstract: Spiking Neural Networks (SNNs) are a subclass of neuromorphic models that have great potential to be used as controllers in Cyber-Physical Systems (CPSs) due to their energy efficiency. They can benefit from the prevalent approach of first training an Artificial Neural Network (ANN) and then translating to an SNN with subsequent hyperparameter tuning. The tuning is required to ensure that the resu… ▽ More

    Submitted 4 August, 2024; originally announced August 2024.

    Comments: This is the complete version of a paper with the same title that appeared at MEMOCODE 2024

  26. arXiv:2408.01594  [pdf, other

    cs.CY cs.SI

    "I don't see myself represented here at all": User Experiences of Stable Diffusion Outputs Containing Representational Harms across Gender Identities and Nationalities

    Authors: Sourojit Ghosh, Nina Lutz, Aylin Caliskan

    Abstract: Though research into text-to-image generators (T2Is) such as Stable Diffusion has demonstrated their amplification of societal biases and potentials to cause harm, such research has primarily relied on computational methods instead of seeking information from real users who experience harm, which is a significant knowledge gap. In this paper, we conduct the largest human subjects study of Stable D… ▽ More

    Submitted 2 August, 2024; originally announced August 2024.

    Comments: Upcoming Publication, AIES 2024

  27. arXiv:2408.01590  [pdf, other

    cs.CY

    Interpretations, Representations, and Stereotypes of Caste within Text-to-Image Generators

    Authors: Sourojit Ghosh

    Abstract: The surge in the popularity of text-to-image generators (T2Is) has been matched by extensive research into ensuring fairness and equitable outcomes, with a focus on how they impact society. However, such work has typically focused on globally-experienced identities or centered Western contexts. In this paper, we address interpretations, representations, and stereotypes surrounding a tragically und… ▽ More

    Submitted 2 August, 2024; originally announced August 2024.

    Comments: Upcoming Publication, AIES 2024

  28. arXiv:2407.19561  [pdf, ps, other

    quant-ph cs.CC

    Anti-Concentration for the Unitary Haar Measure and Applications to Random Quantum Circuits

    Authors: Bill Fefferman, Soumik Ghosh, Wei Zhan

    Abstract: We prove a Carbery-Wright style anti-concentration inequality for the unitary Haar measure, by showing that the probability of a polynomial in the entries of a random unitary falling into an $\varepsilon$ range is at most a polynomial in $\varepsilon$. Using it, we show that the scrambling speed of a random quantum circuit is lower bounded: Namely, every input qubit has an influence that is at lea… ▽ More

    Submitted 28 July, 2024; originally announced July 2024.

    Comments: 31 pages

  29. arXiv:2407.19099  [pdf, other

    cs.CY cs.CE cs.IR

    Sponsored is the New Organic: Implications of Sponsored Results on Quality of Search Results in the Amazon Marketplace

    Authors: Abhisek Dash, Saptarshi Ghosh, Animesh Mukherjee, Abhijnan Chakraborty, Krishna P. Gummadi

    Abstract: Interleaving sponsored results (advertisements) amongst organic results on search engine result pages (SERP) has become a common practice across multiple digital platforms. Advertisements have catered to consumer satisfaction and fostered competition in digital public spaces; making them an appealing gateway for businesses to reach their consumers. However, especially in the context of digital mar… ▽ More

    Submitted 26 July, 2024; originally announced July 2024.

    Comments: This work has been accepted as a full paper in AAAI/ACM conference on Artificial Intelligence, Ethics and Society (AIES) 2024

  30. arXiv:2407.16661  [pdf, ps, other

    math.NA cs.CC cs.DM

    Regenerative Ulam-von Neumann Algorithm: An Innovative Markov chain Monte Carlo Method for Matrix Inversion

    Authors: Soumyadip Ghosh, Lior Horesh, Vassilis Kalantzis, Yingdong Lu, Tomasz Nowicki

    Abstract: This paper presents an extension of the classical Ulan-von Neumann Markov chain Monte-Carlo algorithm for the computation of the matrix inverse. The algorithm presented in this paper, termed as \emph{regenerative Ulam-von Neumann algorithm}, utilizes the regenerative structure of classical, non-truncated Neumann series defined by a non-singular matrix and produces an unbiased estimator of the matr… ▽ More

    Submitted 16 August, 2024; v1 submitted 23 July, 2024; originally announced July 2024.

    MSC Class: 68Q25; 68R10; 65C05

  31. arXiv:2407.15810  [pdf, other

    cs.CV cs.CY

    Breaking the Global North Stereotype: A Global South-centric Benchmark Dataset for Auditing and Mitigating Biases in Facial Recognition Systems

    Authors: Siddharth D Jaiswal, Animesh Ganai, Abhisek Dash, Saptarshi Ghosh, Animesh Mukherjee

    Abstract: Facial Recognition Systems (FRSs) are being developed and deployed globally at unprecedented rates. Most platforms are designed in a limited set of countries but deployed in worldwide, without adequate checkpoints. This is especially problematic for Global South countries which lack strong legislation to safeguard persons facing disparate performance of these systems. A combination of unavailabili… ▽ More

    Submitted 26 July, 2024; v1 submitted 22 July, 2024; originally announced July 2024.

    Comments: This work has been accepted for publication at AAAI/ACM AIES 2024

  32. arXiv:2407.14779  [pdf, other

    cs.CY cs.AI cs.HC

    Do Generative AI Models Output Harm while Representing Non-Western Cultures: Evidence from A Community-Centered Approach

    Authors: Sourojit Ghosh, Pranav Narayanan Venkit, Sanjana Gautam, Shomir Wilson, Aylin Caliskan

    Abstract: Our research investigates the impact of Generative Artificial Intelligence (GAI) models, specifically text-to-image generators (T2Is), on the representation of non-Western cultures, with a focus on Indian contexts. Despite the transformative potential of T2Is in content creation, concerns have arisen regarding biases that may lead to misrepresentations and marginalizations. Through a community-cen… ▽ More

    Submitted 3 August, 2024; v1 submitted 20 July, 2024; originally announced July 2024.

    Comments: This is the pre-peer reviewed version, which has been accepted at the 7th AAAI ACM Conference on AI, Ethics, and Society, Oct. 21, 2024, California, USA

  33. arXiv:2407.14650  [pdf, other

    cs.CY cs.HC cs.IR

    Auditing the Grid-Based Placement of Private Label Products on E-commerce Search Result Pages

    Authors: Siddharth D Jaiswal, Abhisek Dash, Nitika Shroff, Yashwanth Babu Vunnam, Saptarshi Ghosh, Animesh Mukherjee

    Abstract: E-commerce platforms support the needs and livelihoods of their two most important stakeholders -- customers and producers/sellers. Multiple algorithmic systems, like ``search'' systems mediate the interactions between these stakeholders by connecting customers to producers with relevant items. Search results include (i) private label (PL) products that are manufactured/sold by the platform itself… ▽ More

    Submitted 19 July, 2024; originally announced July 2024.

  34. arXiv:2407.12869  [pdf, ps, other

    cs.CL cs.AI

    Bilingual Adaptation of Monolingual Foundation Models

    Authors: Gurpreet Gosal, Yishi Xu, Gokul Ramakrishnan, Rituraj Joshi, Avraham Sheinin, Zhiming, Chen, Biswajit Mishra, Natalia Vassilieva, Joel Hestness, Neha Sengupta, Sunil Kumar Sahu, Bokang Jia, Onkar Pandit, Satheesh Katipomu, Samta Kamboj, Samujjwal Ghosh, Rahul Pal, Parvez Mullah, Soundar Doraiswamy, Mohamed El Karim Chami, Preslav Nakov

    Abstract: We present an efficient method for adapting a monolingual Large Language Model (LLM) to another language, addressing challenges of catastrophic forgetting and tokenizer limitations. We focus this study on adapting Llama 2 to Arabic. Our two-stage approach begins with expanding the vocabulary and training only the embeddings matrix, followed by full model continual pre-training on a bilingual corpu… ▽ More

    Submitted 25 July, 2024; v1 submitted 13 July, 2024; originally announced July 2024.

  35. arXiv:2407.12848  [pdf, ps, other

    cs.CL cs.AI

    Applicability of Large Language Models and Generative Models for Legal Case Judgement Summarization

    Authors: Aniket Deroy, Kripabandhu Ghosh, Saptarshi Ghosh

    Abstract: Automatic summarization of legal case judgements, which are known to be long and complex, has traditionally been tried via extractive summarization models. In recent years, generative models including abstractive summarization models and Large language models (LLMs) have gained huge popularity. In this paper, we explore the applicability of such models for legal case judgement summarization. We ap… ▽ More

    Submitted 20 July, 2024; v1 submitted 6 July, 2024; originally announced July 2024.

    Comments: Accepted at Artificial Intelligence and Law, Springer, 2024

  36. arXiv:2407.11268  [pdf, other

    stat.ML cs.CE cs.LG

    Heterogenous Multi-Source Data Fusion Through Input Mapping and Latent Variable Gaussian Process

    Authors: Yigitcan Comlek, Sandipp Krishnan Ravi, Piyush Pandita, Sayan Ghosh, Liping Wang, Wei Chen

    Abstract: Artificial intelligence and machine learning frameworks have served as computationally efficient mapping between inputs and outputs for engineering problems. These mappings have enabled optimization and analysis routines that have warranted superior designs, ingenious material systems and optimized manufacturing processes. A common occurrence in such modeling endeavors is the existence of multiple… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

    Comments: 20 Pages,9 Figures, Data is available per request

  37. arXiv:2407.07237  [pdf, other

    quant-ph cs.CR cs.ET cs.LG

    The Quantum Imitation Game: Reverse Engineering of Quantum Machine Learning Models

    Authors: Archisman Ghosh, Swaroop Ghosh

    Abstract: Quantum Machine Learning (QML) amalgamates quantum computing paradigms with machine learning models, providing significant prospects for solving complex problems. However, with the expansion of numerous third-party vendors in the Noisy Intermediate-Scale Quantum (NISQ) era of quantum computing, the security of QML models is of prime importance, particularly against reverse engineering, which could… ▽ More

    Submitted 15 July, 2024; v1 submitted 9 July, 2024; originally announced July 2024.

    Comments: 11 pages, 12 figures

  38. arXiv:2407.06323  [pdf, ps, other

    cs.CL

    When in Doubt, Cascade: Towards Building Efficient and Capable Guardrails

    Authors: Manish Nagireddy, Inkit Padhi, Soumya Ghosh, Prasanna Sattigeri

    Abstract: Large language models (LLMs) have convincing performance in a variety of downstream tasks. However, these systems are prone to generating undesirable outputs such as harmful and biased text. In order to remedy such generations, the development of guardrail (or detector) models has gained traction. Motivated by findings from developing a detector for social bias, we adopt the notion of a use-mentio… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

  39. arXiv:2407.05901  [pdf, other

    cs.NI

    Intelligent Routing as a Service (iRaaS)

    Authors: Saptarshi Ghosh, Konstantinos Antonakoglou, Ioannis Mavromatis, Kostas Katsaros

    Abstract: The scope of the Sixth-Generation Self-Organized Networks (6G-SON) advances its predecessor's capability towards agility, flexibility, and adaptability. On-demand overlay networking technologies have shown a prominent maturity while coping with the rising complexity and scale of enterprise, service provider, and data centre networks. The Software-Defined Networking paradigm has recently offered Mo… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

  40. arXiv:2407.05399  [pdf, other

    cs.CL cs.AI cs.LG

    IL-TUR: Benchmark for Indian Legal Text Understanding and Reasoning

    Authors: Abhinav Joshi, Shounak Paul, Akshat Sharma, Pawan Goyal, Saptarshi Ghosh, Ashutosh Modi

    Abstract: Legal systems worldwide are inundated with exponential growth in cases and documents. There is an imminent need to develop NLP and ML techniques for automatically processing and understanding legal documents to streamline the legal system. However, evaluating and comparing various NLP models designed specifically for the legal domain is challenging. This paper addresses this challenge by proposing… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

    Comments: Accepted at ACL 2024 Main Conference; 40 Pages (9 Pages + References + Appendix)

  41. arXiv:2407.03835  [pdf, other

    cs.CV

    7th ABAW Competition: Multi-Task Learning and Compound Expression Recognition

    Authors: Dimitrios Kollias, Stefanos Zafeiriou, Irene Kotsia, Abhinav Dhall, Shreya Ghosh, Chunchang Shao, Guanyu Hu

    Abstract: This paper describes the 7th Affective Behavior Analysis in-the-wild (ABAW) Competition, which is part of the respective Workshop held in conjunction with ECCV 2024. The 7th ABAW Competition addresses novel challenges in understanding human expressions and behaviors, crucial for the development of human-centered technologies. The Competition comprises of two sub-challenges: i) Multi-Task Learning… ▽ More

    Submitted 8 July, 2024; v1 submitted 4 July, 2024; originally announced July 2024.

  42. arXiv:2407.02658  [pdf, other

    cs.CG

    Efficient Exact Algorithms for Minimum Covering of Orthogonal Polygons with Squares

    Authors: Anubhav Dhar, Subham Ghosh, Sudeshna Kolay

    Abstract: The Orthogonal Polygon Covering with Squares (OPCS) problem takes as input an orthogonal polygon $P$ without holes with $n$ vertices, where vertices have integral coordinates. The aim is to find a minimum number of axis-parallel, possibly overlapping squares which lie completely inside $P$, such that their union covers the entire region inside $P$. Aupperle et. al~\cite{aupperle1988covering} provi… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  43. arXiv:2407.02536  [pdf, other

    cs.LG cs.IR econ.GN stat.AP

    Reducing False Discoveries in Statistically-Significant Regional-Colocation Mining: A Summary of Results

    Authors: Subhankar Ghosh, Jayant Gupta, Arun Sharma, Shuai An, Shashi Shekhar

    Abstract: Given a set \emph{S} of spatial feature types, its feature instances, a study area, and a neighbor relationship, the goal is to find pairs $<$a region ($r_{g}$), a subset \emph{C} of \emph{S}$>$ such that \emph{C} is a statistically significant regional-colocation pattern in $r_{g}$. This problem is important for applications in various domains including ecology, economics, and sociology. The prob… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    ACM Class: E.m; F.2; E.1; H.3; I.5; J.0

  44. arXiv:2407.01878  [pdf, other

    cs.CL

    Compare without Despair: Reliable Preference Evaluation with Generation Separability

    Authors: Sayan Ghosh, Tejas Srinivasan, Swabha Swayamdipta

    Abstract: Human evaluation of generated language through pairwise preference judgments is pervasive. However, under common scenarios, such as when generations from a model pair are very similar, or when stochastic decoding results in large variations in generations, it results in inconsistent preference ratings. We address these challenges by introducing a meta-evaluation measure, separability, which estima… ▽ More

    Submitted 8 July, 2024; v1 submitted 1 July, 2024; originally announced July 2024.

    Comments: Corrected description of reference in Related Work

  45. arXiv:2407.01732  [pdf, other

    cs.CY cs.HC cs.IR

    Investigating Nudges toward Related Sellers on E-commerce Marketplaces: A Case Study on Amazon

    Authors: Abhisek Dash, Abhijnan Chakraborty, Saptarshi Ghosh, Animesh Mukherjee, Krishna P. Gummadi

    Abstract: E-commerce marketplaces provide business opportunities to millions of sellers worldwide. Some of these sellers have special relationships with the marketplace by virtue of using their subsidiary services (e.g., fulfillment and/or shipping services provided by the marketplace) -- we refer to such sellers collectively as Related Sellers. When multiple sellers offer to sell the same product, the mark… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: This work has been accepted for presentation at the ACM Conference on Computer-Supported Cooperative Work and Social Computing (CSCW) 2024. It will appear in Proceedings of the ACM on Human-Computer Interaction

  46. arXiv:2407.00317  [pdf, other

    cs.IR stat.AP

    Towards Statistically Significant Taxonomy Aware Co-location Pattern Detection

    Authors: Subhankar Ghosh, Arun Sharma, Jayant Gupta, Shashi Shekhar

    Abstract: Given a collection of Boolean spatial feature types, their instances, a neighborhood relation (e.g., proximity), and a hierarchical taxonomy of the feature types, the goal is to find the subsets of feature types or their parents whose spatial interaction is statistically significant. This problem is for taxonomy-reliant applications such as ecology (e.g., finding new symbiotic relationships across… ▽ More

    Submitted 4 July, 2024; v1 submitted 29 June, 2024; originally announced July 2024.

    Comments: Accepted in The 16th Conference on Spatial Information Theory (COSIT) 2024

    ACM Class: E.m; H.3.3; I.5; J.4; J.4

  47. arXiv:2406.17957  [pdf, other

    cs.SD cs.AI eess.AS

    Improving Robustness of LLM-based Speech Synthesis by Learning Monotonic Alignment

    Authors: Paarth Neekhara, Shehzeen Hussain, Subhankar Ghosh, Jason Li, Rafael Valle, Rohan Badlani, Boris Ginsburg

    Abstract: Large Language Model (LLM) based text-to-speech (TTS) systems have demonstrated remarkable capabilities in handling large speech datasets and generating natural speech for new speakers. However, LLM-based TTS models are not robust as the generated output can contain repeating words, missing words and mis-aligned speech (referred to as hallucinations or attention errors), especially when the text c… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: Published as a conference paper at INTERSPEECH 2024

  48. arXiv:2406.11768  [pdf, other

    cs.SD cs.AI cs.CL eess.AS

    GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities

    Authors: Sreyan Ghosh, Sonal Kumar, Ashish Seth, Chandra Kiran Reddy Evuru, Utkarsh Tyagi, S Sakshi, Oriol Nieto, Ramani Duraiswami, Dinesh Manocha

    Abstract: Perceiving and understanding non-speech sounds and non-verbal speech is essential to making decisions that help us interact with our surroundings. In this paper, we propose GAMA, a novel General-purpose Large Audio-Language Model (LALM) with Advanced Audio Understanding and Complex Reasoning Abilities. We build GAMA by integrating an LLM with multiple types of audio representations, including feat… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: Project Website: https://sreyan88.github.io/gamaaudio/

  49. arXiv:2406.11704  [pdf, other

    cs.CL cs.AI cs.LG

    Nemotron-4 340B Technical Report

    Authors: Nvidia, :, Bo Adler, Niket Agarwal, Ashwath Aithal, Dong H. Anh, Pallab Bhattacharya, Annika Brundyn, Jared Casper, Bryan Catanzaro, Sharon Clay, Jonathan Cohen, Sirshak Das, Ayush Dattagupta, Olivier Delalleau, Leon Derczynski, Yi Dong, Daniel Egert, Ellie Evans, Aleksander Ficek, Denys Fridman, Shaona Ghosh, Boris Ginsburg, Igor Gitman, Tomasz Grzegorzek , et al. (58 additional authors not shown)

    Abstract: We release the Nemotron-4 340B model family, including Nemotron-4-340B-Base, Nemotron-4-340B-Instruct, and Nemotron-4-340B-Reward. Our models are open access under the NVIDIA Open Model License Agreement, a permissive model license that allows distribution, modification, and use of the models and its outputs. These models perform competitively to open access models on a wide range of evaluation be… ▽ More

    Submitted 6 August, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

  50. arXiv:2406.06818  [pdf, other

    cs.LG

    Conformal Prediction for Class-wise Coverage via Augmented Label Rank Calibration

    Authors: Yuanjie Shi, Subhankar Ghosh, Taha Belkhouja, Janardhan Rao Doppa, Yan Yan

    Abstract: Conformal prediction (CP) is an emerging uncertainty quantification framework that allows us to construct a prediction set to cover the true label with a pre-specified marginal or conditional probability. Although the valid coverage guarantee has been extensively studied for classification problems, CP often produces large prediction sets which may not be practically useful. This issue is exacerba… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.