Skip to main content

Showing 1–50 of 246 results for author: Pham, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.04165  [pdf, other

    math.CO cs.DM math.PR

    Sunflowers in set systems with small VC-dimension

    Authors: József Balogh, Anton Bernshteyn, Michelle Delcourt, Asaf Ferber, Huy Tuan Pham

    Abstract: A family of $r$ distinct sets $\{A_1,\ldots, A_r\}$ is an $r$-sunflower if for all $1 \leqslant i < j \leqslant r$ and $1 \leqslant i' < j' \leqslant r$, we have $A_i \cap A_j = A_{i'} \cap A_{j'}$. Erdős and Rado conjectured in 1960 that every family $\mathcal{H}$ of $\ell$-element sets of size at least $K(r)^\ell$ contains an $r$-sunflower, where $K(r)$ is some function that depends only on $r$.… ▽ More

    Submitted 7 August, 2024; originally announced August 2024.

    Comments: 14 pages

  2. arXiv:2408.03035  [pdf, other

    eess.IV cs.CV

    Training-Free Condition Video Diffusion Models for single frame Spatial-Semantic Echocardiogram Synthesis

    Authors: Van Phi Nguyen, Tri Nhan Luong Ha, Huy Hieu Pham, Quoc Long Tran

    Abstract: Conditional video diffusion models (CDM) have shown promising results for video synthesis, potentially enabling the generation of realistic echocardiograms to address the problem of data scarcity. However, current CDMs require a paired segmentation map and echocardiogram dataset. We present a new method called Free-Echo for generating realistic echocardiograms from a single end-diastolic segmentat… ▽ More

    Submitted 6 August, 2024; originally announced August 2024.

    Comments: Accepted to MICCAI 2024

  3. arXiv:2408.00892  [pdf, other

    q-bio.BM cs.LG

    Peptide Sequencing Via Protein Language Models

    Authors: Thuong Le Hoai Pham, Jillur Rahman Saurav, Aisosa A. Omere, Calvin J. Heyl, Mohammad Sadegh Nasr, Cody Tyler Reynolds, Jai Prakash Yadav Veerla, Helen H Shang, Justyn Jaworski, Alison Ravenscraft, Joseph Anthony Buonomo, Jacob M. Luber

    Abstract: We introduce a protein language model for determining the complete sequence of a peptide based on measurement of a limited set of amino acids. To date, protein sequencing relies on mass spectrometry, with some novel edman degregation based platforms able to sequence non-native peptides. Current protein sequencing techniques face limitations in accurately identifying all amino acids, hindering comp… ▽ More

    Submitted 1 August, 2024; originally announced August 2024.

  4. arXiv:2407.18532  [pdf, other

    math.OC cs.AI

    Outer Approximation and Super-modular Cuts for Constrained Assortment Optimization under Mixed-Logit Model

    Authors: Hoang Giang Pham, Tien Mai

    Abstract: In this paper, we study the assortment optimization problem under the mixed-logit customer choice model. While assortment optimization has been a major topic in revenue management for decades, the mixed-logit model is considered one of the most general and flexible approaches for modeling and predicting customer purchasing behavior. Existing exact methods have primarily relied on mixed-integer lin… ▽ More

    Submitted 26 July, 2024; originally announced July 2024.

  5. arXiv:2407.17790  [pdf, other

    cs.LG cs.AR

    Exploring the Limitations of Kolmogorov-Arnold Networks in Classification: Insights to Software Training and Hardware Implementation

    Authors: Van Duy Tran, Tran Xuan Hieu Le, Thi Diem Tran, Hoai Luan Pham, Vu Trung Duong Le, Tuan Hai Vu, Van Tinh Nguyen, Yasuhiko Nakashima

    Abstract: Kolmogorov-Arnold Networks (KANs), a novel type of neural network, have recently gained popularity and attention due to the ability to substitute multi-layer perceptions (MLPs) in artificial intelligence (AI) with higher accuracy and interoperability. However, KAN assessment is still limited and cannot provide an in-depth analysis of a specific domain. Furthermore, no study has been conducted on t… ▽ More

    Submitted 25 July, 2024; v1 submitted 25 July, 2024; originally announced July 2024.

    Comments: 6 pages, 3 figures, 2 tables

  6. arXiv:2407.16497  [pdf, other

    cs.CV

    Dynamic Retraining-Updating Mean Teacher for Source-Free Object Detection

    Authors: Trinh Le Ba Khanh, Huy-Hung Nguyen, Long Hoang Pham, Duong Nguyen-Ngoc Tran, Jae Wook Jeon

    Abstract: In object detection, unsupervised domain adaptation (UDA) aims to transfer knowledge from a labeled source domain to an unlabeled target domain. However, UDA's reliance on labeled source data restricts its adaptability in privacy-related scenarios. This study focuses on source-free object detection (SFOD), which adapts a source-trained detector to an unlabeled target domain without using labeled s… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

    Comments: ECCV 2024

  7. arXiv:2407.14738  [pdf, other

    cs.LG cs.AI cs.CR

    Flatness-aware Sequential Learning Generates Resilient Backdoors

    Authors: Hoang Pham, The-Anh Ta, Anh Tran, Khoa D. Doan

    Abstract: Recently, backdoor attacks have become an emerging threat to the security of machine learning models. From the adversary's perspective, the implanted backdoors should be resistant to defensive algorithms, but some recently proposed fine-tuning defenses can remove these backdoors with notable efficacy. This is mainly due to the catastrophic forgetting (CF) property of deep neural networks. This pap… ▽ More

    Submitted 19 July, 2024; originally announced July 2024.

    Comments: ECCV 2024

  8. arXiv:2407.06697  [pdf, other

    cs.LG

    Certified Continual Learning for Neural Network Regression

    Authors: Long H. Pham, Jun Sun

    Abstract: On the one hand, there has been considerable progress on neural network verification in recent years, which makes certifying neural networks a possibility. On the other hand, neural networks in practice are often re-trained over time to cope with new data distribution or for solving different tasks (a.k.a. continual learning). Once re-trained, the verified correctness of the neural network is like… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

  9. arXiv:2407.02873  [pdf, other

    cs.RO

    Robot Shape and Location Retention in Video Generation Using Diffusion Models

    Authors: Peng Wang, Zhihao Guo, Abdul Latheef Sait, Minh Huy Pham

    Abstract: Diffusion models have marked a significant milestone in the enhancement of image and video generation technologies. However, generating videos that precisely retain the shape and location of moving objects such as robots remains a challenge. This paper presents diffusion models specifically tailored to generate videos that accurately maintain the shape and location of mobile robots. This developme… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: 8 pages, 10 figures

  10. arXiv:2406.17381  [pdf, other

    cs.LG cs.CV

    Forget but Recall: Incremental Latent Rectification in Continual Learning

    Authors: Nghia D. Nguyen, Hieu Trung Nguyen, Ang Li, Hoang Pham, Viet Anh Nguyen, Khoa D. Doan

    Abstract: Intrinsic capability to continuously learn a changing data stream is a desideratum of deep neural networks (DNNs). However, current DNNs suffer from catastrophic forgetting, which hinders remembering past knowledge. To mitigate this issue, existing Continual Learning (CL) approaches either retain exemplars for replay, regularize learning, or allocate dedicated capacity for new tasks. This paper in… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  11. arXiv:2405.17926  [pdf, other

    cs.CV

    SarcNet: A Novel AI-based Framework to Automatically Analyze and Score Sarcomere Organizations in Fluorescently Tagged hiPSC-CMs

    Authors: Huyen Le, Khiet Dang, Tien Lai, Nhung Nguyen, Mai Tran, Hieu Pham

    Abstract: Quantifying sarcomere structure organization in human-induced pluripotent stem cell-derived cardiomyocytes (hiPSC-CMs) is crucial for understanding cardiac disease pathology, improving drug screening, and advancing regenerative medicine. Traditional methods, such as manual annotation and Fourier transform analysis, are labor-intensive, error-prone, and lack high-throughput capabilities. In this st… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  12. arXiv:2405.17002  [pdf, other

    cs.CV

    UIT-DarkCow team at ImageCLEFmedical Caption 2024: Diagnostic Captioning for Radiology Images Efficiency with Transformer Models

    Authors: Quan Van Nguyen, Huy Quang Pham, Dan Quang Tran, Thang Kien-Bao Nguyen, Nhat-Hao Nguyen-Dang, Bao-Thien Nguyen-Tat

    Abstract: Purpose: This study focuses on the development of automated text generation from radiology images, termed diagnostic captioning, to assist medical professionals in reducing clinical errors and improving productivity. The aim is to provide tools that enhance report quality and efficiency, which can significantly impact both clinical practice and deep learning research in the biomedical field. Metho… ▽ More

    Submitted 27 May, 2024; v1 submitted 27 May, 2024; originally announced May 2024.

  13. arXiv:2405.14781  [pdf, other

    cs.CR cs.AI

    Unified Neural Backdoor Removal with Only Few Clean Samples through Unlearning and Relearning

    Authors: Nay Myat Min, Long H. Pham, Jun Sun

    Abstract: The application of deep neural network models in various security-critical applications has raised significant security concerns, particularly the risk of backdoor attacks. Neural backdoors pose a serious security threat as they allow attackers to maliciously alter model behavior. While many defenses have been explored, existing approaches are often bounded by model-specific constraints, or necess… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  14. arXiv:2405.02287  [pdf, other

    cs.CL cs.AI cs.CV

    Vibe-Eval: A hard evaluation suite for measuring progress of multimodal language models

    Authors: Piotr Padlewski, Max Bain, Matthew Henderson, Zhongkai Zhu, Nishant Relan, Hai Pham, Donovan Ong, Kaloyan Aleksiev, Aitor Ormazabal, Samuel Phua, Ethan Yeo, Eugenie Lamprecht, Qi Liu, Yuqi Wang, Eric Chen, Deyu Fu, Lei Li, Che Zheng, Cyprien de Masson d'Autume, Dani Yogatama, Mikel Artetxe, Yi Tay

    Abstract: We introduce Vibe-Eval: a new open benchmark and framework for evaluating multimodal chat models. Vibe-Eval consists of 269 visual understanding prompts, including 100 of hard difficulty, complete with gold-standard responses authored by experts. Vibe-Eval is open-ended and challenging with dual objectives: (i) vibe checking multimodal chat models for day-to-day tasks and (ii) rigorously testing a… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

  15. arXiv:2405.01488  [pdf, other

    cs.LG stat.ML

    Digital Twin Generators for Disease Modeling

    Authors: Nameyeh Alam, Jake Basilico, Daniele Bertolini, Satish Casie Chetty, Heather D'Angelo, Ryan Douglas, Charles K. Fisher, Franklin Fuller, Melissa Gomes, Rishabh Gupta, Alex Lang, Anton Loukianov, Rachel Mak-McCully, Cary Murray, Hanalei Pham, Susanna Qiao, Elena Ryapolova-Webb, Aaron Smith, Dimitri Theoharatos, Anil Tolwani, Eric W. Tramel, Anna Vidovszky, Judy Viduya, Jonathan R. Walsh

    Abstract: A patient's digital twin is a computational model that describes the evolution of their health over time. Digital twins have the potential to revolutionize medicine by enabling individual-level computer simulations of human health, which can be used to conduct more efficient clinical trials or to recommend personalized treatment options. Due to the overwhelming complexity of human biology, machine… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

  16. arXiv:2405.01054  [pdf, other

    cs.RO cs.CV cs.LG

    Continual Learning for Robust Gate Detection under Dynamic Lighting in Autonomous Drone Racing

    Authors: Zhongzheng Qiao, Xuan Huy Pham, Savitha Ramasamy, Xudong Jiang, Erdal Kayacan, Andriy Sarabakha

    Abstract: In autonomous and mobile robotics, a principal challenge is resilient real-time environmental perception, particularly in situations characterized by unknown and dynamic elements, as exemplified in the context of autonomous drone racing. This study introduces a perception technique for detecting drone racing gates under illumination variations, which is common during high-speed drone flights. The… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: 8 pages, 6 figures, in 2024 International Joint Conference on Neural Networks (IJCNN)

  17. arXiv:2404.18831  [pdf, other

    cs.CV cs.AI

    ConPro: Learning Severity Representation for Medical Images using Contrastive Learning and Preference Optimization

    Authors: Hong Nguyen, Hoang Nguyen, Melinda Chang, Hieu Pham, Shrikanth Narayanan, Michael Pazzani

    Abstract: Understanding the severity of conditions shown in images in medical diagnosis is crucial, serving as a key guide for clinical assessment, treatment, as well as evaluating longitudinal progression. This paper proposes Con- PrO: a novel representation learning method for severity assessment in medical images using Contrastive learningintegrated Preference Optimization. Different from conventional co… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: 8 pages

  18. arXiv:2404.18397  [pdf, other

    cs.CV

    ViOCRVQA: Novel Benchmark Dataset and Vision Reader for Visual Question Answering by Understanding Vietnamese Text in Images

    Authors: Huy Quang Pham, Thang Kien-Bao Nguyen, Quan Van Nguyen, Dan Quang Tran, Nghia Hieu Nguyen, Kiet Van Nguyen, Ngan Luu-Thuy Nguyen

    Abstract: Optical Character Recognition - Visual Question Answering (OCR-VQA) is the task of answering text information contained in images that have just been significantly developed in the English language in recent years. However, there are limited studies of this task in low-resource languages such as Vietnamese. To this end, we introduce a novel dataset, ViOCRVQA (Vietnamese Optical Character Recogniti… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

  19. arXiv:2404.12387  [pdf, other

    cs.CL cs.CV

    Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models

    Authors: Reka Team, Aitor Ormazabal, Che Zheng, Cyprien de Masson d'Autume, Dani Yogatama, Deyu Fu, Donovan Ong, Eric Chen, Eugenie Lamprecht, Hai Pham, Isaac Ong, Kaloyan Aleksiev, Lei Li, Matthew Henderson, Max Bain, Mikel Artetxe, Nishant Relan, Piotr Padlewski, Qi Liu, Ren Chen, Samuel Phua, Yazheng Yang, Yi Tay, Yuqi Wang, Zhongkai Zhu , et al. (1 additional authors not shown)

    Abstract: We introduce Reka Core, Flash, and Edge, a series of powerful multimodal language models trained from scratch by Reka. Reka models are able to process and reason with text, images, video, and audio inputs. This technical report discusses details of training some of these models and provides comprehensive evaluation results. We show that Reka Edge and Reka Flash are not only state-of-the-art but al… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

  20. arXiv:2404.10652  [pdf, other

    cs.CL

    ViTextVQA: A Large-Scale Visual Question Answering Dataset for Evaluating Vietnamese Text Comprehension in Images

    Authors: Quan Van Nguyen, Dan Quang Tran, Huy Quang Pham, Thang Kien-Bao Nguyen, Nghia Hieu Nguyen, Kiet Van Nguyen, Ngan Luu-Thuy Nguyen

    Abstract: Visual Question Answering (VQA) is a complicated task that requires the capability of simultaneously processing natural language and images. Initially, this task was researched, focusing on methods to help machines understand objects and scene contexts in images. However, some text appearing in the image that carries explicit information about the full content of the image is not mentioned. Along… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Comments: Preprint submitted to IJCV

  21. arXiv:2403.19867  [pdf, ps, other

    cs.DS cs.AI cs.LG

    Finding Decision Tree Splits in Streaming and Massively Parallel Models

    Authors: Huy Pham, Hoang Ta, Hoa T. Vu

    Abstract: In this work, we provide data stream algorithms that compute optimal splits in decision tree learning. In particular, given a data stream of observations $x_i$ and their labels $y_i$, the goal is to find the optimal split point $j$ that divides the data into two sets such that the mean squared error (for regression) or misclassification rate (for classification) is minimized. We provide various fa… ▽ More

    Submitted 17 April, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

  22. arXiv:2403.15959  [pdf, other

    cs.RO eess.SY math.OC

    Risk-Calibrated Human-Robot Interaction via Set-Valued Intent Prediction

    Authors: Justin Lidard, Hang Pham, Ariel Bachman, Bryan Boateng, Anirudha Majumdar

    Abstract: Tasks where robots must anticipate human intent, such as navigating around a cluttered home or sorting everyday items, are challenging because they exhibit a wide range of valid actions that lead to similar outcomes. Moreover, zero-shot cooperation between human-robot partners is an especially challenging problem because it requires the robot to infer and adapt on the fly to a latent human intent,… ▽ More

    Submitted 23 April, 2024; v1 submitted 23 March, 2024; originally announced March 2024.

    Comments: Website with additional information, videos, and code: https://risk-calibrated-planning.github.io/

  23. arXiv:2403.04264  [pdf, other

    cs.AI

    Competitive Facility Location under Random Utilities and Routing Constraints

    Authors: Hoang Giang Pham, Tien Thanh Dam, Ngan Ha Duong, Tien Mai, Minh Hoang Ha

    Abstract: In this paper, we study a facility location problem within a competitive market context, where customer demand is predicted by a random utility choice model. Unlike prior research, which primarily focuses on simple constraints such as a cardinality constraint on the number of selected locations, we introduce routing constraints that necessitate the selection of locations in a manner that guarantee… ▽ More

    Submitted 9 March, 2024; v1 submitted 7 March, 2024; originally announced March 2024.

  24. arXiv:2402.13822  [pdf, other

    cs.CV

    MSTAR: Multi-Scale Backbone Architecture Search for Timeseries Classification

    Authors: Tue M. Cao, Nhat H. Tran, Hieu H. Pham, Hung T. Nguyen, Le P. Nguyen

    Abstract: Most of the previous approaches to Time Series Classification (TSC) highlight the significance of receptive fields and frequencies while overlooking the time resolution. Hence, unavoidably suffered from scalability issues as they integrated an extensive range of receptive fields into classification models. Other methods, while having a better adaptation for large datasets, require manual design an… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

  25. arXiv:2402.11702  [pdf, other

    cs.SE cs.AI cs.LG

    Can ChatGPT Support Developers? An Empirical Evaluation of Large Language Models for Code Generation

    Authors: Kailun Jin, Chung-Yu Wang, Hung Viet Pham, Hadi Hemmati

    Abstract: Large language models (LLMs) have demonstrated notable proficiency in code generation, with numerous prior studies showing their promising capabilities in various development scenarios. However, these studies mainly provide evaluations in research settings, which leaves a significant gap in understanding how effectively LLMs can support developers in real-world. To address this, we conducted an em… ▽ More

    Submitted 16 March, 2024; v1 submitted 18 February, 2024; originally announced February 2024.

    Comments: 4 pages, 3 figures, 21st International Conference on Mining Software Repositories (MSR '24), April 15-16, 2024, Lisbon, Portugal

    ACM Class: I.2.2

  26. arXiv:2401.13594  [pdf, other

    cs.CL cs.AI

    Graph Guided Question Answer Generation for Procedural Question-Answering

    Authors: Hai X. Pham, Isma Hadji, Xinnuo Xu, Ziedune Degutyte, Jay Rainey, Evangelos Kazakos, Afsaneh Fazly, Georgios Tzimiropoulos, Brais Martinez

    Abstract: In this paper, we focus on task-specific question answering (QA). To this end, we introduce a method for generating exhaustive and high-quality training data, which allows us to train compact (e.g., run on a mobile device), task-specific QA models that are competitive against GPT variants. The key technological enabler is a novel mechanism for automatic question-answer generation from procedural t… ▽ More

    Submitted 24 January, 2024; originally announced January 2024.

    Comments: Accepted to EACL 2024 as long paper. 25 pages including appendix

    MSC Class: I.2.7

  27. arXiv:2401.00827  [pdf, ps, other

    math.CO cs.DM

    A multipartite analogue of Dilworth's Theorem

    Authors: Jacob Fox, Huy Tuan Pham

    Abstract: We prove that every partially ordered set on $n$ elements contains $k$ subsets $A_{1},A_{2},\dots,A_{k}$ such that either each of these subsets has size $Ω(n/k^{5})$ and, for every $i<j$, every element in $A_{i}$ is less than or equal to every element in $A_{j}$, or each of these subsets has size $Ω(n/(k^{2}\log n))$ and, for every $i \not = j$, every element in $A_{i}$ is incomparable with every… ▽ More

    Submitted 1 January, 2024; originally announced January 2024.

  28. arXiv:2312.11011  [pdf, other

    cs.CL

    VinaLLaMA: LLaMA-based Vietnamese Foundation Model

    Authors: Quan Nguyen, Huy Pham, Dung Dao

    Abstract: In this technical report, we present VinaLLaMA, an open-weight, state-of-the-art (SOTA) Large Language Model for the Vietnamese language, built upon LLaMA-2 with an additional 800 billion trained tokens. VinaLLaMA not only demonstrates fluency in Vietnamese but also exhibits a profound understanding of Vietnamese culture, making it a truly indigenous model. VinaLLaMA-7B-chat, trained on 1 million… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

    Comments: VinaLLaMA Technical Report - 13 pages

  29. arXiv:2312.09445  [pdf, other

    eess.SP cs.CV cs.LG

    IncepSE: Leveraging InceptionTime's performance with Squeeze and Excitation mechanism in ECG analysis

    Authors: Tue Minh Cao, Nhat Hong Tran, Le Phi Nguyen, Hieu Huy Pham, Hung Thanh Nguyen

    Abstract: Our study focuses on the potential for modifications of Inception-like architecture within the electrocardiogram (ECG) domain. To this end, we introduce IncepSE, a novel network characterized by strategic architectural incorporation that leverages the strengths of both InceptionTime and channel attention mechanisms. Furthermore, we propose a training setup that employs stabilization techniques tha… ▽ More

    Submitted 16 November, 2023; originally announced December 2023.

  30. arXiv:2312.00398  [pdf, other

    cs.CV

    Learning to Estimate Critical Gait Parameters from Single-View RGB Videos with Transformer-Based Attention Network

    Authors: Quoc Hung T. Le, Hieu H. Pham

    Abstract: Musculoskeletal diseases and cognitive impairments in patients lead to difficulties in movement as well as negative effects on their psychological health. Clinical gait analysis, a vital tool for early diagnosis and treatment, traditionally relies on expensive optical motion capture systems. Recent advances in computer vision and deep learning have opened the door to more accessible and cost-effec… ▽ More

    Submitted 1 March, 2024; v1 submitted 1 December, 2023; originally announced December 2023.

    Comments: Accepted at ISBI 2024 (21st IEEE International Symposium on Biomedical Imaging)

  31. arXiv:2311.15041  [pdf, other

    cs.LG cs.AI eess.SP

    MPCNN: A Novel Matrix Profile Approach for CNN-based Sleep Apnea Classification

    Authors: Hieu X. Nguyen, Duong V. Nguyen, Hieu H. Pham, Cuong D. Do

    Abstract: Sleep apnea (SA) is a significant respiratory condition that poses a major global health challenge. Previous studies have investigated several machine and deep learning models for electrocardiogram (ECG)-based SA diagnoses. Despite these advancements, conventional feature extractions derived from ECG signals, such as R-peaks and RR intervals, may fail to capture crucial information encompassed wit… ▽ More

    Submitted 25 November, 2023; originally announced November 2023.

  32. arXiv:2311.05192  [pdf, other

    cs.CV

    TransReg: Cross-transformer as auto-registration module for multi-view mammogram mass detection

    Authors: Hoang C. Nguyen, Chi Phan, Hieu H. Pham

    Abstract: Screening mammography is the most widely used method for early breast cancer detection, significantly reducing mortality rates. The integration of information from multi-view mammograms enhances radiologists' confidence and diminishes false-positive rates since they can examine on dual-view of the same breast to cross-reference the existence and location of the lesion. Inspired by this, we present… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

  33. arXiv:2311.00638  [pdf, other

    cs.LG cs.AI

    FAIRLABEL: Correcting Bias in Labels

    Authors: Srinivasan H Sengamedu, Hien Pham

    Abstract: There are several algorithms for measuring fairness of ML models. A fundamental assumption in these approaches is that the ground truth is fair or unbiased. In real-world datasets, however, the ground truth often contains data that is a result of historical and societal biases and discrimination. Models trained on these datasets will inherit and propagate the biases to the model outputs. We propos… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

    Comments: ICDM LegalAI Workshop 2023

    MSC Class: 68T07 ACM Class: I.2.6

    Journal ref: ICDM 2023 Workshop

  34. arXiv:2310.18532  [pdf, other

    cs.SE

    SkipAnalyzer: A Tool for Static Code Analysis with Large Language Models

    Authors: Mohammad Mahdi Mohajer, Reem Aleithan, Nima Shiri Harzevili, Moshi Wei, Alvine Boaye Belle, Hung Viet Pham, Song Wang

    Abstract: We introduce SkipAnalyzer, a large language model (LLM)-powered tool for static code analysis. SkipAnalyzer has three components: 1) an LLM-based static bug detector that scans source code and reports specific types of bugs, 2) an LLM-based false-positive filter that can identify false-positive bugs in the results of static bug detectors (e.g., the result of step 1) to improve detection accuracy,… ▽ More

    Submitted 17 December, 2023; v1 submitted 27 October, 2023; originally announced October 2023.

  35. arXiv:2310.14030  [pdf, other

    cs.RO

    Visual Tracking Nonlinear Model Predictive Control Method for Autonomous Wind Turbine Inspection

    Authors: Abdelhakim Amer, Mohit Mehndiratta, Jonas le Fevre Sejersen, Huy Xuan Pham, Erdal Kayacan

    Abstract: Automated visual inspection of on-and offshore wind turbines using aerial robots provides several benefits, namely, a safe working environment by circumventing the need for workers to be suspended high above the ground, reduced inspection time, preventive maintenance, and access to hard-to-reach areas. A novel nonlinear model predictive control (NMPC) framework alongside a global wind turbine path… ▽ More

    Submitted 21 October, 2023; originally announced October 2023.

    Comments: 8 pages, accepted for publication at ICAR conference

  36. arXiv:2310.06912  [pdf, other

    cs.SE

    Benchmarking Deep Learning Fuzzers

    Authors: Nima Shiri Harzevili, Hung Viet Pham, Song Wang

    Abstract: In this work, we set out to conduct the first ground-truth empirical evaluation of state-of-the-art DL fuzzers. Specifically, we first manually created an extensive DL bug benchmark dataset, which includes 627 real-world DL bugs from TensorFlow and PyTorch libraries reported by users between 2020 and 2022. Then we run three state-of-the-art DL fuzzers, i.e., FreeFuzz, DeepRel, and DocTer, on the b… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

  37. arXiv:2309.04317  [pdf, other

    stat.ML cs.LG math.OC

    Actor critic learning algorithms for mean-field control with moment neural networks

    Authors: Huyên Pham, Xavier Warin

    Abstract: We develop a new policy gradient and actor-critic algorithm for solving mean-field control problems within a continuous time reinforcement learning setting. Our approach leverages a gradient-based representation of the value function, employing parametrized randomized policies. The learning for both the actor (policy) and critic (value function) is facilitated by a class of moment neural network f… ▽ More

    Submitted 8 September, 2023; originally announced September 2023.

    Comments: 16 pages, 11 figures

    MSC Class: 68T07

  38. arXiv:2308.15772  [pdf, other

    cs.CL

    Task-Based MoE for Multitask Multilingual Machine Translation

    Authors: Hai Pham, Young Jin Kim, Subhabrata Mukherjee, David P. Woodruff, Barnabas Poczos, Hany Hassan Awadalla

    Abstract: Mixture-of-experts (MoE) architecture has been proven a powerful method for diverse tasks in training deep models in many applications. However, current MoE implementations are task agnostic, treating all tokens from different tasks in the same manner. In this work, we instead design a novel method that incorporates task information into MoE models at different granular levels with shared dynamic… ▽ More

    Submitted 24 October, 2023; v1 submitted 30 August, 2023; originally announced August 2023.

  39. arXiv:2308.07832  [pdf, ps, other

    cs.LG cs.AI stat.ME

    REFORMS: Reporting Standards for Machine Learning Based Science

    Authors: Sayash Kapoor, Emily Cantrell, Kenny Peng, Thanh Hien Pham, Christopher A. Bail, Odd Erik Gundersen, Jake M. Hofman, Jessica Hullman, Michael A. Lones, Momin M. Malik, Priyanka Nanayakkara, Russell A. Poldrack, Inioluwa Deborah Raji, Michael Roberts, Matthew J. Salganik, Marta Serra-Garcia, Brandon M. Stewart, Gilles Vandewiele, Arvind Narayanan

    Abstract: Machine learning (ML) methods are proliferating in scientific research. However, the adoption of these methods has been accompanied by failures of validity, reproducibility, and generalizability. These failures can hinder scientific progress, lead to false consensus around invalid claims, and undermine the credibility of ML-based science. ML methods are often applied and fail in similar ways acros… ▽ More

    Submitted 19 September, 2023; v1 submitted 15 August, 2023; originally announced August 2023.

  40. arXiv:2307.16834  [pdf

    cs.CV cs.AI cs.LG eess.IV

    Benchmarking Jetson Edge Devices with an End-to-end Video-based Anomaly Detection System

    Authors: Hoang Viet Pham, Thinh Gia Tran, Chuong Dinh Le, An Dinh Le, Hien Bich Vo

    Abstract: Innovative enhancement in embedded system platforms, specifically hardware accelerations, significantly influence the application of deep learning in real-world scenarios. These innovations translate human labor efforts into automated intelligent systems employed in various areas such as autonomous driving, robotics, Internet-of-Things (IoT), and numerous other impactful applications. NVIDIA's Jet… ▽ More

    Submitted 12 September, 2023; v1 submitted 28 July, 2023; originally announced July 2023.

    Comments: Accepted in Future of Information and Communication Conference (FICC) 2024

  41. arXiv:2307.10466  [pdf, ps, other

    math.PR cs.DS math-ph

    Universality of Spectral Independence with Applications to Fast Mixing in Spin Glasses

    Authors: Nima Anari, Vishesh Jain, Frederic Koehler, Huy Tuan Pham, Thuy-Duong Vuong

    Abstract: We study Glauber dynamics for sampling from discrete distributions $μ$ on the hypercube $\{\pm 1\}^n$. Recently, techniques based on spectral independence have successfully yielded optimal $O(n)$ relaxation times for a host of different distributions $μ$. We show that spectral independence is universal: a relaxation time of $O(n)$ implies spectral independence. We then study a notion of tractabi… ▽ More

    Submitted 19 July, 2023; originally announced July 2023.

  42. arXiv:2307.02289  [pdf, other

    cs.CR cs.SE

    Fuzzing with Quantitative and Adaptive Hot-Bytes Identification

    Authors: Tai D. Nguyen, Long H. Pham, Jun Sun

    Abstract: Fuzzing has emerged as a powerful technique for finding security bugs in complicated real-world applications. American fuzzy lop (AFL), a leading fuzzing tool, has demonstrated its powerful bug finding ability through a vast number of reported CVEs. However, its random mutation strategy is unable to generate test inputs that satisfy complicated branching conditions (e.g., magic-byte comparisons, c… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.

  43. arXiv:2306.06579  [pdf, other

    cs.LG

    Learning Robust and Consistent Time Series Representations: A Dilated Inception-Based Approach

    Authors: Anh Duy Nguyen, Trang H. Tran, Hieu H. Pham, Phi Le Nguyen, Lam M. Nguyen

    Abstract: Representation learning for time series has been an important research area for decades. Since the emergence of the foundation models, this topic has attracted a lot of attention in contrastive self-supervised learning, to solve a wide range of downstream tasks. However, there have been several challenges for contrastive time series processing. First, there is no work considering noise, which is o… ▽ More

    Submitted 11 June, 2023; originally announced June 2023.

  44. arXiv:2306.03269  [pdf, other

    cs.CR cs.SE

    Security Knowledge-Guided Fuzzing of Deep Learning Libraries

    Authors: Nima Shiri Harzevili, Mohammad Mahdi Mohajer, Moshi Wei, Hung Viet Pham, Song Wang

    Abstract: Recently, many Deep Learning fuzzers have been proposed for testing of DL libraries. However, they either perform unguided input generation (e.g., not considering the relationship between API arguments when generating inputs) or only support a limited set of corner case test inputs. Furthermore, a substantial number of developer APIs crucial for library development remain untested, as they are typ… ▽ More

    Submitted 24 December, 2023; v1 submitted 5 June, 2023; originally announced June 2023.

  45. arXiv:2305.18607  [pdf, other

    cs.SE cs.AI cs.CR

    How Effective Are Neural Networks for Fixing Security Vulnerabilities

    Authors: Yi Wu, Nan Jiang, Hung Viet Pham, Thibaud Lutellier, Jordan Davis, Lin Tan, Petr Babkin, Sameena Shah

    Abstract: Security vulnerability repair is a difficult task that is in dire need of automation. Two groups of techniques have shown promise: (1) large code language models (LLMs) that have been pre-trained on source code for tasks such as code completion, and (2) automated program repair (APR) techniques that use deep learning (DL) models to automatically fix software bugs. This paper is the first to stud… ▽ More

    Submitted 1 April, 2024; v1 submitted 29 May, 2023; originally announced May 2023.

    Comments: This paper was accepted in the proceedings of the 32nd ACM SIGSOFT International Symposium on Software Testing and Analysis (ISSTA 2023), and was presented at the conference, that was held in Seattle, USA, 17-21 July 2023

  46. arXiv:2305.14707  [pdf, other

    cs.CL cs.AI cs.LG

    SciFix: Outperforming GPT3 on Scientific Factual Error Correction

    Authors: Dhananjay Ashok, Atharva Kulkarni, Hai Pham, Barnabás Póczos

    Abstract: Due to the prohibitively high cost of creating error correction datasets, most Factual Claim Correction methods rely on a powerful verification model to guide the correction process. This leads to a significant drop in performance in domains like scientific claims, where good verification models do not always exist. In this work, we introduce SciFix, a scientific claim correction system that does… ▽ More

    Submitted 12 October, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: To appear in proceedings of EMNLP2023 (findings)

  47. arXiv:2305.14132  [pdf, ps, other

    math.CO cs.DM cs.IT

    Set-coloring Ramsey numbers and error-correcting codes near the zero-rate threshold

    Authors: David Conlon, Jacob Fox, Huy Tuan Pham, Yufei Zhao

    Abstract: For positive integers $n,r,s$ with $r > s$, the set-coloring Ramsey number $R(n;r,s)$ is the minimum $N$ such that if every edge of the complete graph $K_N$ receives a set of $s$ colors from a palette of $r$ colors, then there is a subset of $n$ vertices where all of the edges between them receive a common color. If $n$ is fixed and $\frac{s}{r}$ is less than and bounded away from… ▽ More

    Submitted 14 August, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

  48. arXiv:2305.10429  [pdf, other

    cs.CL cs.LG

    DoReMi: Optimizing Data Mixtures Speeds Up Language Model Pretraining

    Authors: Sang Michael Xie, Hieu Pham, Xuanyi Dong, Nan Du, Hanxiao Liu, Yifeng Lu, Percy Liang, Quoc V. Le, Tengyu Ma, Adams Wei Yu

    Abstract: The mixture proportions of pretraining data domains (e.g., Wikipedia, books, web text) greatly affect language model (LM) performance. In this paper, we propose Domain Reweighting with Minimax Optimization (DoReMi), which first trains a small proxy model using group distributionally robust optimization (Group DRO) over domains to produce domain weights (mixture proportions) without knowledge of do… ▽ More

    Submitted 20 November, 2023; v1 submitted 17 May, 2023; originally announced May 2023.

    Comments: NeurIPS 2023

  49. arXiv:2305.06198  [pdf, ps, other

    cs.DS math.CO math.PR

    Optimal mixing of the down-up walk on independent sets of a given size

    Authors: Vishesh Jain, Marcus Michelen, Huy Tuan Pham, Thuy-Duong Vuong

    Abstract: Let $G$ be a graph on $n$ vertices of maximum degree $Δ$. We show that, for any $δ> 0$, the down-up walk on independent sets of size $k \leq (1-δ)α_c(Δ)n$ mixes in time $O_{Δ,δ}(k\log{n})$, thereby resolving a conjecture of Davies and Perkins in an optimal form. Here, $α_{c}(Δ)n$ is the NP-hardness threshold for the problem of counting independent sets of a given size in a graph on $n$ vertices of… ▽ More

    Submitted 10 May, 2023; originally announced May 2023.

    Comments: 25 pages; comments welcome!

  50. arXiv:2305.00627  [pdf

    eess.IV cs.CV

    CNN-based fully automatic mitral valve extraction using CT images and existence probability maps

    Authors: Yukiteru Masuda, Ryo Ishikawa, Toru Tanaka, Gakuto Aoyama, Keitaro Kawashima, James V. Chapman, Masahiko Asami, Michael Huy Cuong Pham, Klaus Fuglsang Kofoed, Takuya Sakaguchi, Kiyohide Satoh

    Abstract: Accurate extraction of mitral valve shape from clinical tomographic images acquired in patients has proven useful for planning surgical and interventional mitral valve treatments. However, manual extraction of the mitral valve shape is laborious, and the existing automatic extraction methods have not been sufficiently accurate. In this paper, we propose a fully automated method of extracting mitra… ▽ More

    Submitted 18 May, 2023; v1 submitted 30 April, 2023; originally announced May 2023.

    Comments: 15 pages, 6 figure, 3 table. changed title, modified taipo