Skip to main content

Showing 1–32 of 32 results for author: Tran, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.01494  [pdf, other

    cs.CV cs.LG stat.ML

    Robust Classification by Coupling Data Mollification with Label Smoothing

    Authors: Markus Heinonen, Ba-Hien Tran, Michael Kampffmeyer, Maurizio Filippone

    Abstract: Introducing training-time augmentations is a key technique to enhance generalization and prepare deep neural networks against test-time corruptions. Inspired by the success of generative diffusion models, we propose a novel approach coupling data augmentation, in the form of image noising and blurring, with label smoothing to align predicted label confidences with image degradation. The method is… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: Under review

  2. arXiv:2405.16339  [pdf, other

    stat.ML cs.LG

    BOLD: Boolean Logic Deep Learning

    Authors: Van Minh Nguyen, Cristian Ocampo, Aymen Askri, Louis Leconte, Ba-Hien Tran

    Abstract: Deep learning is computationally intensive, with significant efforts focused on reducing arithmetic complexity, particularly regarding energy consumption dominated by data movement. While existing literature emphasizes inference, training is considerably more resource-intensive. This paper proposes a novel mathematical principle by introducing the notion of Boolean variation such that neurons made… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

    Comments: Under review

  3. arXiv:2405.11697  [pdf, other

    cs.CY

    AMMeBa: A Large-Scale Survey and Dataset of Media-Based Misinformation In-The-Wild

    Authors: Nicholas Dufour, Arkanath Pathak, Pouya Samangouei, Nikki Hariri, Shashi Deshetti, Andrew Dudfield, Christopher Guess, Pablo Hernández Escayola, Bobby Tran, Mevan Babakar, Christoph Bregler

    Abstract: The prevalence and harms of online misinformation is a perennial concern for internet platforms, institutions and society at large. Over time, information shared online has become more media-heavy and misinformation has readily adapted to these new modalities. The rise of generative AI-based tools, which provide widely-accessible methods for synthesizing realistic audio, images, video and human-li… ▽ More

    Submitted 21 May, 2024; v1 submitted 19 May, 2024; originally announced May 2024.

    Comments: Grammar, spelling corrections. Minor rewording and clarification of one sentence. 24 pages, 31 figures

  4. arXiv:2404.12076  [pdf, other

    cs.AI cs.NE

    Evolutionary Multi-Objective Optimisation for Fairness-Aware Self Adjusting Memory Classifiers in Data Streams

    Authors: Pivithuru Thejan Amarasinghe, Diem Pham, Binh Tran, Su Nguyen, Yuan Sun, Damminda Alahakoon

    Abstract: This paper introduces a novel approach, evolutionary multi-objective optimisation for fairness-aware self-adjusting memory classifiers, designed to enhance fairness in machine learning algorithms applied to data stream classification. With the growing concern over discrimination in algorithmic decision-making, particularly in dynamic data stream environments, there is a need for methods that ensur… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: This paper has been accepted by GECCO 2024

  5. arXiv:2401.01108  [pdf, other

    cs.CL

    Unveiling Comparative Sentiments in Vietnamese Product Reviews: A Sequential Classification Framework

    Authors: Ha Le, Bao Tran, Phuong Le, Tan Nguyen, Dac Nguyen, Ngoan Pham, Dang Huynh

    Abstract: Comparative opinion mining is a specialized field of sentiment analysis that aims to identify and extract sentiments expressed comparatively. To address this task, we propose an approach that consists of solving three sequential sub-tasks: (i) identifying comparative sentence, i.e., if a sentence has a comparative meaning, (ii) extracting comparative elements, i.e., what are comparison subjects, o… ▽ More

    Submitted 2 January, 2024; originally announced January 2024.

    Comments: Accepted manuscript at VLSP 2023

  6. arXiv:2311.09491  [pdf, other

    stat.ML cs.LG

    Spatial Bayesian Neural Networks

    Authors: Andrew Zammit-Mangion, Michael D. Kaminski, Ba-Hien Tran, Maurizio Filippone, Noel Cressie

    Abstract: interpretable, and well understood models that are routinely employed even though, as is revealed through prior and posterior predictive checks, these can poorly characterise the spatial heterogeneity in the underlying process of interest. Here, we propose a new, flexible class of spatial-process models, which we refer to as spatial Bayesian neural networks (SBNNs). An SBNN leverages the represent… ▽ More

    Submitted 4 April, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

    Comments: 35 pages, 21 figures

  7. arXiv:2308.16501  [pdf, other

    cs.MA cs.AI

    Individually Rational Collaborative Vehicle Routing through Give-And-Take Exchanges

    Authors: Paul Mingzheng Tang, Ba Phong Tran, Hoong Chuin Lau

    Abstract: In this paper, we are concerned with the automated exchange of orders between logistics companies in a marketplace platform to optimize total revenues. We introduce a novel multi-agent approach to this problem, focusing on the Collaborative Vehicle Routing Problem (CVRP) through the lens of individual rationality. Our proposed algorithm applies the principles of Vehicle Routing Problem (VRP) to pa… ▽ More

    Submitted 31 August, 2023; originally announced August 2023.

    Comments: 7 pages 4 figures This paper was presented in the IJCAI 2023 First International Workshop on Search and Planning with Complex Objectives (WoSePCO) http://idm-lab.org/wiki/complex-objective

  8. arXiv:2305.18900  [pdf, other

    cs.LG

    One-Line-of-Code Data Mollification Improves Optimization of Likelihood-based Generative Models

    Authors: Ba-Hien Tran, Giulio Franzese, Pietro Michiardi, Maurizio Filippone

    Abstract: Generative Models (GMs) have attracted considerable attention due to their tremendous success in various domains, such as computer vision where they are capable to generate impressive realistic-looking images. Likelihood-based GMs are attractive due to the possibility to generate new data by a single model evaluation. However, they typically achieve lower sample quality compared to state-of-the-ar… ▽ More

    Submitted 21 December, 2023; v1 submitted 30 May, 2023; originally announced May 2023.

    Comments: NeurIPS 2023

  9. arXiv:2302.04534  [pdf, other

    cs.LG stat.ML

    Fully Bayesian Autoencoders with Latent Sparse Gaussian Processes

    Authors: Ba-Hien Tran, Babak Shahbaba, Stephan Mandt, Maurizio Filippone

    Abstract: Autoencoders and their variants are among the most widely used models in representation learning and generative modeling. However, autoencoder-based models usually assume that the learned representations are i.i.d. and fail to capture the correlations between the data samples. To address this issue, we propose a novel Sparse Gaussian Process Bayesian Autoencoder (SGPBAE) model in which we impose f… ▽ More

    Submitted 9 February, 2023; originally announced February 2023.

  10. arXiv:2210.15904  [pdf, other

    cs.CV cs.AI cs.GR

    Self-Supervised Learning with Multi-View Rendering for 3D Point Cloud Analysis

    Authors: Bach Tran, Binh-Son Hua, Anh Tuan Tran, Minh Hoai

    Abstract: Recently, great progress has been made in 3D deep learning with the emergence of deep neural networks specifically designed for 3D point clouds. These networks are often trained from scratch or from pre-trained models learned purely from point cloud data. Inspired by the success of deep learning in the image domain, we devise a novel pre-training technique for better model initialization by utiliz… ▽ More

    Submitted 28 October, 2022; originally announced October 2022.

    Comments: ACCV 2022 paper. 14 pages of content, 4 pages of references, 6 pages of supplementary material

  11. arXiv:2208.11035  [pdf, other

    cs.DC

    Not All GPUs Are Created Equal: Characterizing Variability in Large-Scale, Accelerator-Rich Systems

    Authors: Prasoon Sinha, Akhil Guliani, Rutwik Jain, Brandon Tran, Matthew D. Sinclair, Shivaram Venkataraman

    Abstract: Scientists are increasingly exploring and utilizing the massive parallelism of general-purpose accelerators such as GPUs for scientific breakthroughs. As a result, datacenters, hyperscalers, national computing centers, and supercomputers have procured hardware to support this evolving application paradigm. These systems contain hundreds to tens of thousands of accelerators, enabling peta- and exa-… ▽ More

    Submitted 8 November, 2022; v1 submitted 23 August, 2022; originally announced August 2022.

    Comments: 14 pages, 18 figures, to appear at The 34th International Conference for High Performance Computing, Networking, Storage, and Analysis (SC '22)

  12. arXiv:2203.10609  [pdf, other

    cs.CV

    A Novel Transparency Strategy-based Data Augmentation Approach for BI-RADS Classification of Mammograms

    Authors: Sam B. Tran, Huyen T. X. Nguyen, Chi Phan, Hieu H. Pham, Ha Q. Nguyen

    Abstract: Image augmentation techniques have been widely investigated to improve the performance of deep learning (DL) algorithms on mammography classification tasks. Recent methods have proved the efficiency of image augmentation on data deficiency or data imbalance issues. In this paper, we propose a novel transparency strategy to boost the Breast Imaging Reporting and Data System (BI-RADS) scores of mamm… ▽ More

    Submitted 17 April, 2023; v1 submitted 20 March, 2022; originally announced March 2022.

    Comments: Accepted for presentation at the 22nd IEEE Statistical Signal Processing (SSP) workshop

  13. SAFL: A Self-Attention Scene Text Recognizer with Focal Loss

    Authors: Bao Hieu Tran, Thanh Le-Cong, Huu Manh Nguyen, Duc Anh Le, Thanh Hung Nguyen, Phi Le Nguyen

    Abstract: In the last decades, scene text recognition has gained worldwide attention from both the academic community and actual users due to its importance in a wide range of applications. Despite achievements in optical character recognition, scene text recognition remains challenging due to inherent problems such as distortions or irregular layout. Most of the existing approaches mainly leverage recurren… ▽ More

    Submitted 1 January, 2022; originally announced January 2022.

    Comments: Accepted to ICMLA 2020

    Journal ref: 2020 19th IEEE International Conference on Machine Learning and Applications (ICMLA)

  14. arXiv:2112.04490  [pdf, other

    eess.IV cs.CV

    A novel multi-view deep learning approach for BI-RADS and density assessment of mammograms

    Authors: Huyen T. X. Nguyen, Sam B. Tran, Dung B. Nguyen, Hieu H. Pham, Ha Q. Nguyen

    Abstract: Advanced deep learning (DL) algorithms may predict the patient's risk of developing breast cancer based on the Breast Imaging Reporting and Data System (BI-RADS) and density standards. Recent studies have suggested that the combination of multi-view analysis improved the overall breast exam classification. In this paper, we propose a novel multi-view DL approach for BI-RADS and density assessment… ▽ More

    Submitted 17 April, 2022; v1 submitted 8 December, 2021; originally announced December 2021.

    Comments: This paper has been accepted by the 44th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (2022 IEEE EMBC)

  15. arXiv:2111.14684  [pdf, other

    cs.CL

    Speech Tasks Relevant to Sleepiness Determined with Deep Transfer Learning

    Authors: Bang Tran, Youxiang Zhu, Xiaohui Liang, James W. Schwoebel, Lindsay A. Warrenburg

    Abstract: Excessive sleepiness in attention-critical contexts can lead to adverse events, such as car crashes. Detecting and monitoring sleepiness can help prevent these adverse events from happening. In this paper, we use the Voiceome dataset to extract speech from 1,828 participants to develop a deep transfer learning model using Hidden-Unit BERT (HuBERT) speech representations to detect sleepiness from i… ▽ More

    Submitted 29 November, 2021; originally announced November 2021.

  16. arXiv:2111.07454  [pdf, other

    cs.CL cs.SD eess.AS

    Towards Interpretability of Speech Pause in Dementia Detection using Adversarial Learning

    Authors: Youxiang Zhu, Bang Tran, Xiaohui Liang, John A. Batsis, Robert M. Roth

    Abstract: Speech pause is an effective biomarker in dementia detection. Recent deep learning models have exploited speech pauses to achieve highly accurate dementia detection, but have not exploited the interpretability of speech pauses, i.e., what and how positions and lengths of speech pauses affect the result of dementia detection. In this paper, we will study the positions and lengths of dementia-sensit… ▽ More

    Submitted 14 November, 2021; originally announced November 2021.

  17. arXiv:2106.06245  [pdf, other

    stat.ML cs.LG

    Model Selection for Bayesian Autoencoders

    Authors: Ba-Hien Tran, Simone Rossi, Dimitrios Milios, Pietro Michiardi, Edwin V. Bonilla, Maurizio Filippone

    Abstract: We develop a novel method for carrying out model selection for Bayesian autoencoders (BAEs) by means of prior hyper-parameter optimization. Inspired by the common practice of type-II maximum likelihood optimization and its equivalence to Kullback-Leibler divergence minimization, we propose to optimize the distributional sliced-Wasserstein distance (DSWD) between the output of the autoencoder and t… ▽ More

    Submitted 11 June, 2021; originally announced June 2021.

  18. arXiv:2011.12829  [pdf, other

    stat.ML cs.LG

    All You Need is a Good Functional Prior for Bayesian Deep Learning

    Authors: Ba-Hien Tran, Simone Rossi, Dimitrios Milios, Maurizio Filippone

    Abstract: The Bayesian treatment of neural networks dictates that a prior distribution is specified over their weight and bias parameters. This poses a challenge because modern neural networks are characterized by a large number of parameters, and the choice of these priors has an uncontrolled effect on the induced functional prior, which is the distribution of the functions obtained by sampling the paramet… ▽ More

    Submitted 25 April, 2022; v1 submitted 25 November, 2020; originally announced November 2020.

  19. Integrated Model-Driven Engineering of Blockchain Applications for Business Processes and Asset Management

    Authors: Qinghua Lu, An Binh Tran, Ingo Weber, Hugo O'Connor, Paul Rimba, Xiwei Xu, Mark Staples, Liming Zhu, Ross Jeffery

    Abstract: Blockchain has attracted broad interests to build decentralised applications. Blockchain has attracted broad interests to build decentralised applications. However, developing such applications without introducing vulnerabilities is hard for developers, not the least because the deployed code is immutable and can be called by anyone with access to the network. Model-driven engineering (MDE) helps… ▽ More

    Submitted 22 October, 2020; v1 submitted 26 May, 2020; originally announced May 2020.

    Comments: to appear in Software: Practice and Experience (2020)

  20. arXiv:2003.11948  [pdf, ps, other

    cs.LG stat.ML

    Bag of biterms modeling for short texts

    Authors: Anh Phan Tuan, Bach Tran, Thien Nguyen Huu, Linh Ngo Van, Khoat Than

    Abstract: Analyzing texts from social media encounters many challenges due to their unique characteristics of shortness, massiveness, and dynamic. Short texts do not provide enough context information, causing the failure of the traditional statistical models. Furthermore, many applications often face with massive and dynamic short texts, causing various computational challenges to the current batch learnin… ▽ More

    Submitted 26 March, 2020; originally announced March 2020.

  21. arXiv:2001.10281  [pdf, other

    cs.SE

    Efficient Logging for Blockchain Applications

    Authors: Christopher Klinkmüller, Ingo Weber, Alexander Ponomarev, An Binh Tran, Wil van der Aalst

    Abstract: Second generation blockchain platforms, like Ethereum, can store arbitrary data and execute user-defined smart contracts. Due to the shared nature of blockchains, understanding the usage of blockchain-based applications and the underlying network is crucial. Although log analysis is a well-established means, data extraction from blockchain platforms can be highly inconvenient and slow, not least d… ▽ More

    Submitted 28 January, 2020; originally announced January 2020.

  22. arXiv:1911.03992  [pdf, ps, other

    math.OC cs.LG math.NA

    Stochastic DCA for minimizing a large sum of DC functions with application to Multi-class Logistic Regression

    Authors: Hoai An Le Thi, Hoai Minh Le, Duy Nhat Phan, Bach Tran

    Abstract: We consider the large sum of DC (Difference of Convex) functions minimization problem which appear in several different areas, especially in stochastic optimization and machine learning. Two DCA (DC Algorithm) based algorithms are proposed: stochastic DCA and inexact stochastic DCA. We prove that the convergence of both algorithms to a critical point is guaranteed with probability one. Furthermore… ▽ More

    Submitted 10 November, 2019; originally announced November 2019.

  23. arXiv:1906.09528  [pdf, other

    q-bio.NC cs.LG

    Neural networks with motivation

    Authors: Sergey A. Shuvaev, Ngoc B. Tran, Marcus Stephenson-Jones, Bo Li, Alexei A. Koulakov

    Abstract: How can animals behave effectively in conditions involving different motivational contexts? Here, we propose how reinforcement learning neural networks can learn optimal behavior for dynamically changing motivational salience vectors. First, we show that Q-learning neural networks with motivation can navigate in environment with dynamic rewards. Second, we show that such networks can learn complex… ▽ More

    Submitted 18 November, 2019; v1 submitted 22 June, 2019; originally announced June 2019.

    Comments: Added the Methods section

  24. arXiv:1906.09453  [pdf, other

    cs.CV cs.LG cs.NE stat.ML

    Image Synthesis with a Single (Robust) Classifier

    Authors: Shibani Santurkar, Dimitris Tsipras, Brandon Tran, Andrew Ilyas, Logan Engstrom, Aleksander Madry

    Abstract: We show that the basic classification framework alone can be used to tackle some of the most challenging tasks in image synthesis. In contrast to other state-of-the-art approaches, the toolkit we develop is rather minimal: it uses a single, off-the-shelf classifier for all these tasks. The crux of our approach is that we train this classifier to be adversarially robust. It turns out that adversari… ▽ More

    Submitted 8 August, 2019; v1 submitted 6 June, 2019; originally announced June 2019.

  25. Domain Adaptation for Enterprise Email Search

    Authors: Brandon Tran, Maryam Karimzadehgan, Rama Kumar Pasumarthi, Michael Bendersky, Donald Metzler

    Abstract: In the enterprise email search setting, the same search engine often powers multiple enterprises from various industries: technology, education, manufacturing, etc. However, using the same global ranking model across different enterprises may result in suboptimal search quality, due to the corpora differences and distinct information needs. On the other hand, training an individual ranking model f… ▽ More

    Submitted 18 June, 2019; originally announced June 2019.

    Comments: Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval

    Journal ref: Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2019

  26. arXiv:1906.00945  [pdf, other

    stat.ML cs.CV cs.LG cs.NE

    Adversarial Robustness as a Prior for Learned Representations

    Authors: Logan Engstrom, Andrew Ilyas, Shibani Santurkar, Dimitris Tsipras, Brandon Tran, Aleksander Madry

    Abstract: An important goal in deep learning is to learn versatile, high-level feature representations of input data. However, standard networks' representations seem to possess shortcomings that, as we illustrate, prevent them from fully realizing this goal. In this work, we show that robust optimization can be re-cast as a tool for enforcing priors on the features learned by deep neural networks. It turns… ▽ More

    Submitted 27 September, 2019; v1 submitted 3 June, 2019; originally announced June 2019.

  27. arXiv:1905.02175  [pdf, other

    stat.ML cs.CR cs.CV cs.LG

    Adversarial Examples Are Not Bugs, They Are Features

    Authors: Andrew Ilyas, Shibani Santurkar, Dimitris Tsipras, Logan Engstrom, Brandon Tran, Aleksander Madry

    Abstract: Adversarial examples have attracted significant attention in machine learning, but the reasons for their existence and pervasiveness remain unclear. We demonstrate that adversarial examples can be directly attributed to the presence of non-robust features: features derived from patterns in the data distribution that are highly predictive, yet brittle and incomprehensible to humans. After capturing… ▽ More

    Submitted 12 August, 2019; v1 submitted 6 May, 2019; originally announced May 2019.

  28. arXiv:1901.11219  [pdf, other

    cs.SE cs.CR

    A Platform Architecture for Multi-Tenant Blockchain-Based Systems

    Authors: Ingo Weber, Qinghua Lu, An Binh Tran, Amit Deshmukh, Marek Gorski, Markus Strazds

    Abstract: Blockchain has attracted a broad range of interests from start-ups, enterprises and governments to build next generation applications in a decentralized manner. Similar to cloud platforms, a single blockchain-based system may need to serve multiple tenants simultaneously. However, design of multi-tenant blockchain-based systems is challenging to architects in terms of data and performance isolatio… ▽ More

    Submitted 31 January, 2019; originally announced January 2019.

    Comments: 10 pages, IEEE International Conference on Software Architecture (ICSA2019)

  29. arXiv:1811.00636  [pdf, other

    cs.LG cs.CR stat.ML

    Spectral Signatures in Backdoor Attacks

    Authors: Brandon Tran, Jerry Li, Aleksander Madry

    Abstract: A recent line of work has uncovered a new form of data poisoning: so-called \emph{backdoor} attacks. These attacks are particularly dangerous because they do not affect a network's behavior on typical, benign data. Rather, the network only deviates from its expected output when triggered by a perturbation planted by an adversary. In this paper, we identify a new property of all known backdoor at… ▽ More

    Submitted 1 November, 2018; originally announced November 2018.

    Comments: 16 pages, accepted to NIPS 2018

  30. arXiv:1806.09620  [pdf, other

    math.OC cs.LG math.NA

    A DCA-Like Algorithm and its Accelerated Version with Application in Data Visualization

    Authors: Hoai An Le Thi, Hoai Minh Le, Duy Nhat Phan, Bach Tran

    Abstract: In this paper, we present two variants of DCA (Different of Convex functions Algorithm) to solve the constrained sum of differentiable function and composite functions minimization problem, with the aim of increasing the convergence speed of DCA. In the first variant, DCA-Like, we introduce a new technique to iteratively modify the decomposition of the objective function. This successive decomposi… ▽ More

    Submitted 25 June, 2018; originally announced June 2018.

  31. arXiv:1712.02779  [pdf, other

    cs.LG cs.CV cs.NE stat.ML

    Exploring the Landscape of Spatial Robustness

    Authors: Logan Engstrom, Brandon Tran, Dimitris Tsipras, Ludwig Schmidt, Aleksander Madry

    Abstract: The study of adversarial robustness has so far largely focused on perturbations bound in p-norms. However, state-of-the-art models turn out to be also vulnerable to other, more natural classes of perturbations such as translations and rotations. In this work, we thoroughly investigate the vulnerability of neural network--based classifiers to rotations and translations. While data augmentation offe… ▽ More

    Submitted 16 September, 2019; v1 submitted 7 December, 2017; originally announced December 2017.

    Comments: ICML 2019. Presented in NIPS 2017 Workshop on Machine Learning and Computer Security as "A Rotation and a Translation Suffice: Fooling CNNs with Simple Transformations."

  32. arXiv:1709.05935  [pdf, other

    cs.CR

    Data Integrity Threats and Countermeasures in Railway Spot Transmission Systems

    Authors: Hoon Wei Lim, William G. Temple, Bao Anh N. Tran, Binbin Chen, Zbigniew Kalbarczyk, Jianying Zhou

    Abstract: Modern trains rely on balises (communication beacons) located on the track to provide location information as they traverse a rail network. Balises, such as those conforming to the Eurobalise standard, were not designed with security in mind and are thus vulnerable to cyber attacks targeting data availability, integrity, or authenticity. In this work, we discuss data integrity threats to balise tr… ▽ More

    Submitted 18 September, 2017; originally announced September 2017.