Skip to main content

Showing 1–50 of 151 results for author: Liu, W

Searching in archive stat. Search in all archives.
.
  1. arXiv:2408.11272  [pdf, other

    stat.ME

    High-Dimensional Overdispersed Generalized Factor Model with Application to Single-Cell Sequencing Data Analysis

    Authors: Jinyu Nie, Zhilong Qin, Wei Liu

    Abstract: The current high-dimensional linear factor models fail to account for the different types of variables, while high-dimensional nonlinear factor models often overlook the overdispersion present in mixed-type data. However, overdispersion is prevalent in practical applications, particularly in fields like biomedical and genomics studies. To address this practical demand, we propose an overdispersed… ▽ More

    Submitted 20 August, 2024; originally announced August 2024.

  2. arXiv:2408.10542  [pdf, other

    stat.ME

    High-Dimensional Covariate-Augmented Overdispersed Multi-Study Poisson Factor Model

    Authors: Wei Liu, Qingzhi Zhong

    Abstract: Factor analysis for high-dimensional data is a canonical problem in statistics and has a wide range of applications. However, there is currently no factor model tailored to effectively analyze high-dimensional count responses with corresponding covariates across multiple studies, such as the single-cell sequencing dataset from a case-control study. In this paper, we introduce factor models designe… ▽ More

    Submitted 20 August, 2024; originally announced August 2024.

  3. arXiv:2407.02010  [pdf, other

    stat.ML cs.LG

    Feynman-Kac Operator Expectation Estimator

    Authors: Jingyuan Li, Wei Liu

    Abstract: The Feynman-Kac Operator Expectation Estimator (FKEE) is an innovative method for estimating the target Mathematical Expectation $\mathbb{E}_{X\sim P}[f(X)]$ without relying on a large number of samples, in contrast to the commonly used Markov Chain Monte Carlo (MCMC) Expectation Estimator. FKEE comprises diffusion bridge models and approximation of the Feynman-Kac operator. The key idea is to use… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  4. arXiv:2405.10925  [pdf

    stat.ME cs.AI cs.LG

    High-dimensional multiple imputation (HDMI) for partially observed confounders including natural language processing-derived auxiliary covariates

    Authors: Janick Weberpals, Pamela A. Shaw, Kueiyu Joshua Lin, Richard Wyss, Joseph M Plasek, Li Zhou, Kerry Ngan, Thomas DeRamus, Sudha R. Raman, Bradley G. Hammill, Hana Lee, Sengwee Toh, John G. Connolly, Kimberly J. Dandreo, Fang Tian, Wei Liu, Jie Li, José J. Hernández-Muñoz, Sebastian Schneeweiss, Rishi J. Desai

    Abstract: Multiple imputation (MI) models can be improved by including auxiliary covariates (AC), but their performance in high-dimensional data is not well understood. We aimed to develop and compare high-dimensional MI (HDMI) approaches using structured and natural language processing (NLP)-derived AC in studies with partially observed confounders. We conducted a plasmode simulation study using data from… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

  5. arXiv:2404.18732  [pdf, other

    stat.ME

    Two-way Homogeneity Pursuit for Quantile Network Vector Autoregression

    Authors: Wenyang Liu, Ganggang Xu, Jianqing Fan, Xuening Zhu

    Abstract: While the Vector Autoregression (VAR) model has received extensive attention for modelling complex time series, quantile VAR analysis remains relatively underexplored for high-dimensional time series data. To address this disparity, we introduce a two-way grouped network quantile (TGNQ) autoregression model for time series collected on large-scale networks, known for their significant heterogeneou… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

  6. arXiv:2404.15207  [pdf, other

    cs.CE cond-mat.mtrl-sci cs.LG stat.AP

    Simulation-Free Determination of Microstructure Representative Volume Element Size via Fisher Scores

    Authors: Wei Liu, Satyajit Mojumder, Wing Kam Liu, Wei Chen, Daniel W. Apley

    Abstract: A representative volume element (RVE) is a reasonably small unit of microstructure that can be simulated to obtain the same effective properties as the entire microstructure sample. Finite element (FE) simulation of RVEs, as opposed to much larger samples, saves computational expense, especially in multiscale modeling. Therefore, it is desirable to have a framework that determines RVE size prior t… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

    Journal ref: APL Mach. Learn. 2(2): 026101 (2024)

  7. arXiv:2404.13056  [pdf, other

    cs.LG cs.CE stat.CO stat.ME stat.ML

    Variational Bayesian Optimal Experimental Design with Normalizing Flows

    Authors: Jiayuan Dong, Christian Jacobsen, Mehdi Khalloufi, Maryam Akram, Wanjiao Liu, Karthik Duraisamy, Xun Huan

    Abstract: Bayesian optimal experimental design (OED) seeks experiments that maximize the expected information gain (EIG) in model parameters. Directly estimating the EIG using nested Monte Carlo is computationally expensive and requires an explicit likelihood. Variational OED (vOED), in contrast, estimates a lower bound of the EIG without likelihood evaluations by approximating the posterior distributions w… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

    MSC Class: 62K05; 94A17; 62C10; 62F15

  8. arXiv:2404.08679  [pdf, other

    cs.CL cs.AI cs.LG stat.ML

    Your Finetuned Large Language Model is Already a Powerful Out-of-distribution Detector

    Authors: Andi Zhang, Tim Z. Xiao, Weiyang Liu, Robert Bamler, Damon Wischik

    Abstract: We revisit the likelihood ratio between a pretrained large language model (LLM) and its finetuned variant as a criterion for out-of-distribution (OOD) detection. The intuition behind such a criterion is that, the pretrained LLM has the prior knowledge about OOD data due to its large amount of training data, and once finetuned with the in-distribution data, the LLM has sufficient knowledge to disti… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

  9. arXiv:2402.15071  [pdf, other

    stat.ME

    High-Dimensional Covariate-Augmented Overdispersed Poisson Factor Model

    Authors: Wei Liu, Qingzhi Zhong

    Abstract: The current Poisson factor models often assume that the factors are unknown, which overlooks the explanatory potential of certain observable covariates. This study focuses on high dimensional settings, where the number of the count response variables and/or covariates can diverge as the sample size increases. A covariate-augmented overdispersed Poisson factor model is proposed to jointly perform a… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

  10. arXiv:2402.07227  [pdf, other

    math.DS econ.GN stat.AP

    Time-Delayed Game Strategy Analysis Among Japan, Other Nations, and the International Atomic Energy Agency in the Context of Fukushima Nuclear Wastewater Discharge Decision

    Authors: Mingyang Li, Han Pengsihua, Fujiao Meng, Zejun Wang, Weian Liu

    Abstract: This academic paper examines the strategic interactions between Japan, other nations, and the International Atomic Energy Agency (IAEA) regarding Japan's decision to release treated nuclear wastewater from the Fukushima Daiichi Nuclear Power Plant into the sea. It introduces a payoff matrix and time-delay elements in replicator dynamic equations to mirror real-world decision-making delays. The pap… ▽ More

    Submitted 11 February, 2024; originally announced February 2024.

  11. arXiv:2402.07210  [pdf, other

    math.DS econ.GN physics.soc-ph stat.AP

    Fukushima Nuclear Wastewater Discharge: An Evolutionary Game Theory Approach to International and Domestic Interaction and Strategic Decision-Making

    Authors: Mingyang Li, Han Pengsihua, Songqing Zhao, Zejun Wang, Limin Yang, Weian Liu

    Abstract: On August 24, 2023, Japan controversially decided to discharge nuclear wastewater from the Fukushima Daiichi Nuclear Power Plant into the ocean, sparking intense domestic and global debates. This study uses evolutionary game theory to analyze the strategic dynamics between Japan, other countries, and the Japan Fisheries Association. By incorporating economic, legal, international aid, and environm… ▽ More

    Submitted 11 February, 2024; originally announced February 2024.

  12. arXiv:2402.02687  [pdf, other

    cs.LG cs.AI stat.ML

    Poisson Process for Bayesian Optimization

    Authors: Xiaoxing Wang, Jiaxing Li, Chao Xue, Wei Liu, Weifeng Liu, Xiaokang Yang, Junchi Yan, Dacheng Tao

    Abstract: BayesianOptimization(BO) is a sample-efficient black-box optimizer, and extensive methods have been proposed to build the absolute function response of the black-box function through a probabilistic surrogate model, including Tree-structured Parzen Estimator (TPE), random forest (SMAC), and Gaussian process (GP). However, few methods have been explored to estimate the relative rankings of candidat… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

  13. arXiv:2402.00440  [pdf, other

    stat.AP

    Optimal investment, consumption and life insurance decisions for households with consumption habits under the health shock risk

    Authors: Zhen Zhao, Wei Liu, Xiaoyi Tang

    Abstract: This paper investigates the optimal investment, consumption, and life insurance strategies for households under the impact of health shock risk. Considering the uncertainty of the future health status of family members, a non-homogeneous Markov process is used to model the health status of the breadwinner. Drawing upon the theory of habit formation, we investigate the influence of different consum… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

  14. arXiv:2401.01294  [pdf, other

    stat.ML cs.LG stat.ME

    Efficient Sparse Least Absolute Deviation Regression with Differential Privacy

    Authors: Weidong Liu, Xiaojun Mao, Xiaofei Zhang, Xin Zhang

    Abstract: In recent years, privacy-preserving machine learning algorithms have attracted increasing attention because of their important applications in many scientific fields. However, in the literature, most privacy-preserving algorithms demand learning objectives to be strongly convex and Lipschitz smooth, which thus cannot cover a wide class of robust loss functions (e.g., quantile/least absolute loss).… ▽ More

    Submitted 2 January, 2024; originally announced January 2024.

    Comments: IEEE Transactions on Information Forensics and Security, 2024

    MSC Class: 62J07

  15. arXiv:2401.00611  [pdf, other

    stat.ML cs.AI cs.LG

    A Compact Representation for Bayesian Neural Networks By Removing Permutation Symmetry

    Authors: Tim Z. Xiao, Weiyang Liu, Robert Bamler

    Abstract: Bayesian neural networks (BNNs) are a principled approach to modeling predictive uncertainties in deep learning, which are important in safety-critical applications. Since exact Bayesian inference over the weights in a BNN is intractable, various approximate inference methods exist, among which sampling methods such as Hamiltonian Monte Carlo (HMC) are often considered the gold standard. While HMC… ▽ More

    Submitted 31 December, 2023; originally announced January 2024.

    Comments: Accepted at NeurIPS 2023 Workshop on Unifying Representations in Neural Models; 4 pages + appendix

  16. arXiv:2310.18286  [pdf, other

    cs.LG stat.AP stat.ML

    Optimal Transport for Treatment Effect Estimation

    Authors: Hao Wang, Zhichao Chen, Jiajun Fan, Haoxuan Li, Tianqiao Liu, Weiming Liu, Quanyu Dai, Yichao Wang, Zhenhua Dong, Ruiming Tang

    Abstract: Estimating conditional average treatment effect from observational data is highly challenging due to the existence of treatment selection bias. Prevalent methods mitigate this issue by aligning distributions of different treatment groups in the latent space. However, there are two critical problems that these methods fail to address: (1) mini-batch sampling effects (MSE), which causes misalignment… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

    Comments: Accepted as NeurIPS 2023 Poster

  17. arXiv:2310.02581  [pdf, other

    stat.ML cs.LG

    Online Estimation and Inference for Robust Policy Evaluation in Reinforcement Learning

    Authors: Weidong Liu, Jiyuan Tu, Yichen Zhang, Xi Chen

    Abstract: Recently, reinforcement learning has gained prominence in modern statistics, with policy evaluation being a key component. Unlike traditional machine learning literature on this topic, our work places emphasis on statistical inference for the parameter estimates computed using reinforcement learning algorithms. While most existing analyses assume random rewards to follow standard distributions, li… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

    Comments: 63 pages, 32 figures

  18. arXiv:2309.05620  [pdf, other

    stat.ME

    Minimum Area Confidence Set Optimality for Simultaneous Confidence Bands for Percentiles in Linear Regression

    Authors: Lingjiao Wang, Yang Han, Wei Liu, Frank Bretz

    Abstract: Simultaneous confidence bands (SCBs) for percentiles in linear regression are valuable tools with many applications. In this paper, we propose a novel criterion for comparing SCBs for percentiles, termed the Minimum Area Confidence Set (MACS) criterion. This criterion utilizes the area of the confidence set for the pivotal quantities, which are generated from the confidence set of the unknown para… ▽ More

    Submitted 13 September, 2023; v1 submitted 11 September, 2023; originally announced September 2023.

    Comments: 26 pages, 6 figures

  19. arXiv:2308.12680  [pdf, other

    cs.LG cs.DC math.OC stat.ML

    Master-slave Deep Architecture for Top-K Multi-armed Bandits with Non-linear Bandit Feedback and Diversity Constraints

    Authors: Hanchi Huang, Li Shen, Deheng Ye, Wei Liu

    Abstract: We propose a novel master-slave architecture to solve the top-$K$ combinatorial multi-armed bandits problem with non-linear bandit feedback and diversity constraints, which, to the best of our knowledge, is the first combinatorial bandits setting considering diversity constraints under bandit feedback. Specifically, to efficiently explore the combinatorial and constrained action space, we introduc… ▽ More

    Submitted 24 August, 2023; originally announced August 2023.

    Comments: IEEE Transactions on Neural Networks and Learning Systems

  20. arXiv:2306.10395  [pdf, other

    stat.ML cs.LG

    Distributed Semi-Supervised Sparse Statistical Inference

    Authors: Jiyuan Tu, Weidong Liu, Xiaojun Mao, Mingyue Xu

    Abstract: The debiased estimator is a crucial tool in statistical inference for high-dimensional model parameters. However, constructing such an estimator involves estimating the high-dimensional inverse Hessian matrix, incurring significant computational costs. This challenge becomes particularly acute in distributed setups, where traditional methods necessitate computing a debiased estimator on every mach… ▽ More

    Submitted 15 December, 2023; v1 submitted 17 June, 2023; originally announced June 2023.

    Comments: IEEE Transactions on Information Theory, 2023

  21. arXiv:2305.17665  [pdf, other

    cs.LG stat.ML

    Acceleration of stochastic gradient descent with momentum by averaging: finite-sample rates and asymptotic normality

    Authors: Kejie Tang, Weidong Liu, Yichen Zhang, Xi Chen

    Abstract: Stochastic gradient descent with momentum (SGDM) has been widely used in many machine learning and statistical applications. Despite the observed empirical benefits of SGDM over traditional SGD, the theoretical understanding of the role of momentum for different learning rates in the optimization process remains widely open. We analyze the finite-sample convergence rate of SGDM under the strongly… ▽ More

    Submitted 1 February, 2024; v1 submitted 28 May, 2023; originally announced May 2023.

  22. Detector Design and Performance Analysis for Target Detection in Subspace Interference

    Authors: Weijian Liu, Jun Liu, Tao Liu, Hui Chen, Yong-Liang Wang

    Abstract: It is often difficult to obtain sufficient training data for adaptive signal detection, which is required to calculate the unknown noise covariance matrix. Additionally, interference is frequently present, which complicates the detecting issue. We provide a two-step method, termed interference cancellation before detection (ICBD), to address the issue of signal detection in the unknown Gaussian no… ▽ More

    Submitted 14 April, 2023; originally announced April 2023.

    Comments: This manuscript is submitted to IEEE SPL with paper ID SPL-35580-2023 and the decision "AQ - Publish In Minor, Required Changes"

  23. A review of distributed statistical inference

    Authors: Yuan Gao, Weidong Liu, Hansheng Wang, Xiaozhou Wang, Yibo Yan, Riquan Zhang

    Abstract: The rapid emergence of massive datasets in various fields poses a serious challenge to traditional statistical methods. Meanwhile, it provides opportunities for researchers to develop novel algorithms. Inspired by the idea of divide-and-conquer, various distributed frameworks for statistical estimation and inference have been proposed. They were developed to deal with large-scale statistical optim… ▽ More

    Submitted 12 April, 2023; originally announced April 2023.

    Journal ref: Statistical Theory and Related Fields, 6(2), 89-99 (2022)

  24. arXiv:2303.15464  [pdf, other

    cs.LG cs.AI math.ST stat.ML

    Mathematical Challenges in Deep Learning

    Authors: Vahid Partovi Nia, Guojun Zhang, Ivan Kobyzev, Michael R. Metel, Xinlin Li, Ke Sun, Sobhan Hemati, Masoud Asgharian, Linglong Kong, Wulong Liu, Boxing Chen

    Abstract: Deep models are dominating the artificial intelligence (AI) industry since the ImageNet challenge in 2012. The size of deep models is increasing ever since, which brings new challenges to this field with applications in cell phones, personal computers, autonomous cars, and wireless base stations. Here we list a set of problems, ranging from training, inference, generalization bound, and optimizati… ▽ More

    Submitted 24 March, 2023; originally announced March 2023.

  25. arXiv:2303.06484  [pdf, other

    cs.LG cs.CV stat.ML

    Generalizing and Decoupling Neural Collapse via Hyperspherical Uniformity Gap

    Authors: Weiyang Liu, Longhui Yu, Adrian Weller, Bernhard Schölkopf

    Abstract: The neural collapse (NC) phenomenon describes an underlying geometric symmetry for deep neural networks, where both deeply learned features and classifiers converge to a simplex equiangular tight frame. It has been shown that both cross-entropy loss and mean square error can provably lead to NC. We remove NC's key assumption on the feature dimension and the number of classes, and then present a ge… ▽ More

    Submitted 15 April, 2023; v1 submitted 11 March, 2023; originally announced March 2023.

    Comments: ICLR 2023 (v2: fixed typos)

  26. arXiv:2302.10633  [pdf, other

    cs.LG stat.ML

    Generalization Bounds for Adversarial Contrastive Learning

    Authors: Xin Zou, Weiwei Liu

    Abstract: Deep networks are well-known to be fragile to adversarial attacks, and adversarial training is one of the most popular methods used to train a robust model. To take advantage of unlabeled data, recent works have applied adversarial training to contrastive learning (Adversarial Contrastive Learning; ACL for short) and obtain promising robust performance. However, the theory of ACL is not well under… ▽ More

    Submitted 21 February, 2023; originally announced February 2023.

  27. arXiv:2210.11039  [pdf, other

    cs.LG cs.AI stat.ML

    Entire Space Counterfactual Learning: Tuning, Analytical Properties and Industrial Applications

    Authors: Hao Wang, Zhichao Chen, Jiajun Fan, Yuxin Huang, Weiming Liu, Xinggao Liu

    Abstract: As a basic research problem for building effective recommender systems, post-click conversion rate (CVR) estimation has long been plagued by sample selection bias and data sparsity issues. To address the data sparsity issue, prevalent methods based on entire space multi-task model leverage the sequential pattern of user actions, i.e. exposure $\rightarrow$ click $\rightarrow$ conversion to constru… ▽ More

    Submitted 20 February, 2023; v1 submitted 20 October, 2022; originally announced October 2022.

    Comments: This submission is an extension of arXiv:2204.05125

  28. arXiv:2210.08393  [pdf, other

    math.ST stat.ML

    Distributed Estimation and Inference for Semi-parametric Binary Response Models

    Authors: Xi Chen, Wenbo Jing, Weidong Liu, Yichen Zhang

    Abstract: The development of modern technology has enabled data collection of unprecedented size, which poses new challenges to many statistical estimation and inference problems. This paper studies the maximum score estimator of a semi-parametric binary choice model under a distributed computing environment without pre-specifying the noise distribution. An intuitive divide-and-conquer estimator is computat… ▽ More

    Submitted 15 August, 2024; v1 submitted 15 October, 2022; originally announced October 2022.

  29. arXiv:2210.04165  [pdf, other

    cs.LG cs.CE nlin.CD stat.ML

    Neural Extended Kalman Filters for Learning and Predicting Dynamics of Structural Systems

    Authors: Wei Liu, Zhilu Lai, Kiran Bacsa, Eleni Chatzi

    Abstract: Accurate structural response prediction forms a main driver for structural health monitoring and control applications. This often requires the proposed model to adequately capture the underlying dynamics of complex structural systems. In this work, we utilize a learnable Extended Kalman Filter (EKF), named the Neural Extended Kalman Filter (Neural EKF) throughout this paper, for learning the laten… ▽ More

    Submitted 3 July, 2023; v1 submitted 9 October, 2022; originally announced October 2022.

    Comments: Accepted for publication in Structural Health Monitoring

    Journal ref: Structural Health Monitoring, 2023

  30. arXiv:2210.02015  [pdf, other

    stat.ML cs.CY cs.LG

    Conformalized Fairness via Quantile Regression

    Authors: Meichen Liu, Lei Ding, Dengdeng Yu, Wulong Liu, Linglong Kong, Bei Jiang

    Abstract: Algorithmic fairness has received increased attention in socially sensitive domains. While rich literature on mean fairness has been established, research on quantile fairness remains sparse but vital. To fulfill great needs and advocate the significance of quantile fairness, we propose a novel framework to learn a real-valued quantile function under the fairness requirement of Demographic Parity… ▽ More

    Submitted 14 October, 2022; v1 submitted 5 October, 2022; originally announced October 2022.

    Comments: 18 pages, 5 figures, 2 tables

  31. arXiv:2209.04419  [pdf, other

    cs.CR cs.LG stat.ME stat.ML

    Majority Vote for Distributed Differentially Private Sign Selection

    Authors: Weidong Liu, Jiyuan Tu, Xiaojun Mao, Xi Chen

    Abstract: Privacy-preserving data analysis has become more prevalent in recent years. In this study, we propose a distributed group differentially private Majority Vote mechanism, for the sign selection problem in a distributed setup. To achieve this, we apply the iterative peeling to the stability function and use the exponential mechanism to recover the signs. For enhanced applicability, we study the priv… ▽ More

    Submitted 4 June, 2024; v1 submitted 8 September, 2022; originally announced September 2022.

    Comments: 41 pages, 5 figures

  32. arXiv:2207.04300  [pdf, other

    stat.ME

    Confidence Sets for a level set in linear regression

    Authors: Fang Wan, Wei Liu, Frank Bretz

    Abstract: Regression modeling is the workhorse of statistics and there is a vast literature on estimation of the regression function. It is realized in recent years that in regression analysis the ultimate aim may be the estimation of a level set of the regression function, instead of the estimation of the regression function itself. The published work on estimation of the level set has thus far focused mai… ▽ More

    Submitted 26 July, 2022; v1 submitted 9 July, 2022; originally announced July 2022.

  33. arXiv:2202.05498  [pdf, other

    stat.ML cs.LG stat.ME

    Fast and Robust Sparsity Learning over Networks: A Decentralized Surrogate Median Regression Approach

    Authors: Weidong Liu, Xiaojun Mao, Xin Zhang

    Abstract: Decentralized sparsity learning has attracted a significant amount of attention recently due to its rapidly growing applications. To obtain the robust and sparse estimators, a natural idea is to adopt the non-smooth median loss combined with a $\ell_1$ sparsity regularizer. However, most of the existing methods suffer from slow convergence performance caused by the {\em double} non-smooth objectiv… ▽ More

    Submitted 11 February, 2022; originally announced February 2022.

    Comments: IEEE Transactions on Signal Processing, 2022

  34. arXiv:2202.02416  [pdf, other

    stat.ME stat.AP stat.ML

    Generalized Causal Tree for Uplift Modeling

    Authors: Preetam Nandy, Xiufan Yu, Wanjun Liu, Ye Tu, Kinjal Basu, Shaunak Chatterjee

    Abstract: Uplift modeling is crucial in various applications ranging from marketing and policy-making to personalized recommendations. The main objective is to learn optimal treatment allocations for a heterogeneous population. A primary line of existing work modifies the loss function of the decision tree algorithm to identify cohorts with heterogeneous treatment effects. Another line of work estimates the… ▽ More

    Submitted 19 December, 2023; v1 submitted 4 February, 2022; originally announced February 2022.

  35. arXiv:2202.00769  [pdf, other

    cs.LG stat.ML

    Distributional Reinforcement Learning by Sinkhorn Divergence

    Authors: Ke Sun, Yingnan Zhao, Wulong Liu, Bei Jiang, Linglong Kong

    Abstract: The empirical success of distributional reinforcement learning~(RL) highly depends on the distribution representation and the choice of distribution divergence. In this paper, we propose \textit{Sinkhorn distributional RL~(SinkhornDRL)} that learns unrestricted statistics from return distributions and leverages Sinkhorn divergence to minimize the difference between current and target Bellman retur… ▽ More

    Submitted 2 February, 2024; v1 submitted 1 February, 2022; originally announced February 2022.

    Comments: arXiv admin note: text overlap with arXiv:2110.03155

  36. arXiv:2112.02504  [pdf, ps, other

    cs.LG stat.ML

    A Novel Sequential Coreset Method for Gradient Descent Algorithms

    Authors: Jiawei Huang, Ruomin Huang, Wenjie Liu, Nikolaos M. Freris, Hu Ding

    Abstract: A wide range of optimization problems arising in machine learning can be solved by gradient descent algorithms, and a central question in this area is how to efficiently compress a large-scale dataset so as to reduce the computational complexity. {\em Coreset} is a popular data compression technique that has been extensively studied before. However, most of existing coreset methods are problem-dep… ▽ More

    Submitted 8 October, 2022; v1 submitted 5 December, 2021; originally announced December 2021.

  37. arXiv:2110.15480  [pdf, other

    stat.ME math.ST

    Multiple-Splitting Projection Test for High-Dimensional Mean Vectors

    Authors: Wanjun Liu, Xiufan Yu, Runze Li

    Abstract: We propose a multiple-splitting projection test (MPT) for one-sample mean vectors in high-dimensional settings. The idea of projection test is to project high-dimensional samples to a 1-dimensional space using an optimal projection direction such that traditional tests can be carried out with projected samples. However, estimation of the optimal projection direction has not been systematically stu… ▽ More

    Submitted 17 April, 2022; v1 submitted 28 October, 2021; originally announced October 2021.

  38. arXiv:2110.14098  [pdf, other

    cs.LG cs.AI stat.ML

    Provable Lifelong Learning of Representations

    Authors: Xinyuan Cao, Weiyang Liu, Santosh S. Vempala

    Abstract: In lifelong learning, tasks (or classes) to be learned arrive sequentially over time in arbitrary order. During training, knowledge from previous tasks can be captured and transferred to subsequent ones to improve sample efficiency. We consider the setting where all target tasks can be represented in the span of a small number of unknown linear or nonlinear features of the input data. We propose a… ▽ More

    Submitted 1 March, 2022; v1 submitted 26 October, 2021; originally announced October 2021.

    Comments: Accepted to AISTATS 2022

  39. arXiv:2110.08607  [pdf, other

    cs.LG cs.CE nlin.CD stat.ML

    Physics-guided Deep Markov Models for Learning Nonlinear Dynamical Systems with Uncertainty

    Authors: Wei Liu, Zhilu Lai, Kiran Bacsa, Eleni Chatzi

    Abstract: In this paper, we propose a probabilistic physics-guided framework, termed Physics-guided Deep Markov Model (PgDMM). The framework targets the inference of the characteristics and latent structure of nonlinear dynamical systems from measurement data, where exact inference of latent variables is typically intractable. A recently surfaced option pertains to leveraging variational inference to perfor… ▽ More

    Submitted 25 May, 2022; v1 submitted 16 October, 2021; originally announced October 2021.

    Comments: Accepted for publication in Mechanical Systems and Signal Processing

    Journal ref: Mechanical Systems and Signal Processing 178 (2022) 109276

  40. arXiv:2108.10015  [pdf, other

    cs.CL stat.ML

    Semantic-Preserving Adversarial Text Attacks

    Authors: Xinghao Yang, Weifeng Liu, James Bailey, Dacheng Tao, Wei Liu

    Abstract: Deep neural networks (DNNs) are known to be vulnerable to adversarial images, while their robustness in text classification is rarely studied. Several lines of text attack methods have been proposed in the literature, including character-level, word-level, and sentence-level attacks. However, it is still a challenge to minimize the number of word changes necessary to induce misclassification, whil… ▽ More

    Submitted 2 March, 2023; v1 submitted 23 August, 2021; originally announced August 2021.

    Comments: 12 pages, 3 figures, 10 tables

  41. arXiv:2107.02999  [pdf, other

    math.ST stat.ME

    A unified precision matrix estimation framework via sparse column-wise inverse operator under weak sparsity

    Authors: Zeyu Wu, Cheng Wang, Weidong Liu

    Abstract: In this paper, we estimate the high dimensional precision matrix under the weak sparsity condition where many entries are nearly zero. We revisit the sparse column-wise inverse operator (SCIO) estimator \cite{liu2015fast} and derive its general error bounds under the weak sparsity condition. A unified framework is established to deal with various cases including the heavy-tailed data, the non-para… ▽ More

    Submitted 20 October, 2022; v1 submitted 6 July, 2021; originally announced July 2021.

    Comments: 29 pages, 5 figures

    MSC Class: 62H12; 62H22

  42. A Generative Node-attribute Network Model for Detecting Generalized Structure

    Authors: Wei Liu, Zhenhai Chang, Caiyan Jia, Yimei Zheng

    Abstract: Exploring meaningful structural regularities embedded in networks is a key to understanding and analyzing the structure and function of a network. The node-attribute information can help improve such understanding and analysis. However, most of the existing methods focus on detecting traditional communities, i.e., groupings of nodes with dense internal connections and sparse external ones. In this… ▽ More

    Submitted 5 June, 2021; originally announced June 2021.

  43. arXiv:2105.13282  [pdf, ps, other

    eess.SP cs.IT stat.AP

    Detection of a rank-one signal with limited training data

    Authors: Weijian Liu, Zhaojian Zhang, Jun Liu, Zheran Shang, Yong-Liang Wang

    Abstract: In this paper, we reconsider the problem of detecting a matrix-valued rank-one signal in unknown Gaussian noise, which was previously addressed for the case of sufficient training data. We relax the above assumption to the case of limited training data. We re-derive the corresponding generalized likelihood ratio test (GLRT) and two-step GLRT (2S--GLRT) based on certain unitary transformation on th… ▽ More

    Submitted 13 April, 2021; originally announced May 2021.

    Comments: This manuscript is accepted by Signal Processing

    Report number: SIGPRO_108120

  44. arXiv:2103.02860  [pdf, other

    stat.ML cs.LG stat.ME

    Variance Reduced Median-of-Means Estimator for Byzantine-Robust Distributed Inference

    Authors: Jiyuan Tu, Weidong Liu, Xiaojun Mao, Xi Chen

    Abstract: This paper develops an efficient distributed inference algorithm, which is robust against a moderate fraction of Byzantine nodes, namely arbitrary and possibly adversarial machines in a distributed learning system. In robust statistics, the median-of-means (MOM) has been a popular approach to hedge against Byzantine failures due to its ease of implementation and computational efficiency. However,… ▽ More

    Submitted 4 March, 2021; originally announced March 2021.

    Comments: 64 pages, 3 figures

  45. arXiv:2102.03474  [pdf, ps, other

    stat.AP

    Multichannel adaptive signal detection: Basic theory and literature review

    Authors: Weijian Liu, Jun Liu, Chengpeng Hao, Yongchan Gao, Yong-Liang Wang

    Abstract: Multichannel adaptive signal detection jointly uses the test and training data to form an adaptive detector, and then make a decision on whether a target exists or not. Remarkably, the resulting adaptive detectors usually possess the constant false alarm rate (CFAR) properties, and hence no additional CFAR processing is needed. Filtering is not needed as a processing procedure either, since the fu… ▽ More

    Submitted 5 February, 2021; originally announced February 2021.

    Comments: 10 pages, 5 figures. This manuscript is accepted in Science China: Information Sciences

    Report number: Manuscript No. SCIS-2020-1112.R1

  46. arXiv:2012.11100  [pdf, other

    stat.ME

    Two-directional simultaneous inference for high-dimensional models

    Authors: Wei Liu, Huazhen Lin, Jin Liu, Shurong Zheng

    Abstract: This paper proposes a general two directional simultaneous inference (TOSI) framework for high-dimensional models with a manifest variable or latent variable structure, for example, high-dimensional mean models, high-dimensional sparse regression models, and high-dimensional latent factors models. TOSI performs simultaneous inference on a set of parameters from two directions, one to test whether… ▽ More

    Submitted 6 February, 2023; v1 submitted 20 December, 2020; originally announced December 2020.

  47. arXiv:2011.05885  [pdf, other

    cs.LG stat.ML

    Leveraged Matrix Completion with Noise

    Authors: Xinjian Huang, Weiwei Liu, Bo Du, Dacheng Tao

    Abstract: Completing low-rank matrices from subsampled measurements has received much attention in the past decade. Existing works indicate that $\mathcal{O}(nr\log^2(n))$ datums are required to theoretically secure the completion of an $n \times n$ noisy matrix of rank $r$ with high probability, under some quite restrictive assumptions: (1) the underlying matrix must be incoherent; (2) observations follow… ▽ More

    Submitted 14 August, 2023; v1 submitted 11 November, 2020; originally announced November 2020.

    Comments: This manuscript has been accepted for publication as a regular paper in the IEEE Transactions on Cybernetics

  48. arXiv:2010.15703  [pdf, other

    cs.CV stat.ML

    Permute, Quantize, and Fine-tune: Efficient Compression of Neural Networks

    Authors: Julieta Martinez, Jashan Shewakramani, Ting Wei Liu, Ioan Andrei Bârsan, Wenyuan Zeng, Raquel Urtasun

    Abstract: Compressing large neural networks is an important step for their deployment in resource-constrained computational platforms. In this context, vector quantization is an appealing framework that expresses multiple parameters using a single code, and has recently achieved state-of-the-art network compression on a range of core vision and natural language processing tasks. Key to the success of vector… ▽ More

    Submitted 10 April, 2021; v1 submitted 29 October, 2020; originally announced October 2020.

    Comments: CVPR 21 Oral

  49. arXiv:2009.13891  [pdf, other

    cs.LG cs.AI stat.ML

    Towards Effective Context for Meta-Reinforcement Learning: an Approach based on Contrastive Learning

    Authors: Haotian Fu, Hongyao Tang, Jianye Hao, Chen Chen, Xidong Feng, Dong Li, Wulong Liu

    Abstract: Context, the embedding of previous collected trajectories, is a powerful construct for Meta-Reinforcement Learning (Meta-RL) algorithms. By conditioning on an effective context, Meta-RL policies can easily generalize to new tasks within a few adaptation steps. We argue that improving the quality of context involves answering two questions: 1. How to train a compact and sufficient encoder that can… ▽ More

    Submitted 15 December, 2020; v1 submitted 29 September, 2020; originally announced September 2020.

    Comments: Accepted to AAAI 2021

  50. arXiv:2008.12522  [pdf, other

    cs.LG stat.ML

    An Intelligent CNN-VAE Text Representation Technology Based on Text Semantics for Comprehensive Big Data

    Authors: Genggeng Liu, Canyang Guo, Lin Xie, Wenxi Liu, Naixue Xiong, Guolong Chen

    Abstract: In the era of big data, a large number of text data generated by the Internet has given birth to a variety of text representation methods. In natural language processing (NLP), text representation transforms text into vectors that can be processed by computer without losing the original semantic information. However, these methods are difficult to effectively extract the semantic features among wo… ▽ More

    Submitted 28 August, 2020; originally announced August 2020.