Skip to main content

Showing 1–32 of 32 results for author: Lyu, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.11500  [pdf, other

    cs.LG

    Slicing Input Features to Accelerate Deep Learning: A Case Study with Graph Neural Networks

    Authors: Zhengjia Xu, Dingyang Lyu, Jinghui Zhang

    Abstract: As graphs grow larger, full-batch GNN training becomes hard for single GPU memory. Therefore, to enhance the scalability of GNN training, some studies have proposed sampling-based mini-batch training and distributed graph learning. However, these methods still have drawbacks, such as performance degradation and heavy communication. This paper introduces SliceGCN, a feature-sliced distributed large… ▽ More

    Submitted 21 August, 2024; originally announced August 2024.

  2. arXiv:2408.09667  [pdf, other

    cs.CL

    BLADE: Benchmarking Language Model Agents for Data-Driven Science

    Authors: Ken Gu, Ruoxi Shang, Ruien Jiang, Keying Kuang, Richard-John Lin, Donghe Lyu, Yue Mao, Youran Pan, Teng Wu, Jiaqian Yu, Yikun Zhang, Tianmai M. Zhang, Lanyi Zhu, Mike A. Merrill, Jeffrey Heer, Tim Althoff

    Abstract: Data-driven scientific discovery requires the iterative integration of scientific domain knowledge, statistical expertise, and an understanding of data semantics to make nuanced analytical decisions, e.g., about which variables, transformations, and statistical models to consider. LM-based agents equipped with planning, memory, and code execution capabilities have the potential to support data-dri… ▽ More

    Submitted 20 August, 2024; v1 submitted 18 August, 2024; originally announced August 2024.

  3. arXiv:2406.05688  [pdf, other

    cs.CL cs.AI cs.LG

    Peer Review as A Multi-Turn and Long-Context Dialogue with Role-Based Interactions

    Authors: Cheng Tan, Dongxin Lyu, Siyuan Li, Zhangyang Gao, Jingxuan Wei, Siqi Ma, Zicheng Liu, Stan Z. Li

    Abstract: Large Language Models (LLMs) have demonstrated wide-ranging applications across various fields and have shown significant potential in the academic peer-review process. However, existing applications are primarily limited to static review generation based on submitted papers, which fail to capture the dynamic and iterative nature of real-world peer reviews. In this paper, we reformulate the peer-r… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Comments: Under review

  4. arXiv:2405.20516  [pdf, other

    cs.LG physics.geo-ph

    WaveCastNet: An AI-enabled Wavefield Forecasting Framework for Earthquake Early Warning

    Authors: Dongwei Lyu, Rie Nakata, Pu Ren, Michael W. Mahoney, Arben Pitarka, Nori Nakata, N. Benjamin Erichson

    Abstract: Large earthquakes can be destructive and quickly wreak havoc on a landscape. To mitigate immediate threats, early warning systems have been developed to alert residents, emergency responders, and critical infrastructure operators seconds to a minute before seismic waves arrive. These warnings provide time to take precautions and prevent damage. The success of these systems relies on fast, accurate… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  5. arXiv:2405.04066  [pdf, other

    cs.SI eess.SY

    Characterizing Regional Importance in Cities with Human Mobility Motifs in Metro Networks

    Authors: Shuyang Shi, Ding Lyu, Lin Wang, Xiaofan Wang, Guanrong Chen

    Abstract: Uncovering higher-order spatiotemporal dependencies within human mobility networks offers valuable insights into the analysis of urban structures. In most existing studies, human mobility networks are typically constructed by aggregating all trips without distinguishing who takes which specific trip. Instead, we claim individual mobility motifs, higher-order structures generated by daily trips of… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

  6. arXiv:2403.10913  [pdf, other

    cs.AR

    DEFA: Efficient Deformable Attention Acceleration via Pruning-Assisted Grid-Sampling and Multi-Scale Parallel Processing

    Authors: Yansong Xu, Dongxu Lyu, Zhenyu Li, Zilong Wang, Yuzhou Chen, Gang Wang, Zhican Wang, Haomin Li, Guanghui He

    Abstract: Multi-scale deformable attention (MSDeformAttn) has emerged as a key mechanism in various vision tasks, demonstrating explicit superiority attributed to multi-scale grid-sampling. However, this newly introduced operator incurs irregular data access and enormous memory requirement, leading to severe PE underutilization. Meanwhile, existing approaches for attention acceleration cannot be directly ap… ▽ More

    Submitted 16 March, 2024; originally announced March 2024.

    Comments: Accepted to DAC 2024

  7. arXiv:2402.05211  [pdf

    cs.DL

    A Maturity Model for Urban Dataset Meta-data

    Authors: Mark S. Fox, Bart Gajderowicz, Dishu Lyu

    Abstract: In the current environment of data generation and publication, there is an ever-growing number of datasets available for download. This growth precipitates an existing challenge: sourcing and integrating relevant datasets for analysis is becoming more complex. Despite efforts by open data platforms, obstacles remain, predominantly rooted in inadequate metadata, unsuitable data presentation, compli… ▽ More

    Submitted 6 August, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

  8. arXiv:2311.11689  [pdf, other

    cs.AI

    Causal Structure Learning Supervised by Large Language Model

    Authors: Taiyu Ban, Lyuzhou Chen, Derui Lyu, Xiangyu Wang, Huanhuan Chen

    Abstract: Causal discovery from observational data is pivotal for deciphering complex relationships. Causal Structure Learning (CSL), which focuses on deriving causal Directed Acyclic Graphs (DAGs) from data, faces challenges due to vast DAG spaces and data sparsity. The integration of Large Language Models (LLMs), recognized for their causal reasoning capabilities, offers a promising direction to enhance C… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

  9. arXiv:2308.09249  [pdf, other

    cs.AR

    SpOctA: A 3D Sparse Convolution Accelerator with Octree-Encoding-Based Map Search and Inherent Sparsity-Aware Processing

    Authors: Dongxu Lyu, Zhenyu Li, Yuzhou Chen, Jinming Zhang, Ningyi Xu, Guanghui He

    Abstract: Point-cloud-based 3D perception has attracted great attention in various applications including robotics, autonomous driving and AR/VR. In particular, the 3D sparse convolution (SpConv) network has emerged as one of the most popular backbones due to its excellent performance. However, it poses severe challenges to real-time perception on general-purpose platforms, such as lengthy map search latenc… ▽ More

    Submitted 17 August, 2023; originally announced August 2023.

    Comments: Accepted to ICCAD 2023

  10. arXiv:2306.07032  [pdf, other

    cs.LG cs.AI

    Mitigating Prior Errors in Causal Structure Learning: Towards LLM driven Prior Knowledge

    Authors: Lyuzhou Chen, Taiyu Ban, Xiangyu Wang, Derui Lyu, Huanhuan Chen

    Abstract: Causal structure learning, a prominent technique for encoding cause and effect relationships among variables, through Bayesian Networks (BNs). Merely recovering causal structures from real-world observed data lacks precision, while the development of Large Language Models (LLM) is opening a new frontier of causality. LLM presents strong capability in discovering causal relationships between variab… ▽ More

    Submitted 12 June, 2023; originally announced June 2023.

    Comments: 14 pages, 4 figures

  11. arXiv:2304.05617  [pdf, other

    cs.SE eess.SY

    AutoRepair: Automated Repair for AI-Enabled Cyber-Physical Systems under Safety-Critical Conditions

    Authors: Deyun Lyu, Jiayang Song, Zhenya Zhang, Zhijie Wang, Tianyi Zhang, Lei Ma, Jianjun Zhao

    Abstract: Cyber-Physical Systems (CPS) have been widely deployed in safety-critical domains such as transportation, power and energy. Recently, there comes an increasing demand in employing deep neural networks (DNNs) in CPS for more intelligent control and decision making in sophisticated industrial safety-critical conditions, giving birth to the class of DNN controllers. However, due to the inherent uncer… ▽ More

    Submitted 12 April, 2023; originally announced April 2023.

  12. arXiv:2210.11761  [pdf

    cs.CE math.NA

    RVE Analysis in LS-DYNA for High-fidelity Multiscale Material Modeling

    Authors: Haoyan Wei, Dandan Lyu, Wei Hu, C. T. Wu

    Abstract: In modern engineering designs, advanced materials (e.g., fiber/particle-reinforced polymers, metallic alloys, laminar composites, etc.) are widely used, where microscale heterogeneities such as grains, inclusions, voids, micro-cracks, and interfaces significantly affect the macroscopic constitutive behaviors. Obviously, an accurate description of the multiscale material behaviors is of great impor… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

    Comments: 11 pages, 4 figures

  13. arXiv:2202.06257  [pdf

    cs.LG

    Fine-Grained Population Mobility Data-Based Community-Level COVID-19 Prediction Model

    Authors: Pengyue Jia, Ling Chen, Dandan Lyu

    Abstract: Predicting the number of infections in the anti-epidemic process is extremely beneficial to the government in developing anti-epidemic strategies, especially in fine-grained geographic units. Previous works focus on low spatial resolution prediction, e.g., county-level, and preprocess data to the same geographic level, which loses some useful information. In this paper, we propose a fine-grained p… ▽ More

    Submitted 15 July, 2022; v1 submitted 13 February, 2022; originally announced February 2022.

    Comments: Accepted by Cybernetics and Systems

  14. arXiv:2202.00531  [pdf, other

    cs.AI

    PRIMA: Planner-Reasoner Inside a Multi-task Reasoning Agent

    Authors: Daoming Lyu, Bo Liu, Jianshu Chen

    Abstract: We consider the problem of multi-task reasoning (MTR), where an agent can solve multiple tasks via (first-order) logic reasoning. This capability is essential for human-like intelligence due to its strong generalizability and simplicity for handling multiple tasks. However, a major challenge in developing effective MTR is the intrinsic conflict between reasoning capability and efficiency. An MTR-c… ▽ More

    Submitted 12 February, 2022; v1 submitted 1 February, 2022; originally announced February 2022.

  15. arXiv:2201.09857  [pdf, other

    cs.LG

    STOPS: Short-Term-based Volatility-controlled Policy Search and its Global Convergence

    Authors: Liangliang Xu, Daoming Lyu, Yangchen Pan, Aiwen Jiang, Bo Liu

    Abstract: It remains challenging to deploy existing risk-averse approaches to real-world applications. The reasons are multi-fold, including the lack of global optimality guarantee and the necessity of learning from long-term consecutive trajectories. Long-term consecutive trajectories are prone to involving visiting hazardous states, which is a major concern in the risk-averse setting. This paper proposes… ▽ More

    Submitted 22 July, 2022; v1 submitted 24 January, 2022; originally announced January 2022.

  16. When Cyber-Physical Systems Meet AI: A Benchmark, an Evaluation, and a Way Forward

    Authors: Jiayang Song, Deyun Lyu, Zhenya Zhang, Zhijie Wang, Tianyi Zhang, Lei Ma

    Abstract: Cyber-physical systems (CPS) have been broadly deployed in safety-critical domains, such as automotive systems, avionics, medical devices, etc. In recent years, Artificial Intelligence (AI) has been increasingly adopted to control CPS. Despite the popularity of AI-enabled CPS, few benchmarks are publicly available. There is also a lack of deep understanding on the performance and reliability of AI… ▽ More

    Submitted 19 April, 2022; v1 submitted 8 November, 2021; originally announced November 2021.

  17. arXiv:2110.13596  [pdf

    cs.SI cs.LG

    TME-BNA: Temporal Motif-Preserving Network Embedding with Bicomponent Neighbor Aggregation

    Authors: Ling Chen, Da Wang, Dandan Lyu, Xing Tang, Hongyu Shi

    Abstract: Evolving temporal networks serve as the abstractions of many real-life dynamic systems, e.g., social network and e-commerce. The purpose of temporal network embedding is to map each node to a time-evolving low-dimension vector for downstream tasks, e.g., link prediction and node classification. The difficulty of temporal network embedding lies in how to utilize the topology and time information jo… ▽ More

    Submitted 26 October, 2021; originally announced October 2021.

  18. arXiv:2110.04503  [pdf

    cs.LG

    Multi-Relation Aware Temporal Interaction Network Embedding

    Authors: Ling Chen, Shanshan Yu, Dandan Lyu, Da Wang

    Abstract: Temporal interaction networks are formed in many fields, e.g., e-commerce, online education, and social network service. Temporal interaction network embedding can effectively mine the information in temporal interaction networks, which is of great significance to the above fields. Usually, the occurrence of an interaction affects not only the nodes directly involved in the interaction (interactin… ▽ More

    Submitted 9 October, 2021; originally announced October 2021.

    Comments: 18 pages, 4 figures

  19. arXiv:2109.10523  [pdf, other

    cs.SI cs.GT cs.LG physics.soc-ph

    Investigating and Modeling the Dynamics of Long Ties

    Authors: Ding Lyu, Yuan Yuan, Lin Wang, Xiaofan Wang, Alex Pentland

    Abstract: Long ties, the social ties that bridge different communities, are widely believed to play crucial roles in spreading novel information in social networks. However, some existing network theories and prediction models indicate that long ties might dissolve quickly or eventually become redundant, thus putting into question the long-term value of long ties. Our empirical analysis of real-world dynami… ▽ More

    Submitted 2 April, 2022; v1 submitted 22 September, 2021; originally announced September 2021.

    Comments: Forthcoming at Communications Physics (Nature portfolio)

    MSC Class: 05C85; 62P25; 91B16 ACM Class: J.4

    Journal ref: Commun. Phys. 5 (2022) 87

  20. arXiv:2109.08275  [pdf

    cs.MM cs.CV cs.LG

    Multi-Level Visual Similarity Based Personalized Tourist Attraction Recommendation Using Geo-Tagged Photos

    Authors: Ling Chen, Dandan Lyu, Shanshan Yu, Gencai Chen

    Abstract: Geo-tagged photo based tourist attraction recommendation can discover users' travel preferences from their taken photos, so as to recommend suitable tourist attractions to them. However, existing visual content based methods cannot fully exploit the user and tourist attraction information of photos to extract visual features, and do not differentiate the significances of different photos. In this… ▽ More

    Submitted 27 January, 2023; v1 submitted 16 September, 2021; originally announced September 2021.

    Comments: Accepted by TKDD

  21. TDM: Trustworthy Decision-Making via Interpretability Enhancement

    Authors: Daoming Lyu, Fangkai Yang, Hugh Kwon, Wen Dong, Levent Yilmaz, Bo Liu

    Abstract: Human-robot interactive decision-making is increasingly becoming ubiquitous, and trust is an influential factor in determining the reliance on autonomy. However, it is not reasonable to trust systems that are beyond our comprehension, and typical machine learning and data-driven decision-making are black-box paradigms that impede interpretability. Therefore, it is critical to establish computation… ▽ More

    Submitted 13 August, 2021; originally announced August 2021.

    Journal ref: IEEE Transactions on Emerging Topics in Computational Intelligence 0 (2021) 1-12

  22. arXiv:2011.11545  [pdf, other

    cs.AI cs.DB cs.SI

    APAN: Asynchronous Propagation Attention Network for Real-time Temporal Graph Embedding

    Authors: Xuhong Wang, Ding Lyu, Mengjian Li, Yang Xia, Qi Yang, Xinwen Wang, Xinguang Wang, Ping Cui, Yupu Yang, Bowen Sun, Zhenyu Guo

    Abstract: Limited by the time complexity of querying k-hop neighbors in a graph database, most graph algorithms cannot be deployed online and execute millisecond-level inference. This problem dramatically limits the potential of applying graph algorithms in certain areas, such as financial fraud detection. Therefore, we propose Asynchronous Propagation Attention Network, an asynchronous continuous time dyna… ▽ More

    Submitted 26 March, 2021; v1 submitted 23 November, 2020; originally announced November 2020.

    Comments: In Proceedings of the 2021 International Conference on Management of Data (SIGMOD/PODS '21)

  23. arXiv:2009.06548  [pdf, other

    cs.LG stat.ML

    Variance-Reduced Off-Policy Memory-Efficient Policy Search

    Authors: Daoming Lyu, Qi Qi, Mohammad Ghavamzadeh, Hengshuai Yao, Tianbao Yang, Bo Liu

    Abstract: Off-policy policy optimization is a challenging problem in reinforcement learning (RL). The algorithms designed for this problem often suffer from high variance in their estimators, which results in poor sample efficiency, and have issues with convergence. A few variance-reduced on-policy policy gradient algorithms have been recently proposed that use methods from stochastic optimization to reduce… ▽ More

    Submitted 14 September, 2020; originally announced September 2020.

  24. ATG-PVD: Ticketing Parking Violations on A Drone

    Authors: Hengli Wang, Yuxuan Liu, Huaiyang Huang, Yuheng Pan, Wenbin Yu, Jialin Jiang, Dianbin Lyu, Mohammud J. Bocus, Ming Liu, Ioannis Pitas, Rui Fan

    Abstract: In this paper, we introduce a novel suspect-and-investigate framework, which can be easily embedded in a drone for automated parking violation detection (PVD). Our proposed framework consists of: 1) SwiftFlow, an efficient and accurate convolutional neural network (CNN) for unsupervised optical flow estimation; 2) Flow-RCNN, a flow-guided CNN for car detection and classification; and 3) an illegal… ▽ More

    Submitted 21 August, 2020; originally announced August 2020.

    Comments: 17 pages, 11 figures and 3 tables. This paper is accepted by ECCV Workshops 2020

  25. arXiv:2006.03978  [pdf, other

    cs.LG stat.ML

    Stable and Efficient Policy Evaluation

    Authors: Daoming Lyu, Bo Liu, Matthieu Geist, Wen Dong, Saad Biaz, Qi Wang

    Abstract: Policy evaluation algorithms are essential to reinforcement learning due to their ability to predict the performance of a policy. However, there are two long-standing issues lying in this prediction problem that need to be tackled: off-policy stability and on-policy efficiency. The conventional temporal difference (TD) algorithm is known to perform very well in the on-policy setting, yet is not of… ▽ More

    Submitted 27 December, 2021; v1 submitted 6 June, 2020; originally announced June 2020.

    Comments: IEEE Transactions on Neural Networks and Learning Systems (IEEE-TNNLS). arXiv admin note: text overlap with arXiv:1704.05147

  26. arXiv:1909.09209  [pdf, other

    cs.AI cs.HC cs.LG cs.LO

    A Human-Centered Data-Driven Planner-Actor-Critic Architecture via Logic Programming

    Authors: Daoming Lyu, Fangkai Yang, Bo Liu, Steven Gustafson

    Abstract: Recent successes of Reinforcement Learning (RL) allow an agent to learn policies that surpass human experts but suffers from being time-hungry and data-hungry. By contrast, human learning is significantly faster because prior and general knowledge and multiple information resources are utilized. In this paper, we propose a Planner-Actor-Critic architecture for huMAN-centered planning and learning… ▽ More

    Submitted 18 September, 2019; originally announced September 2019.

    Comments: In Proceedings ICLP 2019, arXiv:1909.07646. arXiv admin note: significant text overlap with arXiv:1906.07268

    Journal ref: EPTCS 306, 2019, pp. 182-195

  27. arXiv:1906.07268  [pdf, other

    cs.AI

    A Joint Planning and Learning Framework for Human-Aided Decision-Making

    Authors: Daoming Lyu, Fangkai Yang, Bo Liu, Steven Gustafson

    Abstract: Conventional reinforcement learning (RL) allows an agent to learn policies via environmental rewards only, with a long and slow learning curve, especially at the beginning stage. On the contrary, human learning is usually much faster because prior and general knowledge and multiple information resources are utilized. In this paper, we propose a \textbf{P}lanner-\textbf{A}ctor-\textbf{C}ritic archi… ▽ More

    Submitted 24 December, 2019; v1 submitted 17 June, 2019; originally announced June 2019.

  28. arXiv:1905.07030  [pdf, other

    cs.AI cs.LO

    Knowledge-Based Sequential Decision-Making Under Uncertainty

    Authors: Daoming Lyu

    Abstract: Deep reinforcement learning (DRL) algorithms have achieved great success on sequential decision-making problems, yet is criticized for the lack of data-efficiency and explainability. Especially, explainability of subtasks is critical in hierarchical decision-making since it enhances the transparency of black-box-style DRL methods and helps the RL practitioners to understand the high-level behavior… ▽ More

    Submitted 15 May, 2020; v1 submitted 16 May, 2019; originally announced May 2019.

    Comments: 5 pages, submitted for the Doctoral Consortium at the 15th International Conference on Logic Programming and Non-monotonic Reasoning (LPNMR 2019). arXiv admin note: text overlap with arXiv:1811.00090

  29. arXiv:1811.00090  [pdf, other

    cs.AI

    SDRL: Interpretable and Data-efficient Deep Reinforcement Learning Leveraging Symbolic Planning

    Authors: Daoming Lyu, Fangkai Yang, Bo Liu, Steven Gustafson

    Abstract: Deep reinforcement learning (DRL) has gained great success by learning directly from high-dimensional sensory inputs, yet is notorious for the lack of interpretability. Interpretability of the subtasks is critical in hierarchical decision-making as it increases the transparency of black-box-style DRL approach and helps the RL practitioners to understand the high-level behavior of the system better… ▽ More

    Submitted 28 February, 2019; v1 submitted 31 October, 2018; originally announced November 2018.

  30. arXiv:1809.02292  [pdf, other

    cs.LG stat.ML

    A Block Coordinate Ascent Algorithm for Mean-Variance Optimization

    Authors: Bo Liu, Tengyang Xie, Yangyang Xu, Mohammad Ghavamzadeh, Yinlam Chow, Daoming Lyu, Daesub Yoon

    Abstract: Risk management in dynamic decision problems is a primary concern in many fields, including financial investment, autonomous driving, and healthcare. The mean-variance function is one of the most widely used objective functions in risk management due to its simplicity and interpretability. Existing algorithms for mean-variance optimization are based on multi-time-scale stochastic approximation, wh… ▽ More

    Submitted 1 November, 2018; v1 submitted 6 September, 2018; originally announced September 2018.

    Comments: Accepted by NIPS 2018

  31. arXiv:1804.07779  [pdf, other

    cs.LG cs.AI stat.ML

    PEORL: Integrating Symbolic Planning and Hierarchical Reinforcement Learning for Robust Decision-Making

    Authors: Fangkai Yang, Daoming Lyu, Bo Liu, Steven Gustafson

    Abstract: Reinforcement learning and symbolic planning have both been used to build intelligent autonomous agents. Reinforcement learning relies on learning from interactions with real world, which often requires an unfeasibly large amount of experience. Symbolic planning relies on manually crafted symbolic knowledge, which may not be robust to domain uncertainties and changes. In this paper we present a un… ▽ More

    Submitted 5 June, 2018; v1 submitted 20 April, 2018; originally announced April 2018.

    Comments: conference version accepted by IJCAI-ECAI-18

  32. arXiv:1704.05147  [pdf, other

    cs.LG stat.ML

    O$^2$TD: (Near)-Optimal Off-Policy TD Learning

    Authors: Bo Liu, Daoming Lyu, Wen Dong, Saad Biaz

    Abstract: Temporal difference learning and Residual Gradient methods are the most widely used temporal difference based learning algorithms; however, it has been shown that none of their objective functions is optimal w.r.t approximating the true value function $V$. Two novel algorithms are proposed to approximate the true value function $V$. This paper makes the following contributions: (1) A batch algorit… ▽ More

    Submitted 19 April, 2017; v1 submitted 17 April, 2017; originally announced April 2017.

    Comments: 10 pages, 7 figures