Skip to main content

Showing 1–50 of 58 results for author: Xiong, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.07989  [pdf, other

    cs.CV cs.AI

    IIU: Independent Inference Units for Knowledge-based Visual Question Answering

    Authors: Yili Li, Jing Yu, Keke Gai, Gang Xiong

    Abstract: Knowledge-based visual question answering requires external knowledge beyond visible content to answer the question correctly. One limitation of existing methods is that they focus more on modeling the inter-modal and intra-modal correlations, which entangles complex multimodal clues by implicit embeddings and lacks interpretability and generalization ability. The key challenge to solve the above… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

  2. arXiv:2408.00727  [pdf, other

    cs.CL cs.AI

    Improving Retrieval-Augmented Generation in Medicine with Iterative Follow-up Questions

    Authors: Guangzhi Xiong, Qiao Jin, Xiao Wang, Minjia Zhang, Zhiyong Lu, Aidong Zhang

    Abstract: The emergent abilities of large language models (LLMs) have demonstrated great potential in solving medical questions. They can possess considerable medical knowledge, but may still hallucinate and are inflexible in the knowledge updates. While Retrieval-Augmented Generation (RAG) has been proposed to enhance the medical question-answering capabilities of LLMs with external knowledge bases, it may… ▽ More

    Submitted 1 August, 2024; originally announced August 2024.

  3. arXiv:2407.19300  [pdf, other

    cs.LG cs.AI

    CoLiDR: Concept Learning using Aggregated Disentangled Representations

    Authors: Sanchit Sinha, Guangzhi Xiong, Aidong Zhang

    Abstract: Interpretability of Deep Neural Networks using concept-based models offers a promising way to explain model behavior through human-understandable concepts. A parallel line of research focuses on disentangling the data distribution into its underlying generative factors, in turn explaining the data generation process. While both directions have received extensive attention, little work has been don… ▽ More

    Submitted 27 July, 2024; originally announced July 2024.

    Comments: KDD 2024

  4. arXiv:2407.15613  [pdf, other

    cs.CV

    Visual-Semantic Decomposition and Partial Alignment for Document-based Zero-Shot Learning

    Authors: Xiangyan Qu, Jing Yu, Keke Gai, Jiamin Zhuang, Yuanmin Tang, Gang Xiong, Gaopeng Gou, Qi Wu

    Abstract: Recent work shows that documents from encyclopedias serve as helpful auxiliary information for zero-shot learning. Existing methods align the entire semantics of a document with corresponding images to transfer knowledge. However, they disregard that semantic information is not equivalent between them, resulting in a suboptimal alignment. In this work, we propose a novel network to extract multi-v… ▽ More

    Submitted 23 July, 2024; v1 submitted 22 July, 2024; originally announced July 2024.

    Comments: Accepted to ACM International Conference on Multimedia (MM) 2024

  5. arXiv:2407.06567  [pdf, other

    cs.CL

    FinCon: A Synthesized LLM Multi-Agent System with Conceptual Verbal Reinforcement for Enhanced Financial Decision Making

    Authors: Yangyang Yu, Zhiyuan Yao, Haohang Li, Zhiyang Deng, Yupeng Cao, Zhi Chen, Jordan W. Suchow, Rong Liu, Zhenyu Cui, Denghui Zhang, Koduvayur Subbalakshmi, Guojun Xiong, Yueru He, Jimin Huang, Dong Li, Qianqian Xie

    Abstract: Large language models (LLMs) have demonstrated notable potential in conducting complex tasks and are increasingly utilized in various financial applications. However, high-quality sequential financial investment decision-making remains challenging. These tasks require multiple interactions with a volatile environment for every decision, demanding sufficient intelligence to maximize returns and man… ▽ More

    Submitted 10 July, 2024; v1 submitted 9 July, 2024; originally announced July 2024.

    Comments: LLM Applications, LLM Agents, Financial Technology, Quantitative Finance, Algorithmic Trading, Cognitive Science

  6. arXiv:2406.12036  [pdf, other

    cs.CL cs.AI

    MedCalc-Bench: Evaluating Large Language Models for Medical Calculations

    Authors: Nikhil Khandekar, Qiao Jin, Guangzhi Xiong, Soren Dunn, Serina S Applebaum, Zain Anwar, Maame Sarfo-Gyamfi, Conrad W Safranek, Abid A Anwar, Andrew Zhang, Aidan Gilson, Maxwell B Singer, Amisha Dave, Andrew Taylor, Aidong Zhang, Qingyu Chen, Zhiyong Lu

    Abstract: As opposed to evaluating computation and logic-based reasoning, current benchmarks for evaluating large language models (LLMs) in medicine are primarily focused on question-answering involving domain knowledge and descriptive reasoning. While such qualitative capabilities are vital to medical diagnosis, in real-world scenarios, doctors frequently use clinical calculators that follow quantitative e… ▽ More

    Submitted 30 June, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

    Comments: Github link: https://github.com/ncbi-nlp/MedCalc-Bench HuggingFace link: https://huggingface.co/datasets/nsk7153/MedCalc-Bench

  7. arXiv:2405.00950  [pdf, other

    cs.LG cs.AI stat.ML

    Provably Efficient Reinforcement Learning for Adversarial Restless Multi-Armed Bandits with Unknown Transitions and Bandit Feedback

    Authors: Guojun Xiong, Jian Li

    Abstract: Restless multi-armed bandits (RMAB) play a central role in modeling sequential decision making problems under an instantaneous activation constraint that at most B arms can be activated at any decision epoch. Each restless arm is endowed with a state that evolves independently according to a Markov decision process regardless of being activated or not. In this paper, we consider the task of learni… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: Accepted by ICML 2024

  8. arXiv:2405.00349  [pdf, other

    cs.LG

    A Self-explaining Neural Architecture for Generalizable Concept Learning

    Authors: Sanchit Sinha, Guangzhi Xiong, Aidong Zhang

    Abstract: With the wide proliferation of Deep Neural Networks in high-stake applications, there is a growing demand for explainability behind their decision-making process. Concept learning models attempt to learn high-level 'concepts' - abstract entities that align with human understanding, and thus provide interpretability to DNN architectures. However, in this paper, we demonstrate that present SOTA conc… ▽ More

    Submitted 5 May, 2024; v1 submitted 1 May, 2024; originally announced May 2024.

    Comments: IJCAI 2024. 16 pages (7 main content, 2 references, 7 Appendix) Code available at https://github.com/sanchit97/secl

  9. arXiv:2404.16920  [pdf, other

    cs.NI cs.IT cs.LG eess.SP

    Structured Reinforcement Learning for Delay-Optimal Data Transmission in Dense mmWave Networks

    Authors: Shufan Wang, Guojun Xiong, Shichen Zhang, Huacheng Zeng, Jian Li, Shivendra Panwar

    Abstract: We study the data packet transmission problem (mmDPT) in dense cell-free millimeter wave (mmWave) networks, i.e., users sending data packet requests to access points (APs) via uplinks and APs transmitting requested data packets to users via downlinks. Our objective is to minimize the average delay in the system due to APs' limited service capacity and unreliable wireless channels between APs and u… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: IEEE Transactions on Wireless Communications

  10. arXiv:2403.13263  [pdf, other

    cs.CV

    SC-Tune: Unleashing Self-Consistent Referential Comprehension in Large Vision Language Models

    Authors: Tongtian Yue, Jie Cheng, Longteng Guo, Xingyuan Dai, Zijia Zhao, Xingjian He, Gang Xiong, Yisheng Lv, Jing Liu

    Abstract: Recent trends in Large Vision Language Models (LVLMs) research have been increasingly focusing on advancing beyond general image understanding towards more nuanced, object-level referential comprehension. In this paper, we present and delve into the self-consistency capability of LVLMs, a crucial aspect that reflects the models' ability to both generate informative captions for specific objects an… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: Accepted by CVPR2024

  11. arXiv:2402.17257  [pdf, other

    cs.LG cs.AI cs.RO

    RIME: Robust Preference-based Reinforcement Learning with Noisy Preferences

    Authors: Jie Cheng, Gang Xiong, Xingyuan Dai, Qinghai Miao, Yisheng Lv, Fei-Yue Wang

    Abstract: Preference-based Reinforcement Learning (PbRL) circumvents the need for reward engineering by harnessing human preferences as the reward signal. However, current PbRL methods excessively depend on high-quality feedback from domain experts, which results in a lack of robustness. In this paper, we present RIME, a robust PbRL algorithm for effective reward learning from noisy preferences. Our method… ▽ More

    Submitted 30 May, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: Accepted by ICML2024

  12. arXiv:2402.15131  [pdf, other

    cs.CL cs.AI

    Interactive-KBQA: Multi-Turn Interactions for Knowledge Base Question Answering with Large Language Models

    Authors: Guanming Xiong, Junwei Bao, Wen Zhao

    Abstract: This study explores the realm of knowledge base question answering (KBQA). KBQA is considered a challenging task, particularly in parsing intricate questions into executable logical forms. Traditional semantic parsing (SP)-based methods require extensive data annotations, which result in significant costs. Recently, the advent of few-shot in-context learning, powered by large language models (LLMs… ▽ More

    Submitted 19 July, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

    Comments: This work has been accepted by the ACL 2024 main conference. Code and data are available at: https://github.com/JimXiongGM/Interactive-KBQA

    ACM Class: I.2.7

  13. arXiv:2402.13178  [pdf, other

    cs.CL cs.AI

    Benchmarking Retrieval-Augmented Generation for Medicine

    Authors: Guangzhi Xiong, Qiao Jin, Zhiyong Lu, Aidong Zhang

    Abstract: While large language models (LLMs) have achieved state-of-the-art performance on a wide range of medical question answering (QA) tasks, they still face challenges with hallucinations and outdated knowledge. Retrieval-augmented generation (RAG) is a promising solution and has been widely adopted. However, a RAG system can involve multiple flexible components, and there is a lack of best practices r… ▽ More

    Submitted 23 February, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

    Comments: Homepage: https://teddy-xionggz.github.io/benchmark-medical-rag/

  14. arXiv:2402.12659  [pdf, other

    cs.CL cs.AI cs.CE

    FinBen: A Holistic Financial Benchmark for Large Language Models

    Authors: Qianqian Xie, Weiguang Han, Zhengyu Chen, Ruoyu Xiang, Xiao Zhang, Yueru He, Mengxi Xiao, Dong Li, Yongfu Dai, Duanyu Feng, Yijing Xu, Haoqiang Kang, Ziyan Kuang, Chenhan Yuan, Kailai Yang, Zheheng Luo, Tianlin Zhang, Zhiwei Liu, Guojun Xiong, Zhiyang Deng, Yuechen Jiang, Zhiyuan Yao, Haohang Li, Yangyang Yu, Gang Hu , et al. (9 additional authors not shown)

    Abstract: LLMs have transformed NLP and shown promise in various fields, yet their potential in finance is underexplored due to a lack of comprehensive evaluation benchmarks, the rapid development of LLMs, and the complexity of financial tasks. In this paper, we introduce FinBen, the first extensive open-source evaluation benchmark, including 36 datasets spanning 24 financial tasks, covering seven critical… ▽ More

    Submitted 18 June, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

    Comments: 26 pages, 11 figures

  15. arXiv:2402.09325  [pdf, other

    cs.CV cs.RO

    PC-NeRF: Parent-Child Neural Radiance Fields Using Sparse LiDAR Frames in Autonomous Driving Environments

    Authors: Xiuzhong Hu, Guangming Xiong, Zheng Zang, Peng Jia, Yuxuan Han, Junyi Ma

    Abstract: Large-scale 3D scene reconstruction and novel view synthesis are vital for autonomous vehicles, especially utilizing temporally sparse LiDAR frames. However, conventional explicit representations remain a significant bottleneck towards representing the reconstructed and synthetic scenes at unlimited resolution. Although the recently developed neural radiance fields (NeRF) have shown compelling res… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2310.00874

  16. GI-PIP: Do We Require Impractical Auxiliary Dataset for Gradient Inversion Attacks?

    Authors: Yu Sun, Gaojian Xiong, Xianxun Yao, Kailang Ma, Jian Cui

    Abstract: Deep gradient inversion attacks expose a serious threat to Federated Learning (FL) by accurately recovering private data from shared gradients. However, the state-of-the-art heavily relies on impractical assumptions to access excessive auxiliary data, which violates the basic data partitioning principle of FL. In this paper, a novel method, Gradient Inversion Attack using Practical Image Prior (GI… ▽ More

    Submitted 1 April, 2024; v1 submitted 22 January, 2024; originally announced January 2024.

  17. arXiv:2401.05459  [pdf, other

    cs.HC cs.AI cs.SE

    Personal LLM Agents: Insights and Survey about the Capability, Efficiency and Security

    Authors: Yuanchun Li, Hao Wen, Weijun Wang, Xiangyu Li, Yizhen Yuan, Guohong Liu, Jiacheng Liu, Wenxing Xu, Xiang Wang, Yi Sun, Rui Kong, Yile Wang, Hanfei Geng, Jian Luan, Xuefeng Jin, Zilong Ye, Guanjing Xiong, Fan Zhang, Xiang Li, Mengwei Xu, Zhijun Li, Peng Li, Yang Liu, Ya-Qin Zhang, Yunxin Liu

    Abstract: Since the advent of personal computing devices, intelligent personal assistants (IPAs) have been one of the key technologies that researchers and engineers have focused on, aiming to help users efficiently obtain information and execute tasks, and provide users with more intelligent, convenient, and rich interaction experiences. With the development of smartphones and IoT, computing and sensing de… ▽ More

    Submitted 8 May, 2024; v1 submitted 10 January, 2024; originally announced January 2024.

    Comments: https://github.com/MobileLLM/Personal_LLM_Agents_Survey

  18. arXiv:2312.10815  [pdf, other

    cs.LG cs.DC math.OC

    DePRL: Achieving Linear Convergence Speedup in Personalized Decentralized Learning with Shared Representations

    Authors: Guojun Xiong, Gang Yan, Shiqiang Wang, Jian Li

    Abstract: Decentralized learning has emerged as an alternative method to the popular parameter-server framework which suffers from high communication burden, single-point failure and scalability issues due to the need of a central server. However, most existing works focus on a single shared model for all workers regardless of the data heterogeneity problem, rendering the resulting model performing poorly o… ▽ More

    Submitted 17 December, 2023; originally announced December 2023.

    Comments: AAAI 2024

  19. arXiv:2312.10303  [pdf, ps, other

    cs.LG

    Online Restless Multi-Armed Bandits with Long-Term Fairness Constraints

    Authors: Shufan Wang, Guojun Xiong, Jian Li

    Abstract: Restless multi-armed bandits (RMAB) have been widely used to model sequential decision making problems with constraints. The decision maker (DM) aims to maximize the expected total reward over an infinite horizon under an "instantaneous activation constraint" that at most B arms can be activated at any decision epoch, where the state of each arm evolves stochastically according to a Markov decisio… ▽ More

    Submitted 21 December, 2023; v1 submitted 15 December, 2023; originally announced December 2023.

    Comments: AAAI 2024

  20. arXiv:2311.05863  [pdf, other

    cs.CR cs.CV

    Watermarking Vision-Language Pre-trained Models for Multi-modal Embedding as a Service

    Authors: Yuanmin Tang, Jing Yu, Keke Gai, Xiangyan Qu, Yue Hu, Gang Xiong, Qi Wu

    Abstract: Recent advances in vision-language pre-trained models (VLPs) have significantly increased visual understanding and cross-modal analysis capabilities. Companies have emerged to provide multi-modal Embedding as a Service (EaaS) based on VLPs (e.g., CLIP-based VLPs), which cost a large amount of training data and resources for high-performance service. However, existing studies indicate that EaaS is… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

  21. LCPR: A Multi-Scale Attention-Based LiDAR-Camera Fusion Network for Place Recognition

    Authors: Zijie Zhou, Jingyi Xu, Guangming Xiong, Junyi Ma

    Abstract: Place recognition is one of the most crucial modules for autonomous vehicles to identify places that were previously visited in GPS-invalid environments. Sensor fusion is considered an effective method to overcome the weaknesses of individual sensors. In recent years, multimodal place recognition fusing information from multiple sensors has gathered increasing attention. However, most existing mul… ▽ More

    Submitted 30 December, 2023; v1 submitted 6 November, 2023; originally announced November 2023.

    Comments: Accepted by IEEE Robotics and Automation Letters (RAL) 2023

  22. arXiv:2310.02147  [pdf, other

    cs.LG cs.AI

    Finite-Time Analysis of Whittle Index based Q-Learning for Restless Multi-Armed Bandits with Neural Network Function Approximation

    Authors: Guojun Xiong, Jian Li

    Abstract: Whittle index policy is a heuristic to the intractable restless multi-armed bandits (RMAB) problem. Although it is provably asymptotically optimal, finding Whittle indices remains difficult. In this paper, we present Neural-Q-Whittle, a Whittle index based Q-learning algorithm for RMAB with neural network function approximation, which is an example of nonlinear two-timescale stochastic approximati… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

    Comments: 26 pages, 4 figures, Neurips 2023

  23. arXiv:2310.00874  [pdf, other

    cs.CV cs.RO

    PC-NeRF: Parent-Child Neural Radiance Fields under Partial Sensor Data Loss in Autonomous Driving Environments

    Authors: Xiuzhong Hu, Guangming Xiong, Zheng Zang, Peng Jia, Yuxuan Han, Junyi Ma

    Abstract: Reconstructing large-scale 3D scenes is essential for autonomous vehicles, especially when partial sensor data is lost. Although the recently developed neural radiance fields (NeRF) have shown compelling results in implicit representations, the large-scale 3D scene reconstruction using partially lost LiDAR point cloud data still needs to be explored. To bridge this gap, we propose a novel 3D scene… ▽ More

    Submitted 1 October, 2023; originally announced October 2023.

  24. arXiv:2309.16141  [pdf, other

    cs.CV

    Align before Search: Aligning Ads Image to Text for Accurate Cross-Modal Sponsored Search

    Authors: Yuanmin Tang, Jing Yu, Keke Gai, Yujing Wang, Yue Hu, Gang Xiong, Qi Wu

    Abstract: Cross-Modal sponsored search displays multi-modal advertisements (ads) when consumers look for desired products by natural language queries in search engines. Since multi-modal ads bring complementary details for query-ads matching, the ability to align ads-specific information in both images and texts is crucial for accurate and flexible sponsored search. Conventional research mainly studies from… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

  25. arXiv:2309.16137  [pdf, other

    cs.CV

    Context-I2W: Mapping Images to Context-dependent Words for Accurate Zero-Shot Composed Image Retrieval

    Authors: Yuanmin Tang, Jing Yu, Keke Gai, Jiamin Zhuang, Gang Xiong, Yue Hu, Qi Wu

    Abstract: Different from Composed Image Retrieval task that requires expensive labels for training task-specific models, Zero-Shot Composed Image Retrieval (ZS-CIR) involves diverse tasks with a broad range of visual content manipulation intent that could be related to domain, scene, object, and attribute. The key challenge for ZS-CIR tasks is to learn a more accurate image representation that has adaptive… ▽ More

    Submitted 15 December, 2023; v1 submitted 27 September, 2023; originally announced September 2023.

    Journal ref: AAAI 2024

  26. arXiv:2306.06559  [pdf, other

    cs.LG cs.DC

    Straggler-Resilient Decentralized Learning via Adaptive Asynchronous Updates

    Authors: Guojun Xiong, Gang Yan, Shiqiang Wang, Jian Li

    Abstract: With the increasing demand for large-scale training of machine learning models, fully decentralized optimization methods have recently been advocated as alternatives to the popular parameter server framework. In this paradigm, each worker maintains a local estimate of the optimal parameter vector, and iteratively updates it by waiting and averaging all estimates obtained from its neighbors, and th… ▽ More

    Submitted 8 July, 2024; v1 submitted 10 June, 2023; originally announced June 2023.

  27. arXiv:2305.13246  [pdf, other

    cs.CL cs.AI

    Interactive Natural Language Processing

    Authors: Zekun Wang, Ge Zhang, Kexin Yang, Ning Shi, Wangchunshu Zhou, Shaochun Hao, Guangzheng Xiong, Yizhi Li, Mong Yuan Sim, Xiuying Chen, Qingqing Zhu, Zhenzhu Yang, Adam Nik, Qi Liu, Chenghua Lin, Shi Wang, Ruibo Liu, Wenhu Chen, Ke Xu, Dayiheng Liu, Yike Guo, Jie Fu

    Abstract: Interactive Natural Language Processing (iNLP) has emerged as a novel paradigm within the field of NLP, aimed at addressing limitations in existing frameworks while aligning with the ultimate goals of artificial intelligence. This paradigm considers language models as agents capable of observing, acting, and receiving feedback iteratively from external entities. Specifically, language models in th… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

    Comments: 110 pages

  28. arXiv:2304.07773  [pdf, other

    cs.CV cs.RO

    PCPNet: An Efficient and Semantic-Enhanced Transformer Network for Point Cloud Prediction

    Authors: Zhen Luo, Junyi Ma, Zijie Zhou, Guangming Xiong

    Abstract: The ability to predict future structure features of environments based on past perception information is extremely needed by autonomous vehicles, which helps to make the following decision-making and path planning more reasonable. Recently, point cloud prediction (PCP) is utilized to predict and describe future environmental structures by the point cloud form. In this letter, we propose a novel ef… ▽ More

    Submitted 16 April, 2023; originally announced April 2023.

  29. arXiv:2304.04351  [pdf, other

    cs.CV

    Evaluate Geometry of Radiance Fields with Low-frequency Color Prior

    Authors: Qihang Fang, Yafei Song, Keqiang Li, Li Shen, Huaiyu Wu, Gang Xiong, Liefeng Bo

    Abstract: A radiance field is an effective representation of 3D scenes, which has been widely adopted in novel-view synthesis and 3D reconstruction. It is still an open and challenging problem to evaluate the geometry, i.e., the density field, as the ground-truth is almost impossible to obtain. One alternative indirect solution is to transform the density field into a point-cloud and compute its Chamfer Dis… ▽ More

    Submitted 17 January, 2024; v1 submitted 9 April, 2023; originally announced April 2023.

    Comments: This paper has been accepted by AAAI 2024

  30. arXiv:2304.03879  [pdf, other

    cs.IR cs.LG

    GPT4Rec: A Generative Framework for Personalized Recommendation and User Interests Interpretation

    Authors: Jinming Li, Wentao Zhang, Tian Wang, Guanglei Xiong, Alan Lu, Gerard Medioni

    Abstract: Recent advancements in Natural Language Processing (NLP) have led to the development of NLP-based recommender systems that have shown superior performance. However, current models commonly treat items as mere IDs and adopt discriminative modeling, resulting in limitations of (1) fully leveraging the content information of items and the language modeling capabilities of NLP models; (2) interpreting… ▽ More

    Submitted 7 April, 2023; originally announced April 2023.

  31. arXiv:2302.01665  [pdf, other

    cs.CV cs.RO

    CVTNet: A Cross-View Transformer Network for Place Recognition Using LiDAR Data

    Authors: Junyi Ma, Guangming Xiong, Jingyi Xu, Xieyuanli Chen

    Abstract: LiDAR-based place recognition (LPR) is one of the most crucial components of autonomous vehicles to identify previously visited places in GPS-denied environments. Most existing LPR methods use mundane representations of the input point cloud without considering different views, which may not fully exploit the information from LiDAR sensors. In this paper, we propose a cross-view transformer-based… ▽ More

    Submitted 6 October, 2023; v1 submitted 3 February, 2023; originally announced February 2023.

    Comments: accepted by IEEE Transactions on Industrial Informatics 2023

  32. arXiv:2212.06279  [pdf, other

    cs.LG cs.AI cs.MA stat.ML

    Decentralized Stochastic Multi-Player Multi-Armed Walking Bandits

    Authors: Guojun Xiong, Jian Li

    Abstract: Multi-player multi-armed bandit is an increasingly relevant decision-making problem, motivated by applications to cognitive radio systems. Most research for this problem focuses exclusively on the settings that players have \textit{full access} to all arms and receive no reward when pulling the same arm. Hence all players solve the same bandit problem with the goal of maximizing their cumulative r… ▽ More

    Submitted 12 December, 2022; originally announced December 2022.

    Comments: AAAI 2023

  33. arXiv:2209.07951  [pdf, other

    cs.CV cs.RO

    SeqOT: A Spatial-Temporal Transformer Network for Place Recognition Using Sequential LiDAR Data

    Authors: Junyi Ma, Xieyuanli Chen, Jingyi Xu, Guangming Xiong

    Abstract: Place recognition is an important component for autonomous vehicles to achieve loop closing or global localization. In this paper, we tackle the problem of place recognition based on sequential 3D LiDAR scans obtained by an onboard LiDAR sensor. We propose a transformer-based network named SeqOT to exploit the temporal and spatial information provided by sequential range images generated from the… ▽ More

    Submitted 16 September, 2022; originally announced September 2022.

    Comments: Submitted to IEEE Transactions on Industrial Electronics

  34. arXiv:2208.12461  [pdf, other

    cs.CL

    AutoQGS: Auto-Prompt for Low-Resource Knowledge-based Question Generation from SPARQL

    Authors: Guanming Xiong, Junwei Bao, Wen Zhao, Youzheng Wu, Xiaodong He

    Abstract: This study investigates the task of knowledge-based question generation (KBQG). Conventional KBQG works generated questions from fact triples in the knowledge graph, which could not express complex operations like aggregation and comparison in SPARQL. Moreover, due to the costly annotation of large-scale SPARQL-question pairs, KBQG from SPARQL under low-resource scenarios urgently needs to be expl… ▽ More

    Submitted 26 August, 2022; originally announced August 2022.

    Comments: Accepted to CIKM2022

  35. arXiv:2207.11484  [pdf, other

    cs.CV

    GraphFit: Learning Multi-scale Graph-Convolutional Representation for Point Cloud Normal Estimation

    Authors: Keqiang Li, Mingyang Zhao, Huaiyu Wu, Dong-Ming Yan, Zhen Shen, Fei-Yue Wang, Gang Xiong

    Abstract: We propose a precise and efficient normal estimation method that can deal with noise and nonuniform density for unstructured 3D point clouds. Unlike existing approaches that directly take patches and ignore the local neighborhood relationships, which make them susceptible to challenging regions such as sharp edges, we propose to learn graph convolutional feature representation for normal estimatio… ▽ More

    Submitted 23 July, 2022; originally announced July 2022.

  36. TTAGN: Temporal Transaction Aggregation Graph Network for Ethereum Phishing Scams Detection

    Authors: Sijia Li, Gaopeng Gou, Chang Liu, Chengshang Hou, Zhenzhen Li, Gang Xiong

    Abstract: In recent years, phishing scams have become the most serious type of crime involved in Ethereum, the second-largest blockchain platform. The existing phishing scams detection technology on Ethereum mostly uses traditional machine learning or network representation learning to mine the key information from the transaction network to identify phishing addresses. However, these methods adopt the last… ▽ More

    Submitted 28 April, 2022; originally announced April 2022.

    Comments: WWW 2022

  37. 6GAN: IPv6 Multi-Pattern Target Generation via Generative Adversarial Nets with Reinforcement Learning

    Authors: Tianyu Cui, Gaopeng Gou, Gang Xiong, Chang Liu, Peipei Fu, Zhen Li

    Abstract: Global IPv6 scanning has always been a challenge for researchers because of the limited network speed and computational power. Target generation algorithms are recently proposed to overcome the problem for Internet assessments by predicting a candidate set to scan. However, IPv6 custom address configuration emerges diverse addressing patterns discouraging algorithmic inference. Widespread IPv6 ali… ▽ More

    Submitted 20 April, 2022; originally announced April 2022.

    Comments: The paper has been accepted at the 2021 IEEE International Conference on Computer Communications (INFOCOM 2021). The source code has been published at https://github.com/CuiTianyu961030/6GAN

  38. A Comprehensive Study of Accelerating IPv6 Deployment

    Authors: Tianyu Cui, Chang Liu, Gaopeng Gou, Junzheng Shi, Gang Xiong

    Abstract: Since the lack of IPv6 network development, China is currently accelerating IPv6 deployment. In this scenario, traffic and network structure show a huge shift. However, due to the long-term prosperity, we are ignorant of the problems behind such outbreak of traffic and performance improvement events in accelerating deployment. IPv6 development in some regions will still face similar challenges in… ▽ More

    Submitted 20 April, 2022; originally announced April 2022.

    Comments: The paper has been accepted at the IEEE International Performance Computing and Communications Conference (IPCCC 2019)

  39. arXiv:2204.09465  [pdf, other

    cs.CR cs.AI cs.NI

    SiamHAN: IPv6 Address Correlation Attacks on TLS Encrypted Traffic via Siamese Heterogeneous Graph Attention Network

    Authors: Tianyu Cui, Gaopeng Gou, Gang Xiong, Zhen Li, Mingxin Cui, Chang Liu

    Abstract: Unlike IPv4 addresses, which are typically masked by a NAT, IPv6 addresses could easily be correlated with user activity, endangering their privacy. Mitigations to address this privacy concern have been deployed, making existing approaches for address-to-user correlation unreliable. This work demonstrates that an adversary could still correlate IPv6 addresses with users accurately, even with these… ▽ More

    Submitted 20 April, 2022; originally announced April 2022.

    Comments: The paper has been accepted at the 30th USENIX Security Symposium (USENIX Security 2021). The source code has been published at https://github.com/CuiTianyu961030/SiamHAN

  40. 6GCVAE: Gated Convolutional Variational Autoencoder for IPv6 Target Generation

    Authors: Tianyu Cui, Gaopeng Gou, Gang Xiong

    Abstract: IPv6 scanning has always been a challenge for researchers in the field of network measurement. Due to the considerable IPv6 address space, while recent network speed and computational power have been improved, using a brute-force approach to probe the entire network space of IPv6 is almost impossible. Systems are required an algorithmic approach to generate more possible active target candidate se… ▽ More

    Submitted 20 April, 2022; originally announced April 2022.

    Comments: The paper has been accepted at the Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD 2020)

  41. arXiv:2202.13187  [pdf, other

    cs.NI cs.LG

    Whittle Index based Q-Learning for Wireless Edge Caching with Linear Function Approximation

    Authors: Guojun Xiong, Shufan Wang, Jian Li, Rahul Singh

    Abstract: We consider the problem of content caching at the wireless edge to serve a set of end users via unreliable wireless channels so as to minimize the average latency experienced by end users due to the constrained wireless edge cache capacity. We formulate this problem as a Markov decision process, or more specifically a restless multi-armed bandit problem, which is provably hard to solve. We begin b… ▽ More

    Submitted 22 February, 2023; v1 submitted 26 February, 2022; originally announced February 2022.

  42. arXiv:2202.06335  [pdf, other

    cs.CR cs.AI cs.NI

    ET-BERT: A Contextualized Datagram Representation with Pre-training Transformers for Encrypted Traffic Classification

    Authors: Xinjie Lin, Gang Xiong, Gaopeng Gou, Zhen Li, Junzheng Shi, Jing Yu

    Abstract: Encrypted traffic classification requires discriminative and robust traffic representation captured from content-invisible and imbalanced traffic data for accurate classification, which is challenging but indispensable to achieve network security and network management. The major limitation of existing solutions is that they highly rely on the deep features, which are overly dependent on data size… ▽ More

    Submitted 19 February, 2022; v1 submitted 13 February, 2022; originally announced February 2022.

    Comments: This work has been accepted in Security, Privacy, and Trust track at The Web Conference 2022 (WWW'22)(see https://www2022.thewebconf.org/cfp/research/security/)

  43. arXiv:2201.12358  [pdf, other

    cs.LG eess.SY

    EVBattery: A Large-Scale Electric Vehicle Dataset for Battery Health and Capacity Estimation

    Authors: Haowei He, Jingzhao Zhang, Yanan Wang, Benben Jiang, Shaobo Huang, Chen Wang, Yang Zhang, Gengang Xiong, Xuebing Han, Dongxu Guo, Guannan He, Minggao Ouyang

    Abstract: Electric vehicles (EVs) play an important role in reducing carbon emissions. As EV adoption accelerates, safety issues caused by EV batteries have become an important research topic. In order to benchmark and develop data-driven methods for this task, we introduce a large and comprehensive dataset of EV batteries. Our dataset includes charging records collected from hundreds of EVs from three manu… ▽ More

    Submitted 1 November, 2023; v1 submitted 28 January, 2022; originally announced January 2022.

    Comments: 15 pages, 8 figures

  44. arXiv:2109.09855  [pdf, other

    cs.LG math.OC stat.ML

    Reinforcement Learning for Finite-Horizon Restless Multi-Armed Multi-Action Bandits

    Authors: Guojun Xiong, Jian Li, Rahul Singh

    Abstract: We study a finite-horizon restless multi-armed bandit problem with multiple actions, dubbed R(MA)^2B. The state of each arm evolves according to a controlled Markov decision process (MDP), and the reward of pulling an arm depends on both the current state of the corresponding MDP and the action taken. The goal is to sequentially choose actions for arms so as to maximize the expected value of the c… ▽ More

    Submitted 23 March, 2022; v1 submitted 20 September, 2021; originally announced September 2021.

    Comments: Appeared in AAAI 2022 with title "Reinforcement Learning Augmented Asymptotically Optimal Index Policy for Finite-Horizon Restless Bandits"

  45. arXiv:2103.11080  [pdf, other

    cs.DB

    Greenplum: A Hybrid Database for Transactional and Analytical Workloads

    Authors: Zhenghua Lyu, Huan Hubert Zhang, Gang Xiong, Haozhou Wang, Gang Guo, Jinbao Chen, Asim Praveen, Yu Yang, Xiaoming Gao, Ashwin Agrawal, Alexandra Wang, Wen Lin, Junfeng Yang, Hao Wu, Xiaoliang Li, Feng Guo, Jiang Wu, Jesse Zhang, Venkatesh Raghavan

    Abstract: Demand for enterprise data warehouse solutions to support real-time Online Transaction Processing (OLTP) queries as well as long-running Online Analytical Processing (OLAP) workloads is growing. Greenplum database is traditionally known as an OLAP data warehouse system with limited ability to process OLTP workloads. In this paper, we augment Greenplum into a hybrid system to serve both OLTP and OL… ▽ More

    Submitted 13 May, 2021; v1 submitted 19 March, 2021; originally announced March 2021.

  46. arXiv:2102.06280  [pdf, other

    cs.LG cs.DC stat.ML

    Straggler-Resilient Distributed Machine Learning with Dynamic Backup Workers

    Authors: Guojun Xiong, Gang Yan, Rahul Singh, Jian Li

    Abstract: With the increasing demand for large-scale training of machine learning models, consensus-based distributed optimization methods have recently been advocated as alternatives to the popular parameter server framework. In this paradigm, each worker maintains a local estimate of the optimal parameter vector, and iteratively updates it by waiting and averaging all estimates obtained from its neighbors… ▽ More

    Submitted 11 February, 2021; originally announced February 2021.

  47. Biomedical Question Answering: A Survey of Approaches and Challenges

    Authors: Qiao Jin, Zheng Yuan, Guangzhi Xiong, Qianlan Yu, Huaiyuan Ying, Chuanqi Tan, Mosha Chen, Songfang Huang, Xiaozhong Liu, Sheng Yu

    Abstract: Automatic Question Answering (QA) has been successfully applied in various domains such as search engines and chatbots. Biomedical QA (BQA), as an emerging QA task, enables innovative applications to effectively perceive, access and understand complex biomedical knowledge. There have been tremendous developments of BQA in the past two decades, which we classify into 5 distinctive approaches: class… ▽ More

    Submitted 8 September, 2021; v1 submitted 10 February, 2021; originally announced February 2021.

    Comments: In submission to ACM Computing Surveys

  48. arXiv:2101.03641  [pdf, other

    cs.NI cs.LG

    Learning Augmented Index Policy for Optimal Service Placement at the Network Edge

    Authors: Guojun Xiong, Rahul Singh, Jian Li

    Abstract: We consider the problem of service placement at the network edge, in which a decision maker has to choose between $N$ services to host at the edge to satisfy the demands of customers. Our goal is to design adaptive algorithms to minimize the average service delivery latency for customers. We pose the problem as a Markov decision process (MDP) in which the system state is given by describing, for e… ▽ More

    Submitted 13 January, 2021; v1 submitted 10 January, 2021; originally announced January 2021.

  49. arXiv:2008.02213  [pdf, other

    cs.NI cs.CL cs.LG

    6VecLM: Language Modeling in Vector Space for IPv6 Target Generation

    Authors: Tianyu Cui, Gang Xiong, Gaopeng Gou, Junzheng Shi, Wei Xia

    Abstract: Fast IPv6 scanning is challenging in the field of network measurement as it requires exploring the whole IPv6 address space but limited by current computational power. Researchers propose to obtain possible active target candidate sets to probe by algorithmically analyzing the active seed sets. However, IPv6 addresses lack semantic information and contain numerous addressing schemes, leading to th… ▽ More

    Submitted 5 August, 2020; originally announced August 2020.

    Comments: The paper has been accepted at the European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECMLPKDD 2020) (https://ecmlpkdd2020.net/programme/accepted/#ADSTab)

  50. arXiv:2004.13821   

    cs.CL cs.AI cs.LG

    Fine-tuning Multi-hop Question Answering with Hierarchical Graph Network

    Authors: Guanming Xiong

    Abstract: In this paper, we present a two stage model for multi-hop question answering. The first stage is a hierarchical graph network, which is used to reason over multi-hop question and is capable to capture different levels of granularity using the nature structure(i.e., paragraphs, questions, sentences and entities) of documents. The reasoning process is convert to node classify task(i.e., paragraph no… ▽ More

    Submitted 27 August, 2022; v1 submitted 20 April, 2020; originally announced April 2020.

    Comments: the experience result is not good and this work is not done