Skip to main content

Showing 1–50 of 2,099 results for author: Sun, Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.08998  [pdf, other

    stat.ML cs.LG

    A Confidence Interval for the $\ell_2$ Expected Calibration Error

    Authors: Yan Sun, Pratik Chaudhari, Ian J. Barnett, Edgar Dobriban

    Abstract: Recent advances in machine learning have significantly improved prediction accuracy in various applications. However, ensuring the calibration of probabilistic predictions remains a significant challenge. Despite efforts to enhance model calibration, the rigorous statistical evaluation of model calibration remains less explored. In this work, we develop confidence intervals the $\ell_2$ Expected C… ▽ More

    Submitted 16 August, 2024; originally announced August 2024.

  2. arXiv:2408.08912  [pdf, other

    cs.DL cs.GR cs.SI

    GeneticPrism: Multifaceted Visualization of Scientific Impact Evolutions

    Authors: Ye Sun, Zipeng Liu, Yuankai Luo, Lei Xia, Lei Shi

    Abstract: Understanding the evolution of scholarly impact is essential for many real-life decision-making processes in academia, such as research planning, frontier exploration, and award selection. Popular platforms like Google Scholar and Web of Science rely on numerical indicators that are too abstract to convey the context and content of scientific impact, while most existing visualization approaches on… ▽ More

    Submitted 14 August, 2024; originally announced August 2024.

    Comments: 13 pages, 8 figures, excluding appendix. Submitted to TVCG on 20240813

  3. arXiv:2408.08852  [pdf, other

    cs.AI cs.LG

    GeoTransformer: Enhancing Urban Forecasting with Geospatial Attention Mechanisms

    Authors: Yuhao Jia, Zile Wu, Shengao Yi, Yifei Sun

    Abstract: Recent advancements have focused on encoding urban spatial information into high-dimensional spaces, with notable efforts dedicated to integrating sociodemographic data and satellite imagery. These efforts have established foundational models in this field. However, the effective utilization of these spatial representations for urban forecasting applications remains under-explored. To address this… ▽ More

    Submitted 16 August, 2024; originally announced August 2024.

  4. arXiv:2408.08202  [pdf, other

    cs.CV

    Towards Practical Human Motion Prediction with LiDAR Point Clouds

    Authors: Xiao Han, Yiming Ren, Yichen Yao, Yujing Sun, Yuexin Ma

    Abstract: Human motion prediction is crucial for human-centric multimedia understanding and interacting. Current methods typically rely on ground truth human poses as observed input, which is not practical for real-world scenarios where only raw visual sensor data is available. To implement these methods in practice, a pre-phrase of pose estimation is essential. However, such two-stage approaches often lead… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

  5. arXiv:2408.08147  [pdf, other

    cs.DC cs.CL cs.LG

    P/D-Serve: Serving Disaggregated Large Language Model at Scale

    Authors: Yibo Jin, Tao Wang, Huimin Lin, Mingyang Song, Peiyang Li, Yipeng Ma, Yicheng Shan, Zhengfan Yuan, Cailong Li, Yajing Sun, Tiandeng Wu, Xing Chu, Ruizhi Huan, Li Ma, Xiao You, Wenting Zhou, Yunpeng Ye, Wen Liu, Xiangkun Xu, Yongsheng Zhang, Tiantian Dong, Jiawei Zhu, Zhe Wang, Xijian Ju, Jianxun Song , et al. (5 additional authors not shown)

    Abstract: Serving disaggregated large language models (LLMs) over tens of thousands of xPU devices (GPUs or NPUs) with reliable performance faces multiple challenges. 1) Ignoring the diversity (various prefixes and tidal requests), treating all the prompts in a mixed pool is inadequate. To facilitate the similarity per scenario and minimize the inner mismatch on P/D (prefill and decoding) processing, fine-g… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

  6. arXiv:2408.07820  [pdf, other

    cs.NI cs.IT eess.SY

    Hybrid Semantic/Bit Communication Based Networking Problem Optimization

    Authors: Le Xia, Yao Sun, Dusit Niyato, Lan Zhang, Lei Zhang, Muhammad Ali Imran

    Abstract: This paper jointly investigates user association (UA), mode selection (MS), and bandwidth allocation (BA) problems in a novel and practical next-generation cellular network where two modes of semantic communication (SemCom) and conventional bit communication (BitCom) coexist, namely hybrid semantic/bit communication network (HSB-Net). Concretely, we first identify a unified performance metric of m… ▽ More

    Submitted 19 August, 2024; v1 submitted 30 July, 2024; originally announced August 2024.

    Comments: This paper has been accepted for publication and will be presented in 2024 IEEE Global Communications Conference (GlobeCom 2024). Copyright may be transferred without notice, after which this version may no longer be accessible. arXiv admin note: substantial text overlap with arXiv:2404.04162

  7. arXiv:2408.07341  [pdf, other

    cs.CV

    Robust Semi-supervised Multimodal Medical Image Segmentation via Cross Modality Collaboration

    Authors: Xiaogen Zhon, Yiyou Sun, Min Deng, Winnie Chiu Wing Chu, Qi Dou

    Abstract: Multimodal learning leverages complementary information derived from different modalities, thereby enhancing performance in medical image segmentation. However, prevailing multimodal learning methods heavily rely on extensive well-annotated data from various modalities to achieve accurate segmentation performance. This dependence often poses a challenge in clinical settings due to limited availabi… ▽ More

    Submitted 14 August, 2024; originally announced August 2024.

  8. arXiv:2408.07317  [pdf, other

    cs.HC

    Connecting Dreams with Visual Brainstorming Instruction

    Authors: Yasheng Sun, Bohan Li, Mingchen Zhuge, Deng-Ping Fan, Salman Khan, Fahad Shahbaz Khan, Hideki Koike

    Abstract: Recent breakthroughs in understanding the human brain have revealed its impressive ability to efficiently process and interpret human thoughts, opening up possibilities for intervening in brain signals. In this paper, we aim to develop a straightforward framework that uses other modalities, such as natural language, to translate the original dreamland. We present DreamConnect, employing a dual-str… ▽ More

    Submitted 14 August, 2024; originally announced August 2024.

  9. OFL-W3: A One-shot Federated Learning System on Web 3.0

    Authors: Linshan Jiang, Moming Duan, Bingsheng He, Yulin Sun, Peishen Yan, Yang Hua, Tao Song

    Abstract: Federated Learning (FL) addresses the challenges posed by data silos, which arise from privacy, security regulations, and ownership concerns. Despite these barriers, FL enables these isolated data repositories to participate in collaborative learning without compromising privacy or security. Concurrently, the advancement of blockchain technology and decentralized applications (DApps) within Web 3.… ▽ More

    Submitted 12 August, 2024; originally announced August 2024.

    Comments: VLDB 24 demo paper

  10. arXiv:2408.06402  [pdf, other

    q-bio.QM cs.AI cs.LG

    PhaGO: Protein function annotation for bacteriophages by integrating the genomic context

    Authors: Jiaojiao Guan, Yongxin Ji, Cheng Peng, Wei Zou, Xubo Tang, Jiayu Shang, Yanni Sun

    Abstract: Bacteriophages are viruses that target bacteria, playing a crucial role in microbial ecology. Phage proteins are important in understanding phage biology, such as virus infection, replication, and evolution. Although a large number of new phages have been identified via metagenomic sequencing, many of them have limited protein function annotation. Accurate function annotation of phage proteins pre… ▽ More

    Submitted 17 August, 2024; v1 submitted 12 August, 2024; originally announced August 2024.

    Comments: 17 pages,6 figures

  11. arXiv:2408.06300  [pdf

    cond-mat.mtrl-sci cs.LG

    Inverse designing metamaterials with programmable nonlinear functional responses in graph space

    Authors: Marco Maurizi, Derek Xu, Yu-Tong Wang, Desheng Yao, David Hahn, Mourad Oudich, Anish Satpati, Mathieu Bauchy, Wei Wang, Yizhou Sun, Yun Jing, Xiaoyu Rayne Zheng

    Abstract: Material responses to static and dynamic stimuli, represented as nonlinear curves, are design targets for engineering functionalities like structural support, impact protection, and acoustic and photonic bandgaps. Three-dimensional metamaterials offer significant tunability due to their internal structure, yet existing methods struggle to capture their complex behavior-to-structure relationships.… ▽ More

    Submitted 12 August, 2024; originally announced August 2024.

    Comments: 19 pages, 5 figures

  12. arXiv:2408.03675  [pdf, other

    cs.CL

    NACL: A General and Effective KV Cache Eviction Framework for LLMs at Inference Time

    Authors: Yilong Chen, Guoxia Wang, Junyuan Shang, Shiyao Cui, Zhenyu Zhang, Tingwen Liu, Shuohuan Wang, Yu Sun, Dianhai Yu, Hua Wu

    Abstract: Large Language Models (LLMs) have ignited an innovative surge of AI applications, marking a new era of exciting possibilities equipped with extended context windows. However, hosting these models is cost-prohibitive mainly due to the extensive memory consumption of KV Cache involving long-context modeling. Despite several works proposing to evict unnecessary tokens from the KV Cache, most of them… ▽ More

    Submitted 7 August, 2024; v1 submitted 7 August, 2024; originally announced August 2024.

    Comments: Accepted by ACL 2024 (main conference, long paper)

  13. arXiv:2408.03572  [pdf, other

    cs.LG cs.AI

    2D-OOB: Attributing Data Contribution through Joint Valuation Framework

    Authors: Yifan Sun, Jingyan Shen, Yongchan Kwon

    Abstract: Data valuation has emerged as a powerful framework to quantify the contribution of each datum to the training of a particular machine learning model. However, it is crucial to recognize that the quality of various cells within a single data point can vary greatly in practice. For example, even in the case of an abnormal data point, not all cells are necessarily noisy. The single scalar valuation a… ▽ More

    Submitted 7 August, 2024; originally announced August 2024.

  14. arXiv:2408.02937  [pdf, other

    cs.IR

    A Real-Time Adaptive Multi-Stream GPU System for Online Approximate Nearest Neighborhood Search

    Authors: Yiping Sun, Yang Shi, Jiaolong Du

    Abstract: In recent years, Approximate Nearest Neighbor Search (ANNS) has played a pivotal role in modern search and recommendation systems, especially in emerging LLM applications like Retrieval-Augmented Generation. There is a growing exploration into harnessing the parallel computing capabilities of GPUs to meet the substantial demands of ANNS. However, existing systems primarily focus on offline scenari… ▽ More

    Submitted 5 August, 2024; originally announced August 2024.

    Comments: Accepted by CIKM'24

  15. arXiv:2408.02025  [pdf, other

    cs.SD cs.AI eess.AS

    Contrastive Learning-based Chaining-Cluster for Multilingual Voice-Face Association

    Authors: Wuyang Chen, Yanjie Sun, Kele Xu, Yong Dou

    Abstract: The innate correlation between a person's face and voice has recently emerged as a compelling area of study, especially within the context of multilingual environments. This paper introduces our novel solution to the Face-Voice Association in Multilingual Environments (FAME) 2024 challenge, focusing on a contrastive learning-based chaining-cluster method to enhance face-voice association. This tas… ▽ More

    Submitted 19 August, 2024; v1 submitted 4 August, 2024; originally announced August 2024.

  16. arXiv:2408.00420  [pdf, other

    cs.CV cs.AI

    MPT-PAR:Mix-Parameters Transformer for Panoramic Activity Recognition

    Authors: Wenqing Gan, Yan Sun, Feiran Liu, Xiangfeng Luo

    Abstract: The objective of the panoramic activity recognition task is to identify behaviors at various granularities within crowded and complex environments, encompassing individual actions, social group activities, and global activities. Existing methods generally use either parameter-independent modules to capture task-specific features or parameter-sharing modules to obtain common features across all tas… ▽ More

    Submitted 1 August, 2024; originally announced August 2024.

  17. arXiv:2408.00275  [pdf, other

    cs.RO

    A Reinforcement Learning Based Motion Planner for Quadrotor Autonomous Flight in Dense Environment

    Authors: Zhaohong Liu, Wenxuan Gao, Yinshuai Sun, Peng Dong

    Abstract: Quadrotor motion planning is critical for autonomous flight in complex environments, such as rescue operations. Traditional methods often employ trajectory generation optimization and passive time allocation strategies, which can limit the exploitation of the quadrotor's dynamic capabilities and introduce delays and inaccuracies. To address these challenges, we propose a novel motion planning fram… ▽ More

    Submitted 5 August, 2024; v1 submitted 1 August, 2024; originally announced August 2024.

  18. arXiv:2408.00114  [pdf, other

    cs.AI

    Inductive or Deductive? Rethinking the Fundamental Reasoning Abilities of LLMs

    Authors: Kewei Cheng, Jingfeng Yang, Haoming Jiang, Zhengyang Wang, Binxuan Huang, Ruirui Li, Shiyang Li, Zheng Li, Yifan Gao, Xian Li, Bing Yin, Yizhou Sun

    Abstract: Reasoning encompasses two typical types: deductive reasoning and inductive reasoning. Despite extensive research into the reasoning capabilities of Large Language Models (LLMs), most studies have failed to rigorously differentiate between inductive and deductive reasoning, leading to a blending of the two. This raises an essential question: In LLM reasoning, which poses a greater challenge - deduc… ▽ More

    Submitted 6 August, 2024; v1 submitted 31 July, 2024; originally announced August 2024.

  19. arXiv:2408.00112  [pdf, other

    cs.CV

    Automated Sperm Morphology Analysis Based on Instance-Aware Part Segmentation

    Authors: Wenyuan Chen, Haocong Song, Changsheng Dai, Aojun Jiang, Guanqiao Shan, Hang Liu, Yanlong Zhou, Khaled Abdalla, Shivani N Dhanani, Katy Fatemeh Moosavi, Shruti Pathak, Clifford Librach, Zhuoran Zhang, Yu Sun

    Abstract: Traditional sperm morphology analysis is based on tedious manual annotation. Automated morphology analysis of a high number of sperm requires accurate segmentation of each sperm part and quantitative morphology evaluation. State-of-the-art instance-aware part segmentation networks follow a "detect-then-segment" paradigm. However, due to sperm's slim shape, their segmentation suffers from large con… ▽ More

    Submitted 31 July, 2024; originally announced August 2024.

    Comments: Accepted to ICRA 2024

  20. arXiv:2408.00001  [pdf, other

    cs.CV cs.AI cs.CY

    Replication in Visual Diffusion Models: A Survey and Outlook

    Authors: Wenhao Wang, Yifan Sun, Zongxin Yang, Zhengdong Hu, Zhentao Tan, Yi Yang

    Abstract: Visual diffusion models have revolutionized the field of creative AI, producing high-quality and diverse content. However, they inevitably memorize training images or videos, subsequently replicating their concepts, content, or styles during inference. This phenomenon raises significant concerns about privacy, security, and copyright within generated outputs. In this survey, we provide the first c… ▽ More

    Submitted 7 July, 2024; originally announced August 2024.

    Comments: The first survey focuses on replication in visual diffusion models. This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  21. HGOE: Hybrid External and Internal Graph Outlier Exposure for Graph Out-of-Distribution Detection

    Authors: Junwei He, Qianqian Xu, Yangbangyan Jiang, Zitai Wang, Yuchen Sun, Qingming Huang

    Abstract: With the progressive advancements in deep graph learning, out-of-distribution (OOD) detection for graph data has emerged as a critical challenge. While the efficacy of auxiliary datasets in enhancing OOD detection has been extensively studied for image and text data, such approaches have not yet been explored for graph data. Unlike Euclidean data, graph data exhibits greater diversity but lower ro… ▽ More

    Submitted 31 July, 2024; originally announced July 2024.

    Comments: Proceedings of the 32nd ACM International Conference on Multimedia

  22. arXiv:2407.21631  [pdf, other

    cs.CV

    RoadFormer+: Delivering RGB-X Scene Parsing through Scale-Aware Information Decoupling and Advanced Heterogeneous Feature Fusion

    Authors: Jianxin Huang, Jiahang Li, Ning Jia, Yuxiang Sun, Chengju Liu, Qijun Chen, Rui Fan

    Abstract: Task-specific data-fusion networks have marked considerable achievements in urban scene parsing. Among these networks, our recently proposed RoadFormer successfully extracts heterogeneous features from RGB images and surface normal maps and fuses these features through attention mechanisms, demonstrating compelling efficacy in RGB-Normal road scene parsing. However, its performance significantly d… ▽ More

    Submitted 31 July, 2024; originally announced July 2024.

    Comments: 11 pages, 5 figures

  23. arXiv:2407.21622  [pdf, other

    stat.ML cs.LG math.ST

    Extended Fiducial Inference: Toward an Automated Process of Statistical Inference

    Authors: Faming Liang, Sehwan Kim, Yan Sun

    Abstract: While fiducial inference was widely considered a big blunder by R.A. Fisher, the goal he initially set --`inferring the uncertainty of model parameters on the basis of observations' -- has been continually pursued by many statisticians. To this end, we develop a new statistical inference method called extended Fiducial inference (EFI). The new method achieves the goal of fiducial inference by leve… ▽ More

    Submitted 31 July, 2024; originally announced July 2024.

  24. Joint Power Allocation and Placement Scheme for UAV-assisted IoT with QoS Guarantee

    Authors: Ruirui Chen, Yanjing Sun, Liping Liang, Wenchi Cheng

    Abstract: In the disaster and remote regions, unmanned aerial vehicles (UAVs) can assist the data acquisition for Internet of Things (IoT). How to cover massive IoT devices (IDs), which require diverse quality-of-service (QoS), is a crucial challenge. For UAV-assisted IoT, this paper studies the deployment scheme with QoS guarantee to place multiple UAVs for covering all ground IDs and maximizing the averag… ▽ More

    Submitted 2 August, 2024; v1 submitted 31 July, 2024; originally announced July 2024.

    Journal ref: IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, VOL. 71, NO. 1, JANUARY 2022

  25. arXiv:2407.20570  [pdf, other

    cs.HC

    Fine-Tuned Large Language Model for Visualization System: A Study on Self-Regulated Learning in Education

    Authors: Lin Gao, Jing Lu, Zekai Shao, Ziyue Lin, Shengbin Yue, Chiokit Ieong, Yi Sun, Rory James Zauner, Zhongyu Wei, Siming Chen

    Abstract: Large Language Models (LLMs) have shown great potential in intelligent visualization systems, especially for domain-specific applications. Integrating LLMs into visualization systems presents challenges, and we categorize these challenges into three alignments: domain problems with LLMs, visualization with LLMs, and interaction with LLMs. To achieve these alignments, we propose a framework and out… ▽ More

    Submitted 30 July, 2024; originally announced July 2024.

  26. arXiv:2407.20177  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    AutoScale: Automatic Prediction of Compute-optimal Data Composition for Training LLMs

    Authors: Feiyang Kang, Yifan Sun, Bingbing Wen, Si Chen, Dawn Song, Rafid Mahmood, Ruoxi Jia

    Abstract: To ensure performance on a diverse set of downstream tasks, LLMs are pretrained via data mixtures over different domains. In this work, we demonstrate that the optimal data composition for a fixed compute budget varies depending on the scale of the training data, suggesting that the common practice of empirically determining an optimal composition using small-scale experiments will not yield the o… ▽ More

    Submitted 29 July, 2024; originally announced July 2024.

  27. Multiscale Representation Enhanced Temporal Flow Fusion Model for Long-Term Workload Forecasting

    Authors: Shiyu Wang, Zhixuan Chu, Yinbo Sun, Yu Liu, Yuliang Guo, Yang Chen, Huiyang Jian, Lintao Ma, Xingyu Lu, Jun Zhou

    Abstract: Accurate workload forecasting is critical for efficient resource management in cloud computing systems, enabling effective scheduling and autoscaling. Despite recent advances with transformer-based forecasting models, challenges remain due to the non-stationary, nonlinear characteristics of workload time series and the long-term dependencies. In particular, inconsistent performance between long-te… ▽ More

    Submitted 18 August, 2024; v1 submitted 29 July, 2024; originally announced July 2024.

    Comments: Proceedings of the 33rd ACM International Conference on Information and Knowledge Management (CIKM '24), October 21--25, 2024, Boise, ID, USA

  28. arXiv:2407.19512  [pdf, other

    cs.CV

    Large-scale cervical precancerous screening via AI-assisted cytology whole slide image analysis

    Authors: Honglin Li, Yusuan Sun, Chenglu Zhu, Yunlong Zhang, Shichuan Zhang, Zhongyi Shui, Pingyi Chen, Jingxiong Li, Sunyi Zheng, Can Cui, Lin Yang

    Abstract: Cervical Cancer continues to be the leading gynecological malignancy, posing a persistent threat to women's health on a global scale. Early screening via cytology Whole Slide Image (WSI) diagnosis is critical to prevent this Cancer progression and improve survival rate, but pathologist's single test suffers inevitable false negative due to the immense number of cells that need to be reviewed withi… ▽ More

    Submitted 28 July, 2024; originally announced July 2024.

  29. arXiv:2407.18961  [pdf, other

    cs.AI

    MMAU: A Holistic Benchmark of Agent Capabilities Across Diverse Domains

    Authors: Guoli Yin, Haoping Bai, Shuang Ma, Feng Nan, Yanchao Sun, Zhaoyang Xu, Shen Ma, Jiarui Lu, Xiang Kong, Aonan Zhang, Dian Ang Yap, Yizhe zhang, Karsten Ahnert, Vik Kamath, Mathias Berglund, Dominic Walsh, Tobias Gindele, Juergen Wiest, Zhengfeng Lai, Xiaoming Wang, Jiulong Shan, Meng Cao, Ruoming Pang, Zirui Wang

    Abstract: Recent advances in large language models (LLMs) have increased the demand for comprehensive benchmarks to evaluate their capabilities as human-like agents. Existing benchmarks, while useful, often focus on specific application scenarios, emphasizing task completion but failing to dissect the underlying skills that drive these outcomes. This lack of granularity makes it difficult to deeply discern… ▽ More

    Submitted 15 August, 2024; v1 submitted 17 July, 2024; originally announced July 2024.

  30. arXiv:2407.18468  [pdf, other

    cs.LG cs.AI

    Diffusion-Driven Semantic Communication for Generative Models with Bandwidth Constraints

    Authors: Lei Guo, Wei Chen, Yuxuan Sun, Bo Ai, Nikolaos Pappas, Tony Quek

    Abstract: Diffusion models have been extensively utilized in AI-generated content (AIGC) in recent years, thanks to the superior generation capabilities. Combining with semantic communications, diffusion models are used for tasks such as denoising, data reconstruction, and content generation. However, existing diffusion-based generative models do not consider the stringent bandwidth limitation, which limits… ▽ More

    Submitted 25 July, 2024; originally announced July 2024.

    Comments: 13 pages, 7 figures, submitted to IEEE for possible publication

  31. arXiv:2407.16975  [pdf, other

    cs.LG stat.ME

    On the Parameter Identifiability of Partially Observed Linear Causal Models

    Authors: Xinshuai Dong, Ignavier Ng, Biwei Huang, Yuewen Sun, Songyao Jin, Roberto Legaspi, Peter Spirtes, Kun Zhang

    Abstract: Linear causal models are important tools for modeling causal dependencies and yet in practice, only a subset of the variables can be observed. In this paper, we examine the parameter identifiability of these models by investigating whether the edge coefficients can be recovered given the causal structure and partially observed data. Our setting is more general than that of prior research - we allo… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

  32. arXiv:2407.16682  [pdf, other

    cs.CV

    SAM-CP: Marrying SAM with Composable Prompts for Versatile Segmentation

    Authors: Pengfei Chen, Lingxi Xie, Xinyue Huo, Xuehui Yu, Xiaopeng Zhang, Yingfei Sun, Zhenjun Han, Qi Tian

    Abstract: The Segment Anything model (SAM) has shown a generalized ability to group image pixels into patches, but applying it to semantic-aware segmentation still faces major challenges. This paper presents SAM-CP, a simple approach that establishes two types of composable prompts beyond SAM and composes them for versatile segmentation. Specifically, given a set of classes (in texts) and a set of SAM patch… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

  33. arXiv:2407.16397  [pdf, other

    cs.LG cs.AI

    On ADMM in Heterogeneous Federated Learning: Personalization, Robustness, and Fairness

    Authors: Shengkun Zhu, Jinshan Zeng, Sheng Wang, Yuan Sun, Xiaodong Li, Yuan Yao, Zhiyong Peng

    Abstract: Statistical heterogeneity is a root cause of tension among accuracy, fairness, and robustness of federated learning (FL), and is key in paving a path forward. Personalized FL (PFL) is an approach that aims to reduce the impact of statistical heterogeneity by developing personalized models for individual users, while also inherently providing benefits in terms of fairness and robustness. However, e… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

    Comments: arXiv admin note: text overlap with arXiv:2311.06756

  34. arXiv:2407.15869  [pdf, other

    cs.LG cs.AI

    Long Input Sequence Network for Long Time Series Forecasting

    Authors: Chao Ma, Yikai Hou, Xiang Li, Yinggang Sun, Haining Yu

    Abstract: Short fixed-length inputs are the main bottleneck of deep learning methods in long time-series forecasting tasks. Prolonging input length causes overfitting, rapidly deteriorating accuracy. Our research indicates that the overfitting is a combination reaction of the multi-scale pattern coupling in time series and the fixed focusing scale of current models. First, we find that the patterns exhibite… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

    Comments: 9 pages

  35. arXiv:2407.15071  [pdf, other

    cs.DB cs.CL

    Relational Database Augmented Large Language Model

    Authors: Zongyue Qin, Chen Luo, Zhengyang Wang, Haoming Jiang, Yizhou Sun

    Abstract: Large language models (LLMs) excel in many natural language processing (NLP) tasks. However, since LLMs can only incorporate new knowledge through training or supervised fine-tuning processes, they are unsuitable for applications that demand precise, up-to-date, and private information not available in the training corpora. This precise, up-to-date, and private information is typically stored in r… ▽ More

    Submitted 21 July, 2024; originally announced July 2024.

  36. arXiv:2407.14829  [pdf, other

    cs.CL

    Overview of AI-Debater 2023: The Challenges of Argument Generation Tasks

    Authors: Jiayu Lin, Guanrong Chen, Bojun Jin, Chenyang Li, Shutong Jia, Wancong Lin, Yang Sun, Yuhang He, Caihua Yang, Jianzhu Bao, Jipeng Wu, Wen Su, Jinglu Chen, Xinyi Li, Tianyu Chen, Mingjie Han, Shuaiwen Du, Zijian Wang, Jiyin Li, Fuzhong Suo, Hao Wang, Nuanchen Lin, Xuanjing Huang, Changjian Jiang, RuiFeng Xu , et al. (4 additional authors not shown)

    Abstract: In this paper we present the results of the AI-Debater 2023 Challenge held by the Chinese Conference on Affect Computing (CCAC 2023), and introduce the related datasets. We organize two tracks to handle the argumentative generation tasks in different scenarios, namely, Counter-Argument Generation (Track 1) and Claim-based Argument Generation (Track 2). Each track is equipped with its distinct data… ▽ More

    Submitted 24 July, 2024; v1 submitted 20 July, 2024; originally announced July 2024.

  37. arXiv:2407.14532  [pdf, other

    cs.DC cs.LG

    A Scenario-Oriented Benchmark for Assessing AIOps Algorithms in Microservice Management

    Authors: Yongqian Sun, Jiaju Wang, Zhengdan Li, Xiaohui Nie, Minghua Ma, Shenglin Zhang, Yuhe Ji, Lu Zhang, Wen Long, Hengmao Chen, Yongnan Luo, Dan Pei

    Abstract: AIOps algorithms play a crucial role in the maintenance of microservice systems. Many previous benchmarks' performance leaderboard provides valuable guidance for selecting appropriate algorithms. However, existing AIOps benchmarks mainly utilize offline datasets to evaluate algorithms. They cannot consistently evaluate the performance of algorithms using real-time datasets, and the operation scena… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: Codes are available at https://github.com/MicroServo/microservo, datasets are available at https://github.com/MicroServo/hot-plugging

  38. arXiv:2407.14530  [pdf, other

    cs.DB cs.AI

    FuncEvalGMN: Evaluating Functional Correctness of SQL via Graph Matching Network

    Authors: Yi Zhan, Yang Sun, Han Weng, Longjie Cui, Guifeng Wang, Jiajun Xie, Yu Tian, Xiaoming Yin, Boyi Liu, Dongchi Huang

    Abstract: In this paper, we propose a novel graph-based methodology to evaluate the functional correctness of SQL generation. Conventional metrics for assessing SQL code generation, such as matching-based and execution-based methods (e.g., exact set match and execution accuracy), are subject to two primary limitations. Firstly, the former fails to effectively assess functional correctness, as different SQL… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

  39. arXiv:2407.13986  [pdf, other

    cs.CV

    Deep Feature Surgery: Towards Accurate and Efficient Multi-Exit Networks

    Authors: Cheng Gong, Yao Chen, Qiuyang Luo, Ye Lu, Tao Li, Yuzhi Zhang, Yufei Sun, Le Zhang

    Abstract: Multi-exit network is a promising architecture for efficient model inference by sharing backbone networks and weights among multiple exits. However, the gradient conflict of the shared weights results in sub-optimal accuracy. This paper introduces Deep Feature Surgery (\methodname), which consists of feature partitioning and feature referencing approaches to resolve gradient conflict issues during… ▽ More

    Submitted 9 August, 2024; v1 submitted 18 July, 2024; originally announced July 2024.

    Comments: ECCV 2024

  40. arXiv:2407.13942  [pdf, other

    cs.CY cs.AI cs.CL cs.SI

    Harmful Suicide Content Detection

    Authors: Kyumin Park, Myung Jae Baik, YeongJun Hwang, Yen Shin, HoJae Lee, Ruda Lee, Sang Min Lee, Je Young Hannah Sun, Ah Rah Lee, Si Yeun Yoon, Dong-ho Lee, Jihyung Moon, JinYeong Bak, Kyunghyun Cho, Jong-Woo Paik, Sungjoon Park

    Abstract: Harmful suicide content on the Internet is a significant risk factor inducing suicidal thoughts and behaviors among vulnerable populations. Despite global efforts, existing resources are insufficient, specifically in high-risk regions like the Republic of Korea. Current research mainly focuses on understanding negative effects of such content or suicide risk in individuals, rather than on automati… ▽ More

    Submitted 2 June, 2024; originally announced July 2024.

    Comments: 30 pages, 7 figures

  41. arXiv:2407.13274  [pdf, other

    cs.IR

    Aligning Explanations for Recommendation with Rating and Feature via Maximizing Mutual Information

    Authors: Yurou Zhao, Yiding Sun, Ruidong Han, Fei Jiang, Lu Guan, Xiang Li, Wei Lin, Jiaxin Mao

    Abstract: Providing natural language-based explanations to justify recommendations helps to improve users' satisfaction and gain users' trust. However, as current explanation generation methods are commonly trained with an objective to mimic existing user reviews, the generated explanations are often not aligned with the predicted ratings or some important features of the recommended items, and thus, are su… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

    Comments: this paper has been accepted by cikm2024, and the camera-ready version will be updated soon

  42. arXiv:2407.13201  [pdf, other

    cs.SE

    $μ$Drive: User-Controlled Autonomous Driving

    Authors: Kun Wang, Christopher M. Poskitt, Yang Sun, Jun Sun, Jingyi Wang, Peng Cheng, Jiming Chen

    Abstract: Autonomous Vehicles (AVs) rely on sophisticated Autonomous Driving Systems (ADSs) to provide passengers a satisfying and safe journey. The individual preferences of riders plays a crucial role in shaping the perception of safety and comfort while they are in the car. Existing ADSs, however, lack mechanisms to systematically capture and integrate rider preferences into their planning modules. To br… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

  43. arXiv:2407.12292  [pdf, other

    cs.CV cs.AI

    Any Target Can be Offense: Adversarial Example Generation via Generalized Latent Infection

    Authors: Youheng Sun, Shengming Yuan, Xuanhan Wang, Lianli Gao, Jingkuan Song

    Abstract: Targeted adversarial attack, which aims to mislead a model to recognize any image as a target object by imperceptible perturbations, has become a mainstream tool for vulnerability assessment of deep neural networks (DNNs). Since existing targeted attackers only learn to attack known target classes, they cannot generalize well to unknown classes. To tackle this issue, we propose $\bf{G}$eneralized… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: ECCV 2024

  44. arXiv:2407.11335  [pdf, other

    cs.CV

    LaMI-DETR: Open-Vocabulary Detection with Language Model Instruction

    Authors: Penghui Du, Yu Wang, Yifan Sun, Luting Wang, Yue Liao, Gang Zhang, Errui Ding, Yan Wang, Jingdong Wang, Si Liu

    Abstract: Existing methods enhance open-vocabulary object detection by leveraging the robust open-vocabulary recognition capabilities of Vision-Language Models (VLMs), such as CLIP.However, two main challenges emerge:(1) A deficiency in concept representation, where the category names in CLIP's text space lack textual and visual knowledge.(2) An overfitting tendency towards base categories, with the open vo… ▽ More

    Submitted 18 July, 2024; v1 submitted 15 July, 2024; originally announced July 2024.

    Comments: ECCV2024

  45. arXiv:2407.11308  [pdf, other

    cs.LG cs.DC

    Detection of Global Anomalies on Distributed IoT Edges with Device-to-Device Communication

    Authors: Hideya Ochiai, Riku Nishihata, Eisuke Tomiyama, Yuwei Sun, Hiroshi Esaki

    Abstract: Anomaly detection is an important function in IoT applications for finding outliers caused by abnormal events. Anomaly detection sometimes comes with high-frequency data sampling which should be carried out at Edge devices rather than Cloud. In this paper, we consider the case that multiple IoT devices are installed in a single remote site and that they collaboratively detect anomalies from the ob… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

    Comments: 6 pages, 3 figures, ACM MobiHoc AIoT 2023 (accepted)

  46. arXiv:2407.11086  [pdf, other

    cs.LG cs.AI physics.chem-ph

    Pre-training with Fractional Denoising to Enhance Molecular Property Prediction

    Authors: Yuyan Ni, Shikun Feng, Xin Hong, Yuancheng Sun, Wei-Ying Ma, Zhi-Ming Ma, Qiwei Ye, Yanyan Lan

    Abstract: Deep learning methods have been considered promising for accelerating molecular screening in drug discovery and material design. Due to the limited availability of labelled data, various self-supervised molecular pre-training methods have been presented. While many existing methods utilize common pre-training tasks in computer vision (CV) and natural language processing (NLP), they often overlook… ▽ More

    Submitted 14 July, 2024; originally announced July 2024.

  47. arXiv:2407.11059  [pdf, other

    cs.CR cs.AI cs.CL cs.LG

    Was it Slander? Towards Exact Inversion of Generative Language Models

    Authors: Adrians Skapars, Edoardo Manino, Youcheng Sun, Lucas C. Cordeiro

    Abstract: Training large language models (LLMs) requires a substantial investment of time and money. To get a good return on investment, the developers spend considerable effort ensuring that the model never produces harmful and offensive outputs. However, bad-faith actors may still try to slander the reputation of an LLM by publicly reporting a forged output. In this paper, we show that defending against s… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: 4 pages, 3 figures

  48. arXiv:2407.11018  [pdf, other

    cs.NI eess.SP

    Online Multi-Task Offloading for Semantic-Aware Edge Computing Systems

    Authors: Xuyang Chen, Qu Luo, Gaojie Chen, Daquan Feng, Yao Sun

    Abstract: Mobile edge computing (MEC) provides low-latency offloading solutions for computationally intensive tasks, effectively improving the computing efficiency and battery life of mobile devices. However, for data-intensive tasks or scenarios with limited uplink bandwidth, network congestion might occur due to massive simultaneous offloading nodes, increasing transmission latency and affecting task perf… ▽ More

    Submitted 28 June, 2024; originally announced July 2024.

  49. arXiv:2407.10105  [pdf, other

    cs.CV cs.AI

    Hierarchical Multi-modal Transformer for Cross-modal Long Document Classification

    Authors: Tengfei Liu, Yongli Hu, Junbin Gao, Yanfeng Sun, Baocai Yin

    Abstract: Long Document Classification (LDC) has gained significant attention recently. However, multi-modal data in long documents such as texts and images are not being effectively utilized. Prior studies in this area have attempted to integrate texts and images in document-related tasks, but they have only focused on short text sequences and images of pages. How to classify long documents with hierarchic… ▽ More

    Submitted 14 July, 2024; originally announced July 2024.

    Comments: IEEE Transactions on Multimedia

  50. arXiv:2407.09852  [pdf

    cs.LG cs.CE

    Free-form Grid Structure Form Finding based on Machine Learning and Multi-objective Optimisation

    Authors: Yiping Meng, Yiming Sun

    Abstract: Free-form structural forms are widely used to design spatial structures for their irregular spatial morphology. Current free-form form-finding methods cannot adequately meet the material properties, structural requirements or construction conditions, which brings the deviation between the initial 3D geometric design model and the constructed free-form structure. Thus, the main focus of this paper… ▽ More

    Submitted 13 July, 2024; originally announced July 2024.

    Comments: 11 pages, 9 figures