Skip to main content

Showing 1–50 of 847 results for author: Zheng, Z

Searching in archive cs. Search in all archives.
.
  1. arXiv:2409.06744  [pdf, other

    q-bio.QM cs.AI cs.LG q-bio.BM

    ProteinBench: A Holistic Evaluation of Protein Foundation Models

    Authors: Fei Ye, Zaixiang Zheng, Dongyu Xue, Yuning Shen, Lihao Wang, Yiming Ma, Yan Wang, Xinyou Wang, Xiangxin Zhou, Quanquan Gu

    Abstract: Recent years have witnessed a surge in the development of protein foundation models, significantly improving performance in protein prediction and generative tasks ranging from 3D structure prediction and protein design to conformational dynamics. However, the capabilities and limitations associated with these models remain poorly understood due to the absence of a unified evaluation framework. To… ▽ More

    Submitted 10 September, 2024; originally announced September 2024.

    Comments: 29 pages, 1 figure and 11 tables

  2. arXiv:2409.06506  [pdf, other

    cs.CV cs.GR

    Neural Laplacian Operator for 3D Point Clouds

    Authors: Bo Pang, Zhongtian Zheng, Yilong Li, Guoping Wang, Peng-Shuai Wang

    Abstract: The discrete Laplacian operator holds a crucial role in 3D geometry processing, yet it is still challenging to define it on point clouds. Previous works mainly focused on constructing a local triangulation around each point to approximate the underlying manifold for defining the Laplacian operator, which may not be robust or accurate. In contrast, we simply use the K-nearest neighbors (KNN) graph… ▽ More

    Submitted 10 September, 2024; originally announced September 2024.

    Comments: SIGGRAPH Asia 2024 (Journal Track)

  3. arXiv:2409.04937  [pdf, other

    cs.SE

    CONNECTOR: Enhancing the Traceability of Decentralized Bridge Applications via Automatic Cross-chain Transaction Association

    Authors: Dan Lin, Jiajing Wu, Yuxin Su, Ziye Zheng, Yuhong Nan, Zibin Zheng

    Abstract: Decentralized bridge applications are important software that connects various blockchains and facilitates cross-chain asset transfer in the decentralized finance (DeFi) ecosystem which currently operates in a multi-chain environment. Cross-chain transaction association identifies and matches unique transactions executed by bridge DApps, which is important research to enhance the traceability of c… ▽ More

    Submitted 7 September, 2024; originally announced September 2024.

  4. arXiv:2409.03788  [pdf, other

    cs.CR cs.AI cs.CL cs.LG

    HSF: Defending against Jailbreak Attacks with Hidden State Filtering

    Authors: Cheng Qian, Hainan Zhang, Lei Sha, Zhiming Zheng

    Abstract: With the growing deployment of LLMs in daily applications like chatbots and content generation, efforts to ensure outputs align with human values and avoid harmful content have intensified. However, increasingly sophisticated jailbreak attacks threaten this alignment, aiming to induce unsafe outputs. Current defense efforts either focus on prompt rewriting or detection, which are limited in effect… ▽ More

    Submitted 31 August, 2024; originally announced September 2024.

    Comments: 13 pages

  5. arXiv:2409.03752  [pdf, other

    cs.CL

    Attention Heads of Large Language Models: A Survey

    Authors: Zifan Zheng, Yezhaohui Wang, Yuxin Huang, Shichao Song, Bo Tang, Feiyu Xiong, Zhiyu Li

    Abstract: Since the advent of ChatGPT, Large Language Models (LLMs) have excelled in various tasks but remain largely as black-box systems. Consequently, their development relies heavily on data-driven approaches, limiting performance enhancement through changes in internal architecture and reasoning pathways. As a result, many researchers have begun exploring the potential internal mechanisms of LLMs, aimi… ▽ More

    Submitted 5 September, 2024; originally announced September 2024.

    Comments: 20 pages, 11 figures, 4 tables

  6. arXiv:2409.01579  [pdf, other

    cs.CL cs.AI

    AdaComp: Extractive Context Compression with Adaptive Predictor for Retrieval-Augmented Large Language Models

    Authors: Qianchi Zhang, Hainan Zhang, Liang Pang, Hongwei Zheng, Zhiming Zheng

    Abstract: Retrieved documents containing noise will hinder RAG from detecting answer clues and make the inference process slow and expensive. Therefore, context compression is necessary to enhance its accuracy and efficiency. Existing context compression methods use extractive or generative models to retain the most query-relevant sentences or apply the information bottleneck theory to preserve sufficient i… ▽ More

    Submitted 2 September, 2024; originally announced September 2024.

    Comments: 8 pages, 5 figures, code available at https://anonymous.4open.science/r/AdaComp-8C0C/

  7. arXiv:2409.01570  [pdf, other

    stat.ML cs.LG eess.SP math.ST stat.ME

    Smoothed Robust Phase Retrieval

    Authors: Zhong Zheng, Lingzhou Xue

    Abstract: The phase retrieval problem in the presence of noise aims to recover the signal vector of interest from a set of quadratic measurements with infrequent but arbitrary corruptions, and it plays an important role in many scientific applications. However, the essential geometric structure of the nonconvex robust phase retrieval based on the $\ell_1$-loss is largely unknown to study spurious local solu… ▽ More

    Submitted 2 September, 2024; originally announced September 2024.

    Comments: 32 pages, 8 figures

  8. arXiv:2409.01071  [pdf, other

    cs.CV cs.CL

    VideoLLaMB: Long-context Video Understanding with Recurrent Memory Bridges

    Authors: Yuxuan Wang, Cihang Xie, Yang Liu, Zilong Zheng

    Abstract: Recent advancements in large-scale video-language models have shown significant potential for real-time planning and detailed interactions. However, their high computational demands and the scarcity of annotated datasets limit their practicality for academic researchers. In this work, we introduce VideoLLaMB, a novel framework that utilizes temporal memory tokens within bridge layers to allow for… ▽ More

    Submitted 2 September, 2024; originally announced September 2024.

  9. arXiv:2409.00685  [pdf, other

    cs.CV

    Accurate Forgetting for All-in-One Image Restoration Model

    Authors: Xin Su, Zhuoran Zheng

    Abstract: Privacy protection has always been an ongoing topic, especially for AI. Currently, a low-cost scheme called Machine Unlearning forgets the private data remembered in the model. Specifically, given a private dataset and a trained neural network, we need to use e.g. pruning, fine-tuning, and gradient ascent to remove the influence of the private dataset on the neural network. Inspired by this, we tr… ▽ More

    Submitted 1 September, 2024; originally announced September 2024.

  10. arXiv:2408.17072  [pdf, other

    cs.CL

    MaFeRw: Query Rewriting with Multi-Aspect Feedbacks for Retrieval-Augmented Large Language Models

    Authors: Yujing Wang, Hainan Zhang, Liang Pang, Liang Pang, Hongwei Zheng, Zhiming Zheng

    Abstract: In a real-world RAG system, the current query often involves spoken ellipses and ambiguous references from dialogue contexts, necessitating query rewriting to better describe user's information needs. However, traditional context-based rewriting has minimal enhancement on downstream generation tasks due to the lengthy process from query rewriting to response generation. Some researchers try to uti… ▽ More

    Submitted 30 August, 2024; originally announced August 2024.

  11. arXiv:2408.15501  [pdf, other

    cs.LG cs.AI

    MODULI: Unlocking Preference Generalization via Diffusion Models for Offline Multi-Objective Reinforcement Learning

    Authors: Yifu Yuan, Zhenrui Zheng, Zibin Dong, Jianye Hao

    Abstract: Multi-objective Reinforcement Learning (MORL) seeks to develop policies that simultaneously optimize multiple conflicting objectives, but it requires extensive online interactions. Offline MORL provides a promising solution by training on pre-collected datasets to generalize to any preference upon deployment. However, real-world offline datasets are often conservatively and narrowly distributed, f… ▽ More

    Submitted 27 August, 2024; originally announced August 2024.

    Comments: 23 pages, 7 figures

  12. arXiv:2408.14765  [pdf, other

    cs.CV

    CrossViewDiff: A Cross-View Diffusion Model for Satellite-to-Street View Synthesis

    Authors: Weijia Li, Jun He, Junyan Ye, Huaping Zhong, Zhimeng Zheng, Zilong Huang, Dahua Lin, Conghui He

    Abstract: Satellite-to-street view synthesis aims at generating a realistic street-view image from its corresponding satellite-view image. Although stable diffusion models have exhibit remarkable performance in a variety of image generation applications, their reliance on similar-view inputs to control the generated structure or texture restricts their application to the challenging cross-view synthesis tas… ▽ More

    Submitted 26 August, 2024; originally announced August 2024.

    Comments: 21 pages, 11 figures

  13. arXiv:2408.14342  [pdf, other

    cs.CV physics.med-ph

    Dual-Domain CLIP-Assisted Residual Optimization Perception Model for Metal Artifact Reduction

    Authors: Xinrui Zhang, Ailong Cai, Shaoyu Wang, Linyuan Wang, Zhizhong Zheng, Lei Li, Bin Yan

    Abstract: Metal artifacts in computed tomography (CT) imaging pose significant challenges to accurate clinical diagnosis. The presence of high-density metallic implants results in artifacts that deteriorate image quality, manifesting in the forms of streaking, blurring, or beam hardening effects, etc. Nowadays, various deep learning-based approaches, particularly generative models, have been proposed for me… ▽ More

    Submitted 29 August, 2024; v1 submitted 13 August, 2024; originally announced August 2024.

    Comments: 14 pages, 18 figures

  14. arXiv:2408.13367  [pdf

    cs.CR cs.ET

    Generative Blockchain: Transforming Blockchain from Transaction Recording to Transaction Generation through Proof-of-Merit

    Authors: Haozhao Zhang, Zhe Zhang, Zhiqiang Zheng, Varghese Jacob

    Abstract: This paper proposes a new paradigm: generative blockchain, which aims to transform conventional blockchain technology by combining transaction generation and recording, rather than focusing solely on transaction recording. Central to our design is a novel consensus mechanism, Proof-of-Merit (PoM), specifically crafted for environments where businesses must solve complex problems before transaction… ▽ More

    Submitted 23 August, 2024; originally announced August 2024.

  15. arXiv:2408.12791  [pdf, other

    cs.CV

    Open-Set Deepfake Detection: A Parameter-Efficient Adaptation Method with Forgery Style Mixture

    Authors: Chenqi Kong, Anwei Luo, Peijun Bao, Haoliang Li, Renjie Wan, Zengwei Zheng, Anderson Rocha, Alex C. Kot

    Abstract: Open-set face forgery detection poses significant security threats and presents substantial challenges for existing detection models. These detectors primarily have two limitations: they cannot generalize across unknown forgery domains and inefficiently adapt to new data. To address these issues, we introduce an approach that is both general and parameter-efficient for face forgery detection. It b… ▽ More

    Submitted 22 August, 2024; originally announced August 2024.

  16. arXiv:2408.12605  [pdf

    eess.IV cs.AI cs.CV

    Convolutional Neural Networks for Predictive Modeling of Lung Disease

    Authors: Yingbin Liang, Xiqing Liu, Haohao Xia, Yiru Cang, Zitao Zheng, Yuanfang Yang

    Abstract: In this paper, Pro-HRnet-CNN, an innovative model combining HRNet and void-convolution techniques, is proposed for disease prediction under lung imaging. Through the experimental comparison on the authoritative LIDC-IDRI dataset, we found that compared with the traditional ResNet-50, Pro-HRnet-CNN showed better performance in the feature extraction and recognition of small-size nodules, significan… ▽ More

    Submitted 7 August, 2024; originally announced August 2024.

    Comments: 7 pages

  17. arXiv:2408.12439  [pdf, other

    cs.CV

    Adapting MIMO video restoration networks to low latency constraints

    Authors: Valéry Dewil, Zhe Zheng, Arnaud Barral, Lara Raad, Nao Nicolas, Ioannis Cassagne, Jean-michel Morel, Gabriele Facciolo, Bruno Galerne, Pablo Arias

    Abstract: MIMO (multiple input, multiple output) approaches are a recent trend in neural network architectures for video restoration problems, where each network evaluation produces multiple output frames. The video is split into non-overlapping stacks of frames that are processed independently, resulting in a very appealing trade-off between output quality and computational cost. In this work we focus on t… ▽ More

    Submitted 22 August, 2024; originally announced August 2024.

    Comments: See the project web page to download the associated videos

  18. arXiv:2408.11720  [pdf, other

    cs.LG cs.CV

    On Learnable Parameters of Optimal and Suboptimal Deep Learning Models

    Authors: Ziwei Zheng, Huizhi Liang, Vaclav Snasel, Vito Latora, Panos Pardalos, Giuseppe Nicosia, Varun Ojha

    Abstract: We scrutinize the structural and operational aspects of deep learning models, particularly focusing on the nuances of learnable parameters (weight) statistics, distribution, node interaction, and visualization. By establishing correlations between variance in weight patterns and overall network performance, we investigate the varying (optimal and suboptimal) performances of various deep-learning m… ▽ More

    Submitted 21 August, 2024; originally announced August 2024.

    Journal ref: 31st International Conference on Neural Information Processing (ICONIP) 2024

  19. arXiv:2408.09698  [pdf, other

    cs.IR cs.AI

    Harnessing Multimodal Large Language Models for Multimodal Sequential Recommendation

    Authors: Yuyang Ye, Zhi Zheng, Yishan Shen, Tianshu Wang, Hengruo Zhang, Peijun Zhu, Runlong Yu, Kai Zhang, Hui Xiong

    Abstract: Recent advances in Large Language Models (LLMs) have demonstrated significant potential in the field of Recommendation Systems (RSs). Most existing studies have focused on converting user behavior logs into textual prompts and leveraging techniques such as prompt tuning to enable LLMs for recommendation tasks. Meanwhile, research interest has recently grown in multimodal recommendation systems tha… ▽ More

    Submitted 20 August, 2024; v1 submitted 19 August, 2024; originally announced August 2024.

  20. arXiv:2408.09230  [pdf, other

    cs.AI

    Siamese Multiple Attention Temporal Convolution Networks for Human Mobility Signature Identification

    Authors: Zhipeng Zheng, Yuchen Jiang, Shiyao Zhang, Xuetao Wei

    Abstract: The Human Mobility Signature Identification (HuMID) problem stands as a fundamental task within the realm of driving style representation, dedicated to discerning latent driving behaviors and preferences from diverse driver trajectories for driver identification. Its solutions hold significant implications across various domains (e.g., ride-hailing, insurance), wherein their application serves to… ▽ More

    Submitted 17 August, 2024; originally announced August 2024.

    Comments: 27th IEEE International Conference on Intelligent Transportation Systems (ITSC) (ITSC 2024)

  21. arXiv:2408.09093  [pdf, other

    cs.CR

    BaThe: Defense against the Jailbreak Attack in Multimodal Large Language Models by Treating Harmful Instruction as Backdoor Trigger

    Authors: Yulin Chen, Haoran Li, Zihao Zheng, Yangqiu Song

    Abstract: Multimodal Large Language Models (MLLMs) have showcased impressive performance in a variety of multimodal tasks. On the other hand, the integration of additional image modality may allow the malicious users to inject harmful content inside the images for jailbreaking. Unlike text-based LLMs, where adversaries need to select discrete tokens to conceal their malicious intent using specific algorithm… ▽ More

    Submitted 17 August, 2024; originally announced August 2024.

  22. arXiv:2408.07084  [pdf

    cs.LG cs.AI

    Dynamic Hypergraph-Enhanced Prediction of Sequential Medical Visits

    Authors: Wangying Yang, Zitao Zheng, Shi Bo, Zhizhong Wu, Bo Zhang, Yuanfang Yang

    Abstract: This study introduces a pioneering Dynamic Hypergraph Networks (DHCE) model designed to predict future medical diagnoses from electronic health records with enhanced accuracy. The DHCE model innovates by identifying and differentiating acute and chronic diseases within a patient's visit history, constructing dynamic hypergraphs that capture the complex, high-order interactions between diseases. It… ▽ More

    Submitted 19 August, 2024; v1 submitted 8 August, 2024; originally announced August 2024.

  23. arXiv:2408.06709  [pdf, other

    cs.CV

    Review Learning: Advancing All-in-One Ultra-High-Definition Image Restoration Training Method

    Authors: Xin Su, Zhuoran Zheng, Chen Wu

    Abstract: All-in-one image restoration tasks are becoming increasingly important, especially for ultra-high-definition (UHD) images. Existing all-in-one UHD image restoration methods usually boost the model's performance by introducing prompt or customized dynamized networks for different degradation types. For the inference stage, it might be friendly, but in the training stage, since the model encounters… ▽ More

    Submitted 13 August, 2024; originally announced August 2024.

  24. arXiv:2408.06245  [pdf, other

    cs.CV

    Latent Disentanglement for Low Light Image Enhancement

    Authors: Zhihao Zheng, Mooi Choo Chuah

    Abstract: Many learning-based low-light image enhancement (LLIE) algorithms are based on the Retinex theory. However, the Retinex-based decomposition techniques in such models introduce corruptions which limit their enhancement performance. In this paper, we propose a Latent Disentangle-based Enhancement Network (LDE-Net) for low light vision tasks. The latent disentanglement module disentangles the input i… ▽ More

    Submitted 12 August, 2024; originally announced August 2024.

  25. arXiv:2408.06037  [pdf, other

    cs.SE

    Hyperion: Unveiling DApp Inconsistencies using LLM and Dataflow-Guided Symbolic Execution

    Authors: Shuo Yang, Xingwei Lin, Jiachi Chen, Qingyuan Zhong, Lei Xiao, Renke Huang, Yanlin Wang, Zibin Zheng

    Abstract: The rapid advancement of blockchain platforms has significantly accelerated the growth of decentralized applications (DApps). Similar to traditional applications, DApps integrate front-end descriptions that showcase their features to attract users, and back-end smart contracts for executing their business logic. However, inconsistencies between the features promoted in front-end descriptions and t… ▽ More

    Submitted 12 August, 2024; originally announced August 2024.

    Comments: Accepted by ICSE 2025

  26. arXiv:2408.05956  [pdf, other

    cs.CV

    Boosting Adverse Weather Crowd Counting via Multi-queue Contrastive Learning

    Authors: Tianhang Pan, Zhuoran Zheng, Xiuyi Jia

    Abstract: Currently, most crowd counting methods have outstanding performance under normal weather conditions. However, they often struggle to maintain their performance in extreme and adverse weather conditions due to significant differences in the domain and a lack of adverse weather images for training. To address this issue and enhance the model's robustness in adverse weather, we propose a two-stage cr… ▽ More

    Submitted 12 August, 2024; originally announced August 2024.

    Comments: 11 pages, 7 figures

  27. arXiv:2408.05542  [pdf, other

    cs.SE

    You Augment Me: Exploring ChatGPT-based Data Augmentation for Semantic Code Search

    Authors: Yanlin Wang, Lianghong Guo, Ensheng Shi, Wenqing Chen, Jiachi Chen, Wanjun Zhong, Menghan Wang, Hui Li, Hongyu Zhang, Ziyu Lyu, Zibin Zheng

    Abstract: Code search plays a crucial role in software development, enabling developers to retrieve and reuse code using natural language queries. While the performance of code search models improves with an increase in high-quality data, obtaining such data can be challenging and expensive. Recently, large language models (LLMs) such as ChatGPT have made remarkable progress in both natural and programming… ▽ More

    Submitted 17 August, 2024; v1 submitted 10 August, 2024; originally announced August 2024.

    Comments: Accepted at ICSME 2023

  28. arXiv:2408.02210  [pdf, other

    cs.CV

    ExoViP: Step-by-step Verification and Exploration with Exoskeleton Modules for Compositional Visual Reasoning

    Authors: Yuxuan Wang, Alan Yuille, Zhuowan Li, Zilong Zheng

    Abstract: Compositional visual reasoning methods, which translate a complex query into a structured composition of feasible visual tasks, have exhibited a strong potential in complicated multi-modal tasks. Empowered by recent advances in large language models (LLMs), this multi-modal challenge has been brought to a new stage by treating LLMs as few-shot/zero-shot planners, i.e., vision-language (VL) program… ▽ More

    Submitted 4 August, 2024; originally announced August 2024.

    Comments: To Appear at COLM 2024

  29. arXiv:2408.01800  [pdf, other

    cs.CV

    MiniCPM-V: A GPT-4V Level MLLM on Your Phone

    Authors: Yuan Yao, Tianyu Yu, Ao Zhang, Chongyi Wang, Junbo Cui, Hongji Zhu, Tianchi Cai, Haoyu Li, Weilin Zhao, Zhihui He, Qianyu Chen, Huarong Zhou, Zhensheng Zou, Haoye Zhang, Shengding Hu, Zhi Zheng, Jie Zhou, Jie Cai, Xu Han, Guoyang Zeng, Dahai Li, Zhiyuan Liu, Maosong Sun

    Abstract: The recent surge of Multimodal Large Language Models (MLLMs) has fundamentally reshaped the landscape of AI research and industry, shedding light on a promising path toward the next AI milestone. However, significant challenges remain preventing MLLMs from being practical in real-world applications. The most notable challenge comes from the huge cost of running an MLLM with a massive number of par… ▽ More

    Submitted 3 August, 2024; originally announced August 2024.

    Comments: preprint

  30. arXiv:2408.01354  [pdf, other

    cs.CR cs.SE

    MCGMark: An Encodable and Robust Online Watermark for LLM-Generated Malicious Code

    Authors: Kaiwen Ning, Jiachi Chen, Qingyuan Zhong, Tao Zhang, Yanlin Wang, Wei Li, Yu Zhang, Weizhe Zhang, Zibin Zheng

    Abstract: With the advent of large language models (LLMs), numerous software service providers (SSPs) are dedicated to developing LLMs customized for code generation tasks, such as CodeLlama and Copilot. However, these LLMs can be leveraged by attackers to create malicious software, which may pose potential threats to the software ecosystem. For example, they can automate the creation of advanced phishing m… ▽ More

    Submitted 2 August, 2024; originally announced August 2024.

  31. arXiv:2408.00619  [pdf, other

    cs.CV

    Harnessing Uncertainty-aware Bounding Boxes for Unsupervised 3D Object Detection

    Authors: Ruiyang Zhang, Hu Zhang, Hang Yu, Zhedong Zheng

    Abstract: Unsupervised 3D object detection aims to identify objects of interest from unlabeled raw data, such as LiDAR points. Recent approaches usually adopt pseudo 3D bounding boxes (3D bboxes) from clustering algorithm to initialize the model training, and then iteratively updating both pseudo labels and the trained model. However, pseudo bboxes inevitably contain noises, and such inaccurate annotation a… ▽ More

    Submitted 1 August, 2024; originally announced August 2024.

    Comments: Preprint, 14 pages, 4 figures, 4 tables

  32. arXiv:2407.20874  [pdf, ps, other

    cs.CR

    On the MacWilliams Theorem over Codes and Lattices

    Authors: Zhiyong Zheng, Fengxia Liu, Kun Tian

    Abstract: Analogies between codes and lattices have been extensively studied for the last decades, in this dictionary, the MacWilliams identity is the finite analog of the Jacobi-Poisson formula of the Theta function. Motivated by the random theory of lattices, the statistical significance of MacWilliams theorem is considered, indeed, MacWilliams distribution provides a finite analog of the classical Gauss… ▽ More

    Submitted 30 July, 2024; originally announced July 2024.

  33. arXiv:2407.20042  [pdf, other

    cs.SE

    When to Stop? Towards Efficient Code Generation in LLMs with Excess Token Prevention

    Authors: Lianghong Guo, Yanlin Wang, Ensheng Shi, Wanjun Zhong, Hongyu Zhang, Jiachi Chen, Ruikai Zhang, Yuchi Ma, Zibin Zheng

    Abstract: Code generation aims to automatically generate code snippets that meet given natural language requirements and plays an important role in software development. Although Code LLMs have shown excellent performance in this domain, their long generation time poses a signification limitation in practice use. In this paper, we first conduct an in-depth preliminary study with different Code LLMs on code… ▽ More

    Submitted 29 July, 2024; originally announced July 2024.

    Comments: To appear at ISSTA 2024

  34. arXiv:2407.19740  [pdf, other

    cs.CL cs.AI

    KNOWCOMP POKEMON Team at DialAM-2024: A Two-Stage Pipeline for Detecting Relations in Dialogical Argument Mining

    Authors: Zihao Zheng, Zhaowei Wang, Qing Zong, Yangqiu Song

    Abstract: Dialogical Argument Mining(DialAM) is an important branch of Argument Mining(AM). DialAM-2024 is a shared task focusing on dialogical argument mining, which requires us to identify argumentative relations and illocutionary relations among proposition nodes and locution nodes. To accomplish this, we propose a two-stage pipeline, which includes the Two-Step S-Node Prediction Model in Stage 1 and the… ▽ More

    Submitted 29 July, 2024; originally announced July 2024.

    Comments: Published on the 11th Workshop on Argument Mining

  35. arXiv:2407.19487  [pdf, other

    cs.SE

    RLCoder: Reinforcement Learning for Repository-Level Code Completion

    Authors: Yanlin Wang, Yanli Wang, Daya Guo, Jiachi Chen, Ruikai Zhang, Yuchi Ma, Zibin Zheng

    Abstract: Repository-level code completion aims to generate code for unfinished code snippets within the context of a specified repository. Existing approaches mainly rely on retrieval-augmented generation strategies due to limitations in input sequence length. However, traditional lexical-based retrieval methods like BM25 struggle to capture code semantics, while model-based retrieval methods face challeng… ▽ More

    Submitted 28 July, 2024; originally announced July 2024.

    Comments: To appear at ICSE 2025

    Journal ref: 47th International Conference on Software Engineering (ICSE 2025)

  36. arXiv:2407.17101  [pdf, other

    cs.CV cs.AI

    PiPa++: Towards Unification of Domain Adaptive Semantic Segmentation via Self-supervised Learning

    Authors: Mu Chen, Zhedong Zheng, Yi Yang

    Abstract: Unsupervised domain adaptive segmentation aims to improve the segmentation accuracy of models on target domains without relying on labeled data from those domains. This approach is crucial when labeled target domain data is scarce or unavailable. It seeks to align the feature representations of the source domain (where labeled data is available) and the target domain (where only unlabeled data is… ▽ More

    Submitted 24 July, 2024; originally announced July 2024.

    Comments: This study is under IEEE TMM review. arXiv admin note: substantial text overlap with arXiv:2211.07609

  37. arXiv:2407.15317  [pdf, other

    cs.CV

    Open-CD: A Comprehensive Toolbox for Change Detection

    Authors: Kaiyu Li, Jiawei Jiang, Andrea Codegoni, Chengxi Han, Yupeng Deng, Keyan Chen, Zhuo Zheng, Hao Chen, Zhengxia Zou, Zhenwei Shi, Sheng Fang, Deyu Meng, Zhi Wang, Xiangyong Cao

    Abstract: We present Open-CD, a change detection toolbox that contains a rich set of change detection methods as well as related components and modules. The toolbox started from a series of open source general vision task tools, including OpenMMLab Toolkits, PyTorch Image Models, etc. It gradually evolves into a unified platform that covers many popular change detection methods and contemporary modules. It… ▽ More

    Submitted 21 July, 2024; originally announced July 2024.

    Comments: 9 pages

  38. arXiv:2407.15252  [pdf, other

    cs.CV

    An Adaptive System for Wearable Devices to Detect Stress Using Physiological Signals

    Authors: Gelei Xu, Ruiyang Qin, Zhi Zheng, Yiyu Shi

    Abstract: Timely stress detection is crucial for protecting vulnerable groups from long-term detrimental effects by enabling early intervention. Wearable devices, by collecting real-time physiological signals, offer a solution for accurate stress detection accommodating individual differences. This position paper introduces an adaptive framework for personalized stress detection using PPG and EDA signals. U… ▽ More

    Submitted 21 July, 2024; originally announced July 2024.

  39. arXiv:2407.15070  [pdf, other

    cs.CV

    3D Gaussian Parametric Head Model

    Authors: Yuelang Xu, Lizhen Wang, Zerong Zheng, Zhaoqi Su, Yebin Liu

    Abstract: Creating high-fidelity 3D human head avatars is crucial for applications in VR/AR, telepresence, digital human interfaces, and film production. Recent advances have leveraged morphable face models to generate animated head avatars from easily accessible data, representing varying identities and expressions within a low-dimensional parametric space. However, existing methods often struggle with mod… ▽ More

    Submitted 21 July, 2024; originally announced July 2024.

    Comments: project page: https://yuelangx.github.io/gphm/

  40. arXiv:2407.14507  [pdf, other

    cs.CL

    Internal Consistency and Self-Feedback in Large Language Models: A Survey

    Authors: Xun Liang, Shichao Song, Zifan Zheng, Hanyu Wang, Qingchen Yu, Xunkai Li, Rong-Hua Li, Peng Cheng, Zhonghao Wang, Feiyu Xiong, Zhiyu Li

    Abstract: Large language models (LLMs) often exhibit deficient reasoning or generate hallucinations. To address these, studies prefixed with "Self-" such as Self-Consistency, Self-Improve, and Self-Refine have been initiated. They share a commonality: involving LLMs evaluating and updating themselves. Nonetheless, these efforts lack a unified perspective on summarization, as existing surveys predominantly f… ▽ More

    Submitted 29 August, 2024; v1 submitted 19 July, 2024; originally announced July 2024.

    Comments: 24 pages, 9 figures, 7 tables, 14 equations

  41. arXiv:2407.14266  [pdf, other

    cs.IR cs.LG

    L^2CL: Embarrassingly Simple Layer-to-Layer Contrastive Learning for Graph Collaborative Filtering

    Authors: Xinzhou Jin, Jintang Li, Liang Chen, Chenyun Yu, Yuanzhen Xie, Tao Xie, Chengxiang Zhuo, Zang Li, Zibin Zheng

    Abstract: Graph neural networks (GNNs) have recently emerged as an effective approach to model neighborhood signals in collaborative filtering. Towards this research line, graph contrastive learning (GCL) demonstrates robust capabilities to address the supervision label shortage issue through generating massive self-supervised signals. Despite its effectiveness, GCL for recommendation suffers seriously from… ▽ More

    Submitted 19 July, 2024; originally announced July 2024.

  42. Identifying Smart Contract Security Issues in Code Snippets from Stack Overflow

    Authors: Jiachi Chen, Chong Chen, Jiang Hu, John Grundy, Yanlin Wang, Ting Chen, Zibin Zheng

    Abstract: Smart contract developers frequently seek solutions to developmental challenges on Q&A platforms such as Stack Overflow (SO). Although community responses often provide viable solutions, the embedded code snippets can also contain hidden vulnerabilities. Integrating such code directly into smart contracts may make them susceptible to malicious attacks. We conducted an online survey and received 74… ▽ More

    Submitted 23 July, 2024; v1 submitted 18 July, 2024; originally announced July 2024.

  43. arXiv:2407.12622  [pdf, other

    cs.CV

    Rethinking the Architecture Design for Efficient Generic Event Boundary Detection

    Authors: Ziwei Zheng, Zechuan Zhang, Yulin Wang, Shiji Song, Gao Huang, Le Yang

    Abstract: Generic event boundary detection (GEBD), inspired by human visual cognitive behaviors of consistently segmenting videos into meaningful temporal chunks, finds utility in various applications such as video editing and. In this paper, we demonstrate that SOTA GEBD models often prioritize final performance over model complexity, resulting in low inference speed and hindering efficient deployment in r… ▽ More

    Submitted 17 July, 2024; originally announced July 2024.

    Comments: ACM MM 2024

  44. arXiv:2407.12112  [pdf, other

    cs.LG cs.CY cs.SI

    A Benchmark for Fairness-Aware Graph Learning

    Authors: Yushun Dong, Song Wang, Zhenyu Lei, Zaiyi Zheng, Jing Ma, Chen Chen, Jundong Li

    Abstract: Fairness-aware graph learning has gained increasing attention in recent years. Nevertheless, there lacks a comprehensive benchmark to evaluate and compare different fairness-aware graph learning methods, which blocks practitioners from choosing appropriate ones for broader real-world applications. In this paper, we present an extensive benchmark on ten representative fairness-aware graph learning… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

  45. arXiv:2407.11095  [pdf, other

    cs.LG cs.AI

    DeepGate3: Towards Scalable Circuit Representation Learning

    Authors: Zhengyuan Shi, Ziyang Zheng, Sadaf Khan, Jianyuan Zhong, Min Li, Qiang Xu

    Abstract: Circuit representation learning has shown promising results in advancing the field of Electronic Design Automation (EDA). Existing models, such as DeepGate Family, primarily utilize Graph Neural Networks (GNNs) to encode circuit netlists into gate-level embeddings. However, the scalability of GNN-based models is fundamentally constrained by architectural limitations, impacting their ability to gen… ▽ More

    Submitted 14 July, 2024; originally announced July 2024.

  46. arXiv:2407.10714  [pdf, other

    cs.IR cs.AI

    SEMINAR: Search Enhanced Multi-modal Interest Network and Approximate Retrieval for Lifelong Sequential Recommendation

    Authors: Kaiming Shen, Xichen Ding, Zixiang Zheng, Yuqi Gong, Qianqian Li, Zhongyi Liu, Guannan Zhang

    Abstract: The modeling of users' behaviors is crucial in modern recommendation systems. A lot of research focuses on modeling users' lifelong sequences, which can be extremely long and sometimes exceed thousands of items. These models use the target item to search for the most relevant items from the historical sequence. However, training lifelong sequences in click through rate (CTR) prediction or personal… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

    Comments: 9 pages,code released

  47. arXiv:2407.09053  [pdf, other

    cs.RO

    Navi2Gaze: Leveraging Foundation Models for Navigation and Target Gazing

    Authors: Jun Zhu, Zihao Du, Haotian Xu, Fengbo Lan, Zilong Zheng, Bo Ma, Shengjie Wang, Tao Zhang

    Abstract: Task-aware navigation continues to be a challenging area of research, especially in scenarios involving open vocabulary. Previous studies primarily focus on finding suitable locations for task completion, often overlooking the importance of the robot's pose. However, the robot's orientation is crucial for successfully completing tasks because of how objects are arranged (e.g., to open a refrigerat… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  48. arXiv:2407.09026  [pdf, other

    cs.CV cs.LG cs.MM eess.IV

    HPC: Hierarchical Progressive Coding Framework for Volumetric Video

    Authors: Zihan Zheng, Houqiang Zhong, Qiang Hu, Xiaoyun Zhang, Li Song, Ya Zhang, Yanfeng Wang

    Abstract: Volumetric video based on Neural Radiance Field (NeRF) holds vast potential for various 3D applications, but its substantial data volume poses significant challenges for compression and transmission. Current NeRF compression lacks the flexibility to adjust video quality and bitrate within a single model for various network and device capacities. To address these issues, we propose HPC, a novel hie… ▽ More

    Submitted 2 August, 2024; v1 submitted 12 July, 2024; originally announced July 2024.

    Comments: 11 pages, 7 figures, ACM Multimedia 24

  49. arXiv:2407.08933  [pdf

    cs.LG

    Machine Learning in High Volume Media Manufacturing

    Authors: Siddarth Reddy Karuka, Abhinav Sunderrajan, Zheng Zheng, Yong Woon Tiean, Ganesh Nagappan, Allan Luk

    Abstract: Errors or failures in a high-volume manufacturing environment can have significant impact that can result in both the loss of time and money. Identifying such failures early has been a top priority for manufacturing industries and various rule-based algorithms have been developed over the years. However, catching these failures is time consuming and such algorithms cannot adapt well to changes in… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

  50. arXiv:2407.08569  [pdf, other

    cs.CV

    Approaching Outside: Scaling Unsupervised 3D Object Detection from 2D Scene

    Authors: Ruiyang Zhang, Hu Zhang, Hang Yu, Zhedong Zheng

    Abstract: The unsupervised 3D object detection is to accurately detect objects in unstructured environments with no explicit supervisory signals. This task, given sparse LiDAR point clouds, often results in compromised performance for detecting distant or small objects due to the inherent sparsity and limited spatial resolution. In this paper, we are among the early attempts to integrate LiDAR data with 2D… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: Accepted by ECCV'24, 18 pages, 5 figures, 6 tables