Skip to main content

Showing 1–50 of 2,063 results for author: Ma, S

.
  1. arXiv:2409.06946  [pdf, other

    cs.IT eess.SP

    Refracting Reconfigurable Intelligent Surface Assisted URLLC for Millimeter Wave High-Speed Train Communication Coverage Enhancement

    Authors: Changzhu Liu, Ruisi He, Yong Niu, Shiwen Mao, Bo Ai, Ruifeng Chen

    Abstract: High-speed train (HST) has garnered significant attention from both academia and industry due to the rapid development of railways worldwide. Millimeter wave (mmWave) communication, known for its large bandwidth is an effective way to address performance bottlenecks in cellular network based HST wireless communication systems. However, mmWave signals suffer from significant path loss when traversi… ▽ More

    Submitted 10 September, 2024; originally announced September 2024.

    Comments: 11 figures, accepted by IEEE Transactions on Vehicular Technology

  2. arXiv:2409.06747  [pdf, other

    astro-ph.IM astro-ph.CO

    Wave effect of gravitational waves intersected with a microlens field II: an adaptive hierarchical tree algorithm and population study

    Authors: Xikai Shan, Guoliang Li, Xuechun Chen, Wen Zhao, Bin Hu, Shude Mao

    Abstract: The gravitational lensing wave effect generated by a microlensing field embedded in a lens galaxy is an inevitable phenomenon in strong lensed gravitational waves (SLGWs). This effect presents both challenges and opportunities for the detection and application of SLGWs. However, investigating this wave effect requires computing a complete diffraction integral over each microlens in the field. This… ▽ More

    Submitted 10 September, 2024; originally announced September 2024.

    Comments: 19 pages, 11 figures, minor revision before publication

  3. arXiv:2409.06141  [pdf, other

    gr-qc

    Einstein-Klein-Gordon system via Cauchy-characteristic evolution: Computation of memory and ringdown tail

    Authors: Sizheng Ma, Kyle C. Nelli, Jordan Moxon, Mark A. Scheel, Nils Deppe, Lawrence E. Kidder, William Throwe, Nils L. Vu

    Abstract: Cauchy-characteristic evolution (CCE) is a powerful method for accurately extracting gravitational waves at future null infinity. In this work, we extend the previously implemented CCE system within the numerical relativity code SpECTRE by incorporating a scalar field. This allows the system to capture features of beyond-general-relativity theories. We derive scalar contributions to the equations… ▽ More

    Submitted 9 September, 2024; originally announced September 2024.

  4. arXiv:2409.05229  [pdf, other

    astro-ph.GA

    Two Channels of Metal-Rich Compact Stellar System Formation: Starbursts Under High Ram Pressure vs. Tidal Stripping

    Authors: Yuan Bian, Min Du, Victor P. Debattista, Dylan Nelson, Mark A. Norris, Luis C. Ho, Shuai Lu, Renyue Cen, Shuo Ma, Chong Ge, Taotao Fang, Hui Li

    Abstract: Most galaxies follow well-defined scaling relations of metallicity and stellar mass; however, some outliers at the low mass end of the observed galaxy population exhibit unusually high metallicity for their mass. Understanding how these objects get to be so metal-rich is vital for understanding the role of feedback in galaxy formation. Using the TNG50 simulation, we explore the origins of this phe… ▽ More

    Submitted 8 September, 2024; originally announced September 2024.

    Comments: 28 pages, 13 figures. Submitted

  5. arXiv:2409.04315  [pdf, ps, other

    math.AG math.NT

    Siegel operators for holomorphic differential forms

    Authors: Shouhei Ma

    Abstract: We give a geometric interpretation of the Siegel operators for holomorphic differential forms on Siegel modular varieties. This involves extension of the differential forms over a toroidal compactification, and we show that the Siegel operator essentially describes the restriction and descent to the boundary Kuga variety via holomorphic Leray filtration. As a consequence, we obtain equivalence of… ▽ More

    Submitted 6 September, 2024; originally announced September 2024.

    MSC Class: 11F46; 11F55; 11F75

  6. arXiv:2409.02260  [pdf, other

    math.OC

    Penalty Adversarial Network (PAN): A neural network-based method to solve PDE-constrained optimal control problems

    Authors: Shilin Ma, Yukun Yue

    Abstract: In this work, we introduce a novel strategy for tackling constrained optimization problems through a modified penalty method. Conventional penalty methods convert constrained problems into unconstrained ones by incorporating constraints into the loss function via a penalty term. However, selecting an optimal penalty parameter remains challenging; an improper choice, whether excessively high or low… ▽ More

    Submitted 3 September, 2024; originally announced September 2024.

  7. arXiv:2409.02157  [pdf, other

    astro-ph.EP astro-ph.SR

    An Earth-Mass Planet and a Brown Dwarf in Orbit Around a White Dwarf

    Authors: Keming Zhang, Weicheng Zang, Kareem El-Badry, Jessica R. Lu, Joshua S. Bloom, Eric Agol, B. Scott Gaudi, Quinn Konopacky, Natalie LeBaron, Shude Mao, Sean Terry

    Abstract: Terrestrial planets born beyond 1-3 AU have been theorized to avoid being engulfed during the red-giant phases of their host stars. Nevertheless, only a few gas-giant planets have been observed around white dwarfs (WDs) -- the end product left behind by a red giant. Here we report on evidence that the lens system that produced the microlensing event KMT-2020-BLG-0414 is composed of a WD orbited by… ▽ More

    Submitted 3 September, 2024; originally announced September 2024.

    Comments: Accepted. 25 pages, 7 figures, 4 tables

  8. arXiv:2409.00956  [pdf

    eess.IV cs.CV

    Physics-Informed Neural Network Based Digital Image Correlation Method

    Authors: Boda Li, Shichao Zhou, Qinwei Ma, Shaopeng Ma

    Abstract: Digital Image Correlation (DIC) is a key technique in experimental mechanics for full-field deformation measurement, traditionally relying on subset matching to determine displacement fields. However, selecting optimal parameters like shape functions and subset size can be challenging in non-uniform deformation scenarios. Recent deep learning-based DIC approaches, both supervised and unsupervised,… ▽ More

    Submitted 2 September, 2024; originally announced September 2024.

  9. arXiv:2408.15245  [pdf, other

    cs.CV cs.AI

    An Edge AI System Based on FPGA Platform for Railway Fault Detection

    Authors: Jiale Li, Yulin Fu, Dongwei Yan, Sean Longyu Ma, Chiu-Wing Sham

    Abstract: As the demands for railway transportation safety increase, traditional methods of rail track inspection no longer meet the needs of modern railway systems. To address the issues of automation and efficiency in rail fault detection, this study introduces a railway inspection system based on Field Programmable Gate Array (FPGA). This edge AI system collects track images via cameras and uses Convolut… ▽ More

    Submitted 8 August, 2024; originally announced August 2024.

    Comments: Accepted at the 2024 IEEE 13th Global Conference on Consumer Electronics (GCCE 2024)

  10. arXiv:2408.14478  [pdf, other

    q-bio.NC cs.AI cs.CY cs.IT

    Uncertainty Quantification in Alzheimer's Disease Progression Modeling

    Authors: Wael Mobeirek, Shirley Mao

    Abstract: With the increasing number of patients diagnosed with Alzheimer's Disease, prognosis models have the potential to aid in early disease detection. However, current approaches raise dependability concerns as they do not account for uncertainty. In this work, we compare the performance of Monte Carlo Dropout, Variational Inference, Markov Chain Monte Carlo, and Ensemble Learning trained on 512 patien… ▽ More

    Submitted 13 August, 2024; originally announced August 2024.

    Comments: This work was done as part of degree requirements for the authors in 2021-2022

  11. arXiv:2408.13960  [pdf, other

    cs.LG cs.AI cs.CY

    Time Series Analysis for Education: Methods, Applications, and Future Directions

    Authors: Shengzhong Mao, Chaoli Zhang, Yichi Song, Jindong Wang, Xiao-Jun Zeng, Zenglin Xu, Qingsong Wen

    Abstract: Recent advancements in the collection and analysis of sequential educational data have brought time series analysis to a pivotal position in educational research, highlighting its essential role in facilitating data-driven decision-making. However, there is a lack of comprehensive summaries that consolidate these advancements. To the best of our knowledge, this paper is the first to provide a comp… ▽ More

    Submitted 27 August, 2024; v1 submitted 25 August, 2024; originally announced August 2024.

    Comments: 24 pages, 3 figures, 6 tables, project page: see https://github.com/ai-for-edu/time-series-analysis-for-education

  12. arXiv:2408.13759  [pdf, other

    cs.RO

    MASQ: Multi-Agent Reinforcement Learning for Single Quadruped Robot Locomotion

    Authors: Qi Liu, Jingxiang Guo, Sixu Lin, Shuaikang Ma, Jinxuan Zhu, Yanjie Li

    Abstract: This paper proposes a novel method to improve locomotion learning for a single quadruped robot using multi-agent deep reinforcement learning (MARL). Many existing methods use single-agent reinforcement learning for an individual robot or MARL for the cooperative task in multi-robot systems. Unlike existing methods, this paper proposes using MARL for the locomotion learning of a single quadruped ro… ▽ More

    Submitted 25 August, 2024; originally announced August 2024.

  13. arXiv:2408.11398  [pdf, other

    eess.SP

    Generative AI based Secure Wireless Sensing for ISAC Networks

    Authors: Jiacheng Wang, Hongyang Du, Yinqiu Liu, Geng Sun, Dusit Niyato, Shiwen Mao, Dong In Kim, Xuemin Shen

    Abstract: Integrated sensing and communications (ISAC) is expected to be a key technology for 6G, and channel state information (CSI) based sensing is a key component of ISAC. However, current research on ISAC focuses mainly on improving sensing performance, overlooking security issues, particularly the unauthorized sensing of users. In this paper, we propose a secure sensing system (DFSS) based on two dist… ▽ More

    Submitted 21 August, 2024; originally announced August 2024.

  14. arXiv:2408.11313  [pdf, other

    cs.AI

    Unlocking Adversarial Suffix Optimization Without Affirmative Phrases: Efficient Black-box Jailbreaking via LLM as Optimizer

    Authors: Weipeng Jiang, Zhenting Wang, Juan Zhai, Shiqing Ma, Zhengyu Zhao, Chao Shen

    Abstract: Despite prior safety alignment efforts, mainstream LLMs can still generate harmful and unethical content when subjected to jailbreaking attacks. Existing jailbreaking methods fall into two main categories: template-based and optimization-based methods. The former requires significant manual effort and domain knowledge, while the latter, exemplified by Greedy Coordinate Gradient (GCG), which seeks… ▽ More

    Submitted 20 August, 2024; originally announced August 2024.

  15. arXiv:2408.08977  [pdf, other

    cs.DC

    FedFQ: Federated Learning with Fine-Grained Quantization

    Authors: Haowei Li, Weiying Xie, Hangyu Ye, Jitao Ma, Shuran Ma, Yunsong Li

    Abstract: Federated learning (FL) is a decentralized approach, enabling multiple participants to collaboratively train a model while ensuring the protection of data privacy. The transmission of updates from numerous edge clusters to the server creates a significant communication bottleneck in FL. Quantization is an effective compression technology, showcasing immense potential in addressing this bottleneck… ▽ More

    Submitted 16 August, 2024; originally announced August 2024.

  16. arXiv:2408.08862  [pdf, other

    cs.LG

    Visual Agents as Fast and Slow Thinkers

    Authors: Guangyan Sun, Mingyu Jin, Zhenting Wang, Cheng-Long Wang, Siqi Ma, Qifan Wang, Ying Nian Wu, Yongfeng Zhang, Dongfang Liu

    Abstract: Achieving human-level intelligence requires refining cognitive distinctions between System 1 and System 2 thinking. While contemporary AI, driven by large language models, demonstrates human-like traits, it falls short of genuine cognition. Transitioning from structured benchmarks to real-world scenarios presents challenges for visual agents, often leading to inaccurate and overly confident respon… ▽ More

    Submitted 6 September, 2024; v1 submitted 16 August, 2024; originally announced August 2024.

  17. arXiv:2408.08833  [pdf, other

    eess.SP

    Intra-symbol Differential Amplitude Shift Keying-aided Blind Detector for Ambient Backscatter Communication Systems

    Authors: Shuaijun Ma, Peng Wei, Sa Xiao, Jianquan Wang, Wanbin Tang, Wei Xiang

    Abstract: Ambient backscatter communications (AmBC) are a promising technology for addressing the energy consumption challenge in wireless communications through the reflection or absorption of surrounding radio frequency (RF) signals. However, it grapples with the intricacies of ambient RF signal and the round-trip path loss. For traditional detectors, the incorporation of pilot sequences results in a redu… ▽ More

    Submitted 16 August, 2024; originally announced August 2024.

  18. arXiv:2408.08765  [pdf, other

    cs.NI

    Rethinking Generative Semantic Communication for Multi-User Systems with Multi-Modal LLM

    Authors: Wanting Yang, Zehui Xiong, Shiwen Mao, Tony Q. S. Quek, Ping Zhang, Merouane Debbah, Rahim Tafazolli

    Abstract: The surge in connected devices in 6G with typical massive access scenarios, such as smart agriculture, and smart cities, poses significant challenges to unsustainable traditional communication with limited radio resources and already high system complexity. Fortunately, the booming artificial intelligence technology and the growing computational power of devices offer a promising 6G enabler: seman… ▽ More

    Submitted 16 August, 2024; originally announced August 2024.

  19. arXiv:2408.06648  [pdf, other

    cs.RO

    A Miniature Vision-Based Localization System for Indoor Blimps

    Authors: Shicong Ma

    Abstract: With increasing attention paid to blimp research, I hope to build an indoor blimp to interact with humans. To begin with, I propose developing a visual localization system to enable blimps to localize themselves in an indoor environment autonomously. This system initially reconstructs an indoor environment by employing Structure from Motion with Superpoint visual features. Next, with the previousl… ▽ More

    Submitted 13 August, 2024; originally announced August 2024.

  20. Towards Effective and Interpretable Semantic Communications

    Authors: Youlong Wu, Yuanmin Shi, Shuai Ma, Chunxiao Jiang, Wei Zhang, Khaled B. Letaief

    Abstract: With the exponential surge in traffic data and the pressing need for ultra-low latency in emerging intelligence applications, it is envisioned that 6G networks will demand disruptive communication technologies to foster ubiquitous intelligence and succinctness within the human society. Semantic communication, a novel paradigm, holds the promise of significantly curtailing communication overhead an… ▽ More

    Submitted 8 August, 2024; originally announced August 2024.

    Comments: This paper has been accepted by IEEE Network Magazine

  21. arXiv:2408.04682  [pdf, other

    cs.CL cs.AI cs.LG

    ToolSandbox: A Stateful, Conversational, Interactive Evaluation Benchmark for LLM Tool Use Capabilities

    Authors: Jiarui Lu, Thomas Holleis, Yizhe Zhang, Bernhard Aumayer, Feng Nan, Felix Bai, Shuang Ma, Shen Ma, Mengyu Li, Guoli Yin, Zirui Wang, Ruoming Pang

    Abstract: Recent large language models (LLMs) advancements sparked a growing research interest in tool assisted LLMs solving real-world challenges, which calls for comprehensive evaluation of tool-use capabilities. While previous works focused on either evaluating over stateless web services (RESTful API), based on a single turn user prompt, or an off-policy dialog trajectory, ToolSandbox includes stateful… ▽ More

    Submitted 8 August, 2024; originally announced August 2024.

  22. SDSS-IV MaNGA: Stellar rotational support in disk galaxies vs. central surface density and stellar population age

    Authors: Xiaohan Wang, Yifei Luo, S. M. Faber, David C. Koo, Shude Mao, Kyle B. Westfall, Shengdong Lu, Weichen Wang, Kevin Bundy, N. Boardman, Vladimir Avila-Reese, José G. Fernández-Trincado, Richard R. Lane

    Abstract: We investigate how the stellar rotational support changes as a function of spatially resolved stellar population age ($\rm D_n4000$) and relative central stellar surface density ($ΔΣ_1$) for MaNGA isolated/central disk galaxies. We find that the galaxy rotational support $λ_{R_\mathrm{e}}$ varies smoothly as a function of $ΔΣ_1$ and $\rm D_n4000$. $\rm D_n4000$ vs. $ΔΣ_1$ follows a "J-shape", with… ▽ More

    Submitted 5 August, 2024; originally announced August 2024.

    Comments: 24 pages, 22 figures (including Appendix), accepted for publication in MNRAS

  23. arXiv:2408.02103  [pdf, other

    cs.CL

    Effective Demonstration Annotation for In-Context Learning via Language Model-Based Determinantal Point Process

    Authors: Peng Wang, Xiaobin Wang, Chao Lou, Shengyu Mao, Pengjun Xie, Yong Jiang

    Abstract: In-context learning (ICL) is a few-shot learning paradigm that involves learning mappings through input-output pairs and appropriately applying them to new instances. Despite the remarkable ICL capabilities demonstrated by Large Language Models (LLMs), existing works are highly dependent on large-scale labeled support sets, not always feasible in practical scenarios. To refine this approach, we fo… ▽ More

    Submitted 4 August, 2024; originally announced August 2024.

  24. arXiv:2408.01946  [pdf, other

    cs.CV

    Masked Angle-Aware Autoencoder for Remote Sensing Images

    Authors: Zhihao Li, Biao Hou, Siteng Ma, Zitong Wu, Xianpeng Guo, Bo Ren, Licheng Jiao

    Abstract: To overcome the inherent domain gap between remote sensing (RS) images and natural images, some self-supervised representation learning methods have made promising progress. However, they have overlooked the diverse angles present in RS objects. This paper proposes the Masked Angle-Aware Autoencoder (MA3E) to perceive and learn angles during pre-training. We design a \textit{scaling center crop} o… ▽ More

    Submitted 4 August, 2024; originally announced August 2024.

    Comments: This paper has been accepted by ECCV 2024

  25. arXiv:2408.01937  [pdf

    astro-ph.SR astro-ph.IM

    Inflight Performance and Calibrations of the Lyman-alpha Solar Telescope on board the Advanced Space-based Solar Observatory

    Authors: Bo Chen, Li Feng, Guang Zhang, Hui Li, Lingping He, Kefei Song, Quanfeng Guo, Ying Li, Yu Huang, Jingwei Li, Jie Zhao, Jianchao Xue, Gen Li, Guanglu Shi, Dechao Song, Lei Lu, Beili Ying, Haifeng Wang, Shuang Dai, Xiaodong Wang, Shilei Mao, Peng Wang, Kun Wu, Shuai Ren, Liang Sun , et al. (18 additional authors not shown)

    Abstract: The Lyman-alpha Solar Telescope (LST) on board the Advanced Space-based Solar Observatory (ASO-S) is the first payload to image the full solar disk and the solar corona in both white-light (WL) and ultraviolet (UV) H I Lya, extending up to 2.5 solar radii (Rs). Since the launch of the ASO-S on 9 October 2022, LST has captured various significant solar activities including flares, prominences, coro… ▽ More

    Submitted 4 August, 2024; originally announced August 2024.

    Comments: Solar Physics (ASO-S mission topical collection), accepted

  26. arXiv:2408.01173  [pdf, other

    cs.NI cs.LG

    Sustainable Diffusion-based Incentive Mechanism for Generative AI-driven Digital Twins in Industrial Cyber-Physical Systems

    Authors: Jinbo Wen, Jiawen Kang, Dusit Niyato, Yang Zhang, Shiwen Mao

    Abstract: Industrial Cyber-Physical Systems (ICPSs) are an integral component of modern manufacturing and industries. By digitizing data throughout the product life cycle, Digital Twins (DTs) in ICPSs enable a shift from current industrial infrastructures to intelligent and adaptive infrastructures. Thanks to data process capability, Generative Artificial Intelligence (GAI) can drive the construction and up… ▽ More

    Submitted 2 August, 2024; originally announced August 2024.

  27. arXiv:2408.01090  [pdf, other

    cs.CL cs.AR cs.NE

    General-purpose Dataflow Model with Neuromorphic Primitives

    Authors: Weihao Zhang, Yu Du, Hongyi Li, Songchen Ma, Rong Zhao

    Abstract: Neuromorphic computing exhibits great potential to provide high-performance benefits in various applications beyond neural networks. However, a general-purpose program execution model that aligns with the features of neuromorphic computing is required to bridge the gap between program versatility and neuromorphic hardware efficiency. The dataflow model offers a potential solution, but it faces hig… ▽ More

    Submitted 2 August, 2024; originally announced August 2024.

  28. arXiv:2407.21075  [pdf, other

    cs.AI cs.CL cs.LG

    Apple Intelligence Foundation Language Models

    Authors: Tom Gunter, Zirui Wang, Chong Wang, Ruoming Pang, Andy Narayanan, Aonan Zhang, Bowen Zhang, Chen Chen, Chung-Cheng Chiu, David Qiu, Deepak Gopinath, Dian Ang Yap, Dong Yin, Feng Nan, Floris Weers, Guoli Yin, Haoshuo Huang, Jianyu Wang, Jiarui Lu, John Peebles, Ke Ye, Mark Lee, Nan Du, Qibin Chen, Quentin Keunebroek , et al. (130 additional authors not shown)

    Abstract: We present foundation language models developed to power Apple Intelligence features, including a ~3 billion parameter model designed to run efficiently on devices and a large server-based language model designed for Private Cloud Compute. These models are designed to perform a wide range of tasks efficiently, accurately, and responsibly. This report describes the model architecture, the data used… ▽ More

    Submitted 29 July, 2024; originally announced July 2024.

  29. arXiv:2407.19174  [pdf, other

    cs.CV

    Reducing Spurious Correlation for Federated Domain Generalization

    Authors: Shuran Ma, Weiying Xie, Daixun Li, Haowei Li, Yunsong Li

    Abstract: The rapid development of multimedia has provided a large amount of data with different distributions for visual tasks, forming different domains. Federated Learning (FL) can efficiently use this diverse data distributed on different client media in a decentralized manner through model sharing. However, in open-world scenarios, there is a challenge: global models may struggle to predict well on ent… ▽ More

    Submitted 27 July, 2024; originally announced July 2024.

    Comments: 10 pages, 4 figures

  30. arXiv:2407.18961  [pdf, other

    cs.AI

    MMAU: A Holistic Benchmark of Agent Capabilities Across Diverse Domains

    Authors: Guoli Yin, Haoping Bai, Shuang Ma, Feng Nan, Yanchao Sun, Zhaoyang Xu, Shen Ma, Jiarui Lu, Xiang Kong, Aonan Zhang, Dian Ang Yap, Yizhe zhang, Karsten Ahnert, Vik Kamath, Mathias Berglund, Dominic Walsh, Tobias Gindele, Juergen Wiest, Zhengfeng Lai, Xiaoming Wang, Jiulong Shan, Meng Cao, Ruoming Pang, Zirui Wang

    Abstract: Recent advances in large language models (LLMs) have increased the demand for comprehensive benchmarks to evaluate their capabilities as human-like agents. Existing benchmarks, while useful, often focus on specific application scenarios, emphasizing task completion but failing to dissect the underlying skills that drive these outcomes. This lack of granularity makes it difficult to deeply discern… ▽ More

    Submitted 15 August, 2024; v1 submitted 17 July, 2024; originally announced July 2024.

  31. arXiv:2407.18171  [pdf, other

    physics.bio-ph cond-mat.soft

    Chemically reactive and aging macromolecular mixtures II: Phase separation and coarsening

    Authors: Ruoyao Zhang, Sheng Mao, Mikko P. Haataja

    Abstract: In a companion paper, we put forth a thermodynamic model for complex formation via a chemical reaction involving multiple macromolecular species, which may subsequently undergo liquid-liquid phase separation and a further transition into a gel-like state. In the present work, we formulate a thermodynamically consistent kinetic framework to study the interplay between phase separation, chemical rea… ▽ More

    Submitted 25 July, 2024; originally announced July 2024.

    Comments: 14 pages, 9 figures

  32. arXiv:2407.16748  [pdf, other

    astro-ph.GA

    The dispersion measure and rotation measure from fast radio burst host galaxies based on the IllustrisTNG50 simulation

    Authors: Timea Orsolya Kovacs, Sui Ann Mao, Aritra Basu, Yik Ki Ma, Laura G. Spitler, Charles R. H. Walker

    Abstract: Fast radio bursts (FRB) will become important cosmological tools, as the number of observed FRBs is increasing rapidly with more surveys being carried out. A large sample of FRBs with dispersion measures (DM) and rotation measures (RM) can be used to study the intergalactic magnetic field. However, the observed DM and RM of FRBs have multiple contributors which must be quantified to obtain the int… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

    Comments: 24 pages, 15 figures Accepted for publication in A&A

  33. arXiv:2407.15498  [pdf, other

    cs.CL

    Refining Corpora from a Model Calibration Perspective for Chinese Spelling Correction

    Authors: Dingyao Yu, Yang An, Wei Ye, Xiongfeng Xiao, Shaoguang Mao, Tao Ge, Shikun Zhang

    Abstract: Chinese Spelling Correction (CSC) commonly lacks large-scale high-quality corpora, due to the labor-intensive labeling of spelling errors in real-life human writing or typing scenarios. Two data augmentation methods are widely adopted: (1) \textit{Random Replacement} with the guidance of confusion sets and (2) \textit{OCR/ASR-based Generation} that simulates character misusing. However, both metho… ▽ More

    Submitted 22 July, 2024; originally announced July 2024.

  34. arXiv:2407.15395  [pdf, other

    eess.SP

    FAST-GSC: Fast and Adaptive Semantic Transmission for Generative Semantic Communication

    Authors: Yiru Wang, Wanting Yang, Zehui Xiong, Yuping Zhao, Shiwen Mao, Tony Q. S. Quek, H. Vincent Poor

    Abstract: The rapidly evolving field of generative artificial intelligence technology has introduced innovative approaches for developing semantic communication (SemCom) frameworks, leading to the emergence of a new paradigm-generative SemCom (GSC). However, the complex processes involved in semantic extraction and generative inference may result in considerable latency in resource-constrained scenarios. To… ▽ More

    Submitted 22 July, 2024; originally announced July 2024.

  35. arXiv:2407.14814  [pdf, other

    cs.LG

    FMamba: Mamba based on Fast-attention for Multivariate Time-series Forecasting

    Authors: Shusen Ma, Yu Kang, Peng Bai, Yun-Bo Zhao

    Abstract: In multivariate time-series forecasting (MTSF), extracting the temporal correlations of the input sequences is crucial. While popular Transformer-based predictive models can perform well, their quadratic computational complexity results in inefficiency and high overhead. The recently emerged Mamba, a selective state space model, has shown promising results in many fields due to its strong temporal… ▽ More

    Submitted 20 July, 2024; originally announced July 2024.

  36. arXiv:2407.14192  [pdf, other

    cs.CL cs.AI

    LeKUBE: A Legal Knowledge Update BEnchmark

    Authors: Changyue Wang, Weihang Su, Hu Yiran, Qingyao Ai, Yueyue Wu, Cheng Luo, Yiqun Liu, Min Zhang, Shaoping Ma

    Abstract: Recent advances in Large Language Models (LLMs) have significantly shaped the applications of AI in multiple fields, including the studies of legal intelligence. Trained on extensive legal texts, including statutes and legal documents, the legal LLMs can capture important legal knowledge/concepts effectively and provide important support for downstream legal applications such as legal consultancy.… ▽ More

    Submitted 19 July, 2024; originally announced July 2024.

  37. arXiv:2407.14009  [pdf, other

    cs.CV

    Scale Disparity of Instances in Interactive Point Cloud Segmentation

    Authors: Chenrui Han, Xuan Yu, Yuxuan Xie, Yili Liu, Sitong Mao, Shunbo Zhou, Rong Xiong, Yue Wang

    Abstract: Interactive point cloud segmentation has become a pivotal task for understanding 3D scenes, enabling users to guide segmentation models with simple interactions such as clicks, therefore significantly reducing the effort required to tailor models to diverse scenarios and new categories. However, in the realm of interactive segmentation, the meaning of instance diverges from that in instance segmen… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

    Comments: Accepted by 2024 IEEE/RSJ International Conference on Intelligent Robots and Systems

  38. arXiv:2407.13801  [pdf, other

    physics.comp-ph physics.ao-ph

    Application of a spectral scheme to simulate horizontally slowly varying three-dimensional ocean acoustic propagation

    Authors: Houwang Tu, Yongxian Wang, Xiaolan Zhou, Guojun Xu, Dongbao Gao, Shuqing Ma

    Abstract: Three-dimensional numerical models for underwater sound propagation are popular in computational ocean acoustics. For horizontally slowly varying waveguide environments, an adiabatic mode-parabolic equation hybrid theory can be used for simulation. This theory employs adiabatic modes in the vertical direction, simplifying the solution of the sound pressure to the solution of horizontal refractive… ▽ More

    Submitted 17 July, 2024; originally announced July 2024.

    Comments: 34 pages, 16 figures

  39. arXiv:2407.13228  [pdf

    cs.CL cs.CY cs.ET cs.LG

    Evaluating Large Language Models for Anxiety and Depression Classification using Counseling and Psychotherapy Transcripts

    Authors: Junwei Sun, Siqi Ma, Yiran Fan, Peter Washington

    Abstract: We aim to evaluate the efficacy of traditional machine learning and large language models (LLMs) in classifying anxiety and depression from long conversational transcripts. We fine-tune both established transformer models (BERT, RoBERTa, Longformer) and more recent large models (Mistral-7B), trained a Support Vector Machine with feature engineering, and assessed GPT models through prompting. We ob… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

  40. arXiv:2407.12867  [pdf, other

    astro-ph.HE gr-qc

    Swift-BAT GUANO follow-up of gravitational-wave triggers in the third LIGO-Virgo-KAGRA observing run

    Authors: Gayathri Raman, Samuele Ronchini, James Delaunay, Aaron Tohuvavohu, Jamie A. Kennea, Tyler Parsotan, Elena Ambrosi, Maria Grazia Bernardini, Sergio Campana, Giancarlo Cusumano, Antonino D'Ai, Paolo D'Avanzo, Valerio D'Elia, Massimiliano De Pasquale, Simone Dichiara, Phil Evans, Dieter Hartmann, Paul Kuin, Andrea Melandri, Paul O'Brien, Julian P. Osborne, Kim Page, David M. Palmer, Boris Sbarufatti, Gianpiero Tagliaferri , et al. (1797 additional authors not shown)

    Abstract: We present results from a search for X-ray/gamma-ray counterparts of gravitational-wave (GW) candidates from the third observing run (O3) of the LIGO-Virgo-KAGRA (LVK) network using the Swift Burst Alert Telescope (Swift-BAT). The search includes 636 GW candidates received in low latency, 86 of which have been confirmed by the offline analysis and included in the third cumulative Gravitational-Wav… ▽ More

    Submitted 13 July, 2024; originally announced July 2024.

    Comments: 50 pages, 10 figures, 4 tables

  41. arXiv:2407.12070  [pdf, other

    cs.LG cs.AI

    Co-Designing Binarized Transformer and Hardware Accelerator for Efficient End-to-End Edge Deployment

    Authors: Yuhao Ji, Chao Fang, Shaobo Ma, Haikuo Shao, Zhongfeng Wang

    Abstract: Transformer models have revolutionized AI tasks, but their large size hinders real-world deployment on resource-constrained and latency-critical edge devices. While binarized Transformers offer a promising solution by significantly reducing model size, existing approaches suffer from algorithm-hardware mismatches with limited co-design exploration, leading to suboptimal performance on edge devices… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: This paper is accepted by ICCAD 2024

  42. arXiv:2407.11449  [pdf, other

    cs.CV cs.AI

    Controllable Contextualized Image Captioning: Directing the Visual Narrative through User-Defined Highlights

    Authors: Shunqi Mao, Chaoyi Zhang, Hang Su, Hwanjun Song, Igor Shalyminov, Weidong Cai

    Abstract: Contextualized Image Captioning (CIC) evolves traditional image captioning into a more complex domain, necessitating the ability for multimodal reasoning. It aims to generate image captions given specific contextual information. This paper further introduces a novel domain of Controllable Contextualized Image Captioning (Ctrl-CIC). Unlike CIC, which solely relies on broad context, Ctrl-CIC accentu… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: ECCV 2024

  43. arXiv:2407.11372  [pdf, other

    cs.CR cs.CV

    UNIT: Backdoor Mitigation via Automated Neural Distribution Tightening

    Authors: Siyuan Cheng, Guangyu Shen, Kaiyuan Zhang, Guanhong Tao, Shengwei An, Hanxi Guo, Shiqing Ma, Xiangyu Zhang

    Abstract: Deep neural networks (DNNs) have demonstrated effectiveness in various fields. However, DNNs are vulnerable to backdoor attacks, which inject a unique pattern, called trigger, into the input to cause misclassification to an attack-chosen target label. While existing works have proposed various methods to mitigate backdoor effects in poisoned models, they tend to be less effective against recent ad… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: The 18th European Conference on Computer Vision ECCV 2024

  44. arXiv:2407.11282  [pdf, other

    cs.CL

    Uncertainty is Fragile: Manipulating Uncertainty in Large Language Models

    Authors: Qingcheng Zeng, Mingyu Jin, Qinkai Yu, Zhenting Wang, Wenyue Hua, Zihao Zhou, Guangyan Sun, Yanda Meng, Shiqing Ma, Qifan Wang, Felix Juefei-Xu, Kaize Ding, Fan Yang, Ruixiang Tang, Yongfeng Zhang

    Abstract: Large Language Models (LLMs) are employed across various high-stakes domains, where the reliability of their outputs is crucial. One commonly used method to assess the reliability of LLMs' responses is uncertainty estimation, which gauges the likelihood of their answers being correct. While many studies focus on improving the accuracy of uncertainty estimations for LLMs, our research investigates… ▽ More

    Submitted 19 July, 2024; v1 submitted 15 July, 2024; originally announced July 2024.

  45. arXiv:2407.10969  [pdf, other

    cs.CL cs.LG

    Q-Sparse: All Large Language Models can be Fully Sparsely-Activated

    Authors: Hongyu Wang, Shuming Ma, Ruiping Wang, Furu Wei

    Abstract: We introduce, Q-Sparse, a simple yet effective approach to training sparsely-activated large language models (LLMs). Q-Sparse enables full sparsity of activations in LLMs which can bring significant efficiency gains in inference. This is achieved by applying top-K sparsification to the activations and the straight-through-estimator to the training. We also introduce Block Q-Sparse for batch traini… ▽ More

    Submitted 24 July, 2024; v1 submitted 15 July, 2024; originally announced July 2024.

    Comments: Work in progress

  46. arXiv:2407.10805  [pdf, other

    cs.CL cs.AI

    Think-on-Graph 2.0: Deep and Interpretable Large Language Model Reasoning with Knowledge Graph-guided Retrieval

    Authors: Shengjie Ma, Chengjin Xu, Xuhui Jiang, Muzhi Li, Huaren Qu, Jian Guo

    Abstract: Retrieval-augmented generation (RAG) has significantly advanced large language models (LLMs) by enabling dynamic information retrieval to mitigate knowledge gaps and hallucinations in generated content. However, these systems often falter with complex reasoning and consistency across diverse queries. In this work, we present Think-on-Graph 2.0, an enhanced RAG framework that aligns questions with… ▽ More

    Submitted 6 August, 2024; v1 submitted 15 July, 2024; originally announced July 2024.

  47. arXiv:2407.10131  [pdf, other

    cs.CV

    WPS-SAM: Towards Weakly-Supervised Part Segmentation with Foundation Models

    Authors: Xinjian Wu, Ruisong Zhang, Jie Qin, Shijie Ma, Cheng-Lin Liu

    Abstract: Segmenting and recognizing diverse object parts is crucial in computer vision and robotics. Despite significant progress in object segmentation, part-level segmentation remains underexplored due to complex boundaries and scarce annotated data. To address this, we propose a novel Weakly-supervised Part Segmentation (WPS) setting and an approach called WPS-SAM, built on the large-scale pre-trained v… ▽ More

    Submitted 14 July, 2024; originally announced July 2024.

  48. arXiv:2407.10046  [pdf, other

    cond-mat.str-el

    Non-Hermitian dynamics of Cooper pair splitter

    Authors: E. S. Ma, Z. Song

    Abstract: We propose a non-Hermitian model for Cooper pair splitters, in which the process of electron tunneling into electrodes is characterized by non-Hermitian terms. We find that across a broad range of parameters, the energy levels consistently remain real, and coalescing states are always present. The Coulomb repulsion between electrons in a quantum dot affects the order of the coalescing states. This… ▽ More

    Submitted 13 July, 2024; originally announced July 2024.

  49. arXiv:2407.08919  [pdf, other

    cs.NI cs.ET eess.SP

    Redefinition of Digital Twin and its Situation Awareness Framework Designing Towards Fourth Paradigm for Energy Internet of Things

    Authors: Xing He, Yuezhong Tang, Shuyan Ma, Qian Ai, Fei Tao, Robert Qiu

    Abstract: Traditional knowledge-based situation awareness (SA) modes struggle to adapt to the escalating complexity of today's Energy Internet of Things (EIoT), necessitating a pivotal paradigm shift. In response, this work introduces a pioneering data-driven SA framework, termed digital twin-based situation awareness (DT-SA), aiming to bridge existing gaps between data and demands, and further to enhance S… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: 16 pages, 15 figures Accepted by IEEE Transactions on Systems, Man and Cybernetics: Systems

  50. arXiv:2407.08424  [pdf, other

    eess.SP

    Semantic Feature Division Multiple Access for Multi-user Digital Interference Networks

    Authors: Shuai Ma, Chuanhui Zhang, Bin Shen, Youlong Wu, Hang Li, Shiyin Li, Guangming Shi, Naofal Al-Dhahir

    Abstract: With the ever-increasing user density and quality of service (QoS) demand,5G networks with limited spectrum resources are facing massive access challenges. To address these challenges, in this paper, we propose a novel discrete semantic feature division multiple access (SFDMA) paradigm for multi-user digital interference networks. Specifically, by utilizing deep learning technology, SFDMA extracts… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.