Skip to main content

Showing 1–50 of 2,092 results for author: Yu, Z

.
  1. arXiv:2409.09724  [pdf, other

    cs.CV

    MFCLIP: Multi-modal Fine-grained CLIP for Generalizable Diffusion Face Forgery Detection

    Authors: Yaning Zhang, Tianyi Wang, Zitong Yu, Zan Gao, Linlin Shen, Shengyong Chen

    Abstract: The rapid development of photo-realistic face generation methods has raised significant concerns in society and academia, highlighting the urgent need for robust and generalizable face forgery detection (FFD) techniques. Although existing approaches mainly capture face forgery patterns using image modality, other modalities like fine-grained noises and texts are not fully explored, which limits th… ▽ More

    Submitted 15 September, 2024; originally announced September 2024.

  2. arXiv:2409.09628  [pdf, other

    cs.CV cs.AI

    Can Large Language Models Grasp Event Signals? Exploring Pure Zero-Shot Event-based Recognition

    Authors: Zongyou Yu, Qiang Qu, Xiaoming Chen, Chen Wang

    Abstract: Recent advancements in event-based zero-shot object recognition have demonstrated promising results. However, these methods heavily depend on extensive training and are inherently constrained by the characteristics of CLIP. To the best of our knowledge, this research is the first study to explore the understanding capabilities of large language models (LLMs) for event-based visual content. We demo… ▽ More

    Submitted 15 September, 2024; originally announced September 2024.

  3. arXiv:2409.08681  [pdf, other

    cs.RO

    SLIM: Scalable and Lightweight LiDAR Mapping in Urban Environments

    Authors: Zehuan Yu, Zhijian Qiao, Wenyi Liu, Huan Yin, Shaojie Shen

    Abstract: LiDAR point cloud maps are extensively utilized on roads for robot navigation due to their high consistency. However, dense point clouds face challenges of high memory consumption and reduced maintainability for long-term operations. In this study, we introduce SLIM, a scalable and lightweight mapping system for long-term LiDAR mapping in urban environments. The system begins by parameterizing str… ▽ More

    Submitted 13 September, 2024; originally announced September 2024.

    Comments: 20 pages, 16 figures

  4. arXiv:2409.08572  [pdf, other

    cs.CV

    DiffFAS: Face Anti-Spoofing via Generative Diffusion Models

    Authors: Xinxu Ge, Xin Liu, Zitong Yu, Jingang Shi, Chun Qi, Jie Li, Heikki Kälviäinen

    Abstract: Face anti-spoofing (FAS) plays a vital role in preventing face recognition (FR) systems from presentation attacks. Nowadays, FAS systems face the challenge of domain shift, impacting the generalization performance of existing FAS methods. In this paper, we rethink about the inherence of domain shift and deconstruct it into two factors: image style and image quality. Quality influences the purity o… ▽ More

    Submitted 13 September, 2024; originally announced September 2024.

    Comments: ECCV 24

  5. arXiv:2409.07800  [pdf, ps, other

    math.PR

    Large deviation inequalities for the nonlinear unbalanced urn model

    Authors: Jianan Shi, Zhenhong Yu, Yu Miao

    Abstract: In the present paper, we consider the two-color nonlinear unbalanced urn model, under a drawing rule reinforced by an $\mathbb{R}^+$-valued concave function and an unbalanced replacement matrix. The large deviation inequalities for the nonlinear unbalanced urn model are established by using the stochastic approximation theory. As an auxiliary theory, we give a specific large deviation inequality f… ▽ More

    Submitted 12 September, 2024; originally announced September 2024.

    MSC Class: 60F10; 62L20

  6. arXiv:2409.07761  [pdf

    physics.med-ph

    CTLESS: A scatter-window projection and deep learning-based transmission-less attenuation compensation method for myocardial perfusion SPECT

    Authors: Zitong Yu, Md Ashequr Rahman, Craig K. Abbey, Richard Laforest, Nancy A. Obuchowski, Barry A. Siegel, Abhinav K. Jha

    Abstract: Attenuation compensation (AC), while being beneficial for visual-interpretation tasks in myocardial perfusion imaging (MPI) by SPECT, typically requires the availability of a separate X-ray CT component, leading to additional radiation dose, higher costs, and potentially inaccurate diagnosis due to SPECT/CT misalignment. To address these issues, we developed a method for cardiac SPECT AC using dee… ▽ More

    Submitted 12 September, 2024; originally announced September 2024.

  7. arXiv:2409.06882  [pdf, other

    hep-ph hep-ex hep-lat nucl-ex nucl-th

    Azimuthal modulations and extraction of generalized parton distributions

    Authors: Jian-Wei Qiu, Nobuo Sato, Zhite Yu

    Abstract: Azimuthal modulations are crucial for the phenomenological extraction and separation of various generalized parton distributions. We provide a new choice of frame and corresponding formalism to describe the azimuthal distributions, based on the separation of physics occurring at different momentum scales. We demonstrate that this new description is not only well-suited for experimental analysis, b… ▽ More

    Submitted 10 September, 2024; originally announced September 2024.

    Comments: 5 pages, 3 figures, plus a 3-page supplemental material

    Report number: JLAB-THY-24-4181

  8. arXiv:2409.06659  [pdf, other

    quant-ph

    Amortized Stabilizer Rényi Entropy of Quantum Dynamics

    Authors: Chengkai Zhu, Yu-Ao Chen, Zanqiu Shen, Zhiping Liu, Zhan Yu, Xin Wang

    Abstract: Unraveling the secrets of how much nonstabilizerness a quantum dynamic can generate is crucial for harnessing the power of magic states, the essential resources for achieving quantum advantage and realizing fault-tolerant quantum computation. In this work, we introduce the amortized $α$-stabilizer Rényi entropy, a magic monotone for unitary operations that quantifies the nonstabilizerness generati… ▽ More

    Submitted 10 September, 2024; originally announced September 2024.

    Comments: 5 + 7 pages, 2 figures

  9. arXiv:2409.06209  [pdf, other

    cs.LG cs.AI

    Adaptive Transformer Modelling of Density Function for Nonparametric Survival Analysis

    Authors: Xin Zhang, Deval Mehta, Yanan Hu, Chao Zhu, David Darby, Zhen Yu, Daniel Merlo, Melissa Gresle, Anneke Van Der Walt, Helmut Butzkueven, Zongyuan Ge

    Abstract: Survival analysis holds a crucial role across diverse disciplines, such as economics, engineering and healthcare. It empowers researchers to analyze both time-invariant and time-varying data, encompassing phenomena like customer churn, material degradation and various medical outcomes. Given the complexity and heterogeneity of such data, recent endeavors have demonstrated successful integration of… ▽ More

    Submitted 10 September, 2024; originally announced September 2024.

  10. arXiv:2409.05522  [pdf, other

    physics.ins-det eess.SY

    Design and Implementation of TAO DAQ System

    Authors: Shuihan Zhang, Chao Chen, Xiaolu Ji, Fei Li, Yu Peng, Fabrizio Petrucci, Yinhui Wu, Zezhong Yu, Tingxuan Zeng, Kejun Zhu

    Abstract: Purpose: The Taishan Antineutrino Observatory (TAO) is a satellite experiment of the Jiangmen Underground Neutrino Observatory (JUNO), also known as JUNO-TAO. Located close to one of the reactors of the Taishan Nuclear Power Plant, TAO will measure the antineutrino energy spectrum precisely as a reference spectrum for JUNO. The data acquisition (DAQ) system is designed to acquire data from the TAO… ▽ More

    Submitted 9 September, 2024; originally announced September 2024.

  11. arXiv:2409.05086  [pdf, other

    math.OC eess.SY

    Exploring the Optimal Size of Grid-forming Energy Storage in an Off-grid Renewable P2H System under Multi-timescale Energy Management

    Authors: Jie Zhu, Yiwei Qiu, Yangjun Zeng, Yi Zhou, Shi Chen, Tianlei Zang, Buxiang Zhou, Zhipeng Yu, Jin Lin

    Abstract: Utility-scale off-grid renewable power-to-hydrogen systems (OReP2HSs) typically include photovoltaic plants, wind turbines, electrolyzers (ELs), and energy storage systems. As an island system, OReP2HS requires at least one component, generally the battery energy storage system (BESS), that operates for grid-forming control to provide frequency and voltage references and regulate them through tran… ▽ More

    Submitted 8 September, 2024; originally announced September 2024.

  12. arXiv:2409.04381  [pdf

    cs.CV

    Enhancing Skin Lesion Diagnosis with Ensemble Learning

    Authors: Xiaoyi Liu, Zhou Yu, Lianghao Tan, Yafeng Yan, Ge Shi

    Abstract: Skin lesions are an increasingly significant medical concern, varying widely in severity from benign to cancerous. Accurate diagnosis is essential for ensuring timely and appropriate treatment. This study examines the implementation of deep learning methods to assist in the diagnosis of skin lesions using the HAM10000 dataset, which contains seven distinct types of lesions. First, we evaluated thr… ▽ More

    Submitted 6 September, 2024; originally announced September 2024.

  13. arXiv:2409.04240  [pdf, other

    q-bio.NC cond-mat.dis-nn nlin.CD physics.data-an

    Network reconstruction may not mean dynamics prediction

    Authors: Zhendong Yu, Haiping Huang

    Abstract: With an increasing amount of observations on the dynamics of many complex systems, it is required to reveal the underlying mechanisms behind these complex dynamics, which is fundamentally important in many scientific fields such as climate, financial, ecological, and neural systems. The underlying mechanisms are commonly encoded into network structures, e.g., capturing how constituents interact wi… ▽ More

    Submitted 6 September, 2024; originally announced September 2024.

    Comments: 27 pages, 9 figures

  14. arXiv:2409.03501  [pdf, other

    cs.CV

    Towards Data-Centric Face Anti-Spoofing: Improving Cross-domain Generalization via Physics-based Data Synthesis

    Authors: Rizhao Cai, Cecelia Soh, Zitong Yu, Haoliang Li, Wenhan Yang, Alex Kot

    Abstract: Face Anti-Spoofing (FAS) research is challenged by the cross-domain problem, where there is a domain gap between the training and testing data. While recent FAS works are mainly model-centric, focusing on developing domain generalization algorithms for improving cross-domain performance, data-centric research for face anti-spoofing, improving generalization from data quality and quantity, is large… ▽ More

    Submitted 3 September, 2024; originally announced September 2024.

    Comments: Accepted by International Journal of Computer Vision (IJCV) in Sept 2024

  15. arXiv:2409.03368  [pdf, other

    cs.NE

    Training-free Conversion of Pretrained ANNs to SNNs for Low-Power and High-Performance Applications

    Authors: Tong Bu, Maohua Li, Zhaofei Yu

    Abstract: Spiking Neural Networks (SNNs) have emerged as a promising substitute for Artificial Neural Networks (ANNs) due to their advantages of fast inference and low power consumption. However, the lack of efficient training algorithms has hindered their widespread adoption. Existing supervised learning algorithms for SNNs require significantly more memory and time than their ANN counterparts. Even common… ▽ More

    Submitted 5 September, 2024; originally announced September 2024.

  16. arXiv:2409.00803  [pdf

    physics.optics cond-mat.mes-hall cond-mat.mtrl-sci physics.app-ph quant-ph

    Broadband light extraction from near-surface NV centers using crystalline-silicon antennas

    Authors: Minjeong Kim, Maryam Zahedian, Wenxin Wu, Chengyu Fang, Zhaoning Yu, Raymond A. Wambold, Shenwei Yin, David A. Czaplewski, Jennifer T. Choy, Mikhail A. Kats

    Abstract: We use crystalline silicon (Si) antennas to efficiently extract broadband single-photon fluorescence from shallow nitrogen-vacancy (NV) centers in diamond into free space. Our design features relatively easy-to-pattern high-index Si resonators on the diamond surface to boost photon extraction by overcoming total internal reflection and Fresnel reflection at the diamond-air interface, and providing… ▽ More

    Submitted 1 September, 2024; originally announced September 2024.

    Comments: Main text + supplementary

  17. arXiv:2409.00694  [pdf, other

    cs.CV

    IAFI-FCOS: Intra- and across-layer feature interaction FCOS model for lesion detection of CT images

    Authors: Qiu Guan, Mengjie Pan, Feng Chen, Zhiqiang Yang, Zhongwen Yu, Qianwei Zhou, Haigen Hu

    Abstract: Effective lesion detection in medical image is not only rely on the features of lesion region,but also deeply relative to the surrounding information.However,most current methods have not fully utilize it.What is more,multi-scale feature fusion mechanism of most traditional detectors are unable to transmit detail information without loss,which makes it hard to detect small and boundary ambiguous l… ▽ More

    Submitted 1 September, 2024; originally announced September 2024.

    Comments: 2024 IJCNN

  18. arXiv:2409.00366  [pdf, other

    nucl-ex hep-ex hep-ph nucl-th

    Mini-Proceedings of the "Fourth International Workshop on the Extension Project for the J-PARC Hadron Experimental Facility (HEF-ex 2024)"

    Authors: P. Achenbach, K. Aoki, S. Aoki, C. Curceanu, S. Diehl, T. Doi, M. Endo, M. Fujita, T. Fukuda, H. Garcia-Tecocoatzi, L. S. Geng, T. Gunji, C. Hanhart, M. Harada, T. Harada, S. Hayakawa, B. R. He, E. Hiyama, R. Honda, Y. Ichikawa, M. Isaka, D. Jido, A. Jinno, K. Kamada, Y. Kamiya , et al. (36 additional authors not shown)

    Abstract: The mini proceedings of the "Fourth International Workshop on the Extension Project for the J-PARC Hadron Experimental Facility (HEF-ex 2024) [https://kds.kek.jp/event/46965]" held at J-PARC, February 19-21, 2024, are presented. The workshop was devoted to discussing the physics case that connects both the present and the future Hadron Experimental Facility at J-PARC, covering a wide range of topi… ▽ More

    Submitted 31 August, 2024; originally announced September 2024.

  19. arXiv:2409.00239  [pdf, other

    quant-ph

    Quantum algorithms for hypergraph simplex finding

    Authors: Zhiying Yu, Shalev Ben-David

    Abstract: We study the quantum query algorithms for simplex finding, a generalization of triangle finding to hypergraphs. This problem satisfies a rank-reduction property: a quantum query algorithm for finding simplices in rank-$r$ hypergraphs can be turned into a faster algorithm for finding simplices in rank-$(r-1)$ hypergraphs. We then show that every nested Johnson graph quantum walk (with any constant… ▽ More

    Submitted 30 August, 2024; originally announced September 2024.

    Comments: 31 pages

  20. arXiv:2408.16712  [pdf, ps, other

    hep-th

    Correlators of long strings on AdS$_3\times$S$^3\times$T$^4$

    Authors: Zhe-fei Yu, Cheng Peng

    Abstract: In this work, we calculate correlators of long strings on AdS$_3\times$S$^3\times$T$^4$ with pure NS-NS flux. We first construct physical vertex operators that correspond to long strings. Due to the GSO projection, they depend on the parity of the spectral flow parameter $w$. For a given $w$, we construct the physical operators that have the lowest space-time weights in both the NS and R sector. T… ▽ More

    Submitted 3 September, 2024; v1 submitted 29 August, 2024; originally announced August 2024.

    Comments: 57 pages, 4 tables; v2: typos corrected

  21. arXiv:2408.16455  [pdf, other

    cs.IT eess.SP

    Addressing the Mutual Interference in Uplink ISAC Receivers: A Projection Method

    Authors: Zhiyuan Yu, Hong Ren, Cunhua Pan, Gui Zhou, Ruizhe Wang, Mengyu Liu, Jiangzhou Wang

    Abstract: Dual function radar and communication (DFRC) is a promising research direction within integrated sensing and communication (ISAC), improving hardware and spectrum efficiency by merging sensing and communication (S&C) functionalities into a shared platform. However, the DFRC receiver (DFRC-R) is tasked with both uplink communication signal detection and simultaneously target-related parameter estim… ▽ More

    Submitted 29 August, 2024; originally announced August 2024.

    Comments: 5 pages, 3 figures, accepted by IEEE WCL

  22. arXiv:2408.16200  [pdf, other

    cs.CV cs.AI

    PolarBEVDet: Exploring Polar Representation for Multi-View 3D Object Detection in Bird's-Eye-View

    Authors: Zichen Yu, Quanli Liu, Wei Wang, Liyong Zhang, Xiaoguang Zhao

    Abstract: Recently, LSS-based multi-view 3D object detection provides an economical and deployment-friendly solution for autonomous driving. However, all the existing LSS-based methods transform multi-view image features into a Cartesian Bird's-Eye-View(BEV) representation, which does not take into account the non-uniform image information distribution and hardly exploits the view symmetry. In this paper, i… ▽ More

    Submitted 28 August, 2024; originally announced August 2024.

    Comments: 11 pages, 6 figures

  23. arXiv:2408.16060  [pdf, other

    astro-ph.HE astro-ph.GA

    The Remarkable X-ray Spectra and Variability of the Ultraluminous Weak-Line Quasar SDSS J1521+5202

    Authors: Shouyi Wang, W. Niel Brandt, Bin Luo, Zhibo Yu, Fan Zou, Qingling Ni, Fabio Vito

    Abstract: We present a focused X-ray and multiwavelength study of the ultraluminous weak-line quasar (WLQ) SDSS J1521+5202, one of the few X-ray weak WLQs that is amenable to basic X-ray spectral and variability investigations. J1521+5202 shows striking X-ray variability during 2006--2023, by up to a factor of $\approx 32$ in 0.5--2 keV flux, and our new 2023 Chandra observation caught it in its brightest X… ▽ More

    Submitted 28 August, 2024; originally announced August 2024.

    Comments: 11 pages, 3 figures, accepted for publication in ApJ

  24. arXiv:2408.15998  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders

    Authors: Min Shi, Fuxiao Liu, Shihao Wang, Shijia Liao, Subhashree Radhakrishnan, De-An Huang, Hongxu Yin, Karan Sapra, Yaser Yacoob, Humphrey Shi, Bryan Catanzaro, Andrew Tao, Jan Kautz, Zhiding Yu, Guilin Liu

    Abstract: The ability to accurately interpret complex visual information is a crucial topic of multimodal large language models (MLLMs). Recent work indicates that enhanced visual perception significantly reduces hallucinations and improves performance on resolution-sensitive tasks, such as optical character recognition and document analysis. A number of recent MLLMs achieve this goal using a mixture of vis… ▽ More

    Submitted 28 August, 2024; originally announced August 2024.

    Comments: Github: https://github.com/NVlabs/Eagle, HuggingFace: https://huggingface.co/NVEagle

  25. arXiv:2408.15881  [pdf, other

    cs.CV

    LLaVA-MoD: Making LLaVA Tiny via MoE Knowledge Distillation

    Authors: Fangxun Shu, Yue Liao, Le Zhuo, Chenning Xu, Guanghao Zhang, Haonan Shi, Long Chen, Tao Zhong, Wanggui He, Siming Fu, Haoyuan Li, Bolin Li, Zhelun Yu, Si Liu, Hongsheng Li, Hao Jiang

    Abstract: We introduce LLaVA-MoD, a novel framework designed to enable the efficient training of small-scale Multimodal Language Models (s-MLLM) by distilling knowledge from large-scale MLLM (l-MLLM). Our approach tackles two fundamental challenges in MLLM distillation. First, we optimize the network structure of s-MLLM by integrating a sparse Mixture of Experts (MoE) architecture into the language model, s… ▽ More

    Submitted 28 August, 2024; originally announced August 2024.

  26. arXiv:2408.15772  [pdf, other

    cs.IT

    220 GHz Urban Microcell Channel Measurement and Characterization on a University Campus

    Authors: Yuanbo Li, Yiqin Wang, Yejian Lyu, Ziming Yu, Chong Han

    Abstract: Owning abundant bandwidth resources, the Terahertz (THz) band (0.1-10~THz) is envisioned as a key technology to realize ultra-high-speed communications in 6G and beyond wireless networks. To realize reliable THz communications in urban microcell (UMi) environments, propagation analysis and channel characterization are still insufficient. In this paper, channel measurement campaigns are conducted i… ▽ More

    Submitted 28 August, 2024; originally announced August 2024.

    Comments: 5 pages, 4 figures, 1 table

  27. arXiv:2408.15126  [pdf, other

    physics.chem-ph cs.LG physics.comp-ph q-bio.BM

    Force-Guided Bridge Matching for Full-Atom Time-Coarsened Dynamics of Peptides

    Authors: Ziyang Yu, Wenbing Huang, Yang Liu

    Abstract: Molecular Dynamics (MD) simulations are irreplaceable and ubiquitous in fields of materials science, chemistry, pharmacology just to name a few. Conventional MD simulations are plagued by numerical stability as well as long equilibration time issues, which limits broader applications of MD simulations. Recently, a surge of deep learning approaches have been devised for time-coarsened dynamics, whi… ▽ More

    Submitted 3 September, 2024; v1 submitted 27 August, 2024; originally announced August 2024.

  28. arXiv:2408.14087  [pdf, other

    cs.CV

    LSM-YOLO: A Compact and Effective ROI Detector for Medical Detection

    Authors: Zhongwen Yu, Qiu Guan, Jianmin Yang, Zhiqiang Yang, Qianwei Zhou, Yang Chen, Feng Chen

    Abstract: In existing medical Region of Interest (ROI) detection, there lacks an algorithm that can simultaneously satisfy both real-time performance and accuracy, not meeting the growing demand for automatic detection in medicine. Although the basic YOLO framework ensures real-time detection due to its fast speed, it still faces challenges in maintaining precision concurrently. To alleviate the above probl… ▽ More

    Submitted 26 August, 2024; originally announced August 2024.

  29. arXiv:2408.13598  [pdf, other

    astro-ph.HE astro-ph.IM

    Advancing Gamma-Ray Burst Identification through Transfer Learning with Convolutional Neural Networks

    Authors: Peng Zhang, Bing Li, Ren-zhou Gui, Shao-lin Xiong, Yu Wang, Yan-qiu Zhang, Chen-wei Wang, Jia-cong Liu, Wang-chen Xue, Chao Zheng, Zheng-hang Yu, Wen-long Zhang

    Abstract: The Rapid and accurate identification of Gamma-Ray Bursts (GRBs) is crucial for unraveling their origins. However, current burst search algorithms frequently miss low-threshold signals or lack universality for observations. In this study, we propose a novel approach utilizing transfer learning experiment based on convolutional neural network (CNN) to establish a universal GRB identification method… ▽ More

    Submitted 24 August, 2024; originally announced August 2024.

    Comments: 17 pages, 7 figures

  30. arXiv:2408.13564  [pdf, ps, other

    physics.geo-ph

    Application of first- and second-order adjoint methods to glacial isostatic adjustment incorporating rotational feedbacks

    Authors: Ziheng Yu, David Al-Attar, Frank Syvret, Andrew J. Lloyd

    Abstract: This paper revisits and extends the adjoint theory for glacial isostatic adjustment (GIA) of Crawford et al. (2018). Rotational feedbacks are now incorporated, and the application of the second-order adjoint method is described for the first time. The first-order adjoint method provides an efficient means for computing sensitivity kernels for a chosen objective functional, while the second-order a… ▽ More

    Submitted 24 August, 2024; originally announced August 2024.

  31. arXiv:2408.13180  [pdf, other

    eess.IV cs.CV

    Deep Learning for Lung Disease Classification Using Transfer Learning and a Customized CNN Architecture with Attention

    Authors: Xiaoyi Liu, Zhou Yu, Lianghao Tan

    Abstract: Many people die from lung-related diseases every year. X-ray is an effective way to test if one is diagnosed with a lung-related disease or not. This study concentrates on categorizing three distinct types of lung X-rays: those depicting healthy lungs, those showing lung opacities, and those indicative of viral pneumonia. Accurately diagnosing the disease at an early phase is critical. In this pap… ▽ More

    Submitted 23 August, 2024; originally announced August 2024.

  32. arXiv:2408.13073  [pdf, other

    cs.LG

    IntelliCare: Improving Healthcare Analysis with Variance-Controlled Patient-Level Knowledge from Large Language Models

    Authors: Zhihao Yu, Yujie Jin, Yongxin Xu, Xu Chu, Yasha Wang, Junfeng Zhao

    Abstract: While pioneering deep learning methods have made great strides in analyzing electronic health record (EHR) data, they often struggle to fully capture the semantics of diverse medical codes from limited data. The integration of external knowledge from Large Language Models (LLMs) presents a promising avenue for improving healthcare predictions. However, LLM analyses may exhibit significant variance… ▽ More

    Submitted 23 August, 2024; originally announced August 2024.

  33. arXiv:2408.12834  [pdf, other

    cs.CL cs.AI

    CLLMFS: A Contrastive Learning enhanced Large Language Model Framework for Few-Shot Named Entity Recognition

    Authors: Yafeng Zhang, Zilan Yu, Yuang Huang, Jing Tang

    Abstract: Few-shot Named Entity Recognition (NER), the task of identifying named entities with only a limited amount of labeled data, has gained increasing significance in natural language processing. While existing methodologies have shown some effectiveness, such as enriching label semantics through various prompting modes or employing metric learning techniques, their performance exhibits limited robustn… ▽ More

    Submitted 23 August, 2024; originally announced August 2024.

    Comments: 27TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE

  34. arXiv:2408.12504  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci physics.app-ph

    All-Electrical Layer-Spintronics in Altermagnetic Bilayer

    Authors: Rui Peng, Jin Yang, Wee-Liat Ong, Pin Ho, Chit Siong Lau, Zhi-Ming Yu, Yee Sin Ang

    Abstract: Electrical manipulation of spin-polarized current is highly desirable yet tremendously challenging in developing ultracompact spintronic device technology. Here we propose a scheme to realize the all-electrical manipulation of spin-polarized current in an altermagnetic bilayer. Such a bilayer system can host layer-spin locking, in which one layer hosts a spin-polarized current while the other laye… ▽ More

    Submitted 22 August, 2024; originally announced August 2024.

    Comments: 20 pages, 5 figures

  35. arXiv:2408.12497  [pdf

    physics.optics

    Long-Propagating Ghost Phonon Polaritons Enabled by Selective Mode Excitation

    Authors: Manuka P. Suriyage, Qingyi Zhou, Hao Qin, Xueqian Sun, Zhuoyuan Lu, Stefan A. Maier, Zongfu Yu, Yuerui Lu

    Abstract: The precise control of phonon polaritons(PhPs) is essential for advancements in nanophotonic applications like on-chip optical communication and quantum information processing. Ghost hyperbolic phonon polaritons (g-HPs), which have been recently discovered, feature in-plane hyperbolic dispersion and oblique wavefronts, enabling long-range propagation. Despite their potential, controlling the direc… ▽ More

    Submitted 25 August, 2024; v1 submitted 22 August, 2024; originally announced August 2024.

  36. arXiv:2408.12364  [pdf, other

    cs.CV cs.AI cs.ET

    SAM-SP: Self-Prompting Makes SAM Great Again

    Authors: Chunpeng Zhou, Kangjie Ning, Qianqian Shen, Sheng Zhou, Zhi Yu, Haishuai Wang

    Abstract: The recently introduced Segment Anything Model (SAM), a Visual Foundation Model (VFM), has demonstrated impressive capabilities in zero-shot segmentation tasks across diverse natural image datasets. Despite its success, SAM encounters noticeably performance degradation when applied to specific domains, such as medical images. Current efforts to address this issue have involved fine-tuning strategi… ▽ More

    Submitted 22 August, 2024; originally announced August 2024.

    Comments: Under Review

  37. arXiv:2408.12141  [pdf, other

    cs.CV

    TRRG: Towards Truthful Radiology Report Generation With Cross-modal Disease Clue Enhanced Large Language Model

    Authors: Yuhao Wang, Chao Hao, Yawen Cui, Xinqi Su, Weicheng Xie, Tao Tan, Zitong Yu

    Abstract: The vision-language modeling capability of multi-modal large language models has attracted wide attention from the community. However, in medical domain, radiology report generation using vision-language models still faces significant challenges due to the imbalanced data distribution caused by numerous negated descriptions in radiology reports and issues such as rough alignment between radiology… ▽ More

    Submitted 22 August, 2024; originally announced August 2024.

  38. arXiv:2408.12093  [pdf, other

    cs.RO cs.CV

    LLM-enhanced Scene Graph Learning for Household Rearrangement

    Authors: Wenhao Li, Zhiyuan Yu, Qijin She, Zhinan Yu, Yuqing Lan, Chenyang Zhu, Ruizhen Hu, Kai Xu

    Abstract: The household rearrangement task involves spotting misplaced objects in a scene and accommodate them with proper places. It depends both on common-sense knowledge on the objective side and human user preference on the subjective side. In achieving such task, we propose to mine object functionality with user preference alignment directly from the scene itself, without relying on human intervention.… ▽ More

    Submitted 12 September, 2024; v1 submitted 21 August, 2024; originally announced August 2024.

    Comments: SIGGRAPH ASIA 2024 conference accepted

  39. arXiv:2408.11557  [pdf, other

    cs.IR

    A Quick, trustworthy spectral detection Q&A system based on the SDAAP Dataset and large language model

    Authors: Jiheng Liang, Ziru Yu, Zujie Xie, Xiangyang Yu

    Abstract: Large Language Model (LLM) has demonstrated significant success in a range of natural language processing (NLP) tasks within general domain. The emergence of LLM has introduced innovative methodologies across diverse fields, including the natural sciences. Researchers aim to implement automated, concurrent process driven by LLM to supplant conventional manual, repetitive and labor-intensive work.… ▽ More

    Submitted 23 August, 2024; v1 submitted 21 August, 2024; originally announced August 2024.

    Comments: 16 pages,10 figures,3 tables

  40. arXiv:2408.11424  [pdf, other

    cs.CV

    EMO-LLaMA: Enhancing Facial Emotion Understanding with Instruction Tuning

    Authors: Bohao Xing, Zitong Yu, Xin Liu, Kaishen Yuan, Qilang Ye, Weicheng Xie, Huanjing Yue, Jingyu Yang, Heikki Kälviäinen

    Abstract: Facial expression recognition (FER) is an important research topic in emotional artificial intelligence. In recent decades, researchers have made remarkable progress. However, current FER paradigms face challenges in generalization, lack semantic information aligned with natural language, and struggle to process both images and videos within a unified framework, making their application in multimo… ▽ More

    Submitted 21 August, 2024; originally announced August 2024.

  41. arXiv:2408.10883  [pdf, other

    cs.AI cs.CV

    DAAD: Dynamic Analysis and Adaptive Discriminator for Fake News Detection

    Authors: Xinqi Su, Yawen Cui, Ajian Liu, Xun Lin, Yuhao Wang, Haochen Liang, Wenhui Li, Zitong Yu

    Abstract: In current web environment, fake news spreads rapidly across online social networks, posing serious threats to society. Existing multimodal fake news detection (MFND) methods can be classified into knowledge-based and semantic-based approaches. However, these methods are overly dependent on human expertise and feedback, lacking flexibility. To address this challenge, we propose a Dynamic Analysis… ▽ More

    Submitted 20 August, 2024; originally announced August 2024.

  42. arXiv:2408.09856  [pdf, other

    cs.CL cs.AI

    TeamLoRA: Boosting Low-Rank Adaptation with Expert Collaboration and Competition

    Authors: Tianwei Lin, Jiang Liu, Wenqiao Zhang, Zhaocheng Li, Yang Dai, Haoyuan Li, Zhelun Yu, Wanggui He, Juncheng Li, Hao Jiang, Siliang Tang, Yueting Zhuang

    Abstract: While Parameter-Efficient Fine-Tuning (PEFT) methods like LoRA have effectively addressed GPU memory constraints during fine-tuning, their performance often falls short, especially in multidimensional task scenarios. To address this issue, one straightforward solution is to introduce task-specific LoRA modules as domain experts, leveraging the modeling of multiple experts' capabilities and thus en… ▽ More

    Submitted 19 August, 2024; originally announced August 2024.

  43. arXiv:2408.09733  [pdf

    physics.med-ph

    Orientation independent quantification of macromolecular proton fraction in tissues with suppression of residual dipolar coupling

    Authors: Zijian Gao, Ziqiang Yu, Ziqin Zhou, Jian Hou, Baiyan Jiang, Michael Ong, Weitian Chen

    Abstract: Quantitative magnetization transfer (MT) imaging enables non-invasive characterization of the macromolecular environment of tissues. However, recent work has highlighted that the quantification of MT parameters exhibits orientation dependence in ordered tissue structures, potentially confounding its clinical applications. Notably, in tissues with ordered structures, such as articular cartilage and… ▽ More

    Submitted 19 August, 2024; originally announced August 2024.

  44. arXiv:2408.07675  [pdf, other

    cs.CV

    G$^2$V$^2$former: Graph Guided Video Vision Transformer for Face Anti-Spoofing

    Authors: Jingyi Yang, Zitong Yu, Xiuming Ni, Jia He, Hui Li

    Abstract: In videos containing spoofed faces, we may uncover the spoofing evidence based on either photometric or dynamic abnormality, even a combination of both. Prevailing face anti-spoofing (FAS) approaches generally concentrate on the single-frame scenario, however, purely photometric-driven methods overlook the dynamic spoofing clues that may be exposed over time. This may lead FAS systems to conclude… ▽ More

    Submitted 14 August, 2024; originally announced August 2024.

    Comments: 11 pages, 5 figures

  45. arXiv:2408.07545  [pdf, other

    cs.LG cs.AI

    $χ$SPN: Characteristic Interventional Sum-Product Networks for Causal Inference in Hybrid Domains

    Authors: Harsh Poonia, Moritz Willig, Zhongjie Yu, Matej Zečević, Kristian Kersting, Devendra Singh Dhami

    Abstract: Causal inference in hybrid domains, characterized by a mixture of discrete and continuous variables, presents a formidable challenge. We take a step towards this direction and propose Characteristic Interventional Sum-Product Network ($χ$SPN) that is capable of estimating interventional distributions in presence of random variables drawn from mixed distributions. $χ$SPN uses characteristic functio… ▽ More

    Submitted 14 August, 2024; originally announced August 2024.

    Comments: 17 pages, 11 figures. Accepted as poster at UAI (Uncertainty in Artificial Intelligence) 2024

  46. arXiv:2408.07255  [pdf, other

    hep-ph hep-ex nucl-ex nucl-th

    Dihadron azimuthal asymmetry and light-quark dipole moments at the Electron-Ion Collider

    Authors: Xin-Kai Wen, Bin Yan, Zhite Yu, C. -P. Yuan

    Abstract: We propose a novel method to probe light-quark dipole moments by examining the azimuthal asymmetries between a collinear pair of hadrons in semi-inclusive deep inelastic lepton scattering off an unpolarized proton target at the Electron-Ion Collider. These asymmetries provide a means to observe transversely polarized quarks, which arise exclusively from the interference between the dipole and the… ▽ More

    Submitted 13 August, 2024; originally announced August 2024.

    Comments: 6 pages, 2 figures

    Report number: JLAB-THY-24-4136,MSUHEP-24-009

  47. arXiv:2408.06701  [pdf, other

    cs.NI cs.LG

    DiffSG: A Generative Solver for Network Optimization with Diffusion Model

    Authors: Ruihuai Liang, Bo Yang, Zhiwen Yu, Bin Guo, Xuelin Cao, Mérouane Debbah, H. Vincent Poor, Chau Yuen

    Abstract: Diffusion generative models, famous for their performance in image generation, are popular in various cross-domain applications. However, their use in the communication community has been mostly limited to auxiliary tasks like data modeling and feature extraction. These models hold greater promise for fundamental problems in network optimization compared to traditional machine learning methods. Di… ▽ More

    Submitted 13 August, 2024; originally announced August 2024.

    Comments: 8 pages, 5 figures

  48. arXiv:2408.05813  [pdf, other

    hep-ph

    $X(4630)$ and $Y(4626)$ production in the $B^+$ and $B_s^0$ decays

    Authors: Zhuo Yu, Qi Wu, Dian-Yong Chen

    Abstract: In the present work, we investigate the production of $X(4630)$ and $Y(4626)$ in $B^+$ and $B_s^0$ decays, where $X(4630)$ and $Y(4626)$ are considered as the $C-$ parity pigeon pair in the $D_{s}^{\ast+} D_{s1}(2536)^-$ molecular frame. The branching fractions of $B^+ \to K^+ X(4630)/Y(4626)$ and $B_s^0 \to ηX(4630)/Y(4626)$ have been evaluated using an effective Lagrangian approach, which are of… ▽ More

    Submitted 11 August, 2024; originally announced August 2024.

  49. arXiv:2408.04738  [pdf, other

    cs.RO

    DiPGrasp: Parallel Local Searching for Efficient Differentiable Grasp Planning

    Authors: Wenqiang Xu, Jieyi Zhang, Tutian Tang, Zhenjun Yu, Yutong Li, Cewu Lu

    Abstract: Grasp planning is an important task for robotic manipulation. Though it is a richly studied area, a standalone, fast, and differentiable grasp planner that can work with robot grippers of different DOFs has not been reported. In this work, we present DiPGrasp, a grasp planner that satisfies all these goals. DiPGrasp takes a force-closure geometric surface matching grasp quality metric. It adopts a… ▽ More

    Submitted 8 August, 2024; originally announced August 2024.

  50. arXiv:2408.03268  [pdf, other

    stat.ME

    Regression analysis of elliptically symmetric direction data

    Authors: Zehao Yu, Xianzheng Huang

    Abstract: A comprehensive toolkit is developed for regression analysis of directional data based on a flexible class of angular Gaussian distributions. Informative testing procedures for isotropy and covariate effects on the directional response are proposed. Moreover, a prediction region that achieves the smallest volume in a class of ellipsoidal prediction regions of the same coverage probability is const… ▽ More

    Submitted 6 August, 2024; originally announced August 2024.

    Comments: 17 pages, 4 figures

    MSC Class: 62F03; 62J20