Skip to main content

Showing 51–100 of 1,806 results for author: Han, S

.
  1. arXiv:2407.03753  [pdf

    eess.SP

    Low-Complexity SVM Signal Recovery in Bandwidth-Limited 100Gb/s PAM4 PON Upstream

    Authors: Liyan Wu, Yanlu Huang, Kai Jin, Shangya Han, Kun Xu, Yanni Ou

    Abstract: We proposed a low-complexity SVM-based signal recovery algorithm and evaluated it in 100G-PON with 25G-class devices. For the first time, it experimentally achieved 24 dB power budget @ FEC threshold 1E-3 over 40 km SMF, improving receiver sensitivity over 2 dB compared to FFE&DFE.

    Submitted 4 July, 2024; originally announced July 2024.

  2. arXiv:2407.03627  [pdf, other

    cs.CL

    DSLR: Document Refinement with Sentence-Level Re-ranking and Reconstruction to Enhance Retrieval-Augmented Generation

    Authors: Taeho Hwang, Soyeong Jeong, Sukmin Cho, SeungYoon Han, Jong C. Park

    Abstract: Recent advancements in Large Language Models (LLMs) have significantly improved their performance across various Natural Language Processing (NLP) tasks. However, LLMs still struggle with generating non-factual responses due to limitations in their parametric memory. Retrieval-Augmented Generation (RAG) systems address this issue by incorporating external knowledge with a retrieval module. Despite… ▽ More

    Submitted 8 September, 2024; v1 submitted 4 July, 2024; originally announced July 2024.

    Comments: 20 pages

    Journal ref: KnowledgeNLP@ACL 2024

  3. arXiv:2407.02848  [pdf, other

    cond-mat.stat-mech

    Efficiency bounds for bipartite information-driven thermodynamic systems

    Authors: Shihao Xia, Shuanglong Han, Ousi Pan, Yuzhuo Pan, Jincan Chen, Shanhe Su

    Abstract: This study introduces a novel approach to derive a lower bound for the entropy production rate of a subsystem by utilizing the Cauchy-Schwarz inequality. It extends to establishing comprehensive upper and lower bounds for the efficiency of two subsystems. These bounds are applicable to a wide range of Markovian stochastic processes, which enhances the accuracy in depicting the range of energy conv… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: 8 pages, 2 figures

  4. arXiv:2407.02211  [pdf, other

    cs.CL cs.AI cs.LG

    PromptIntern: Saving Inference Costs by Internalizing Recurrent Prompt during Large Language Model Fine-tuning

    Authors: Jiaru Zou, Mengyu Zhou, Tao Li, Shi Han, Dongmei Zhang

    Abstract: Large language models (LLMs) have played a fundamental role in various natural language processing tasks with powerful prompt techniques. However, in real-world applications, there are often similar prompt components for repeated queries, which causes significant computational burdens during inference. Existing prompt compression and direct fine-tuning methods aim to tackle these challenges, yet t… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  5. arXiv:2407.01093  [pdf, other

    cs.CL cs.AI cs.MA

    IBSEN: Director-Actor Agent Collaboration for Controllable and Interactive Drama Script Generation

    Authors: Senyu Han, Lu Chen, Li-Min Lin, Zhengshan Xu, Kai Yu

    Abstract: Large language models have demonstrated their capabilities in storyline creation and human-like character role-playing. Current language model agents mainly focus on reasonable behaviors from the level of individuals, and their behaviors might be hard to constraint on the level of the whole storyline. In this paper we introduce IBSEN, a director-actor coordinate agent framework that generates dram… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: Accepted by ACL 2024 Main

  6. arXiv:2407.01076  [pdf

    cond-mat.str-el

    Orbital origin of magnetic moment enhancement induced by charge density wave in kagome FeGe

    Authors: Shulun Han, Linyang Li, Chi Sin Tang, Qi Wang, Lingfeng Zhang, Caozheng Diao, Mingwen Zhao, Shuo Sun, Lijun Tian, Mark B. H. Breese, Chuanbing Cai, Milorad V. Milosevic, Yanpeng Qi, Andrew T. S. Wee, Xinmao Yin

    Abstract: Interactions among various electronic states such as CDW, magnetism, and superconductivity are of high significance in strongly correlated systems. While significant progress has been made in understanding the relationship between CDW and superconductivity, the interplay between CDW and magnetic order remains largely elusive. Kagome lattices, which intertwine nontrivial topology, charge order, and… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  7. arXiv:2407.00896  [pdf, other

    eess.SP cs.AI

    Channel Modeling Aided Dataset Generation for AI-Enabled CSI Feedback: Advances, Challenges, and Solutions

    Authors: Yupeng Li, Gang Li, Zirui Wen, Shuangfeng Han, Shijian Gao, Guangyi Liu, Jiangzhou Wang

    Abstract: The AI-enabled autoencoder has demonstrated great potential in channel state information (CSI) feedback in frequency division duplex (FDD) multiple input multiple output (MIMO) systems. However, this method completely changes the existing feedback strategies, making it impractical to deploy in recent years. To address this issue, this paper proposes a channel modeling aided data augmentation metho… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

  8. arXiv:2406.19856  [pdf

    eess.SP

    LUT-boosted CDR and Equalization for Burst-mode 50/100 Gbit/s Bandwidth-limited Flexible PON

    Authors: Yanlu Huang, Liyan Wu, Shangya Han, Kai Jin, Kun Xu, Yanni Ou

    Abstract: We proposed and experimentally demonstrated a look-up table boosted fast CDR and equalization scheme for the burst-mode 50/100 Gbps bandwidth-limited flexible PON, requiring no preamble for convergence and achieved the same bit error rate performance as in the case of long preambles.

    Submitted 28 June, 2024; originally announced June 2024.

  9. arXiv:2406.19135  [pdf, other

    eess.AS cs.AI

    DEX-TTS: Diffusion-based EXpressive Text-to-Speech with Style Modeling on Time Variability

    Authors: Hyun Joon Park, Jin Sob Kim, Wooseok Shin, Sung Won Han

    Abstract: Expressive Text-to-Speech (TTS) using reference speech has been studied extensively to synthesize natural speech, but there are limitations to obtaining well-represented styles and improving model generalization ability. In this study, we present Diffusion-based EXpressive TTS (DEX-TTS), an acoustic model designed for reference-based speech synthesis with enhanced style representations. Based on a… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: Preprint

  10. arXiv:2406.18925  [pdf, other

    cs.CL cs.CV

    Selective Vision is the Challenge for Visual Reasoning: A Benchmark for Visual Argument Understanding

    Authors: Jiwan Chung, Sungjae Lee, Minseo Kim, Seungju Han, Ashkan Yousefpour, Jack Hessel, Youngjae Yu

    Abstract: Visual arguments, often used in advertising or social causes, rely on images to persuade viewers to do or believe something. Understanding these arguments requires selective vision: only specific visual stimuli within an image are relevant to the argument, and relevance can only be understood within the context of a broader argumentative structure. While visual arguments are readily appreciated by… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: 12 pages, 5 figures

  11. arXiv:2406.18819  [pdf, other

    stat.ME stat.AP

    MultiObjMatch: Matching with Optimal Tradeoffs between Multiple Objectives in R

    Authors: Shichao Han, Samuel D. Pimentel

    Abstract: In an observational study, matching aims to create many small sets of similar treated and control units from initial samples that may differ substantially in order to permit more credible causal inferences. The problem of constructing matched sets may be formulated as an optimization problem, but it can be challenging to specify a single objective function that adequately captures all the design c… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  12. arXiv:2406.18510  [pdf, other

    cs.CL

    WildTeaming at Scale: From In-the-Wild Jailbreaks to (Adversarially) Safer Language Models

    Authors: Liwei Jiang, Kavel Rao, Seungju Han, Allyson Ettinger, Faeze Brahman, Sachin Kumar, Niloofar Mireshghallah, Ximing Lu, Maarten Sap, Yejin Choi, Nouha Dziri

    Abstract: We introduce WildTeaming, an automatic LLM safety red-teaming framework that mines in-the-wild user-chatbot interactions to discover 5.7K unique clusters of novel jailbreak tactics, and then composes multiple tactics for systematic exploration of novel jailbreaks. Compared to prior work that performed red-teaming via recruited human workers, gradient-based optimization, or iterative revision with… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  13. arXiv:2406.18495  [pdf, other

    cs.CL

    WildGuard: Open One-Stop Moderation Tools for Safety Risks, Jailbreaks, and Refusals of LLMs

    Authors: Seungju Han, Kavel Rao, Allyson Ettinger, Liwei Jiang, Bill Yuchen Lin, Nathan Lambert, Yejin Choi, Nouha Dziri

    Abstract: We introduce WildGuard -- an open, light-weight moderation tool for LLM safety that achieves three goals: (1) identifying malicious intent in user prompts, (2) detecting safety risks of model responses, and (3) determining model refusal rate. Together, WildGuard serves the increasing needs for automatic safety moderation and evaluation of LLM interactions, providing a one-stop tool with enhanced a… ▽ More

    Submitted 9 July, 2024; v1 submitted 26 June, 2024; originally announced June 2024.

    Comments: First two authors contributed equally. Third and fourth authors contributed equally

  14. arXiv:2406.15734  [pdf, other

    cs.CL cs.AI

    RankAdaptor: Hierarchical Dynamic Low-Rank Adaptation for Structural Pruned LLMs

    Authors: Changhai Zhou, Shijie Han, Shiyang Zhang, Shichao Weng, Zekai Liu, Cheng Jin

    Abstract: The efficient compression of large language models (LLMs) is becoming increasingly popular. However, recovering the accuracy of compressed LLMs is still a major challenge. Structural pruning with standard Low-Rank Adaptation (LoRA) is a common technique in current LLM compression. In structural pruning, the model architecture is modified unevenly, resulting in suboptimal performance in various dow… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

  15. arXiv:2406.14703  [pdf, other

    cs.CL cs.AI

    Do LLMs Have Distinct and Consistent Personality? TRAIT: Personality Testset designed for LLMs with Psychometrics

    Authors: Seungbeen Lee, Seungwon Lim, Seungju Han, Giyeong Oh, Hyungjoo Chae, Jiwan Chung, Minju Kim, Beong-woo Kwak, Yeonsoo Lee, Dongha Lee, Jinyoung Yeo, Youngjae Yu

    Abstract: The idea of personality in descriptive psychology, traditionally defined through observable behavior, has now been extended to Large Language Models (LLMs) to better understand their behavior. This raises a question: do LLMs exhibit distinct and consistent personality traits, similar to humans? Existing self-assessment personality tests, while applicable, lack the necessary validity and reliabilit… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: Preprint; Under review

  16. arXiv:2406.14459  [pdf, other

    cs.CL

    Healing Powers of BERT: How Task-Specific Fine-Tuning Recovers Corrupted Language Models

    Authors: Shijie Han, Zhenyu Zhang, Andrei Arsene Simion

    Abstract: Language models like BERT excel at sentence classification tasks due to extensive pre-training on general data, but their robustness to parameter corruption is unexplored. To understand this better, we look at what happens if a language model is "broken", in the sense that some of its parameters are corrupted and then recovered by fine-tuning. Strategically corrupting BERT variants at different le… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  17. arXiv:2406.14380  [pdf, other

    econ.EM cs.LG stat.ME

    Estimating Treatment Effects under Recommender Interference: A Structured Neural Networks Approach

    Authors: Ruohan Zhan, Shichao Han, Yuchen Hu, Zhenling Jiang

    Abstract: Recommender systems are essential for content-sharing platforms by curating personalized content. To evaluate updates to recommender systems targeting content creators, platforms frequently rely on creator-side randomized experiments. The treatment effect measures the change in outcomes when a new algorithm is implemented compared to the status quo. We show that the standard difference-in-means es… ▽ More

    Submitted 5 July, 2024; v1 submitted 20 June, 2024; originally announced June 2024.

  18. arXiv:2406.12904  [pdf, other

    cs.LG physics.comp-ph physics.optics

    Meent: Differentiable Electromagnetic Simulator for Machine Learning

    Authors: Yongha Kim, Anthony W. Jung, Sanmun Kim, Kevin Octavian, Doyoung Heo, Chaejin Park, Jeongmin Shin, Sunghyun Nam, Chanhyung Park, Juho Park, Sangjun Han, Jinmyoung Lee, Seolho Kim, Min Seok Jang, Chan Y. Park

    Abstract: Electromagnetic (EM) simulation plays a crucial role in analyzing and designing devices with sub-wavelength scale structures such as solar cells, semiconductor devices, image sensors, future displays and integrated photonic devices. Specifically, optics problems such as estimating semiconductor device structures and designing nanophotonic devices provide intriguing research topics with far-reachin… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: under review

  19. arXiv:2406.12874  [pdf, other

    physics.ins-det hep-ex

    The Design, Implementation, and Performance of the LZ Calibration Systems

    Authors: J. Aalbers, D. S. Akerib, A. K. Al Musalhi, F. Alder, C. S. Amarasinghe, A. Ames, T. J. Anderson, N. Angelides, H. M. Araújo, J. E. Armstrong, M. Arthurs, A. Baker, S. Balashov, J. Bang, E. E. Barillier, J. W. Bargemann, K. Beattie, T. Benson, A. Bhatti, A. Biekert, T. P. Biesiadzinski, H. J. Birch, E. Bishop, G. M. Blockinger, B. Boxer , et al. (179 additional authors not shown)

    Abstract: LUX-ZEPLIN (LZ) is a tonne-scale experiment searching for direct dark matter interactions and other rare events. It is located at the Sanford Underground Research Facility (SURF) in Lead, South Dakota, USA. The core of the LZ detector is a dual-phase xenon time projection chamber (TPC), designed with the primary goal of detecting Weakly Interacting Massive Particles (WIMPs) via their induced low e… ▽ More

    Submitted 5 September, 2024; v1 submitted 2 May, 2024; originally announced June 2024.

    Journal ref: JINST 19 P08027 (2024)

  20. arXiv:2406.11961  [pdf, other

    hep-ph hep-ex

    Elaborating Higgs to dimuon decay from gluon fusion by decorrelation and jet substructure

    Authors: Subin Han, Hyung Do Kim

    Abstract: Discovery of the Higgs boson decay to dimuon is anticipated soon based on the current evidence. Precise categorization of the events without affecting the invariant mass shape is crucial in the analysis. Decorrelation of the invariant mass and the output of discriminators (the score of discriminators) is essential for consistent and precise analysis. In this paper we use distance correlation as th… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 25 pages, 7 figures, 7 tables

  21. arXiv:2406.11260  [pdf, other

    cs.CL cs.AI

    Adversarial Style Augmentation via Large Language Model for Robust Fake News Detection

    Authors: Sungwon Park, Sungwon Han, Meeyoung Cha

    Abstract: The spread of fake news negatively impacts individuals and is regarded as a significant social challenge that needs to be addressed. A number of algorithmic and insightful features have been identified for detecting fake news. However, with the recent LLMs and their advanced generation capabilities, many of the detectable features (e.g., style-conversion attacks) can be altered, making it more cha… ▽ More

    Submitted 22 July, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

    Comments: 8 pages

  22. arXiv:2406.10847  [pdf, other

    cs.AI cs.CE cs.CL cs.MA

    TorchOpera: A Compound AI System for LLM Safety

    Authors: Shanshan Han, Yuhang Yao, Zijian Hu, Dimitris Stripelis, Zhaozhuo Xu, Chaoyang He

    Abstract: We introduce TorchOpera, a compound AI system for enhancing the safety and quality of prompts and responses for Large Language Models. TorchOpera ensures that all user prompts are safe, contextually grounded, and effectively processed, while enhancing LLM responses to be relevant and high quality. TorchOpera utilizes the vector database for contextual grounding, rule-based wrappers for flexible mo… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

  23. arXiv:2406.10774  [pdf, other

    cs.CL cs.LG

    Quest: Query-Aware Sparsity for Efficient Long-Context LLM Inference

    Authors: Jiaming Tang, Yilong Zhao, Kan Zhu, Guangxuan Xiao, Baris Kasikci, Song Han

    Abstract: As the demand for long-context large language models (LLMs) increases, models with context windows of up to 128K or 1M tokens are becoming increasingly prevalent. However, long-context LLM inference is challenging since the inference speed decreases significantly as the sequence length grows. This slowdown is primarily caused by loading a large KV cache during self-attention. Previous works have s… ▽ More

    Submitted 26 August, 2024; v1 submitted 15 June, 2024; originally announced June 2024.

    Comments: ICML 2024

  24. arXiv:2406.10537  [pdf, other

    cs.LG cs.AI stat.ML

    Scalable Differentiable Causal Discovery in the Presence of Latent Confounders with Skeleton Posterior (Extended Version)

    Authors: Pingchuan Ma, Rui Ding, Qiang Fu, Jiaru Zhang, Shuai Wang, Shi Han, Dongmei Zhang

    Abstract: Differentiable causal discovery has made significant advancements in the learning of directed acyclic graphs. However, its application to real-world datasets remains restricted due to the ubiquity of latent confounders and the requirement to learn maximal ancestral graphs (MAGs). To date, existing differentiable MAG learning algorithms have been limited to small datasets and failed to scale to lar… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

  25. arXiv:2406.09799  [pdf, other

    cs.CY

    GeoSEE: Regional Socio-Economic Estimation With a Large Language Model

    Authors: Sungwon Han, Donghyun Ahn, Seungeon Lee, Minhyuk Song, Sungwon Park, Sangyoon Park, Jihee Kim, Meeyoung Cha

    Abstract: Moving beyond traditional surveys, combining heterogeneous data sources with AI-driven inference models brings new opportunities to measure socio-economic conditions, such as poverty and population, over expansive geographic areas. The current research presents GeoSEE, a method that can estimate various socio-economic indicators using a unified pipeline powered by a large language model (LLM). Pre… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  26. arXiv:2406.09076  [pdf, other

    cs.CL

    3M: Multi-modal Multi-task Multi-teacher Learning for Game Event Detection

    Authors: Thye Shan Ng, Feiqi Cao, Soyeon Caren Han

    Abstract: Esports has rapidly emerged as a global phenomenon with an ever-expanding audience via platforms, like YouTube. Due to the inherent complexity nature of the game, it is challenging for newcomers to comprehend what the event entails. The chaotic nature of online chat, the fast-paced speech of the game commentator, and the game-specific user interface further compound the difficulty for users in com… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  27. arXiv:2406.08777   

    math.AP

    Finite Time Blowup of Integer- and Fractional-Order Time-Delayed Diffusion Equations

    Authors: Christopher N. Angstmann, Stuart-James M. Burney, Daniel S. Han, Bruce I. Henry, Boris Z. Huang, Zhuang Xu

    Abstract: In this work, exact solutions are derived for an integer- and fractional-order time-delayed diffusion equation with arbitrary initial conditions. The solutions are obtained using Fourier transform methods in conjunction with the known properties of delay functions. It is observed that the solutions do not exhibit infinite speed of propagation for smooth initial conditions that are bounded and posi… ▽ More

    Submitted 3 August, 2024; v1 submitted 12 June, 2024; originally announced June 2024.

    Comments: Errors were discovered in the analysis, significant revisions are being made to the manuscript

    MSC Class: 35R25; 35C10; 34K06; 34K37; 33E20; 42A38

  28. arXiv:2406.08301  [pdf, other

    nucl-ex

    Jet modification via $π^0$-hadron correlations in Au$+$Au collisions at $\sqrt{s_{_{NN}}}=200$ GeV

    Authors: PHENIX Collaboration, N. J. Abdulameer, U. Acharya, A. Adare, S. Afanasiev, C. Aidala, N. N. Ajitanand, Y. Akiba, H. Al-Bataineh, J. Alexander, M. Alfred, K. Aoki, N. Apadula, L. Aphecetche, J. Asai, H. Asano, E. T. Atomssa, R. Averbeck, T. C. Awes, B. Azmoun, V. Babintsev, M. Bai, G. Baksay, L. Baksay, A. Baldisseri , et al. (510 additional authors not shown)

    Abstract: High-momentum two-particle correlations are a useful tool for studying jet-quenching effects in the quark-gluon plasma. Angular correlations between neutral-pion triggers and charged hadrons with transverse momenta in the range 4--12~GeV/$c$ and 0.5--7~GeV/$c$, respectively, have been measured by the PHENIX experiment in 2014 for Au$+$Au collisions at $\sqrt{s_{_{NN}}}=200$~GeV. Suppression is obs… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 534 authors from 83 institutions, 12 pages, 7 figures. v1 is version submitted to Physical Review C. HEPdata tables for the points plotted in figures for this and previous PHENIX publications are (or will be) publicly available at http://www.phenix.bnl.gov/papers.html

  29. arXiv:2406.08020  [pdf, other

    cs.CV

    Generalizable Disaster Damage Assessment via Change Detection with Vision Foundation Model

    Authors: Kyeongjin Ahn, Sungwon Han, Sungwon Park, Jihee Kim, Sangyoon Park, Meeyoung Cha

    Abstract: The increasing frequency and intensity of natural disasters demand more sophisticated approaches for rapid and precise damage assessment. To tackle this issue, researchers have developed various methods on disaster benchmark datasets from satellite imagery to aid in detecting disaster damage. However, the diverse nature of geographical landscapes and disasters makes it challenging to apply existin… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 9 pages, 4 figures, 2 tables

  30. arXiv:2406.05649  [pdf, other

    cs.CV cs.AI

    GTR: Improving Large 3D Reconstruction Models through Geometry and Texture Refinement

    Authors: Peiye Zhuang, Songfang Han, Chaoyang Wang, Aliaksandr Siarohin, Jiaxu Zou, Michael Vasilkovsky, Vladislav Shakhrai, Sergey Korolev, Sergey Tulyakov, Hsin-Ying Lee

    Abstract: We propose a novel approach for 3D mesh reconstruction from multi-view images. Our method takes inspiration from large reconstruction models like LRM that use a transformer-based triplane generator and a Neural Radiance Field (NeRF) model trained on multi-view images. However, in our method, we introduce several important modifications that allow us to significantly enhance 3D reconstruction quali… ▽ More

    Submitted 13 June, 2024; v1 submitted 9 June, 2024; originally announced June 2024.

    Comments: 19 pages, 17 figures. Project page: https://snap-research.github.io/GTR/

  31. arXiv:2406.05431  [pdf

    cs.CL

    MaTableGPT: GPT-based Table Data Extractor from Materials Science Literature

    Authors: Gyeong Hoon Yi, Jiwoo Choi, Hyeongyun Song, Olivia Miano, Jaewoong Choi, Kihoon Bang, Byungju Lee, Seok Su Sohn, David Buttler, Anna Hiszpanski, Sang Soo Han, Donghun Kim

    Abstract: Efficiently extracting data from tables in the scientific literature is pivotal for building large-scale databases. However, the tables reported in materials science papers exist in highly diverse forms; thus, rule-based extractions are an ineffective approach. To overcome this challenge, we present MaTableGPT, which is a GPT-based table data extractor from the materials science literature. MaTabl… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

  32. arXiv:2406.05086  [pdf, other

    math.OC cs.AI cs.GT

    Robust Reward Design for Markov Decision Processes

    Authors: Shuo Wu, Haoxiang Ma, Jie Fu, Shuo Han

    Abstract: The problem of reward design examines the interaction between a leader and a follower, where the leader aims to shape the follower's behavior to maximize the leader's payoff by modifying the follower's reward function. Current approaches to reward design rely on an accurate model of how the follower responds to reward modifications, which can be sensitive to modeling inaccuracies. To address this… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: 50 pages, 8 figures

  33. arXiv:2406.05078  [pdf, other

    cs.IT

    Enhancing LEO Mega-Constellations with Inter-Satellite Links: Vision and Challenges

    Authors: Chenyu Wu, Shuai Han, Qian Chen, Yu Wang, Weixiao Meng, Abderrahim Benslimane

    Abstract: Low Earth orbit (LEO) satellites have been envisioned as a significant component of the sixth generation (6G) network architecture for achieving ubiquitous coverage and seamless access. However, the implementation of LEO satellites is largely restricted by the deployment of ground stations. Inter-satellite links (ISLs) have been regarded as a promising technique to fully exploit the potentials of… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: 7 pages, 4 figures

  34. arXiv:2406.04639  [pdf, other

    cs.LG cs.AI cs.CV

    Cooperative Meta-Learning with Gradient Augmentation

    Authors: Jongyun Shin, Seunjin Han, Jangho Kim

    Abstract: Model agnostic meta-learning (MAML) is one of the most widely used gradient-based meta-learning, consisting of two optimization loops: an inner loop and outer loop. MAML learns the new task from meta-initialization parameters with an inner update and finds the meta-initialization parameters in the outer loop. In general, the injection of noise into the gradient of the model for augmenting the grad… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: Accepted to UAI 2024

  35. arXiv:2406.03274  [pdf, other

    eess.AS cs.AI cs.SD

    Enhancing CTC-based speech recognition with diverse modeling units

    Authors: Shiyi Han, Zhihong Lei, Mingbin Xu, Xingyu Na, Zhen Huang

    Abstract: In recent years, the evolution of end-to-end (E2E) automatic speech recognition (ASR) models has been remarkable, largely due to advances in deep learning architectures like transformer. On top of E2E systems, researchers have achieved substantial accuracy improvement by rescoring E2E model's N-best hypotheses with a phoneme-based model. This raises an interesting question about where the improvem… ▽ More

    Submitted 11 June, 2024; v1 submitted 5 June, 2024; originally announced June 2024.

  36. arXiv:2406.03019  [pdf, other

    cs.CV

    Puzzle Pieces Picker: Deciphering Ancient Chinese Characters with Radical Reconstruction

    Authors: Pengjie Wang, Kaile Zhang, Xinyu Wang, Shengwei Han, Yongge Liu, Lianwen Jin, Xiang Bai, Yuliang Liu

    Abstract: Oracle Bone Inscriptions is one of the oldest existing forms of writing in the world. However, due to the great antiquity of the era, a large number of Oracle Bone Inscriptions (OBI) remain undeciphered, making it one of the global challenges in the field of paleography today. This paper introduces a novel approach, namely Puzzle Pieces Picker (P$^3$), to decipher these enigmatic characters throug… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: ICDAR 2024

  37. arXiv:2406.02441  [pdf, other

    hep-ex

    Probing the Scalar WIMP-Pion Coupling with the first LUX-ZEPLIN data

    Authors: J. Aalbers, D. S. Akerib, A. K. Al Musalhi, F. Alder, C. S. Amarasinghe, A. Ames, T. J. Anderson, N. Angelides, H. M. Araújo, J. E. Armstrong, M. Arthurs, A. Baker, S. Balashov, J. Bang, E. E. Barillier, J. W. Bargemann, K. Beattie, T. Benson, A. Bhatti, A. Biekert, T. P. Biesiadzinski, H. J. Birch, E. J. Bishop, G. M. Blockinger, B. Boxer , et al. (178 additional authors not shown)

    Abstract: Weakly interacting massive particles (WIMPs) may interact with a virtual pion that is exchanged between nucleons. This interaction channel is important to consider in models where the spin-independent isoscalar channel is suppressed. Using data from the first science run of the LUX-ZEPLIN dark matter experiment, containing 60 live days of data in a 5.5~tonne fiducial mass of liquid xenon, we repor… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  38. arXiv:2406.01963  [pdf

    cond-mat.mes-hall physics.app-ph

    Diamond molecular balance: Revolutionizing high-resolution mass spectrometry from MDa to TDa at room temperature

    Authors: Donggeun Lee, Seung-Woo Jeon, Chang-Hwan Yi, Yang-Hee Kim, Yeeun Choi, Sang-Hun Lee, Jinwoong Cha, Seung-Bo Shim, Junho Suh, Il-Young Kim, Dongyeon Daniel Kang, Hojoong Jung, Cherlhyun Jeong, Jae-pyoung Ahn, Hee Chul Park, Sang-Wook Han, Chulki Kim

    Abstract: The significance of mass spectrometry lies in its unparalleled ability to accurately identify and quantify molecules in complex samples, providing invaluable insights into molecular structures and interactions. Here, we leverage diamond nanostructures as highly sensitive mass sensors by utilizing a self-excitation mechanism under an electron beam in a conventional scanning electron microscope (SEM… ▽ More

    Submitted 25 July, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

    Comments: 16 pages, 4 figures

  39. arXiv:2406.01886  [pdf, other

    econ.TH

    Monotone Equilibrium Design for Matching Markets with Signaling

    Authors: Seungjin Han, Alex Sam, Youngki Shin

    Abstract: We study monotone equilibrium design by a planner who chooses an interval of reactions that receivers take before senders and receivers move in matching markets with signaling. Given the convex efficiency frontier over sender surplus and receiver surplus generated by the interval delegation, the optimal reaction interval crucially depends on the ripple effect of its lower bound and on the trade-of… ▽ More

    Submitted 23 July, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

    Comments: 54 pages, 14 figures

  40. arXiv:2406.01681  [pdf, other

    cond-mat.str-el cond-mat.stat-mech hep-th

    Gross-Neveu-Yukawa theory of $\text{SO}(2N)\rightarrow \text{SO}(N) \times \text{SO}(N)$ spontaneous symmetry breaking

    Authors: SangEun Han, Igor F. Herbut

    Abstract: We construct and study the relativistic Gross-Neveu-Yukawa field theory for the $\text{SO}(2N)$ real symmetric second-rank tensor order parameter coupled to $N_f$ flavors of $4N$-component Majorana fermions in 2+1 dimensions. Such a tensor order parameter unifies all Lorentz-invariant mass-gap orders for $N$ two-component Dirac fermions in two dimensions except for the $\text{SO}(2N)$-singlet anom… ▽ More

    Submitted 13 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

    Comments: 13 pages, 5 figures

  41. arXiv:2406.01339  [pdf, other

    cs.HC cs.OS cs.SE

    Recover as It is Designed to Be: Recovering from Compatibility Mobile App Crashes by Reusing User Flows

    Authors: Donghwi Kim, Hyungjun Yoon, Chang Min Park, Sujin Han, Youngjin Kwon, Steven Y. Ko, Sung-Ju Lee

    Abstract: Android OS is severely fragmented by API updates and device vendors' OS customization, creating a market condition where vastly different OS versions coexist. This gives rise to compatibility crash problems where Android apps crash on certain Android versions but not on others. Although well-known, this problem is extremely challenging for app developers to overcome due to the sheer number of Andr… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  42. arXiv:2406.00897  [pdf, other

    math.AP

    Exact Solutions of a Time-Delay Advection Equation and a Fractional Time-Delay Advection Equation

    Authors: Christopher N. Angstmann, Stuart-James M. Burney, Daniel S. Han, Bruce I. Henry, Boris Z. Huang, Zhuang Xu

    Abstract: Exact solutions are derived for a time-delay advection equation and a fractional-order time-delay advection equation with a time-delay in the spatial derivative. Solutions are obtained, for arbitrary separable initial conditions, by incorporating recently introduced delay functions in a separation of variables approach. Examples are provided showing oscillatory and translatory behaviours fundament… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

    Comments: Letter

    MSC Class: 35C10; 35F10; 34K06; 42A38; 33E20

  43. arXiv:2406.00684  [pdf, other

    cs.CV cs.CL

    Deciphering Oracle Bone Language with Diffusion Models

    Authors: Haisu Guan, Huanxin Yang, Xinyu Wang, Shengwei Han, Yongge Liu, Lianwen Jin, Xiang Bai, Yuliang Liu

    Abstract: Originating from China's Shang Dynasty approximately 3,000 years ago, the Oracle Bone Script (OBS) is a cornerstone in the annals of linguistic history, predating many established writing systems. Despite the discovery of thousands of inscriptions, a vast expanse of OBS remains undeciphered, casting a veil of mystery over this ancient language. The emergence of modern AI technologies presents a no… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

    Comments: ACL2024 main conference long paper

  44. arXiv:2405.20610  [pdf, other

    cs.CV

    Revisiting and Maximizing Temporal Knowledge in Semi-supervised Semantic Segmentation

    Authors: Wooseok Shin, Hyun Joon Park, Jin Sob Kim, Sung Won Han

    Abstract: In semi-supervised semantic segmentation, the Mean Teacher- and co-training-based approaches are employed to mitigate confirmation bias and coupling problems. However, despite their high performance, these approaches frequently involve complex training pipelines and a substantial computational burden, limiting the scalability and compatibility of these methods. In this paper, we propose a PrevMatc… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: 14 pages, 5 figures, submitted to IEEE TPAMI. This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  45. arXiv:2405.19521  [pdf, other

    cs.LG stat.ML

    Crowdsourcing with Difficulty: A Bayesian Rating Model for Heterogeneous Items

    Authors: Seong Woo Han, Ozan Adıgüzel, Bob Carpenter

    Abstract: In applied statistics and machine learning, the "gold standards" used for training are often biased and almost always noisy. Dawid and Skene's justifiably popular crowdsourcing model adjusts for rater (coder, annotator) sensitivity and specificity, but fails to capture distributional properties of rating data gathered for training, which in turn biases training. In this study, we introduce a gener… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  46. arXiv:2405.19335  [pdf, other

    cs.CV cs.CL cs.LG

    X-VILA: Cross-Modality Alignment for Large Language Model

    Authors: Hanrong Ye, De-An Huang, Yao Lu, Zhiding Yu, Wei Ping, Andrew Tao, Jan Kautz, Song Han, Dan Xu, Pavlo Molchanov, Hongxu Yin

    Abstract: We introduce X-VILA, an omni-modality model designed to extend the capabilities of large language models (LLMs) by incorporating image, video, and audio modalities. By aligning modality-specific encoders with LLM inputs and diffusion decoders with LLM outputs, X-VILA achieves cross-modality understanding, reasoning, and generation. To facilitate this cross-modality alignment, we curate an effectiv… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: Technical Report

  47. arXiv:2405.18919  [pdf, other

    cs.IT

    Exploiting Inter-Satellite Links for In-Flight Connectivity Scheme in Space-Air-Ground Integrated Networks

    Authors: Qian Chen, Chenyu Wu, Shuai Han, Weixiao Meng, Tony Q. S. Quek

    Abstract: Space-air-ground integrated networks (SAGIN) are pivotal for achieving uninterrupted in-flight connectivity (IFC). Most existing studies, however, merely treat satellites as transparent forwarding nodes, and overlook their caching capabilities in enhancing the IFC data rate. In this paper, we consider an IFC-oriented SAGIN, where the satellites collaboratively deliver the content to airborne passe… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: 6 pages, 4 figures

  48. arXiv:2405.18698  [pdf, other

    cs.LG cs.AI

    Spectral-Risk Safe Reinforcement Learning with Convergence Guarantees

    Authors: Dohyeong Kim, Taehyun Cho, Seungyub Han, Hojun Chung, Kyungjae Lee, Songhwai Oh

    Abstract: The field of risk-constrained reinforcement learning (RCRL) has been developed to effectively reduce the likelihood of worst-case scenarios by explicitly handling risk-measure-based constraints. However, the nonlinearity of risk measures makes it challenging to achieve convergence and optimality. To overcome the difficulties posed by the nonlinearity, we propose a spectral risk measure-constrained… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: 26 pages

  49. arXiv:2405.16493  [pdf, other

    cs.CV

    Flow Snapshot Neurons in Action: Deep Neural Networks Generalize to Biological Motion Perception

    Authors: Shuangpeng Han, Ziyu Wang, Mengmi Zhang

    Abstract: Biological motion perception (BMP) refers to humans' ability to perceive and recognize the actions of living beings solely from their motion patterns, sometimes as minimal as those depicted on point-light displays. While humans excel at these tasks without any prior training, current AI models struggle with poor generalization performance. To close this research gap, we propose the Motion Perceive… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

  50. arXiv:2405.16234  [pdf, other

    cs.CV

    Vision Language Models for Spreadsheet Understanding: Challenges and Opportunities

    Authors: Shiyu Xia, Junyu Xiong, Haoyu Dong, Jianbo Zhao, Yuzhang Tian, Mengyu Zhou, Yeye He, Shi Han, Dongmei Zhang

    Abstract: This paper explores capabilities of Vision Language Models on spreadsheet comprehension. We propose three self-supervised challenges with corresponding evaluation metrics to comprehensively evaluate VLMs on Optical Character Recognition (OCR), spatial perception, and visual format recognition. Additionally, we utilize the spreadsheet table detection task to assess the overall performance of VLMs b… ▽ More

    Submitted 8 August, 2024; v1 submitted 25 May, 2024; originally announced May 2024.