Skip to main content

Showing 51–100 of 1,318 results for author: Wei, J

.
  1. arXiv:2406.06559  [pdf, other

    cs.CL cs.AI cs.LG

    Harnessing Business and Media Insights with Large Language Models

    Authors: Yujia Bao, Ankit Parag Shah, Neeru Narang, Jonathan Rivers, Rajeev Maksey, Lan Guan, Louise N. Barrere, Shelley Evenson, Rahul Basole, Connie Miao, Ankit Mehta, Fabien Boulay, Su Min Park, Natalie E. Pearson, Eldhose Joy, Tiger He, Sumiran Thakur, Koustav Ghosal, Josh On, Phoebe Morrison, Tim Major, Eva Siqi Wang, Gina Escobar, Jiaheng Wei, Tharindu Cyril Weerasooriya , et al. (8 additional authors not shown)

    Abstract: This paper introduces Fortune Analytics Language Model (FALM). FALM empowers users with direct access to comprehensive business analysis, including market trends, company performance metrics, and expert insights. Unlike generic LLMs, FALM leverages a curated knowledge base built from professional journalism, enabling it to deliver precise and in-depth answers to intricate business questions. Users… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

  2. arXiv:2406.05688  [pdf, other

    cs.CL cs.AI cs.LG

    Peer Review as A Multi-Turn and Long-Context Dialogue with Role-Based Interactions

    Authors: Cheng Tan, Dongxin Lyu, Siyuan Li, Zhangyang Gao, Jingxuan Wei, Siqi Ma, Zicheng Liu, Stan Z. Li

    Abstract: Large Language Models (LLMs) have demonstrated wide-ranging applications across various fields and have shown significant potential in the academic peer-review process. However, existing applications are primarily limited to static review generation based on submitted papers, which fail to capture the dynamic and iterative nature of real-world peer reviews. In this paper, we reformulate the peer-r… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Comments: Under review

  3. arXiv:2406.03880  [pdf, other

    cs.LG cs.AI

    Memorization in deep learning: A survey

    Authors: Jiaheng Wei, Yanjun Zhang, Leo Yu Zhang, Ming Ding, Chao Chen, Kok-Leong Ong, Jun Zhang, Yang Xiang

    Abstract: Deep Learning (DL) powered by Deep Neural Networks (DNNs) has revolutionized various domains, yet understanding the intricacies of DNN decision-making and learning processes remains a significant challenge. Recent investigations have uncovered an interesting memorization phenomenon in which DNNs tend to memorize specific details from examples rather than learning general patterns, affecting model… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  4. arXiv:2406.03799  [pdf

    cs.CV cs.AI

    Enhanced Semantic Segmentation Pipeline for WeatherProof Dataset Challenge

    Authors: Nan Zhang, Xidan Zhang, Jianing Wei, Fangjun Wang, Zhiming Tan

    Abstract: This report describes the winning solution to the WeatherProof Dataset Challenge (CVPR 2024 UG2+ Track 3). Details regarding the challenge are available at https://cvpr2024ug2challenge.github.io/track3.html. We propose an enhanced semantic segmentation pipeline for this challenge. Firstly, we improve semantic segmentation models, using backbone pretrained with Depth Anything to improve UperNet mod… ▽ More

    Submitted 6 June, 2024; v1 submitted 6 June, 2024; originally announced June 2024.

  5. arXiv:2406.01839  [pdf, other

    physics.ins-det hep-ex

    Simulation of DAMPE silicon microstrip detectors in the $\rm Allpix^{2}$ framework

    Authors: Yu-Xin Cui, Xiang Li, Shen Wang, Chuan Yue, Qiang Wan, Shi-Jun Lei, Guan-Wen Yuan, Yi-Ming Hu, Jia-Ju Wei, Jian-Hua Guo

    Abstract: Silicon strip detectors have been widely utilized in space experiments for gamma-ray and cosmic-ray detections thanks to their high spatial resolution and stable performance. For a silicon micro-strip detector, the Monte Carlo simulation is recognized as a practical and cost-effective approach to verify the detector performance. In this study, a technique for the simulation of the silicon micro-st… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Journal ref: Nuclear Instruments and Methods in Physics Research A 1057 (2023) 168685

  6. arXiv:2406.00993  [pdf

    eess.SP cs.HC q-bio.OT

    Detection of Acetone as a Gas Biomarker for Diabetes Based on Gas Sensor Technology

    Authors: Jiaming Wei, Tong Liu, Jipeng Huang, Xiaowei Li, Yurui Qi, Gangyin Luo

    Abstract: With the continuous development and improvement of medical services, there is a growing demand for improving diabetes diagnosis. Exhaled breath analysis, characterized by its speed, convenience, and non-invasive nature, is leading the trend in diagnostic development. Studies have shown that the acetone levels in the breath of diabetes patients are higher than normal, making acetone a basis for dia… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: 9 pages, 14 figures

  7. arXiv:2406.00948  [pdf

    cond-mat.mtrl-sci

    Real-space tilting method for atomic resolution STEM imaging of nanocrystalline materials

    Authors: Jiake Wei, Zhangze Xu, Wenjie Shen, Bin Feng, Ryo Ishikawa, Naoya Shibata, Yuichi Ikuhara, Xuedong Bai

    Abstract: Atomic-resolution scanning transmission electron microscopy (STEM) characterization requires precise tilting of the specimen to high symmetric zone axis, which is usually processed in reciprocal space by following the diffraction patterns. However, for small-sized nanocrystalline materials, their diffraction patterns are too faint to guide the tilting process. Here, a simple and effective tilting… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

  8. arXiv:2405.20834  [pdf, other

    cs.CV

    Retrieval Meets Reasoning: Even High-school Textbook Knowledge Benefits Multimodal Reasoning

    Authors: Cheng Tan, Jingxuan Wei, Linzhuang Sun, Zhangyang Gao, Siyuan Li, Bihui Yu, Ruifeng Guo, Stan Z. Li

    Abstract: Large language models equipped with retrieval-augmented generation (RAG) represent a burgeoning field aimed at enhancing answering capabilities by leveraging external knowledge bases. Although the application of RAG with language-only models has been extensively explored, its adaptation into multimodal vision-language models remains nascent. Going beyond mere answer generation, the primary goal of… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

    Comments: Under review

  9. arXiv:2405.19592  [pdf, other

    cs.LG cs.AI cs.CL

    Why Larger Language Models Do In-context Learning Differently?

    Authors: Zhenmei Shi, Junyi Wei, Zhuoyan Xu, Yingyu Liang

    Abstract: Large language models (LLM) have emerged as a powerful tool for AI, with the key ability of in-context learning (ICL), where they can perform well on unseen tasks based on a brief series of task examples without necessitating any adjustments to the model parameters. One recent interesting mysterious observation is that models of different scales may have different ICL behaviors: larger models tend… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  10. arXiv:2405.19266  [pdf, other

    cs.CL

    PediatricsGPT: Large Language Models as Chinese Medical Assistants for Pediatric Applications

    Authors: Dingkang Yang, Jinjie Wei, Dongling Xiao, Shunli Wang, Tong Wu, Gang Li, Mingcheng Li, Shuaibing Wang, Jiawei Chen, Yue Jiang, Qingyao Xu, Ke Li, Peng Zhai, Lihua Zhang

    Abstract: Developing intelligent pediatric consultation systems offers promising prospects for improving diagnostic efficiency, especially in China, where healthcare resources are scarce. Despite recent advances in Large Language Models (LLMs) for Chinese medicine, their performance is sub-optimal in pediatric applications due to inadequate instruction data and vulnerable training procedures. To address the… ▽ More

    Submitted 3 June, 2024; v1 submitted 29 May, 2024; originally announced May 2024.

    Comments: A Technical Report on a Chinese Medical Large Language Model

  11. arXiv:2405.16849  [pdf, other

    cs.CV

    Sync4D: Video Guided Controllable Dynamics for Physics-Based 4D Generation

    Authors: Zhoujie Fu, Jiacheng Wei, Wenhao Shen, Chaoyue Song, Xiaofeng Yang, Fayao Liu, Xulei Yang, Guosheng Lin

    Abstract: In this work, we introduce a novel approach for creating controllable dynamics in 3D-generated Gaussians using casually captured reference videos. Our method transfers the motion of objects from reference videos to a variety of generated 3D Gaussians across different categories, ensuring precise and customizable motion transfer. We achieve this by employing blend skinning-based non-parametric shap… ▽ More

    Submitted 7 July, 2024; v1 submitted 27 May, 2024; originally announced May 2024.

    Comments: Our project page: https://sync4dphys.github.io/

  12. arXiv:2405.16209  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci

    Analytical photoresponses of Schottky contact MoS2 phototransistors

    Authors: Jianyong Wei, Yumeng Liu, Yizhuo Wang, Kai Li, Zhentao Lian, Maosong Xie, Xinhan Yang, Seyed Saleh Mousavi Khaleghi, Fuxing Dai, Weida Hu, Xuejiao Gao, Rui Yang, Yaping Dan

    Abstract: High-gain photodetectors based on two-dimensional (2D) semiconductors, in particular those in photoconductive mode, have been extensively investigated in the past decade. However, the classical photoconductive theory was derived on two misplaced assumptions. In this work, we established an explicit analytical device model for Schottky contact MoS2 phototransistors that fits well with experimental… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

    Comments: 15 pages, 6 figures

  13. arXiv:2405.16191  [pdf, other

    cs.AI

    Rocket Landing Control with Grid Fins and Path-following using MPC

    Authors: Junhao Yu, Jiarun Wei

    Abstract: In this project, we attempt to optimize a landing trajectory of a rocket. The goal is to minimize the total fuel consumption during the landing process using different techniques. Once the optimal and feasible trajectory is generated using batch approach, we attempt to follow the path using a Model Predictive Control (MPC) based algorithm, called Trajectory Optimizing Path following Estimation fro… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  14. arXiv:2405.15995  [pdf, other

    cs.CV

    Efficient Temporal Action Segmentation via Boundary-aware Query Voting

    Authors: Peiyao Wang, Yuewei Lin, Erik Blasch, Jie Wei, Haibin Ling

    Abstract: Although the performance of Temporal Action Segmentation (TAS) has improved in recent years, achieving promising results often comes with a high computational cost due to dense inputs, complex model structures, and resource-intensive post-processing requirements. To improve the efficiency while keeping the performance, we present a novel perspective centered on per-segment classification. By harne… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: 17 pages, 8 figures, 11 tables

  15. arXiv:2405.12851  [pdf

    physics.optics physics.chem-ph

    Ultrafast Broadband Strong-Field Tunnelling in Asymmetric Nanogaps for Time-Resolved Nanoscopy

    Authors: Haoqing Ning, Marios Maimaris, Jiewen Wei, Emilie Gérouville, Evangelos Moutoulas, Zhu Meng, Clement Ferchaud, Dmitry Maslennikov, Navendu Mondal, Tong Wang, Colin Chow, Aleksandar P. Ivanov, Joshua B. Edel, Saif A. Haque, Misha Ivanov, Jon P. Marangos, Dimitra G. Georgiadou, Artem A. Bakulin

    Abstract: Femtosecond-fast and nanometre-size pulses of electrons are emerging as unique probes for ultrafast dynamics at the nanoscale. Presently, such pulses are achievable only in highly sophisticated ultrafast electron microscopes or equally complex setups involving few-cycle-pulsed lasers with stable carrier-envelope phase (CEP) and nanotip probes. Here, we show that the generation of femtosecond pulse… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

  16. arXiv:2405.12665  [pdf, other

    astro-ph.CO

    Age of massive galaxies at redshift 8

    Authors: M. Lopez-Corredoira, F. Melia, J. -J. Wei, C. -Y. Gao

    Abstract: Recent James Webb Space Telescope (JWST) data analyses have shown that massive red galaxies existed at redshifts $z>6$, a discovery that is difficult to understand in the context of standard cosmology ($Λ$CDM). Here we analyze these observations more deeply by fitting a stellar population model to the optical and near-infrared photometric data. These fits include a main stellar population in addit… ▽ More

    Submitted 22 May, 2024; v1 submitted 21 May, 2024; originally announced May 2024.

    Comments: 16 pages, 8 figures, accepted to be published in ApJ

  17. arXiv:2405.12571  [pdf, other

    cs.RO

    iHERO: Interactive Human-oriented Exploration and Supervision Under Scarce Communication

    Authors: Zhuoli Tian, Yuyang Zhang, Jinsheng Wei, Meng Guo

    Abstract: Exploration of unknown scenes before human entry is essential for safety and efficiency in numerous scenarios, e.g., subterranean exploration, reconnaissance, search and rescue missions. Fleets of autonomous robots are particularly suitable for this task, via concurrent exploration, multi-sensory perception and autonomous navigation. Communication however among the robots can be severely restricte… ▽ More

    Submitted 7 June, 2024; v1 submitted 21 May, 2024; originally announced May 2024.

    Comments: Accepted at RSS 2024

  18. arXiv:2405.12504  [pdf, other

    astro-ph.HE

    SN 2019tua : A Type IIb Supernova with Multiple Bumps in the Light Curves

    Authors: Xin-Bo Huang, Xiang-Gao Wang, Long Li, Li-Ping Xin, Jing Wang, Tian-Ci Zheng, Qi Wang, Hui-Ya Liu, Zi-Min Zhou, Xiao-meng Lu, jian-yan Wei, En-Wei Liang

    Abstract: We present photometric and spectroscopic observations and analysis of the type IIb supernova (SN) SN 2019tua, which exhibits multiple bumps in its declining light curves between 40 and 65 days after discovery. SN 2019tua shows a time to peak of about 25 days similar to other type IIb SNe. Our observations indicate a decrease in its brightness of about 1 magnitude in the 60 days after the peak. At… ▽ More

    Submitted 23 May, 2024; v1 submitted 21 May, 2024; originally announced May 2024.

    Comments: Accepted for publication in ApJ. 24 pages, 17 figures, 6 tables

  19. arXiv:2405.12031  [pdf, other

    cs.SD eess.AS

    Neighborhood Attention Transformer with Progressive Channel Fusion for Speaker Verification

    Authors: Nian Li, Jianguo Wei

    Abstract: Transformer-based architectures for speaker verification typically require more training data than ECAPA-TDNN. Therefore, recent work has generally been trained on VoxCeleb1&2. We propose a backbone network based on self-attention, which can achieve competitive results when trained on VoxCeleb2 alone. The network alternates between neighborhood attention and global attention to capture local and g… ▽ More

    Submitted 29 May, 2024; v1 submitted 20 May, 2024; originally announced May 2024.

    Comments: 8 pages, 2 figures, 3 tables; added github link

  20. arXiv:2405.11826  [pdf, other

    astro-ph.IM hep-ex physics.ins-det

    Data quality control system and long-term performance monitor of the LHAASO-KM2A

    Authors: Zhen Cao, F. Aharonian, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, W. Bian, A. V. Bukevich, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, H. X. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. Chen , et al. (263 additional authors not shown)

    Abstract: The KM2A is the largest sub-array of the Large High Altitude Air Shower Observatory (LHAASO). It consists of 5216 electromagnetic particle detectors (EDs) and 1188 muon detectors (MDs). The data recorded by the EDs and MDs are used to reconstruct primary information of cosmic ray and gamma-ray showers. This information is used for physical analysis in gamma-ray astronomy and cosmic ray physics. To… ▽ More

    Submitted 13 June, 2024; v1 submitted 20 May, 2024; originally announced May 2024.

    Comments: 15 pages, 9 figures

  21. arXiv:2405.10663  [pdf, ps, other

    astro-ph.GA astro-ph.CO

    Instability of Circumnuclear Gas Supply as An Origin of "Changing-look" Phenomenon of Supermassive Blackholes

    Authors: J. Wang, D. W. Xu, Xinwu Cao, C. Gao, C. H. Xie, J. Y. Wei

    Abstract: The origin of the "Changing-look" (CL) phenomenon in supermassive black holes (SMBHs) remains an open issue. This study aims to shed light on this phenomenon by focusing on a sample that encompasses all known repeating CL active galactic nuclei (AGNs). Through the identification of a characteristic time scale for the CL phenomenon, it was observed that larger SMBHs possess shorter characteristic t… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

    Comments: 14 pages, 4 figures and 2 tables, accepted by ApJ

  22. Treatment Effect Estimation for User Interest Exploration on Recommender Systems

    Authors: Jiaju Chen, Wenjie Wang, Chongming Gao, Peng Wu, Jianxiong Wei, Qingsong Hua

    Abstract: Recommender systems learn personalized user preferences from user feedback like clicks. However, user feedback is usually biased towards partially observed interests, leaving many users' hidden interests unexplored. Existing approaches typically mitigate the bias, increase recommendation diversity, or use bandit algorithms to balance exploration-exploitation trade-offs. Nevertheless, they fail to… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

    Comments: Accepted to SIGIR 2024

  23. arXiv:2405.07691  [pdf, other

    astro-ph.HE

    Discovery of Very-high-energy Gamma-ray Emissions from the Low Luminosity AGN NGC 4278 by LHAASO

    Authors: Zhen Cao, F. Aharonian, Q. An, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (255 additional authors not shown)

    Abstract: The first source catalog of Large High Altitude Air Shower Observatory reported the detection of a very-high-energy gamma ray source, 1LHAASO J1219+2915. In this paper a further detailed study of the spectral and temporal behavior of this point-like source have been carried. The best-fit position of the TeV source ($\rm{RA}=185.05^{\circ}\pm0.04^{\circ}$, $\rm{Dec}=29.25^{\circ}\pm0.03^{\circ}$) i… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: 11 pages, 5 figures

  24. arXiv:2405.07690  [pdf, ps, other

    math.NA

    Convergence analysis of three semi-discrete numerical schemes for nonlocal geometric flows including perimeter terms

    Authors: Jiang Wei, Su Chunmei, Zhang Ganghui

    Abstract: We present and analyze three distinct semi-discrete schemes for solving nonlocal geometric flows incorporating perimeter terms. These schemes are based on the finite difference method, the finite element method, and the finite element method with a specific tangential motion. We offer rigorous proofs of quadratic convergence under $H^1$-norm for the first scheme and linear convergence under $H^1$-… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: 34 pages, 9 figures

    MSC Class: 65M60; 65M12; 35K55

  25. arXiv:2405.06841  [pdf, other

    cs.CV cs.LG

    Bridging the Gap: Protocol Towards Fair and Consistent Affect Analysis

    Authors: Guanyu Hu, Eleni Papadopoulou, Dimitrios Kollias, Paraskevi Tzouveli, Jie Wei, Xinyu Yang

    Abstract: The increasing integration of machine learning algorithms in daily life underscores the critical need for fairness and equity in their deployment. As these technologies play a pivotal role in decision-making, addressing biases across diverse subpopulation groups, including age, gender, and race, becomes paramount. Automatic affect analysis, at the intersection of physiology, psychology, and machin… ▽ More

    Submitted 16 May, 2024; v1 submitted 10 May, 2024; originally announced May 2024.

    Comments: accepted at IEEE FG 2024

  26. arXiv:2405.05484  [pdf, ps, other

    math.AP

    Critical Mass Phenomena and Blow-up behavior of Ground States in stationary second order Mean-Field Games systems with decreasing cost

    Authors: Marco Cirant, Fanze Kong, Juncheng Wei, Xiaoyu Zeng

    Abstract: This paper is devoted to the study of Mean-field Games (MFG) systems in the mass critical exponent case. We firstly establish the optimal Gagliardo-Nirenberg type inequality associated with the potential-free MFG system. Then, under some mild assumptions on the potential function, we show that there exists a critical mass $M^*$ such that the MFG system admits a least energy solution if and only if… ▽ More

    Submitted 2 August, 2024; v1 submitted 8 May, 2024; originally announced May 2024.

    Comments: 58 pages; appendix was updated

  27. arXiv:2405.00145  [pdf, other

    cs.SE cs.CV

    GUing: A Mobile GUI Search Engine using a Vision-Language Model

    Authors: Jialiang Wei, Anne-Lise Courbis, Thomas Lambolais, Binbin Xu, Pierre Louis Bernard, Gérard Dray, Walid Maalej

    Abstract: App developers use the Graphical User Interface (GUI) of other apps as an important source of inspiration to design and improve their own apps. In recent years, research suggested various approaches to retrieve GUI designs that fit a certain text query from screenshot datasets acquired through automated GUI exploration. However, such text-to-GUI retrieval approaches only leverage the textual infor… ▽ More

    Submitted 30 April, 2024; originally announced May 2024.

  28. arXiv:2404.19108  [pdf, other

    cs.CV astro-ph.IM eess.IV

    Real-Time Convolutional Neural Network-Based Star Detection and Centroiding Method for CubeSat Star Tracker

    Authors: Hongrui Zhao, Michael F. Lembeck, Adrian Zhuang, Riya Shah, Jesse Wei

    Abstract: Star trackers are one of the most accurate celestial sensors used for absolute attitude determination. The devices detect stars in captured images and accurately compute their projected centroids on an imaging focal plane with subpixel precision. Traditional algorithms for star detection and centroiding often rely on threshold adjustments for star pixel detection and pixel brightness weighting for… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

  29. arXiv:2404.18688  [pdf, other

    cs.IT

    Distributed Source Coding for Parametric and Non-Parametric Regression

    Authors: Jiahui Wei, Elsa Dupraz, Philippe Mary

    Abstract: The design of communication systems dedicated to machine learning tasks is one key aspect of goal-oriented communications. In this framework, this article investigates the interplay between data reconstruction and learning from the same compressed observations, particularly focusing on the regression problem. We establish achievable rate-generalization error regions for both parametric and non-par… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

  30. arXiv:2404.17154  [pdf, other

    astro-ph.CO astro-ph.HE gr-qc hep-ph

    Cosmology-independent Photon Mass Limits from Localized Fast Radio Bursts by using Artificial Neural Networks

    Authors: Jing-Yu Ran, Bao Wang, Jun-Jie Wei

    Abstract: A hypothetical photon mass, $m_γ$, can produce a frequency-dependent vacuum dispersion of light, which leads to an additional time delay between photons with different frequencies when they propagate through a fixed distance. The dispersion measure--redshift measurements of fast radio bursts (FRBs) have been widely used to constrain the rest mass of the photon. However, all current studies analyze… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

    Comments: 8 pages, 3 figures, 3 tables. Accepted for publication in Chinese Physics Letters. Invited article to special issue "FAST"

  31. arXiv:2404.16831  [pdf, other

    cs.CV

    The Third Monocular Depth Estimation Challenge

    Authors: Jaime Spencer, Fabio Tosi, Matteo Poggi, Ripudaman Singh Arora, Chris Russell, Simon Hadfield, Richard Bowden, GuangYuan Zhou, ZhengXin Li, Qiang Rao, YiPing Bao, Xiao Liu, Dohyeong Kim, Jinseong Kim, Myunghyun Kim, Mykola Lavreniuk, Rui Li, Qing Mao, Jiang Wu, Yu Zhu, Jinqiu Sun, Yanning Zhang, Suraj Patni, Aradhye Agarwal, Chetan Arora , et al. (16 additional authors not shown)

    Abstract: This paper discusses the results of the third edition of the Monocular Depth Estimation Challenge (MDEC). The challenge focuses on zero-shot generalization to the challenging SYNS-Patches dataset, featuring complex scenes in natural and indoor settings. As with the previous edition, methods can use any form of supervision, i.e. supervised or self-supervised. The challenge received a total of 19 su… ▽ More

    Submitted 27 April, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

    Comments: To appear in CVPRW2024

  32. arXiv:2404.16425  [pdf, other

    astro-ph.HE

    Soft X-ray prompt emission from a high-redshift gamma-ray burst EP240315a

    Authors: Y. Liu, H. Sun, D. Xu, D. S. Svinkin, J. Delaunay, N. R. Tanvir, H. Gao, C. Zhang, Y. Chen, X. -F. Wu, B. Zhang, W. Yuan, J. An, G. Bruni, D. D. Frederiks, G. Ghirlanda, J. -W. Hu, A. Li, C. -K. Li, J. -D. Li, D. B. Malesani, L. Piro, G. Raman, R. Ricci, E. Troja , et al. (170 additional authors not shown)

    Abstract: Long gamma-ray bursts (GRBs) are believed to originate from core collapse of massive stars. High-redshift GRBs can probe the star formation and reionization history of the early universe, but their detection remains rare. Here we report the detection of a GRB triggered in the 0.5--4 keV band by the Wide-field X-ray Telescope (WXT) on board the Einstein Probe (EP) mission, designated as EP240315a,… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: 41 pages, 8 figures, 7 tables

  33. arXiv:2404.16385  [pdf, other

    cs.CV

    Efficiency in Focus: LayerNorm as a Catalyst for Fine-tuning Medical Visual Language Pre-trained Models

    Authors: Jiawei Chen, Dingkang Yang, Yue Jiang, Mingcheng Li, Jinjie Wei, Xiaolu Hou, Lihua Zhang

    Abstract: In the realm of Medical Visual Language Models (Med-VLMs), the quest for universal efficient fine-tuning mechanisms remains paramount, especially given researchers in interdisciplinary fields are often extremely short of training resources, yet largely unexplored. Given the unique challenges in the medical domain, such as limited data scope and significant domain-specific requirements, evaluating… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

  34. arXiv:2404.14827  [pdf, other

    cs.CL

    Sentence-Level or Token-Level? A Comprehensive Study on Knowledge Distillation

    Authors: Jingxuan Wei, Linzhuang Sun, Yichong Leng, Xu Tan, Bihui Yu, Ruifeng Guo

    Abstract: Knowledge distillation, transferring knowledge from a teacher model to a student model, has emerged as a powerful technique in neural machine translation for compressing models or simplifying training targets. Knowledge distillation encompasses two primary methods: sentence-level distillation and token-level distillation. In sentence-level distillation, the student model is trained to align with t… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

  35. arXiv:2404.14676  [pdf, other

    cs.CV cs.GR

    DreamPBR: Text-driven Generation of High-resolution SVBRDF with Multi-modal Guidance

    Authors: Linxuan Xin, Zheng Zhang, Jinfu Wei, Wei Gao, Duan Gao

    Abstract: Prior material creation methods had limitations in producing diverse results mainly because reconstruction-based methods relied on real-world measurements and generation-based methods were trained on relatively small material datasets. To address these challenges, we propose DreamPBR, a novel diffusion-based generative framework designed to create spatially-varying appearance properties guided by… ▽ More

    Submitted 1 July, 2024; v1 submitted 22 April, 2024; originally announced April 2024.

    Comments: 16 pages, 17 figures

    ACM Class: I.3.0; I.4.9

  36. arXiv:2404.11808  [pdf, other

    astro-ph.IM astro-ph.HE

    Future Perspectives for Gamma-ray Burst Detection from Space

    Authors: Enrico Bozzo, Lorenzo Amati, Wayne Baumgartner, Tzu-Ching Chang, Bertrand Cordier, Nicolas De Angelis, Akihiro Doi, Marco Feroci, Cynthia Froning, Jessica Gaskin, Adam Goldstein, Diego Götz, Jon E. Grove, Sylvain Guiriec, Margarita Hernanz, C. Michelle Hui, Peter Jenke, Daniel Kocevski, Merlin Kole, Chryssa Kouveliotou, Thomas Maccarone, Mark L. McConnell, Hideo Matsuhara, Paul O'Brien, Nicolas Produit , et al. (13 additional authors not shown)

    Abstract: Since their first discovery in the late 1960s, Gamma-ray bursts have attracted an exponentially growing interest from the international community due to their central role in the most highly debated open questions of the modern research of astronomy, astrophysics, cosmology, and fundamental physics. These range from the intimate nuclear composition of high density material within the core of ultra… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: Accepted for publication on Universe. Invited review, contribution to the Universe Special Issue "Recent Advances in Gamma Ray Astrophysics and Future Perspectives", P. Romano eds. (https://www.mdpi.com/journal/universe/special_issues/7299902Z97)

  37. arXiv:2404.11151  [pdf, other

    cs.CV

    REACTO: Reconstructing Articulated Objects from a Single Video

    Authors: Chaoyue Song, Jiacheng Wei, Chuan-Sheng Foo, Guosheng Lin, Fayao Liu

    Abstract: In this paper, we address the challenge of reconstructing general articulated 3D objects from a single video. Existing works employing dynamic neural radiance fields have advanced the modeling of articulated objects like humans and animals from videos, but face challenges with piece-wise rigid general articulated objects due to limitations in their deformation models. To tackle this, we propose Qu… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

  38. arXiv:2404.11134  [pdf, ps, other

    math.AP

    Co-existence of Type II blow-ups with multiple blow-up rates for five-dimensional heat equation with critical nonlinear boundary conditions

    Authors: Juncheng Wei, Zikai Ye, Xiaoyu Zeng, Qidi Zhang

    Abstract: We consider the following five-dimensional heat equation with critical boundary condition \begin{equation*} \partial_t u=Δu \mbox{ \ in \ } \mathbb{R}_+^5\times (0,T) , \quad -\partial_{x_5}u =|u|^\frac{2}{3}u \mbox{ \ on \ } \pp \mathbb{R}^5_+ \times (0,T) . \end{equation*} Given $\mathfrak{o}$ distinct boundary points $q^{[i]} \in \partial \mathbb{R}_+^5$, and $\mathfrak{o}$ integers… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: 59 pages; comments welcome

  39. arXiv:2404.10352  [pdf, other

    cs.HC

    CanvasPic: An Interactive Tool for Freely Generating Facial Images Based on Spatial Layout

    Authors: Jiafu Wei, Chia-Ming Chang, Xi Yang, Takeo Igarashi

    Abstract: In real-world usage, existing GAN image generation tools come up short due to their lack of intuitive interfaces and limited flexibility. To overcome these limitations, we developed CanvasPic, an innovative tool for flexible GAN image generation. Our tool introduces a novel 2D layout design that allows users to intuitively control image attributes based on real-world images. By interacting with th… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

  40. arXiv:2404.07503  [pdf, ps, other

    cs.CL

    Best Practices and Lessons Learned on Synthetic Data

    Authors: Ruibo Liu, Jerry Wei, Fangyu Liu, Chenglei Si, Yanzhe Zhang, Jinmeng Rao, Steven Zheng, Daiyi Peng, Diyi Yang, Denny Zhou, Andrew M. Dai

    Abstract: The success of AI models relies on the availability of large, diverse, and high-quality datasets, which can be challenging to obtain due to data scarcity, privacy concerns, and high costs. Synthetic data has emerged as a promising solution by generating artificial data that mimics real-world patterns. This paper provides an overview of synthetic data research, discussing its applications, challeng… ▽ More

    Submitted 10 August, 2024; v1 submitted 11 April, 2024; originally announced April 2024.

    Comments: In COLM 2024

  41. arXiv:2404.06836  [pdf, other

    cs.CV

    O2V-Mapping: Online Open-Vocabulary Mapping with Neural Implicit Representation

    Authors: Muer Tie, Julong Wei, Zhengjun Wang, Ke Wu, Shansuai Yuan, Kaizhao Zhang, Jie Jia, Jieru Zhao, Zhongxue Gan, Wenchao Ding

    Abstract: Online construction of open-ended language scenes is crucial for robotic applications, where open-vocabulary interactive scene understanding is required. Recently, neural implicit representation has provided a promising direction for online interactive mapping. However, implementing open-vocabulary scene understanding capability into online neural implicit mapping still faces three challenges: lac… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

  42. arXiv:2404.06346  [pdf, other

    cond-mat.mtrl-sci

    Hierarchy of Exchange-Correlation Functionals in Computing Lattice Thermal Conductivities of Rocksalt and Zincblende Semiconductors

    Authors: Jiacheng Wei, Zhonghao Xia, Yi Xia, Jiangang He

    Abstract: Lattice thermal conductivity ($κ_{\rm L}$) is a crucial characteristic of crystalline solids with significant implications for practical applications. While the higher order of anharmonicity of phonon gas model is commonly used for explaining extraordinary heat transfer behaviors in crystals, the impact of exchange-correlation (XC) functionals in DFT on describing anharmonicity has been largely ov… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: 13 pages, 8 figures

  43. arXiv:2404.04801  [pdf, ps, other

    astro-ph.IM astro-ph.HE

    LHAASO-KM2A detector simulation using Geant4

    Authors: Zhen Cao, F. Aharonian, Q. An, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (254 additional authors not shown)

    Abstract: KM2A is one of the main sub-arrays of LHAASO, working on gamma ray astronomy and cosmic ray physics at energies above 10 TeV. Detector simulation is the important foundation for estimating detector performance and data analysis. It is a big challenge to simulate the KM2A detector in the framework of Geant4 due to the need to track numerous photons from a large number of detector units (>6000) with… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

  44. arXiv:2404.01548  [pdf, other

    cs.CV cs.AI

    mChartQA: A universal benchmark for multimodal Chart Question Answer based on Vision-Language Alignment and Reasoning

    Authors: Jingxuan Wei, Nan Xu, Guiyong Chang, Yin Luo, BiHui Yu, Ruifeng Guo

    Abstract: In the fields of computer vision and natural language processing, multimodal chart question-answering, especially involving color, structure, and textless charts, poses significant challenges. Traditional methods, which typically involve either direct multimodal processing or a table-to-text conversion followed by language model analysis, have limitations in effectively handling these complex scen… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

  45. arXiv:2403.20168  [pdf, other

    eess.IV cs.CV

    Unsupervised Tumor-Aware Distillation for Multi-Modal Brain Image Translation

    Authors: Chuan Huang, Jia Wei, Rui Li

    Abstract: Multi-modal brain images from MRI scans are widely used in clinical diagnosis to provide complementary information from different modalities. However, obtaining fully paired multi-modal images in practice is challenging due to various factors, such as time, cost, and artifacts, resulting in modality-missing brain images. To address this problem, unsupervised multi-modal brain image translation has… ▽ More

    Submitted 24 April, 2024; v1 submitted 29 March, 2024; originally announced March 2024.

    Comments: 8 pages, 5 figures. It has been provisionally accepted for IJCNN 2024

  46. arXiv:2403.20159  [pdf, other

    cs.CV

    HGS-Mapping: Online Dense Mapping Using Hybrid Gaussian Representation in Urban Scenes

    Authors: Ke Wu, Kaizhao Zhang, Zhiwei Zhang, Shanshuai Yuan, Muer Tie, Julong Wei, Zijun Xu, Jieru Zhao, Zhongxue Gan, Wenchao Ding

    Abstract: Online dense mapping of urban scenes forms a fundamental cornerstone for scene understanding and navigation of autonomous vehicles. Recent advancements in mapping methods are mainly based on NeRF, whose rendering speed is too slow to meet online requirements. 3D Gaussian Splatting (3DGS), with its rendering speed hundreds of times faster than NeRF, holds greater potential in online dense mapping.… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

  47. arXiv:2403.19060  [pdf, other

    cs.RO cs.AI cs.HC cs.LG

    Towards Human-Centered Construction Robotics: An RL-Driven Companion Robot For Contextually Assisting Carpentry Workers

    Authors: Yuning Wu, Jiaying Wei, Jean Oh, Daniel Cardoso Llach

    Abstract: In the dynamic construction industry, traditional robotic integration has primarily focused on automating specific tasks, often overlooking the complexity and variability of human aspects in construction workflows. This paper introduces a human-centered approach with a "work companion rover" designed to assist construction workers within their existing practices, aiming to enhance safety and workf… ▽ More

    Submitted 28 March, 2024; v1 submitted 27 March, 2024; originally announced March 2024.

    Comments: 8 pages, 9 figures. This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  48. arXiv:2403.18802  [pdf, other

    cs.CL cs.AI cs.LG

    Long-form factuality in large language models

    Authors: Jerry Wei, Chengrun Yang, Xinying Song, Yifeng Lu, Nathan Hu, Jie Huang, Dustin Tran, Daiyi Peng, Ruibo Liu, Da Huang, Cosmo Du, Quoc V. Le

    Abstract: Large language models (LLMs) often generate content that contains factual errors when responding to fact-seeking prompts on open-ended topics. To benchmark a model's long-form factuality in open domains, we first use GPT-4 to generate LongFact, a prompt set comprising thousands of questions spanning 38 topics. We then propose that LLM agents can be used as automated evaluators for long-form factua… ▽ More

    Submitted 3 April, 2024; v1 submitted 27 March, 2024; originally announced March 2024.

  49. arXiv:2403.17024  [pdf, ps, other

    nucl-th

    Dark matter effects on the properties of neutron stars: compactness and tidal deformability

    Authors: Hong-Ming Liu, Jin-Biao Wei, Zeng-Hua Li, G. F. Burgio, H. C. Das, H. -J. Schulze

    Abstract: We systematically study the observable properties of dark-matter admixed neutron stars, employing a realistic nuclear EOS in combination with self-interacting fermionic dark matter respecting constraints on the self-interaction cross section. Deviations from universal relations valid for nucleonic neutron stars are analyzed over the whole parameter space of the model and unequivocal signals for th… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

    Comments: 9 pages, 7 figures. arXiv admin note: substantial text overlap with arXiv:2307.11313

  50. arXiv:2403.16519  [pdf, ps, other

    cs.SC

    Two Algorithms for Computing Rational Univariate Representations of Zero-Dimensional Ideals with Parameters

    Authors: Dingkang Wang, Jingjing Wei, Fanghui Xiao, Xiaopeng Zheng

    Abstract: Based on the partition of parameter space, two algorithms for computing the rational univariate representation of zero-dimensional ideals with parameters are presented in the paper. Unlike the rational univariate representation of zero-dimensional ideals without parameters, the number of zeros of zero-dimensional ideals with parameters under various specializations is different, which leads to cho… ▽ More

    Submitted 24 July, 2024; v1 submitted 25 March, 2024; originally announced March 2024.