Skip to main content

Showing 1–50 of 130 results for author: Jeong, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2409.02052  [pdf, other

    cs.LG

    Robust Fourier Neural Networks

    Authors: Halyun Jeong, Jihun Han

    Abstract: Fourier embedding has shown great promise in removing spectral bias during neural network training. However, it can still suffer from high generalization errors, especially when the labels or measurements are noisy. We demonstrate that introducing a simple diagonal layer after the Fourier embedding layer makes the network more robust to measurement noise, effectively prompting it to learn sparse F… ▽ More

    Submitted 3 September, 2024; originally announced September 2024.

    Comments: 31 pages, 9 figures

    MSC Class: 65T40; 62J02; 68T07

  2. arXiv:2408.07900  [pdf, other

    cs.SI physics.soc-ph

    Network analysis reveals news press landscape and asymmetric user polarization

    Authors: Byunghwee Lee, Hyo-sun Ryu, Jae Kook Lee, Hawoong Jeong, Beom Jun Kim

    Abstract: Unlike traditional media, online news platforms allow users to consume content that suits their tastes and to facilitate interactions with other people. However, as more personalized consumption of information and interaction with like-minded users increase, ideological bias can inadvertently increase and contribute to the formation of echo chambers, reinforcing the polarization of opinions. Altho… ▽ More

    Submitted 14 August, 2024; originally announced August 2024.

    Comments: 21 pages, 6 figures

  3. arXiv:2407.18941  [pdf, other

    cs.CV cs.LG

    LEMoN: Label Error Detection using Multimodal Neighbors

    Authors: Haoran Zhang, Aparna Balagopalan, Nassim Oufattole, Hyewon Jeong, Yan Wu, Jiacheng Zhu, Marzyeh Ghassemi

    Abstract: Large repositories of image-caption pairs are essential for the development of vision-language models. However, these datasets are often extracted from noisy data scraped from the web, and contain many mislabeled examples. In order to improve the reliability of downstream models, it is important to identify and filter images with incorrect captions. However, beyond filtering based on image-caption… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

  4. arXiv:2407.17493  [pdf, other

    cs.CV cs.AI

    ReDiFine: Reusable Diffusion Finetuning for Mitigating Degradation in the Chain of Diffusion

    Authors: Youngseok Yoon, Dainong Hu, Iain Weissburg, Yao Qin, Haewon Jeong

    Abstract: Diffusion models have achieved tremendous improvements in generative modeling for images, enabling high-quality generation that is indistinguishable by humans from real images. The qualities of images have reached a threshold at which we can reuse synthetic images for training machine learning models again. This attracts the area as it can relieve the high cost of data collection and fundamentally… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

    Comments: 27 page

  5. arXiv:2407.11659  [pdf, other

    astro-ph.SR astro-ph.IM cs.LG

    Magnetogram-to-Magnetogram: Generative Forecasting of Solar Evolution

    Authors: Francesco Pio Ramunno, Hyun-Jin Jeong, Stefan Hackstein, André Csillaghy, Svyatoslav Voloshynovskiy, Manolis K. Georgoulis

    Abstract: Investigating the solar magnetic field is crucial to understand the physical processes in the solar interior as well as their effects on the interplanetary environment. We introduce a novel method to predict the evolution of the solar line-of-sight (LoS) magnetogram using image-to-image translation with Denoising Diffusion Probabilistic Models (DDPMs). Our approach combines "computer science metri… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: Conference paper accepted for an oral presentation to the ESA SPAICE CONFERENCE 17 19 September 2024

  6. arXiv:2407.10164  [pdf, other

    cs.CV

    LabelDistill: Label-guided Cross-modal Knowledge Distillation for Camera-based 3D Object Detection

    Authors: Sanmin Kim, Youngseok Kim, Sihwan Hwang, Hyeonjun Jeong, Dongsuk Kum

    Abstract: Recent advancements in camera-based 3D object detection have introduced cross-modal knowledge distillation to bridge the performance gap with LiDAR 3D detectors, leveraging the precise geometric information in LiDAR point clouds. However, existing cross-modal knowledge distillation methods tend to overlook the inherent imperfections of LiDAR, such as the ambiguity of measurements on distant or occ… ▽ More

    Submitted 14 July, 2024; originally announced July 2024.

    Comments: ECCV 2024

  7. arXiv:2407.03289  [pdf, other

    cs.IT cs.CR cs.LG

    Correlated Privacy Mechanisms for Differentially Private Distributed Mean Estimation

    Authors: Sajani Vithana, Viveck R. Cadambe, Flavio P. Calmon, Haewon Jeong

    Abstract: Differentially private distributed mean estimation (DP-DME) is a fundamental building block in privacy-preserving federated learning, where a central server estimates the mean of $d$-dimensional vectors held by $n$ users while ensuring $(ε,δ)$-DP. Local differential privacy (LDP) and distributed DP with secure aggregation (SecAgg) are the most common notions of DP used in DP-DME settings with an u… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  8. arXiv:2406.12319  [pdf, other

    cs.CL

    PRePair: Pointwise Reasoning Enhance Pairwise Evaluating for Robust Instruction-Following Assessments

    Authors: Hawon Jeong, ChaeHun Park, Jimin Hong, Jaegul Choo

    Abstract: Pairwise evaluation using large language models (LLMs) is widely used for evaluating natural language generation (NLG) tasks. However, the reliability of LLMs is often compromised by biases, such as favoring verbosity and authoritative tone. In the study, we focus on the comparison of two LLM-based evaluation approaches, pointwise and pairwise. Our findings demonstrate that pointwise evaluators ex… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  9. arXiv:2406.07736  [pdf, other

    cs.CL

    MultiPragEval: Multilingual Pragmatic Evaluation of Large Language Models

    Authors: Dojun Park, Jiwoo Lee, Seohyun Park, Hyeyun Jeong, Youngeun Koo, Soonha Hwang, Seonwoo Park, Sungeun Lee

    Abstract: As the capabilities of LLMs expand, it becomes increasingly important to evaluate them beyond basic knowledge assessment, focusing on higher-level language understanding. This study introduces MultiPragEval, a robust test suite designed for the multilingual pragmatic evaluation of LLMs across English, German, Korean, and Chinese. Comprising 1200 question units categorized according to Grice's Coop… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: 8 pages, under review

  10. arXiv:2406.07693  [pdf

    cs.CY cs.AI cs.CL cs.LG cs.SI

    A Labelled Dataset for Sentiment Analysis of Videos on YouTube, TikTok, and Other Sources about the 2024 Outbreak of Measles

    Authors: Nirmalya Thakur, Vanessa Su, Mingchen Shao, Kesha A. Patel, Hongseok Jeong, Victoria Knieling, Andrew Bian

    Abstract: The work of this paper presents a dataset that contains the data of 4011 videos about the ongoing outbreak of measles published on 264 websites on the internet between January 1, 2024, and May 31, 2024. The dataset is available at https://dx.doi.org/10.21227/40s8-xf63. These websites primarily include YouTube and TikTok, which account for 48.6% and 15.2% of the videos, respectively. The remainder… ▽ More

    Submitted 18 July, 2024; v1 submitted 11 June, 2024; originally announced June 2024.

    Comments: 19 pages

    ACM Class: I.2.7; I.2.8; I.5.4; K.4.2; H.2.8; I.2.6

  11. arXiv:2406.06796  [pdf, other

    cs.CV cs.AI cs.LG cs.RO eess.SP

    FlexLoc: Conditional Neural Networks for Zero-Shot Sensor Perspective Invariance in Object Localization with Distributed Multimodal Sensors

    Authors: Jason Wu, Ziqi Wang, Xiaomin Ouyang, Ho Lyun Jeong, Colin Samplawski, Lance Kaplan, Benjamin Marlin, Mani Srivastava

    Abstract: Localization is a critical technology for various applications ranging from navigation and surveillance to assisted living. Localization systems typically fuse information from sensors viewing the scene from different perspectives to estimate the target location while also employing multiple modalities for enhanced robustness and accuracy. Recently, such systems have employed end-to-end deep neura… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  12. arXiv:2406.00396  [pdf, other

    cs.LG cond-mat.stat-mech cs.AI stat.ML

    Stochastic Restarting to Overcome Overfitting in Neural Networks with Noisy Labels

    Authors: Youngkyoung Bae, Yeongwoo Song, Hawoong Jeong

    Abstract: Despite its prevalence, giving up and starting over may seem wasteful in many situations such as searching for a target or training deep neural networks (DNNs). Our study, though, demonstrates that restarting from a checkpoint can significantly improve generalization performance when training DNNs with noisy labels. In the presence of noisy labels, DNNs initially learn the general patterns of the… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

    Comments: 21 pages, 10 figures

  13. arXiv:2405.13999  [pdf, other

    cs.CV

    Computer-Vision-Enabled Worker Video Analysis for Motion Amount Quantification

    Authors: Hari Iyer, Neel Macwan, Shenghan Guo, Heejin Jeong

    Abstract: The performance of physical workers is significantly influenced by the quantity of their motions. However, monitoring and assessing these motions is challenging due to the complexities of motion sensing, tracking, and quantification. Recent advancements have utilized in-situ video analysis for real-time observation of worker behaviors, enabling data-driven quantification of motion amounts. Neverth… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  14. arXiv:2405.13946  [pdf, other

    cs.IT

    Coded Computing Meets Quantum Circuit Simulation: Coded Parallel Tensor Network Contraction Algorithm

    Authors: Jin Lee, Sofia Gonzalez-Garcia, Zheng Zhang, Haewon Jeong

    Abstract: Parallel tensor network contraction algorithms have emerged as the pivotal benchmarks for assessing the classical limits of computation, exemplified by Google's demonstration of quantum supremacy through random circuit sampling. However, the massive parallelization of the algorithm makes it vulnerable to computer node failures. In this work, we apply coded computing to a practical parallel tensor… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: Accepted to ISIT2024

  15. arXiv:2405.00689  [pdf

    cs.RO cs.AI

    Anti-Jamming Path Planning Using GCN for Multi-UAV

    Authors: Haechan Jeong

    Abstract: This paper addresses the increasing significance of UAVs (Unmanned Aerial Vehicles) and the emergence of UAV swarms for collaborative operations in various domains. However, the effectiveness of UAV swarms can be severely compromised by jamming technology, necessitating robust antijamming strategies. While existing methods such as frequency hopping and physical path planning have been explored, th… ▽ More

    Submitted 13 March, 2024; originally announced May 2024.

  16. arXiv:2405.00642  [pdf, other

    stat.ML cond-mat.dis-nn cond-mat.stat-mech cs.LG

    From Empirical Observations to Universality: Dynamics of Deep Learning with Inputs Built on Gaussian mixture

    Authors: Jaeyong Bae, Hawoong Jeong

    Abstract: This study broadens the scope of theoretical frameworks in deep learning by delving into the dynamics of neural networks with inputs that demonstrate the structural characteristics to Gaussian Mixture (GM). We analyzed how the dynamics of neural networks under GM-structured inputs diverge from the predictions of conventional theories based on simple Gaussian structures. A revelation of our work is… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: 19 pages, 9 figures

  17. arXiv:2404.18826  [pdf, other

    cs.SI

    Winning the Social Media Influence Battle: Uncertainty-Aware Opinions to Understand and Spread True Information via Competitive Influence Maximization

    Authors: Qi Zhang, Lance M. Kaplan, Audun Jøsang, Dong Hyun. Jeong, Feng Chen, Jin-Hee Cho

    Abstract: Competitive Influence Maximization (CIM) involves entities competing to maximize influence in online social networks (OSNs). Current Deep Reinforcement Learning (DRL) methods in CIM rely on simplistic binary opinion models (i.e., an opinion is represented by either 0 or 1) and often overlook the complexity of users' behavioral characteristics and their prior knowledge. We propose a novel DRL-based… ▽ More

    Submitted 29 April, 2024; v1 submitted 29 April, 2024; originally announced April 2024.

    Comments: 8 pages, 3 figures, submitted to ASONAM 2024

  18. arXiv:2404.15155  [pdf, other

    cs.CL cs.AI cs.LG

    Adaptive Collaboration Strategy for LLMs in Medical Decision Making

    Authors: Yubin Kim, Chanwoo Park, Hyewon Jeong, Yik Siu Chan, Xuhai Xu, Daniel McDuff, Cynthia Breazeal, Hae Won Park

    Abstract: Foundation models have become invaluable in advancing the medical field. Despite their promise, the strategic deployment of LLMs for effective utility in complex medical tasks remains an open question. Our novel framework, Medical Decision-making Agents (MDAgents) aims to address this gap by automatically assigning the effective collaboration structure for LLMs. Assigned solo or group collaboratio… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  19. arXiv:2404.10980  [pdf, other

    cs.CV cs.LG

    Hyper Evidential Deep Learning to Quantify Composite Classification Uncertainty

    Authors: Changbin Li, Kangshuo Li, Yuzhe Ou, Lance M. Kaplan, Audun Jøsang, Jin-Hee Cho, Dong Hyun Jeong, Feng Chen

    Abstract: Deep neural networks (DNNs) have been shown to perform well on exclusive, multi-class classification tasks. However, when different classes have similar visual features, it becomes challenging for human annotators to differentiate them. This scenario necessitates the use of composite class labels. In this paper, we propose a novel framework called Hyper-Evidential Neural Network (HENN) that explic… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Comments: In Proceedings of The Twelfth International Conference on Learning Representations, ICLR 2024

  20. arXiv:2404.07431  [pdf, other

    cs.RO eess.SY

    Parameterized Fast and Safe Tracking (FaSTrack) using Deepreach

    Authors: Hyun Joe Jeong, Zheng Gong, Somil Bansal, Sylvia Herbert

    Abstract: Fast and Safe Tracking (FaSTrack) is a modular framework that provides safety guarantees while planning and executing trajectories in real time via value functions of Hamilton-Jacobi (HJ) reachability. These value functions are computed through dynamic programming, which is notorious for being computationally inefficient. Moreover, the resulting trajectory does not adapt online to the environment,… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: 12 pages, 6 figures, 1 table, to be published in L4DC

  21. arXiv:2403.19425  [pdf, ps, other

    eess.IV cs.CV

    A Robust Ensemble Algorithm for Ischemic Stroke Lesion Segmentation: Generalizability and Clinical Utility Beyond the ISLES Challenge

    Authors: Ezequiel de la Rosa, Mauricio Reyes, Sook-Lei Liew, Alexandre Hutton, Roland Wiest, Johannes Kaesmacher, Uta Hanning, Arsany Hakim, Richard Zubal, Waldo Valenzuela, David Robben, Diana M. Sima, Vincenzo Anania, Arne Brys, James A. Meakin, Anne Mickan, Gabriel Broocks, Christian Heitkamp, Shengbo Gao, Kongming Liang, Ziji Zhang, Md Mahfuzur Rahman Siddiquee, Andriy Myronenko, Pooya Ashtari, Sabine Van Huffel , et al. (33 additional authors not shown)

    Abstract: Diffusion-weighted MRI (DWI) is essential for stroke diagnosis, treatment decisions, and prognosis. However, image and disease variability hinder the development of generalizable AI algorithms with clinical value. We address this gap by presenting a novel ensemble algorithm derived from the 2022 Ischemic Stroke Lesion Segmentation (ISLES) challenge. ISLES'22 provided 400 patient scans with ischemi… ▽ More

    Submitted 3 April, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

  22. arXiv:2403.17165  [pdf, other

    cs.HC

    Building an Open-Source Community to Enhance Autonomic Nervous System Signal Analysis: DBDP-Autonomic

    Authors: Jessilyn Dunn, Varun Mishra, Md Mobashir Hasan Shandhi, Hayoung Jeong, Natasha Yamane, Yuna Watanabe, Bill Chen, Matthew S. Goodwin

    Abstract: Smartphones and wearable sensors offer an unprecedented ability to collect peripheral psychophysiological signals across diverse timescales, settings, populations, and modalities. However, open-source software development has yet to keep pace with rapid advancements in hardware technology and availability, creating an analytical barrier that limits the scientific usefulness of acquired data. We pr… ▽ More

    Submitted 29 March, 2024; v1 submitted 25 March, 2024; originally announced March 2024.

  23. arXiv:2403.16509  [pdf, other

    cs.LG

    Human Understanding AI Paper Challenge 2024 -- Dataset Design

    Authors: Se Won Oh, Hyuntae Jeong, Jeong Mook Lim, Seungeun Chung, Kyoung Ju Noh

    Abstract: In 2024, we will hold a research paper competition (the third Human Understanding AI Paper Challenge) for the research and development of artificial intelligence technologies to understand human daily life. This document introduces the datasets that will be provided to participants in the competition, and summarizes the issues to consider in data processing and learning model development.

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: 7 pages, 3 figures

    ACM Class: J.7; E.m

  24. arXiv:2403.15249  [pdf, other

    cs.CV cs.AI cs.LG

    Spectral Motion Alignment for Video Motion Transfer using Diffusion Models

    Authors: Geon Yeong Park, Hyeonho Jeong, Sang Wan Lee, Jong Chul Ye

    Abstract: The evolution of diffusion models has greatly impacted video generation and understanding. Particularly, text-to-video diffusion models (VDMs) have significantly facilitated the customization of input video with target appearance, motion, etc. Despite these advances, challenges persist in accurately distilling motion information from video frames. While existing works leverage the consecutive fram… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

    Comments: Project page: https://geonyeong-park.github.io/spectral-motion-alignment/

  25. arXiv:2403.12675  [pdf, other

    cs.CL

    Pragmatic Competence Evaluation of Large Language Models for Korean

    Authors: Dojun Park, Jiwoo Lee, Hyeyun Jeong, Seohyun Park, Sungeun Lee

    Abstract: The current evaluation of Large Language Models (LLMs) predominantly relies on benchmarks focusing on their embedded knowledge by testing through multiple-choice questions (MCQs), a format inherently suited for automated evaluation. Our study extends this evaluation to explore LLMs' pragmatic competence--a facet previously underexamined before the advent of sophisticated LLMs, specifically in the… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: 9 pages, submitted for publication

  26. arXiv:2403.12002  [pdf, other

    cs.CV cs.AI

    DreamMotion: Space-Time Self-Similar Score Distillation for Zero-Shot Video Editing

    Authors: Hyeonho Jeong, Jinho Chang, Geon Yeong Park, Jong Chul Ye

    Abstract: Text-driven diffusion-based video editing presents a unique challenge not encountered in image editing literature: establishing real-world motion. Unlike existing video editing approaches, here we focus on score distillation sampling to circumvent the standard reverse diffusion process and initiate optimization from videos that already exhibit natural motion. Our analysis reveals that while video… ▽ More

    Submitted 15 July, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

    Comments: Accepted to ECCV 2024, Project page: https://hyeonho99.github.io/dreammotion/

  27. arXiv:2403.02437  [pdf, other

    cs.LG cs.AI cs.DC

    SoK: Challenges and Opportunities in Federated Unlearning

    Authors: Hyejun Jeong, Shiqing Ma, Amir Houmansadr

    Abstract: Federated learning (FL), introduced in 2017, facilitates collaborative learning between non-trusting parties with no need for the parties to explicitly share their data among themselves. This allows training models on user data while respecting privacy regulations such as GDPR and CPRA. However, emerging privacy requirements may mandate model owners to be able to \emph{forget} some learned data, e… ▽ More

    Submitted 5 June, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

  28. arXiv:2403.01628  [pdf, ps, other

    cs.LG

    Recent Advances, Applications, and Open Challenges in Machine Learning for Health: Reflections from Research Roundtables at ML4H 2023 Symposium

    Authors: Hyewon Jeong, Sarah Jabbour, Yuzhe Yang, Rahul Thapta, Hussein Mozannar, William Jongwon Han, Nikita Mehandru, Michael Wornow, Vladislav Lialin, Xin Liu, Alejandro Lozano, Jiacheng Zhu, Rafal Dariusz Kocielnik, Keith Harrigian, Haoran Zhang, Edward Lee, Milos Vukadinovic, Aparna Balagopalan, Vincent Jeanselme, Katherine Matton, Ilker Demirel, Jason Fries, Parisa Rashidi, Brett Beaulieu-Jones, Xuhai Orson Xu , et al. (18 additional authors not shown)

    Abstract: The third ML4H symposium was held in person on December 10, 2023, in New Orleans, Louisiana, USA. The symposium included research roundtable sessions to foster discussions between participants and senior researchers on timely and relevant topics for the \ac{ML4H} community. Encouraged by the successful virtual roundtables in the previous year, we organized eleven in-person roundtables and four vir… ▽ More

    Submitted 5 April, 2024; v1 submitted 3 March, 2024; originally announced March 2024.

    Comments: ML4H 2023, Research Roundtables

  29. arXiv:2403.01204  [pdf, ps, other

    cs.LG math.NA stat.ML

    Stochastic gradient descent for streaming linear and rectified linear systems with Massart noise

    Authors: Halyun Jeong, Deanna Needell, Elizaveta Rebrova

    Abstract: We propose SGD-exp, a stochastic gradient descent approach for linear and ReLU regressions under Massart noise (adversarial semi-random corruption model) for the fully streaming setting. We show novel nearly linear convergence guarantees of SGD-exp to the true parameter with up to $50\%$ Massart corruption rate, and with any corruption rate in the case of symmetric oblivious corruptions. This is t… ▽ More

    Submitted 2 March, 2024; originally announced March 2024.

    Comments: Submitted to a journal

    MSC Class: 65F10; 60-XX

  30. arXiv:2402.16827  [pdf, other

    cs.CL cs.LG

    A Survey on Data Selection for Language Models

    Authors: Alon Albalak, Yanai Elazar, Sang Michael Xie, Shayne Longpre, Nathan Lambert, Xinyi Wang, Niklas Muennighoff, Bairu Hou, Liangming Pan, Haewon Jeong, Colin Raffel, Shiyu Chang, Tatsunori Hashimoto, William Yang Wang

    Abstract: A major factor in the recent success of large language models is the use of enormous and ever-growing text datasets for unsupervised pre-training. However, naively training a model on all available data may not be optimal (or feasible), as the quality of available text data can vary. Filtering out data can also decrease the carbon footprint and financial costs of training models by reducing the am… ▽ More

    Submitted 2 August, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

    Comments: Paper list available at https://github.com/alon-albalak/data-selection-survey

  31. arXiv:2402.14136  [pdf, other

    cs.RO cs.LG eess.SP

    GDTM: An Indoor Geospatial Tracking Dataset with Distributed Multimodal Sensors

    Authors: Ho Lyun Jeong, Ziqi Wang, Colin Samplawski, Jason Wu, Shiwei Fang, Lance M. Kaplan, Deepak Ganesan, Benjamin Marlin, Mani Srivastava

    Abstract: Constantly locating moving objects, i.e., geospatial tracking, is essential for autonomous building infrastructure. Accurate and robust geospatial tracking often leverages multimodal sensor fusion algorithms, which require large datasets with time-aligned, synchronized data from various sensor types. However, such datasets are not readily available. Hence, we propose GDTM, a nine-hour dataset for… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

  32. arXiv:2402.10482  [pdf, other

    cs.LG stat.ML

    Understanding Self-Distillation and Partial Label Learning in Multi-Class Classification with Label Noise

    Authors: Hyeonsu Jeong, Hye Won Chung

    Abstract: Self-distillation (SD) is the process of training a student model using the outputs of a teacher model, with both models sharing the same architecture. Our study theoretically examines SD in multi-class classification with cross-entropy loss, exploring both multi-round SD and SD with refined teacher outputs, inspired by partial label learning (PLL). By deriving a closed-form solution for the stude… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

  33. arXiv:2402.01338  [pdf, other

    cond-mat.stat-mech cond-mat.soft cs.LG physics.bio-ph

    Inferring the Langevin Equation with Uncertainty via Bayesian Neural Networks

    Authors: Youngkyoung Bae, Seungwoong Ha, Hawoong Jeong

    Abstract: Pervasive across diverse domains, stochastic systems exhibit fluctuations in processes ranging from molecular dynamics to climate phenomena. The Langevin equation has served as a common mathematical model for studying such systems, enabling predictions of their temporal evolution and analyses of thermodynamic quantities, including absorbed heat, work done on the system, and entropy production. How… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

    Comments: 30 pages, 17 figures

  34. Security and Privacy Issues and Solutions in Federated Learning for Digital Healthcare

    Authors: Hyejun Jeong, Tai-Myoung Chung

    Abstract: The advent of Federated Learning has enabled the creation of a high-performing model as if it had been trained on a considerable amount of data. A multitude of participants and a server cooperatively train a model without the need for data disclosure or collection. The healthcare industry, where security and privacy are paramount, can substantially benefit from this new learning paradigm, as data… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

    Journal ref: International Conference on Future Data and Security Engineering (2022) 316-331

  35. arXiv:2401.03846  [pdf, other

    cs.CV cs.LG

    UFO: Unidentified Foreground Object Detection in 3D Point Cloud

    Authors: Hyunjun Choi, Hawook Jeong, Jin Young Choi

    Abstract: In this paper, we raise a new issue on Unidentified Foreground Object (UFO) detection in 3D point clouds, which is a crucial technology in autonomous driving in the wild. UFO detection is challenging in that existing 3D object detectors encounter extremely hard challenges in both 3D localization and Out-of-Distribution (OOD) detection. To tackle these challenges, we suggest a new UFO detection fra… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

    Comments: Under review

  36. arXiv:2312.10308  [pdf, other

    cs.LG

    Event-Based Contrastive Learning for Medical Time Series

    Authors: Hyewon Jeong, Nassim Oufattole, Matthew Mcdermott, Aparna Balagopalan, Bryan Jangeesingh, Marzyeh Ghassemi, Collin Stultz

    Abstract: In clinical practice, one often needs to identify whether a patient is at high risk of adverse outcomes after some key medical event. For example, quantifying the risk of adverse outcomes after an acute cardiovascular event helps healthcare providers identify those patients at the highest risk of poor outcomes; i.e., patients who benefit from invasive therapies that can lower their risk. Assessing… ▽ More

    Submitted 8 August, 2024; v1 submitted 15 December, 2023; originally announced December 2023.

    Comments: Accepted at Unifying Representations in Neural Models Workshop in NeurIPS 2023, MLHC 2024

    Journal ref: MLHC 2024

  37. arXiv:2312.00845  [pdf, other

    cs.CV cs.AI cs.LG

    VMC: Video Motion Customization using Temporal Attention Adaption for Text-to-Video Diffusion Models

    Authors: Hyeonho Jeong, Geon Yeong Park, Jong Chul Ye

    Abstract: Text-to-video diffusion models have advanced video generation significantly. However, customizing these models to generate videos with tailored motions presents a substantial challenge. In specific, they encounter hurdles in (a) accurately reproducing motion from a target video, and (b) creating diverse visual variations. For example, straightforward extensions of static image customization method… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

    Comments: Project page: https://video-motion-customization.github.io

  38. arXiv:2311.11943  [pdf, other

    cs.DC cs.IT eess.SY

    Coded Computing for Fault-Tolerant Parallel QR Decomposition

    Authors: Quang Minh Nguyen, Iain Weissburg, Haewon Jeong

    Abstract: QR decomposition is an essential operation for solving linear equations and obtaining least-squares solutions. In high-performance computing systems, large-scale parallel QR decomposition often faces node faults. We address this issue by proposing a fault-tolerant algorithm that incorporates `coded computing' into the parallel Gram-Schmidt method, commonly used for QR decomposition. Coded computin… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

    MSC Class: 15A23; 94B05; 65Y05

  39. arXiv:2310.20204  [pdf, other

    cs.LG cs.CL

    General-Purpose Retrieval-Enhanced Medical Prediction Model Using Near-Infinite History

    Authors: Junu Kim, Chaeeun Shim, Bosco Seong Kyu Yang, Chami Im, Sung Yoon Lim, Han-Gil Jeong, Edward Choi

    Abstract: Machine learning (ML) has recently shown promising results in medical predictions using electronic health records (EHRs). However, since ML models typically have a limited capability in terms of input sizes, selecting specific medical events from EHRs for use as input is necessary. This selection process, often relying on expert opinion, can cause bottlenecks in development. We propose Retrieval-E… ▽ More

    Submitted 22 July, 2024; v1 submitted 31 October, 2023; originally announced October 2023.

    Comments: The source codes corresponding to this paper are available at: https://github.com/starmpcc/REMed

    Journal ref: Machine Learning for Healthcare Conference 2024

  40. arXiv:2310.01580  [pdf, other

    cs.HC cs.AI

    Active Learning on Neural Networks through Interactive Generation of Digit Patterns and Visual Representation

    Authors: Dong H. Jeong, Jin-Hee Cho, Feng Chen, Audun Josang, Soo-Yeon Ji

    Abstract: Artificial neural networks (ANNs) have been broadly utilized to analyze various data and solve different domain problems. However, neural networks (NNs) have been considered a black box operation for years because their underlying computation and meaning are hidden. Due to this nature, users often face difficulties in interpreting the underlying mechanism of the NNs and the benefits of using them.… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

  41. arXiv:2310.01107  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Ground-A-Video: Zero-shot Grounded Video Editing using Text-to-image Diffusion Models

    Authors: Hyeonho Jeong, Jong Chul Ye

    Abstract: Recent endeavors in video editing have showcased promising results in single-attribute editing or style transfer tasks, either by training text-to-video (T2V) models on text-video data or adopting training-free methods. However, when confronted with the complexities of multi-attribute editing scenarios, they exhibit shortcomings such as omitting or overlooking intended attribute changes, modifying… ▽ More

    Submitted 24 February, 2024; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: Accepted to ICLR 2024, Project Page: http://ground-a-video.github.io

  42. arXiv:2310.00398  [pdf, other

    math.OC cs.RO eess.SY

    Optimal Impact Angle Guidance via First-Order Optimization under Nonconvex Constraints

    Authors: Gyubin Park, Jiwoo Choi, Da Hoon Jeong, Jong-Han Kim

    Abstract: Most of the optimal guidance problems can be formulated as nonconvex optimization problems, which can be solved indirectly by relaxation, convexification, or linearization. Although these methods are guaranteed to converge to the global optimum of the modified problems, the obtained solution may not guarantee global optimality or even the feasibility of the original nonconvex problems. In this pap… ▽ More

    Submitted 17 March, 2024; v1 submitted 30 September, 2023; originally announced October 2023.

    Comments: To appear at 2024 American Control Conference

  43. arXiv:2309.16105  [pdf, other

    cs.IT cs.CR cs.DC cs.LG

    Differentially Private Secure Multiplication: Hiding Information in the Rubble of Noise

    Authors: Viveck R. Cadambe, Ateet Devulapalli, Haewon Jeong, Flavio P. Calmon

    Abstract: We consider the problem of private distributed multi-party multiplication. It is well-established that Shamir secret-sharing coding strategies can enable perfect information-theoretic privacy in distributed computation via the celebrated algorithm of Ben Or, Goldwasser and Wigderson (the "BGW algorithm"). However, perfect privacy and accuracy require an honest majority, that is, $N \geq 2t+1$ comp… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

    Comments: Extended version of papers presented in IEEE ISIT 2022, IEEE ISIT 2023 and TPDP 2023

  44. Exploring Cohesive Subgraphs in Hypergraphs: The (k,g)-core Approach

    Authors: Dahee Kim, Junghoon Kim, Sungsu Lim, Hyun Ji Jeong

    Abstract: Identifying cohesive subgraphs in hypergraphs is a fundamental problem that has received recent attention in data mining and engineering fields. Existing approaches mainly focus on a strongly induced subhypergraph or edge cardinality, overlooking the importance of the frequency of co-occurrence. In this paper, we propose a new cohesive subgraph named (k,g)-core, which considers both neighbour and… ▽ More

    Submitted 8 September, 2023; originally announced September 2023.

    Comments: 5 pages

  45. arXiv:2308.16483  [pdf, other

    eess.SP cs.HC cs.LG

    Improving Out-of-Distribution Detection in Echocardiographic View Classication through Enhancing Semantic Features

    Authors: Jaeik Jeon, Seongmin Ha, Yeonggul Jang, Yeonyee E. Yoon, Jiyeon Kim, Hyunseok Jeong, Dawun Jeong, Youngtaek Hong, Seung-Ah Lee Hyuk-Jae Chang

    Abstract: In echocardiographic view classification, accurately detecting out-of-distribution (OOD) data is essential but challenging, especially given the subtle differences between in-distribution and OOD data. While conventional OOD detection methods, such as Mahalanobis distance (MD) are effective in far-OOD scenarios with clear distinctions between distributions, they struggle to discern the less obviou… ▽ More

    Submitted 23 November, 2023; v1 submitted 31 August, 2023; originally announced August 2023.

  46. arXiv:2308.14595  [pdf, other

    cs.CV cs.AI cs.LG eess.IV

    Neural Network Training Strategy to Enhance Anomaly Detection Performance: A Perspective on Reconstruction Loss Amplification

    Authors: YeongHyeon Park, Sungho Kang, Myung Jin Kim, Hyeonho Jeong, Hyunkyu Park, Hyeong Seok Kim, Juneho Yi

    Abstract: Unsupervised anomaly detection (UAD) is a widely adopted approach in industry due to rare anomaly occurrences and data imbalance. A desirable characteristic of an UAD model is contained generalization ability which excels in the reconstruction of seen normal patterns but struggles with unseen anomalies. Recent studies have pursued to contain the generalization capability of their UAD models in rec… ▽ More

    Submitted 28 August, 2023; originally announced August 2023.

    Comments: 5 pages, 4 figures, 2 tables

  47. arXiv:2308.10609  [pdf, other

    cs.LG

    ST-RAP: A Spatio-Temporal Framework for Real Estate Appraisal

    Authors: Hojoon Lee, Hawon Jeong, Byungkun Lee, Kyungyup Lee, Jaegul Choo

    Abstract: In this paper, we introduce ST-RAP, a novel Spatio-Temporal framework for Real estate APpraisal. ST-RAP employs a hierarchical architecture with a heterogeneous graph neural network to encapsulate temporal dynamics and spatial relationships simultaneously. Through comprehensive experiments on a large-scale real estate dataset, ST-RAP outperforms previous methods, demonstrating the significant bene… ▽ More

    Submitted 21 August, 2023; originally announced August 2023.

    Comments: Accepted to CIKM'23

  48. arXiv:2308.04650  [pdf, other

    cs.LG eess.SP q-bio.QM

    Deep Metric Learning for the Hemodynamics Inference with Electrocardiogram Signals

    Authors: Hyewon Jeong, Collin M. Stultz, Marzyeh Ghassemi

    Abstract: Heart failure is a debilitating condition that affects millions of people worldwide and has a significant impact on their quality of life and mortality rates. An objective assessment of cardiac pressures remains an important method for the diagnosis and treatment prognostication for patients with heart failure. Although cardiac catheterization is the gold standard for estimating central hemodynami… ▽ More

    Submitted 10 September, 2023; v1 submitted 8 August, 2023; originally announced August 2023.

    Journal ref: MLHC 2023

  49. arXiv:2308.01030  [pdf, other

    cs.LG cs.AI cs.CV

    Three Factors to Improve Out-of-Distribution Detection

    Authors: Hyunjun Choi, JaeHo Chung, Hawook Jeong, Jin Young Choi

    Abstract: In the problem of out-of-distribution (OOD) detection, the usage of auxiliary data as outlier data for fine-tuning has demonstrated encouraging performance. However, previous methods have suffered from a trade-off between classification accuracy (ACC) and OOD detection performance (AUROC, FPR, AUPR). To improve this trade-off, we make three contributions: (i) Incorporating a self-knowledge distill… ▽ More

    Submitted 2 August, 2023; originally announced August 2023.

    Comments: Under review

  50. End-to-End Learnable Multi-Scale Feature Compression for VCM

    Authors: Yeongwoong Kim, Hyewon Jeong, Janghyun Yu, Younhee Kim, Jooyoung Lee, Se Yoon Jeong, Hui Yong Kim

    Abstract: The proliferation of deep learning-based machine vision applications has given rise to a new type of compression, so called video coding for machine (VCM). VCM differs from traditional video coding in that it is optimized for machine vision performance instead of human visual quality. In the feature compression track of MPEG-VCM, multi-scale features extracted from images are subject to compressio… ▽ More

    Submitted 8 August, 2023; v1 submitted 29 June, 2023; originally announced June 2023.

    Comments: 13 pages, accepted by IEEE Transactions on Circuits and Systems for Video Technology