Skip to main content

Showing 1–50 of 124 results for author: Ng, Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.16296  [pdf, other

    cs.CV cs.IR

    Rethinking Sparse Lexical Representations for Image Retrieval in the Age of Rising Multi-Modal Large Language Models

    Authors: Kengo Nakata, Daisuke Miyashita, Youyang Ng, Yasuto Hoshi, Jun Deguchi

    Abstract: In this paper, we rethink sparse lexical representations for image retrieval. By utilizing multi-modal large language models (M-LLMs) that support visual prompting, we can extract image features and convert them into textual data, enabling us to utilize efficient sparse retrieval algorithms employed in natural language processing for image retrieval tasks. To assist the LLM in extracting image fea… ▽ More

    Submitted 29 August, 2024; originally announced August 2024.

    Comments: Accepted to ECCV 2024 Workshops: 2nd Workshop on Traditional Computer Vision in the Age of Deep Learning (TradiCV)

  2. arXiv:2408.06891  [pdf

    cs.AI cs.CE cs.CV cs.LG

    Automatic Feature Recognition and Dimensional Attributes Extraction From CAD Models for Hybrid Additive-Subtractive Manufacturing

    Authors: Muhammad Tayyab Khan, Wenhe Feng, Lequn Chen, Ye Han Ng, Nicholas Yew Jin Tan, Seung Ki Moon

    Abstract: The integration of Computer-Aided Design (CAD), Computer-Aided Process Planning (CAPP), and Computer-Aided Manufacturing (CAM) plays a crucial role in modern manufacturing, facilitating seamless transitions from digital designs to physical products. However, a significant challenge within this integration is the Automatic Feature Recognition (AFR) of CAD models, especially in the context of hybrid… ▽ More

    Submitted 14 August, 2024; v1 submitted 13 August, 2024; originally announced August 2024.

    Comments: 10 pages, 12 figures. This paper has been accepted for presentation at the ASME IDETC-CIE 2024 conference

  3. arXiv:2408.06494  [pdf, other

    cs.HC cs.CL cs.CV

    What Color Scheme is More Effective in Assisting Readers to Locate Information in a Color-Coded Article?

    Authors: Ho Yin Ng, Zeyu He, Ting-Hao 'Kenneth' Huang

    Abstract: Color coding, a technique assigning specific colors to cluster information types, has proven advantages in aiding human cognitive activities, especially reading and comprehension. The rise of Large Language Models (LLMs) has streamlined document coding, enabling simple automatic text labeling with various schemes. This has the potential to make color-coding more accessible and benefit more users.… ▽ More

    Submitted 26 August, 2024; v1 submitted 12 August, 2024; originally announced August 2024.

    Comments: This paper will appear at IEEE VIS 2024

  4. arXiv:2408.04567  [pdf, other

    cs.CV cs.GR

    Sketch2Scene: Automatic Generation of Interactive 3D Game Scenes from User's Casual Sketches

    Authors: Yongzhi Xu, Yonhon Ng, Yifu Wang, Inkyu Sa, Yunfei Duan, Yang Li, Pan Ji, Hongdong Li

    Abstract: 3D Content Generation is at the heart of many computer graphics applications, including video gaming, film-making, virtual and augmented reality, etc. This paper proposes a novel deep-learning based approach for automatically generating interactive and playable 3D game scenes, all from the user's casual prompts such as a hand-drawn sketch. Sketch-based input offers a natural, and convenient way to… ▽ More

    Submitted 8 August, 2024; originally announced August 2024.

    Comments: Project Page: https://xrvisionlabs.github.io/Sketch2Scene/

  5. arXiv:2408.00131  [pdf, other

    stat.ML cs.AI cs.LG q-fin.RM

    Distributionally Robust Optimization as a Scalable Framework to Characterize Extreme Value Distributions

    Authors: Patrick Kuiper, Ali Hasan, Wenhao Yang, Yuting Ng, Hoda Bidkhori, Jose Blanchet, Vahid Tarokh

    Abstract: The goal of this paper is to develop distributionally robust optimization (DRO) estimators, specifically for multidimensional Extreme Value Theory (EVT) statistics. EVT supports using semi-parametric models called max-stable distributions built from spatial Poisson point processes. While powerful, these models are only asymptotically valid for large samples. However, since extreme data is by defin… ▽ More

    Submitted 31 July, 2024; originally announced August 2024.

  6. arXiv:2407.21045  [pdf

    cs.CL cs.AI

    Unlocking the Potential: Benchmarking Large Language Models in Water Engineering and Research

    Authors: Boyan Xu, Liang Wen, Zihao Li, Yuxing Yang, Guanlan Wu, Xiongpeng Tang, Yu Li, Zihao Wu, Qingxian Su, Xueqing Shi, Yue Yang, Rui Tong, How Yong Ng

    Abstract: Recent advancements in Large Language Models (LLMs) have sparked interest in their potential applications across various fields. This paper embarked on a pivotal inquiry: Can existing LLMs effectively serve as "water expert models" for water engineering and research tasks? This study was the first to evaluate LLMs' contributions across various water engineering and research tasks by establishing a… ▽ More

    Submitted 22 July, 2024; originally announced July 2024.

  7. arXiv:2406.13434  [pdf, other

    cs.RO

    Tactile Aware Dynamic Obstacle Avoidance in Crowded Environment with Deep Reinforcement Learning

    Authors: Yung Chuen Ng, Qi Wen, Lim, Chun Ye Tan, Zhen Hao Gan, Meng Yee, Chuah

    Abstract: Mobile robots operating in crowded environments require the ability to navigate among humans and surrounding obstacles efficiently while adhering to safety standards and socially compliant mannerisms. This scale of the robot navigation problem may be classified as both a local path planning and trajectory optimization problem. This work presents an array of force sensors that act as a tactile laye… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  8. arXiv:2405.09798  [pdf, other

    cs.LG cs.AI cs.CL cs.CV

    Many-Shot In-Context Learning in Multimodal Foundation Models

    Authors: Yixing Jiang, Jeremy Irvin, Ji Hun Wang, Muhammad Ahmed Chaudhry, Jonathan H. Chen, Andrew Y. Ng

    Abstract: Large language models are well-known to be effective at few-shot in-context learning (ICL). Recent advancements in multimodal foundation models have enabled unprecedentedly long context windows, presenting an opportunity to explore their capability to perform ICL with many more demonstrating examples. In this work, we evaluate the performance of multimodal foundation models scaling from few-shot t… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

  9. arXiv:2404.16398  [pdf, other

    cs.CV

    Revisiting Relevance Feedback for CLIP-based Interactive Image Retrieval

    Authors: Ryoya Nara, Yu-Chieh Lin, Yuji Nozawa, Youyang Ng, Goh Itoh, Osamu Torii, Yusuke Matsui

    Abstract: Many image retrieval studies use metric learning to train an image encoder. However, metric learning cannot handle differences in users' preferences, and requires data to train an image encoder. To overcome these limitations, we revisit relevance feedback, a classic technique for interactive retrieval systems, and propose an interactive CLIP-based image retrieval system with relevance feedback. Ou… ▽ More

    Submitted 29 April, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

    Comments: 20 pages, 8 sugures

  10. arXiv:2404.09402  [pdf, other

    cs.LG cs.AI stat.ML

    Neural McKean-Vlasov Processes: Distributional Dependence in Diffusion Processes

    Authors: Haoming Yang, Ali Hasan, Yuting Ng, Vahid Tarokh

    Abstract: McKean-Vlasov stochastic differential equations (MV-SDEs) provide a mathematical description of the behavior of an infinite number of interacting particles by imposing a dependence on the particle density. As such, we study the influence of explicitly including distributional information in the parameterization of the SDE. We propose a series of semi-parametric methods for representing MV-SDEs, an… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

    Comments: Appears in AISTATS 2024

  11. arXiv:2402.11412  [pdf, other

    cs.RO

    Predicting Maximum Permitted Process Forces for Object Grasping and Manipulation Using a Deep Learning Regression Model

    Authors: S. Wucherer, R. McMurray, K. Y. Ng, F. Kerber

    Abstract: During the execution of handling processes in manufacturing, it is difficult to measure the process forces with state-of-the-art gripper systems since they usually lack integrated sensors. Thus, the exact state of the gripped object and the actuating process forces during manipulation and handling are unknown. This paper proposes a deep learning regression model to construct a continuous stability… ▽ More

    Submitted 17 February, 2024; originally announced February 2024.

    Comments: 6 pages, 4 figures, 3 tables, to be submitted as a conference paper to IEEE CCTA2024

  12. arXiv:2402.10083  [pdf

    cs.AI

    Fine-tuning Large Language Model (LLM) Artificial Intelligence Chatbots in Ophthalmology and LLM-based evaluation using GPT-4

    Authors: Ting Fang Tan, Kabilan Elangovan, Liyuan Jin, Yao Jie, Li Yong, Joshua Lim, Stanley Poh, Wei Yan Ng, Daniel Lim, Yuhe Ke, Nan Liu, Daniel Shu Wei Ting

    Abstract: Purpose: To assess the alignment of GPT-4-based evaluation to human clinician experts, for the evaluation of responses to ophthalmology-related patient queries generated by fine-tuned LLM chatbots. Methods: 400 ophthalmology questions and paired answers were created by ophthalmologists to represent commonly asked patient questions, divided into fine-tuning (368; 92%), and testing (40; 8%). We find… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

    Comments: 13 Pages, 1 Figure, 8 Tables

  13. arXiv:2402.08788  [pdf

    cs.CL cs.SD eess.AS

    Syllable based DNN-HMM Cantonese Speech to Text System

    Authors: Timothy Wong, Claire Li, Sam Lam, Billy Chiu, Qin Lu, Minglei Li, Dan Xiong, Roy Shing Yu, Vincent T. Y. Ng

    Abstract: This paper reports our work on building up a Cantonese Speech-to-Text (STT) system with a syllable based acoustic model. This is a part of an effort in building a STT system to aid dyslexic students who have cognitive deficiency in writing skills but have no problem expressing their ideas through speech. For Cantonese speech recognition, the basic unit of acoustic models can either be the conventi… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

    Comments: 7 pages, 3 figures, LREC 2016

    MSC Class: 94-06 ACM Class: I.2.7

  14. arXiv:2401.14486  [pdf, other

    cs.CV cs.LG

    CloudTracks: A Dataset for Localizing Ship Tracks in Satellite Images of Clouds

    Authors: Muhammad Ahmed Chaudhry, Lyna Kim, Jeremy Irvin, Yuzu Ido, Sonia Chu, Jared Thomas Isobe, Andrew Y. Ng, Duncan Watson-Parris

    Abstract: Clouds play a significant role in global temperature regulation through their effect on planetary albedo. Anthropogenic emissions of aerosols can alter the albedo of clouds, but the extent of this effect, and its consequent impact on temperature change, remains uncertain. Human-induced clouds caused by ship aerosol emissions, commonly referred to as ship tracks, provide visible manifestations of t… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

    Comments: 11 pages, 5 figures, submitted to Journal of Machine Learning Research

  15. arXiv:2312.02200  [pdf, other

    cs.CV cs.AI stat.AP

    An Empirical Study of Automated Mislabel Detection in Real World Vision Datasets

    Authors: Maya Srikanth, Jeremy Irvin, Brian Wesley Hill, Felipe Godoy, Ishan Sabane, Andrew Y. Ng

    Abstract: Major advancements in computer vision can primarily be attributed to the use of labeled datasets. However, acquiring labels for datasets often results in errors which can harm model performance. Recent works have proposed methods to automatically identify mislabeled images, but developing strategies to effectively implement them in real world datasets has been sparsely explored. Towards improved d… ▽ More

    Submitted 2 December, 2023; originally announced December 2023.

  16. arXiv:2312.02199  [pdf, other

    cs.CV cs.AI cs.LG eess.IV stat.AP

    USat: A Unified Self-Supervised Encoder for Multi-Sensor Satellite Imagery

    Authors: Jeremy Irvin, Lucas Tao, Joanne Zhou, Yuntao Ma, Langston Nashold, Benjamin Liu, Andrew Y. Ng

    Abstract: Large, self-supervised vision models have led to substantial advancements for automatically interpreting natural images. Recent works have begun tailoring these methods to remote sensing data which has rich structure with multi-sensor, multi-spectral, and temporal information providing massive amounts of self-labeled data that can be used for self-supervised pre-training. In this work, we develop… ▽ More

    Submitted 2 December, 2023; originally announced December 2023.

  17. arXiv:2311.17449  [pdf, other

    cs.CV

    Weakly-semi-supervised object detection in remotely sensed imagery

    Authors: Ji Hun Wang, Jeremy Irvin, Beri Kohen Behar, Ha Tran, Raghav Samavedam, Quentin Hsu, Andrew Y. Ng

    Abstract: Deep learning for detecting objects in remotely sensed imagery can enable new technologies for important applications including mitigating climate change. However, these models often require large datasets labeled with bounding box annotations which are expensive to curate, prohibiting the development of models for new tasks and geographies. To address this challenge, we develop weakly-semi-superv… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

    Comments: Tackling Climate Change with Machine Learning at NeurIPS 2023

  18. arXiv:2310.19852  [pdf, other

    cs.AI

    AI Alignment: A Comprehensive Survey

    Authors: Jiaming Ji, Tianyi Qiu, Boyuan Chen, Borong Zhang, Hantao Lou, Kaile Wang, Yawen Duan, Zhonghao He, Jiayi Zhou, Zhaowei Zhang, Fanzhi Zeng, Kwan Yee Ng, Juntao Dai, Xuehai Pan, Aidan O'Gara, Yingshan Lei, Hua Xu, Brian Tse, Jie Fu, Stephen McAleer, Yaodong Yang, Yizhou Wang, Song-Chun Zhu, Yike Guo, Wen Gao

    Abstract: AI alignment aims to make AI systems behave in line with human intentions and values. As AI systems grow more capable, so do risks from misalignment. To provide a comprehensive and up-to-date overview of the alignment field, in this survey, we delve into the core concepts, methodology, and practice of alignment. First, we identify four principles as the key objectives of AI alignment: Robustness,… ▽ More

    Submitted 1 May, 2024; v1 submitted 30 October, 2023; originally announced October 2023.

    Comments: Continually updated, including weak-to-strong generalization and socio-technical thinking. 58 pages (excluding bibliography), 801 references

  19. arXiv:2310.01720  [pdf, other

    cs.LG cs.AI

    Perceiver-based CDF Modeling for Time Series Forecasting

    Authors: Cat P. Le, Chris Cannella, Ali Hasan, Yuting Ng, Vahid Tarokh

    Abstract: Transformers have demonstrated remarkable efficacy in forecasting time series data. However, their extensive dependence on self-attention mechanisms demands significant computational resources, thereby limiting their practical applicability across diverse tasks, especially in multimodal problems. In this work, we propose a new architecture, called perceiver-CDF, for modeling cumulative distributio… ▽ More

    Submitted 24 June, 2024; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: Accepted in Winter Simulation Conference 2024

  20. arXiv:2309.08142  [pdf, other

    cs.RO

    MAVIS: Multi-Camera Augmented Visual-Inertial SLAM using SE2(3) Based Exact IMU Pre-integration

    Authors: Yifu Wang, Yonhon Ng, Inkyu Sa, Alvaro Parra, Cristian Rodriguez, Tao Jun Lin, Hongdong Li

    Abstract: We present a novel optimization-based Visual-Inertial SLAM system designed for multiple partially overlapped camera systems, named MAVIS. Our framework fully exploits the benefits of wide field-of-view from multi-camera systems, and the metric scale measurements provided by an inertial measurement unit (IMU). We introduce an improved IMU pre-integration formulation based on the exponential functio… ▽ More

    Submitted 16 July, 2024; v1 submitted 15 September, 2023; originally announced September 2023.

    Comments: OpenMAVIS available at: https://github.com/MAVIS-SLAM/ORB_SLAM3_MULTI

  21. arXiv:2309.01361  [pdf, other

    cs.ET cs.CV cs.RO

    High Frequency, High Accuracy Pointing onboard Nanosats using Neuromorphic Event Sensing and Piezoelectric Actuation

    Authors: Yasir Latif, Peter Anastasiou, Yonhon Ng, Zebb Prime, Tien-Fu Lu, Matthew Tetlow, Robert Mahony, Tat-Jun Chin

    Abstract: As satellites become smaller, the ability to maintain stable pointing decreases as external forces acting on the satellite come into play. At the same time, reaction wheels used in the attitude determination and control system (ADCS) introduce high frequency jitter which can disrupt pointing stability. For space domain awareness (SDA) tasks that track objects tens of thousands of kilometres away,… ▽ More

    Submitted 10 September, 2023; v1 submitted 4 September, 2023; originally announced September 2023.

  22. An Asynchronous Linear Filter Architecture for Hybrid Event-Frame Cameras

    Authors: Ziwei Wang, Yonhon Ng, Cedric Scheerlinck, Robert Mahony

    Abstract: Event cameras are ideally suited to capture High Dynamic Range (HDR) visual information without blur but provide poor imaging capability for static or slowly varying scenes. Conversely, conventional image sensors measure absolute intensity of slowly changing scenes effectively but do poorly on HDR or quickly changing scenes. In this paper, we present an asynchronous linear filter architecture, fus… ▽ More

    Submitted 29 August, 2024; v1 submitted 3 September, 2023; originally announced September 2023.

    Comments: 17 pages, 10 figures. Date of Publication: 04 September 2023

    Journal ref: IEEE Transactions on Pattern Analysis and Machine Intelligence (Volume: 46, Issue: 2, February 2024). Page(s): 695 - 711

  23. arXiv:2308.10633  [pdf, other

    cs.CL cs.AI

    RaLLe: A Framework for Developing and Evaluating Retrieval-Augmented Large Language Models

    Authors: Yasuto Hoshi, Daisuke Miyashita, Youyang Ng, Kento Tatsuno, Yasuhiro Morioka, Osamu Torii, Jun Deguchi

    Abstract: Retrieval-augmented large language models (R-LLMs) combine pre-trained large language models (LLMs) with information retrieval systems to improve the accuracy of factual question-answering. However, current libraries for building R-LLMs provide high-level abstractions without sufficient transparency for evaluating and optimizing prompts within specific inference processes such as retrieval and gen… ▽ More

    Submitted 16 October, 2023; v1 submitted 21 August, 2023; originally announced August 2023.

    Comments: 18 pages, 2 figures, see https://youtu.be/JYbm75qnfTg for the demonstration screencast, accepted by EMNLP 2023 System Demonstrations

  24. arXiv:2308.03983  [pdf, other

    cs.CL cs.AI

    SimplyRetrieve: A Private and Lightweight Retrieval-Centric Generative AI Tool

    Authors: Youyang Ng, Daisuke Miyashita, Yasuto Hoshi, Yasuhiro Morioka, Osamu Torii, Tomoya Kodama, Jun Deguchi

    Abstract: Large Language Model (LLM) based Generative AI systems have seen significant progress in recent years. Integrating a knowledge retrieval architecture allows for seamless integration of private data into publicly available Generative AI systems using pre-trained LLM without requiring additional model fine-tuning. Moreover, Retrieval-Centric Generation (RCG) approach, a promising future research dir… ▽ More

    Submitted 7 August, 2023; originally announced August 2023.

    Comments: 12 pages, 6 figures

  25. arXiv:2306.11697  [pdf, other

    stat.ME cs.LG stat.ML

    Treatment Effects in Extreme Regimes

    Authors: Ahmed Aloui, Ali Hasan, Yuting Ng, Miroslav Pajic, Vahid Tarokh

    Abstract: Understanding treatment effects in extreme regimes is important for characterizing risks associated with different interventions. This is hindered by the unavailability of counterfactual outcomes and the rarity and difficulty of collecting extreme data in practice. To address this issue, we propose a new framework based on extreme value theory for estimating treatment effects in extreme regimes. W… ▽ More

    Submitted 22 May, 2024; v1 submitted 20 June, 2023; originally announced June 2023.

  26. arXiv:2304.02122  [pdf, other

    cs.CV

    OpenContrails: Benchmarking Contrail Detection on GOES-16 ABI

    Authors: Joe Yue-Hei Ng, Kevin McCloskey, Jian Cui, Vincent R. Meijer, Erica Brand, Aaron Sarna, Nita Goyal, Christopher Van Arsdale, Scott Geraedts

    Abstract: Contrails (condensation trails) are line-shaped ice clouds caused by aircraft and are likely the largest contributor of aviation-induced climate change. Contrail avoidance is potentially an inexpensive way to significantly reduce the climate impact of aviation. An automated contrail detection system is an essential tool to develop and evaluate contrail avoidance systems. In this paper, we present… ▽ More

    Submitted 20 April, 2023; v1 submitted 4 April, 2023; originally announced April 2023.

  27. arXiv:2303.05153  [pdf, other

    cs.CL cs.AI

    Can a Frozen Pretrained Language Model be used for Zero-shot Neural Retrieval on Entity-centric Questions?

    Authors: Yasuto Hoshi, Daisuke Miyashita, Yasuhiro Morioka, Youyang Ng, Osamu Torii, Jun Deguchi

    Abstract: Neural document retrievers, including dense passage retrieval (DPR), have outperformed classical lexical-matching retrievers, such as BM25, when fine-tuned and tested on specific question-answering datasets. However, it has been shown that the existing dense retrievers do not generalize well not only out of domain but even in domain such as Wikipedia, especially when a named entity in a question i… ▽ More

    Submitted 9 March, 2023; originally announced March 2023.

    Comments: Accepted to Workshop on Knowledge Augmented Methods for Natural Language Processing, in conjunction with AAAI 2023

  28. arXiv:2302.10381  [pdf

    cs.DL

    Electronic Laboratory Notebook on Web2py Framework

    Authors: Yong-Yao Ng, Maurice HT Ling

    Abstract: Proper experimental record-keeping is an important cornerstone in research and development for the purpose of auditing. The gold standard of record-keeping is based on the judicious use of physical, permanent notebooks. However, advances in technology had resulted in large amounts of electronic records making it virtually impossible to maintain a full set of records in physical notebooks. Electron… ▽ More

    Submitted 20 February, 2023; originally announced February 2023.

    Journal ref: The Python Papers 5(3): 7 (2010)

  29. arXiv:2302.03504  [pdf, other

    cs.RO eess.IV eess.SP

    Learning to Predict Grip Quality from Simulation: Establishing a Digital Twin to Generate Simulated Data for a Grip Stability Metric

    Authors: Stefanie Wucherer, Robert McMurray, Kok Yew Ng, Florian Kerber

    Abstract: A robust grip is key to successful manipulation and joining of work pieces involved in any industrial assembly process. Stability of a grip depends on geometric and physical properties of the object as well as the gripper itself. Current state-of-the-art algorithms can usually predict if a grip would fail. However, they are not able to predict the force at which the gripped object starts to slip,… ▽ More

    Submitted 6 February, 2023; originally announced February 2023.

    Comments: 7 pages, 7 figures

  30. arXiv:2301.01842  [pdf, other

    cs.CV cs.CY

    Detecting Neighborhood Gentrification at Scale via Street-level Visual Data

    Authors: Tianyuan Huang, Timothy Dai, Zhecheng Wang, Hesu Yoon, Hao Sheng, Andrew Y. Ng, Ram Rajagopal, Jackelyn Hwang

    Abstract: Neighborhood gentrification plays a significant role in shaping the social and economic well-being of both individuals and communities at large. While some efforts have been made to detect gentrification in cities, existing approaches rely mainly on estimated measures from survey data, require substantial work of human labeling, and are limited in characterizing the neighborhood as a whole. We pro… ▽ More

    Submitted 4 January, 2023; originally announced January 2023.

  31. arXiv:2211.15322  [pdf, other

    cs.LG stat.ML

    Transductive Kernels for Gaussian Processes on Graphs

    Authors: Yin-Cong Zhi, Felix L. Opolka, Yin Cheng Ng, Pietro Liò, Xiaowen Dong

    Abstract: Kernels on graphs have had limited options for node-level problems. To address this, we present a novel, generalized kernel for graphs with node feature data for semi-supervised learning. The kernel is derived from a regularization framework by treating the graph and feature data as two Hilbert spaces. We also show how numerous kernel-based models on graphs are instances of our design. A kernel de… ▽ More

    Submitted 28 November, 2022; originally announced November 2022.

  32. Overcoming Bias: Equivariant Filter Design for Biased Attitude Estimation with Online Calibration

    Authors: Alessandro Fornasier, Yonhon Ng, Christian Brommer, Christoph Böhm, Robert Mahony, Stephan Weiss

    Abstract: Stochastic filters for on-line state estimation are a core technology for autonomous systems. The performance of such filters is one of the key limiting factors to a system's capability. Both asymptotic behavior (e.g.,~for regular operation) and transient response (e.g.,~for fast initialization and reset) of such filters are of crucial importance in guaranteeing robust operation of autonomous syst… ▽ More

    Submitted 24 September, 2022; originally announced September 2022.

    Comments: to be published in Robotics and Automation Letters

  33. arXiv:2209.09508  [pdf, other

    cs.RO

    Real-time Digital Double Framework to Predict Collapsible Terrains for Legged Robots

    Authors: Garen Haddeler, Hari P. Palanivelu, Yung Chuen Ng, Fabien Colonnier, Albertus H. Adiwahono, Zhibin Li, Chee-Meng Chew, Meng Yee, Chuah

    Abstract: Inspired by the digital twinning systems, a novel real-time digital double framework is developed to enhance robot perception of the terrain conditions. Based on the very same physical model and motion control, this work exploits the use of such simulated digital double synchronized with a real robot to capture and extract discrepancy information between the two systems, which provides high dimens… ▽ More

    Submitted 20 September, 2022; originally announced September 2022.

    Comments: IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). Preprint version. Accepted June 2022

  34. arXiv:2209.06434  [pdf, other

    cs.SD cs.CL eess.AS

    ConvNeXt Based Neural Network for Audio Anti-Spoofing

    Authors: Qiaowei Ma, Jinghui Zhong, Yitao Yang, Weiheng Liu, Ying Gao, Wing W. Y. Ng

    Abstract: With the rapid development of speech conversion and speech synthesis algorithms, automatic speaker verification (ASV) systems are vulnerable to spoofing attacks. In recent years, researchers had proposed a number of anti-spoofing methods based on hand-crafted features. However, using hand-crafted features rather than raw waveform will lose implicit information for anti-spoofing. Inspired by the pr… ▽ More

    Submitted 21 December, 2022; v1 submitted 14 September, 2022; originally announced September 2022.

    Comments: 6 pages

  35. arXiv:2208.13027  [pdf, other

    cs.LG cs.AI

    Improving debris flow evacuation alerts in Taiwan using machine learning

    Authors: Yi-Lin Tsai, Jeremy Irvin, Suhas Chundi, Andrew Y. Ng, Christopher B. Field, Peter K. Kitanidis

    Abstract: Taiwan has the highest susceptibility to and fatalities from debris flows worldwide. The existing debris flow warning system in Taiwan, which uses a time-weighted measure of rainfall, leads to alerts when the measure exceeds a predefined threshold. However, this system generates many false alarms and misses a substantial fraction of the actual debris flows. Towards improving this system, we implem… ▽ More

    Submitted 2 September, 2022; v1 submitted 27 August, 2022; originally announced August 2022.

    Comments: Supplementary information: https://drive.google.com/file/d/1Y17YxXo5rhIbUuZzwLo99pmttbh28v9X/view?usp=sharing

  36. arXiv:2208.01710  [pdf, other

    cs.RO

    Smart Visual Beacons with Asynchronous Optical Communications using Event Cameras

    Authors: Ziwei Wang, Yonhon Ng, Jack Henderson, Robert Mahony

    Abstract: Event cameras are bio-inspired dynamic vision sensors that respond to changes in image intensity with a high temporal resolution, high dynamic range and low latency. These sensor characteristics are ideally suited to enable visual target tracking in concert with a broadcast visual communication channel for smart visual beacons with applications in distributed robotics. Visual beacons can be constr… ▽ More

    Submitted 2 August, 2022; originally announced August 2022.

    Comments: 7 pages, 8 figures, accepted by IEEE International Conference on Intelligent Robots and Systems (IROS) 2022

  37. arXiv:2207.11166  [pdf, other

    cs.CV

    METER-ML: A Multi-Sensor Earth Observation Benchmark for Automated Methane Source Mapping

    Authors: Bryan Zhu, Nicholas Lui, Jeremy Irvin, Jimmy Le, Sahil Tadwalkar, Chenghao Wang, Zutao Ouyang, Frankie Y. Liu, Andrew Y. Ng, Robert B. Jackson

    Abstract: Reducing methane emissions is essential for mitigating global warming. To attribute methane emissions to their sources, a comprehensive dataset of methane source infrastructure is necessary. Recent advancements with deep learning on remotely sensed imagery have the potential to identify the locations and characteristics of methane sources, but there is a substantial lack of publicly available data… ▽ More

    Submitted 15 August, 2022; v1 submitted 22 July, 2022; originally announced July 2022.

    Comments: Workshop on Complex Data Challenges in Earth Observation at IJCAI-ECAI 2022

  38. arXiv:2206.02679  [pdf, other

    cs.RO cs.AI

    Real2Sim or Sim2Real: Robotics Visual Insertion using Deep Reinforcement Learning and Real2Sim Policy Adaptation

    Authors: Yiwen Chen, Xue Li, Sheng Guo, Xian Yao Ng, Marcelo Ang

    Abstract: Reinforcement learning has shown a wide usage in robotics tasks, such as insertion and grasping. However, without a practical sim2real strategy, the policy trained in simulation could fail on the real task. There are also wide researches in the sim2real strategies, but most of those methods rely on heavy image rendering, domain randomization training, or tuning. In this work, we solve the insertio… ▽ More

    Submitted 6 June, 2022; originally announced June 2022.

  39. arXiv:2205.14025  [pdf, other

    stat.ME cs.LG stat.ML

    Inference and Sampling for Archimax Copulas

    Authors: Yuting Ng, Ali Hasan, Vahid Tarokh

    Abstract: Understanding multivariate dependencies in both the bulk and the tails of a distribution is an important problem for many applications, such as ensuring algorithms are robust to observations that are infrequent but have devastating effects. Archimax copulas are a family of distributions endowed with a precise representation that allows simultaneous modeling of the bulk and the tails of a distribut… ▽ More

    Submitted 20 September, 2022; v1 submitted 27 May, 2022; originally announced May 2022.

    Comments: Yuting Ng and Ali Hasan contributed equally to this work. This work has been accepted at NeurIPS 2022

  40. arXiv:2205.08090  [pdf, other

    cs.CV

    A Linear Comb Filter for Event Flicker Removal

    Authors: Ziwei Wang, Dingran Yuan, Yonhon Ng, Robert Mahony

    Abstract: Event cameras are bio-inspired sensors that capture per-pixel asynchronous intensity change rather than the synchronous absolute intensity frames captured by a classical camera sensor. Such cameras are ideal for robotics applications since they have high temporal resolution, high dynamic range and low latency. However, due to their high temporal resolution, event cameras are particularly sensitive… ▽ More

    Submitted 17 May, 2022; originally announced May 2022.

    Comments: 10 pages, 7 figures, published in IEEE International Conference on Robotics and Automation (ICRA), 2022

  41. arXiv:2205.05963  [pdf, other

    cs.RO cs.AI cs.CV

    Economical Precise Manipulation and Auto Eye-Hand Coordination with Binocular Visual Reinforcement Learning

    Authors: Yiwen Chen, Sheng Guo, Zedong Zhang, Lei Zhou, Xian Yao Ng, Marcelo H. Ang Jr

    Abstract: Precision robotic manipulation tasks (insertion, screwing, precisely pick, precisely place) are required in many scenarios. Previous methods achieved good performance on such manipulation tasks. However, such methods typically require tedious calibration or expensive sensors. 3D/RGB-D cameras and torque/force sensors add to the cost of the robotic application and may not always be economical. In t… ▽ More

    Submitted 15 September, 2022; v1 submitted 12 May, 2022; originally announced May 2022.

    Comments: 12 pages, 16 figures

  42. arXiv:2204.01186  [pdf, ps, other

    cs.CV

    Revisiting a kNN-based Image Classification System with High-capacity Storage

    Authors: Kengo Nakata, Youyang Ng, Daisuke Miyashita, Asuka Maki, Yu-Chieh Lin, Jun Deguchi

    Abstract: In existing image classification systems that use deep neural networks, the knowledge needed for image classification is implicitly stored in model parameters. If users want to update this knowledge, then they need to fine-tune the model parameters. Moreover, users cannot verify the validity of inference results or evaluate the contribution of knowledge to the results. In this paper, we investigat… ▽ More

    Submitted 28 July, 2022; v1 submitted 3 April, 2022; originally announced April 2022.

    Comments: Accepted to ECCV 2022 (Oral)

  43. Equivariant Filter Design for Inertial Navigation Systems with Input Measurement Biases

    Authors: Alessandro Fornasier, Yonhon Ng, Robert Mahony, Stephan Weiss

    Abstract: Inertial Navigation Systems (INS) are a key technology for autonomous vehicles applications. Recent advances in estimation and filter design for the INS problem have exploited geometry and symmetry to overcome limitations of the classical Extended Kalman Filter (EKF) approach that formed the mainstay of INS systems since the mid-twentieth century. The industry standard INS filter, the Multiplicati… ▽ More

    Submitted 4 February, 2022; originally announced February 2022.

  44. arXiv:2201.02437  [pdf, other

    cs.RO

    Continuous-time Radar-inertial Odometry for Automotive Radars

    Authors: Yin Zhi Ng, Benjamin Choi, Robby Tan, Lionel Heng

    Abstract: We present an approach for radar-inertial odometry which uses a continuous-time framework to fuse measurements from multiple automotive radars and an inertial measurement unit (IMU). Adverse weather conditions do not have a significant impact on the operating performance of radar sensors unlike that of camera and LiDAR sensors. Radar's robustness in such conditions and the increasing prevalence of… ▽ More

    Submitted 7 January, 2022; originally announced January 2022.

    Comments: In Proceedings of the 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

  45. arXiv:2201.01449  [pdf, other

    eess.IV cs.CV cs.LG

    Deep Learning-Based Sparse Whole-Slide Image Analysis for the Diagnosis of Gastric Intestinal Metaplasia

    Authors: Jon Braatz, Pranav Rajpurkar, Stephanie Zhang, Andrew Y. Ng, Jeanne Shen

    Abstract: In recent years, deep learning has successfully been applied to automate a wide variety of tasks in diagnostic histopathology. However, fast and reliable localization of small-scale regions-of-interest (ROI) has remained a key challenge, as discriminative morphologic features often occupy only a small fraction of a gigapixel-scale whole-slide image (WSI). In this paper, we propose a sparse WSI ana… ▽ More

    Submitted 4 January, 2022; originally announced January 2022.

  46. arXiv:2112.04963  [pdf, other

    cs.LG physics.ao-ph

    Model-Agnostic Hybrid Numerical Weather Prediction and Machine Learning Paradigm for Solar Forecasting in the Tropics

    Authors: Nigel Yuan Yun Ng, Harish Gopalan, Venugopalan S. G. Raghavan, Chin Chun Ooi

    Abstract: Numerical weather prediction (NWP) and machine learning (ML) methods are popular for solar forecasting. However, NWP models have multiple possible physical parameterizations, which requires site-specific NWP optimization. This is further complicated when regional NWP models are used with global climate models with different possible parameterizations. In this study, an alternative approach is prop… ▽ More

    Submitted 9 December, 2021; originally announced December 2021.

  47. Stereo Hybrid Event-Frame (SHEF) Cameras for 3D Perception

    Authors: Ziwei Wang, Liyuan Pan, Yonhon Ng, Zheyu Zhuang, Robert Mahony

    Abstract: Stereo camera systems play an important role in robotics applications to perceive the 3D world. However, conventional cameras have drawbacks such as low dynamic range, motion blur and latency due to the underlying frame-based mechanism. Event cameras address these limitations as they report the brightness changes of each pixel independently with a fine temporal resolution, but they are unable to a… ▽ More

    Submitted 11 May, 2022; v1 submitted 11 October, 2021; originally announced October 2021.

    Comments: 10 pages, 6 figures, accepted for presentation at International Conference on Intelligent Robots and Systems (IROS), 2021

  48. Epileptic Seizure Classification Using Combined Labels and a Genetic Algorithm

    Authors: Scot Davidson, Niamh McCallan, Kok Yew Ng, Pardis Biglarbeigi, Dewar Finlay, Boon Leong Lan, James McLaughlin

    Abstract: Epilepsy affects 50 million people worldwide and is one of the most common serious neurological disorders. Seizure detection and classification is a valuable tool for diagnosing and maintaining the condition. An automated classification algorithm will allow for accurate diagnosis. Utilising the Temple University Hospital (TUH) Seizure Corpus, six seizure types are compared; absence, complex partia… ▽ More

    Submitted 28 April, 2022; v1 submitted 4 October, 2021; originally announced October 2021.

    Comments: 6 pages, 3 figures, accepted for publication at the 21st IEEE Mediterranean Electrotechnical Conference (MELECON 2022)

    Journal ref: 2022 IEEE 21st Mediterranean Electrotechnical Conference (MELECON)

  49. arXiv:2109.14828  [pdf, other

    cs.RO

    Uncertainty Estimation of Dense Optical-Flow for Robust Visual Navigation

    Authors: Yonhon Ng, Hongdong Li, Jonghyuk Kim

    Abstract: This paper presents a novel dense optical-flow algorithm to solve the monocular simultaneous localization and mapping (SLAM) problem for ground or aerial robots. Dense optical flow can effectively provide the ego-motion of the vehicle while enabling collision avoidance with the potential obstacles. Existing work has not fully utilized the uncertainty of the optical flow -- at most an isotropic Gau… ▽ More

    Submitted 29 September, 2021; originally announced September 2021.

  50. arXiv:2108.01764  [pdf, other

    cs.CL cs.AI

    Q-Pain: A Question Answering Dataset to Measure Social Bias in Pain Management

    Authors: Cécile Logé, Emily Ross, David Yaw Amoah Dadey, Saahil Jain, Adriel Saporta, Andrew Y. Ng, Pranav Rajpurkar

    Abstract: Recent advances in Natural Language Processing (NLP), and specifically automated Question Answering (QA) systems, have demonstrated both impressive linguistic fluency and a pernicious tendency to reflect social biases. In this study, we introduce Q-Pain, a dataset for assessing bias in medical QA in the context of pain management, one of the most challenging forms of clinical decision-making. Alon… ▽ More

    Submitted 3 August, 2021; originally announced August 2021.

    Comments: Accepted to the 35th Conference on Neural Information Processing Systems (NeurIPS 2021) Track on Datasets and Benchmarks