Skip to main content

Showing 1–50 of 361 results for author: Chen, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.10162  [pdf, other

    cs.RO cs.LG

    Physics-Aware Combinatorial Assembly Planning using Deep Reinforcement Learning

    Authors: Ruixuan Liu, Alan Chen, Weiye Zhao, Changliu Liu

    Abstract: Combinatorial assembly uses standardized unit primitives to build objects that satisfy user specifications. Lego is a widely used platform for combinatorial assembly, in which people use unit primitives (ie Lego bricks) to build highly customizable 3D objects. This paper studies sequence planning for physical combinatorial assembly using Lego. Given the shape of the desired object, we want to find… ▽ More

    Submitted 19 August, 2024; originally announced August 2024.

  2. arXiv:2408.09945  [pdf, other

    cs.CL cs.AI

    Benchmarking LLMs for Translating Classical Chinese Poetry:Evaluating Adequacy, Fluency, and Elegance

    Authors: Andong Chen, Lianzhang Lou, Kehai Chen, Xuefeng Bai, Yang Xiang, Muyun Yang, Tiejun Zhao, Min Zhang

    Abstract: Large language models (LLMs) have shown remarkable performance in general translation tasks. However, the increasing demand for high-quality translations that are not only adequate but also fluent and elegant. To assess the extent to which current LLMs can meet these demands, we introduce a suitable benchmark for translating classical Chinese poetry into English. This task requires not only adequa… ▽ More

    Submitted 19 August, 2024; originally announced August 2024.

    Comments: Work in progress

  3. arXiv:2408.09030  [pdf, other

    cs.CL cs.HC

    Studying the Effects of Collaboration in Interactive Theme Discovery Systems

    Authors: Alvin Po-Chun Chen, Dananjay Srinivas, Alexandra Barry, Maksim Seniw, Maria Leonor Pacheco

    Abstract: NLP-assisted solutions have gained considerable traction to support qualitative data analysis. However, there does not exist a unified evaluation framework that can account for the many different settings in which qualitative researchers may employ them. In this paper, we take a first step in this direction by proposing an evaluation framework to study the way in which different tools may result i… ▽ More

    Submitted 16 August, 2024; originally announced August 2024.

  4. arXiv:2408.06634  [pdf, other

    q-fin.CP cs.CL cs.LG q-fin.ST

    Harnessing Earnings Reports for Stock Predictions: A QLoRA-Enhanced LLM Approach

    Authors: Haowei Ni, Shuchen Meng, Xupeng Chen, Ziqing Zhao, Andi Chen, Panfeng Li, Shiyao Zhang, Qifu Yin, Yuanqing Wang, Yuxi Chan

    Abstract: Accurate stock market predictions following earnings reports are crucial for investors. Traditional methods, particularly classical machine learning models, struggle with these predictions because they cannot effectively process and interpret extensive textual data contained in earnings reports and often overlook nuances that influence market movements. This paper introduces an advanced approach b… ▽ More

    Submitted 13 August, 2024; originally announced August 2024.

    Comments: Accepted by 2024 6th International Conference on Data-driven Optimization of Complex Systems

  5. arXiv:2408.06478  [pdf, other

    cs.CR cs.PL

    Theorem-Carrying-Transaction: Runtime Certification to Ensure Safety for Smart Contract Transactions

    Authors: Nikolaj S. Bjørner, Ashley J. Chen, Shuo Chen, Yang Chen, Zhongxin Guo, Tzu-Han Hsu, Peng Liu, Nanqing Luo

    Abstract: Security bugs and trapdoors in smart contracts have been impacting the Ethereum community since its inception. Conceptually, the 1.45-million Ethereum's contracts form a single "gigantic program" whose behaviors are determined by the complex reference-topology between the contracts. Can the Ethereum community be assured that this gigantic program conforms to its design-level safety properties, des… ▽ More

    Submitted 12 August, 2024; originally announced August 2024.

  6. arXiv:2408.04752  [pdf, other

    cs.HC

    A Multi-Level Task Framework for Event Sequence Analysis

    Authors: Kazi Tasnim Zinat, Saimadhav Naga Sakhamuri, Aaron Sun Chen, Zhicheng Liu

    Abstract: Despite the development of numerous visual analytics tools for event sequence data across various domains, including but not limited to healthcare, digital marketing, and user behavior analysis, comparing these domain-specific investigations and transferring the results to new datasets and problem areas remain challenging. Task abstractions can help us go beyond domain-specific details, but existi… ▽ More

    Submitted 8 August, 2024; originally announced August 2024.

    Comments: Task Abstraction, Event Sequence Data

  7. arXiv:2408.04268  [pdf, other

    cs.CV

    Evaluating Modern Approaches in 3D Scene Reconstruction: NeRF vs Gaussian-Based Methods

    Authors: Yiming Zhou, Zixuan Zeng, Andi Chen, Xiaofan Zhou, Haowei Ni, Shiyao Zhang, Panfeng Li, Liangxi Liu, Mengyao Zheng, Xupeng Chen

    Abstract: Exploring the capabilities of Neural Radiance Fields (NeRF) and Gaussian-based methods in the context of 3D scene reconstruction, this study contrasts these modern approaches with traditional Simultaneous Localization and Mapping (SLAM) systems. Utilizing datasets such as Replica and ScanNet, we assess performance based on tracking accuracy, mapping fidelity, and view synthesis. Findings reveal th… ▽ More

    Submitted 8 August, 2024; originally announced August 2024.

    Comments: Accepted by 2024 6th International Conference on Data-driven Optimization of Complex Systems

  8. X.509 Information Security Certification Based on Post-Quantum Cryptography

    Authors: Abel C. H. Chen

    Abstract: In recent years, with the advancement of quantum computing, mainstream asymmetric cryptographic methods in the current Public Key Infrastructure (PKI) systems are gradually being threatened. Therefore, this study explores X.509 security certificates based on Post-Quantum Cryptography (PQC) and discusses implemented solutions. This study compares mainstream asymmetric cryptographic methods (includi… ▽ More

    Submitted 4 August, 2024; originally announced August 2024.

    Comments: The manuscript was submitted to arXiv on 6 May 2024, but it was rejected on 11 July 2024. The appeal was submitted on 11 July 2024, and it was accepted on 2 August 2024. The manuscript is written in Chinese language

  9. arXiv:2407.19342  [pdf, other

    cs.LG cs.CL

    Parameter-Efficient Fine-Tuning via Circular Convolution

    Authors: Aochuan Chen, Ziqi Gao, Zijing Liu, Yu Li, Jia Li

    Abstract: Low-Rank Adaptation (LoRA) has gained popularity for fine-tuning large foundation models, leveraging low-rank matrices $\mathbf{A}$ and $\mathbf{B}$ to represent weight changes (\textit{i.e.,} $Δ\mathbf{W} = \mathbf{B} \mathbf{A}$). This method reduces trainable parameters and mitigates heavy memory consumption associated with full delta matrices by sequentially multiplying $\mathbf{A}$ and… ▽ More

    Submitted 27 July, 2024; originally announced July 2024.

    Comments: Work in progress

  10. arXiv:2407.18397  [pdf, other

    cs.LG stat.ML

    Gaussian Process Kolmogorov-Arnold Networks

    Authors: Andrew Siyuan Chen

    Abstract: In this paper, we introduce a probabilistic extension to Kolmogorov Arnold Networks (KANs) by incorporating Gaussian Process (GP) as non-linear neurons, which we refer to as GP-KAN. A fully analytical approach to handling the output distribution of one GP as an input to another GP is achieved by considering the function inner product of a GP function sample with the input distribution. These GP ne… ▽ More

    Submitted 17 August, 2024; v1 submitted 25 July, 2024; originally announced July 2024.

    Comments: related code: https://github.com/siyuan0/gp-kan Under review at IEEE

  11. arXiv:2407.17676  [pdf, other

    cs.DC

    Empowering the Quantum Cloud User with QRIO

    Authors: Shmeelok Chakraborty, Yuewen Hou, Ang Chen, Gokul Subramanian Ravi

    Abstract: Quantum computing is moving swiftly from theoretical to practical applications, making it crucial to establish a significant quantum advantage. Despite substantial investments, access to quantum devices is still limited, with users facing issues like long wait times and inefficient resource management. Unlike the mature cloud solutions for classical computing, quantum computing lacks effective inf… ▽ More

    Submitted 25 July, 2024; v1 submitted 24 July, 2024; originally announced July 2024.

    Comments: To appear at the IEEE International Symposium on Workload Characterization, 2024

  12. arXiv:2407.15359  [pdf

    cs.CL

    UF-HOBI at "Discharge Me!": A Hybrid Solution for Discharge Summary Generation Through Prompt-based Tuning of GatorTronGPT Models

    Authors: Mengxian Lyu, Cheng Peng, Daniel Paredes, Ziyi Chen, Aokun Chen, Jiang Bian, Yonghui Wu

    Abstract: Automatic generation of discharge summaries presents significant challenges due to the length of clinical documentation, the dispersed nature of patient information, and the diverse terminology used in healthcare. This paper presents a hybrid solution for generating discharge summary sections as part of our participation in the "Discharge Me!" Challenge at the BioNLP 2024 Shared Task. We developed… ▽ More

    Submitted 22 July, 2024; originally announced July 2024.

    Comments: BIONLP 2024 and Shared Tasks @ ACL 2024

    Journal ref: BIONLP 2024 and Shared Tasks @ ACL 2024

  13. arXiv:2407.12939  [pdf, other

    cs.CV

    GenRC: Generative 3D Room Completion from Sparse Image Collections

    Authors: Ming-Feng Li, Yueh-Feng Ku, Hong-Xuan Yen, Chi Liu, Yu-Lun Liu, Albert Y. C. Chen, Cheng-Hao Kuo, Min Sun

    Abstract: Sparse RGBD scene completion is a challenging task especially when considering consistent textures and geometries throughout the entire scene. Different from existing solutions that rely on human-designed text prompts or predefined camera trajectories, we propose GenRC, an automated training-free pipeline to complete a room-scale 3D mesh with high-fidelity textures. To achieve this, we first proje… ▽ More

    Submitted 1 August, 2024; v1 submitted 17 July, 2024; originally announced July 2024.

    Comments: ECCV 2024

  14. arXiv:2407.12810  [pdf

    cs.NI

    A Study on the Situation of Connected Car Patent Portfolios

    Authors: Abel C. H. Chen, Chia-Shen Chang

    Abstract: In recent years, the countries of the world have drafted the specifications of connected cars; for instance, the Security Credential Management System (SCMS) has been proposed by United States Department of Transportation (USDOT), and the Cooperative Intelligent Transportation System (C-ITS) Credential Management System (CCMS) has been proposed by European Union (EU). Therefore, several companies… ▽ More

    Submitted 26 June, 2024; originally announced July 2024.

    Comments: in Chinese language

  15. Performance Comparison of Various Modes of Advanced Encryption Standard

    Authors: Abel C. H. Chen

    Abstract: With the maturation of quantum computing technology, many cryptographic methods are gradually facing threats from quantum computing. Although the Grover algorithm can accelerate search speeds, current research indicates that the Advanced Encryption Standard (AES) method can still enhance security by increasing the length of the secret key. However, the AES method involves multiple modes in impleme… ▽ More

    Submitted 21 May, 2024; originally announced July 2024.

    Comments: in Chinese language

  16. arXiv:2407.07457  [pdf, other

    cs.LG cs.CL

    GLBench: A Comprehensive Benchmark for Graph with Large Language Models

    Authors: Yuhan Li, Peisong Wang, Xiao Zhu, Aochuan Chen, Haiyun Jiang, Deng Cai, Victor Wai Kin Chan, Jia Li

    Abstract: The emergence of large language models (LLMs) has revolutionized the way we interact with graphs, leading to a new paradigm called GraphLLM. Despite the rapid development of GraphLLM methods in recent years, the progress and understanding of this field remain unclear due to the lack of a benchmark with consistent experimental protocols. To bridge this gap, we introduce GLBench, the first comprehen… ▽ More

    Submitted 11 July, 2024; v1 submitted 10 July, 2024; originally announced July 2024.

    Comments: arXiv admin note: text overlap with arXiv:2306.10280 by other authors

  17. arXiv:2407.04699  [pdf, other

    cs.CV cs.AI

    LaRa: Efficient Large-Baseline Radiance Fields

    Authors: Anpei Chen, Haofei Xu, Stefano Esposito, Siyu Tang, Andreas Geiger

    Abstract: Radiance field methods have achieved photorealistic novel view synthesis and geometry reconstruction. But they are mostly applied in per-scene optimization or small-baseline settings. While several recent works investigate feed-forward reconstruction with large baselines by utilizing transformers, they all operate with a standard global attention mechanism and hence ignore the local nature of 3D r… ▽ More

    Submitted 15 July, 2024; v1 submitted 5 July, 2024; originally announced July 2024.

    Comments: Project Page: https://apchenstu.github.io/LaRa/

  18. arXiv:2407.04264  [pdf, ps, other

    cs.LG math.OC

    Langevin Dynamics: A Unified Perspective on Optimization via Lyapunov Potentials

    Authors: August Y. Chen, Ayush Sekhari, Karthik Sridharan

    Abstract: We study the problem of non-convex optimization using Stochastic Gradient Langevin Dynamics (SGLD). SGLD is a natural and popular variation of stochastic gradient descent where at each step, appropriately scaled Gaussian noise is added. To our knowledge, the only strategy for showing global convergence of SGLD on the loss function is to show that SGLD can sample from a stationary distribution whic… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  19. arXiv:2407.02666  [pdf, other

    cs.RO cs.AI

    Commonsense Reasoning for Legged Robot Adaptation with Vision-Language Models

    Authors: Annie S. Chen, Alec M. Lessing, Andy Tang, Govind Chada, Laura Smith, Sergey Levine, Chelsea Finn

    Abstract: Legged robots are physically capable of navigating a diverse variety of environments and overcoming a wide range of obstructions. For example, in a search and rescue mission, a legged robot could climb over debris, crawl through gaps, and navigate out of dead ends. However, the robot's controller needs to respond intelligently to such varied obstacles, and this requires handling unexpected and unu… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: 27 pages

  20. arXiv:2406.13121  [pdf, other

    cs.CL cs.AI cs.IR

    Can Long-Context Language Models Subsume Retrieval, RAG, SQL, and More?

    Authors: Jinhyuk Lee, Anthony Chen, Zhuyun Dai, Dheeru Dua, Devendra Singh Sachan, Michael Boratko, Yi Luan, Sébastien M. R. Arnold, Vincent Perot, Siddharth Dalmia, Hexiang Hu, Xudong Lin, Panupong Pasupat, Aida Amini, Jeremy R. Cole, Sebastian Riedel, Iftekhar Naim, Ming-Wei Chang, Kelvin Guu

    Abstract: Long-context language models (LCLMs) have the potential to revolutionize our approach to tasks traditionally reliant on external tools like retrieval systems or databases. Leveraging LCLMs' ability to natively ingest and process entire corpora of information offers numerous advantages. It enhances user-friendliness by eliminating the need for specialized knowledge of tools, provides robust end-to-… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: 29 pages. Dataset available at https://github.com/google-deepmind/loft

  21. arXiv:2406.12177  [pdf, other

    cs.CV cs.LG

    Location-based Radiology Report-Guided Semi-supervised Learning for Prostate Cancer Detection

    Authors: Alex Chen, Nathan Lay, Stephanie Harmon, Kutsev Ozyoruk, Enis Yilmaz, Brad J. Wood, Peter A. Pinto, Peter L. Choyke, Baris Turkbey

    Abstract: Prostate cancer is one of the most prevalent malignancies in the world. While deep learning has potential to further improve computer-aided prostate cancer detection on MRI, its efficacy hinges on the exhaustive curation of manually annotated images. We propose a novel methodology of semisupervised learning (SSL) guided by automatically extracted clinical information, specifically the lesion locat… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 4 page paper accepted to IEEE International Symposium on Biomedical Imaging (ISBI 2024)

  22. arXiv:2406.12123  [pdf, other

    cs.RO cs.AI cs.LG

    ChatEMG: Synthetic Data Generation to Control a Robotic Hand Orthosis for Stroke

    Authors: Jingxi Xu, Runsheng Wang, Siqi Shang, Ava Chen, Lauren Winterbottom, To-Liang Hsu, Wenxi Chen, Khondoker Ahmed, Pedro Leandro La Rotta, Xinyue Zhu, Dawn M. Nilsen, Joel Stein, Matei Ciocarlie

    Abstract: Intent inferral on a hand orthosis for stroke patients is challenging due to the difficulty of data collection from impaired subjects. Additionally, EMG signals exhibit significant variations across different conditions, sessions, and subjects, making it hard for classifiers to generalize. Traditional approaches require a large labeled dataset from the new condition, session, or subject to train i… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 8 pages

  23. arXiv:2406.11309  [pdf, other

    cs.CV

    BaFTA: Backprop-Free Test-Time Adaptation For Zero-Shot Vision-Language Models

    Authors: Xuefeng Hu, Ke Zhang, Min Sun, Albert Chen, Cheng-Hao Kuo, Ram Nevatia

    Abstract: Large-scale pretrained vision-language models like CLIP have demonstrated remarkable zero-shot image classification capabilities across diverse domains. To enhance CLIP's performance while preserving the zero-shot paradigm, various test-time prompt tuning methods have been introduced to refine class embeddings through unsupervised learning objectives during inference. However, these methods often… ▽ More

    Submitted 18 June, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

    Comments: Preprint updated from our earlier manuscript submitted to ICLR 2024 (https://openreview.net/forum?id=KNtcoAM5Gy)

  24. arXiv:2406.07232  [pdf, other

    cs.CL cs.AI

    DUAL-REFLECT: Enhancing Large Language Models for Reflective Translation through Dual Learning Feedback Mechanisms

    Authors: Andong Chen, Lianzhang Lou, Kehai Chen, Xuefeng Bai, Yang Xiang, Muyun Yang, Tiejun Zhao, Min Zhang

    Abstract: Recently, large language models (LLMs) enhanced by self-reflection have achieved promising performance on machine translation. The key idea is guiding LLMs to generate translation with human-like feedback. However, existing self-reflection methods lack effective feedback information, limiting the translation performance. To address this, we introduce a DUAL-REFLECT framework, leveraging the dual l… ▽ More

    Submitted 21 June, 2024; v1 submitted 11 June, 2024; originally announced June 2024.

    Comments: Accepted to ACL 2024 main conference

  25. arXiv:2406.05810  [pdf, other

    cs.CV

    ControlLoc: Physical-World Hijacking Attack on Visual Perception in Autonomous Driving

    Authors: Chen Ma, Ningfei Wang, Zhengyu Zhao, Qian Wang, Qi Alfred Chen, Chao Shen

    Abstract: Recent research in adversarial machine learning has focused on visual perception in Autonomous Driving (AD) and has shown that printed adversarial patches can attack object detectors. However, it is important to note that AD visual perception encompasses more than just object detection; it also includes Multiple Object Tracking (MOT). MOT enhances the robustness by compensating for object detectio… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

  26. arXiv:2406.05800   

    cs.CV cs.CR

    SlowPerception: Physical-World Latency Attack against Visual Perception in Autonomous Driving

    Authors: Chen Ma, Ningfei Wang, Zhengyu Zhao, Qi Alfred Chen, Chao Shen

    Abstract: Autonomous Driving (AD) systems critically depend on visual perception for real-time object detection and multiple object tracking (MOT) to ensure safe driving. However, high latency in these visual perception components can lead to significant safety risks, such as vehicle collisions. While previous research has extensively explored latency attacks within the digital realm, translating these meth… ▽ More

    Submitted 19 July, 2024; v1 submitted 9 June, 2024; originally announced June 2024.

    Comments: This submission was made without all contributors' consent

  27. arXiv:2406.04784  [pdf, other

    cs.CL cs.AI

    SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals

    Authors: Ruihan Yang, Jiangjie Chen, Yikai Zhang, Siyu Yuan, Aili Chen, Kyle Richardson, Yanghua Xiao, Deqing Yang

    Abstract: Language agents powered by large language models (LLMs) are increasingly valuable as decision-making tools in domains such as gaming and programming. However, these agents often face challenges in achieving high-level goals without detailed instructions and in adapting to environments where feedback is delayed. In this paper, we present SelfGoal, a novel automatic approach designed to enhance agen… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: Preprint

  28. arXiv:2406.04531  [pdf, other

    cs.SE

    TESTEVAL: Benchmarking Large Language Models for Test Case Generation

    Authors: Wenhan Wang, Chenyuan Yang, Zhijie Wang, Yuheng Huang, Zhaoyang Chu, Da Song, Lingming Zhang, An Ran Chen, Lei Ma

    Abstract: Testing plays a crucial role in the software development cycle, enabling the detection of bugs, vulnerabilities, and other undesirable behaviors. To perform software testing, testers need to write code snippets that execute the program under test. Recently, researchers have recognized the potential of large language models (LLMs) in software testing. However, there remains a lack of fair compariso… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  29. arXiv:2405.20774  [pdf, other

    cs.CR cs.AI

    Exploring Backdoor Attacks against Large Language Model-based Decision Making

    Authors: Ruochen Jiao, Shaoyuan Xie, Justin Yue, Takami Sato, Lixu Wang, Yixuan Wang, Qi Alfred Chen, Qi Zhu

    Abstract: Large Language Models (LLMs) have shown significant promise in decision-making tasks when fine-tuned on specific applications, leveraging their inherent common sense and reasoning abilities learned from vast amounts of data. However, these systems are exposed to substantial safety and security risks during the fine-tuning phase. In this work, we propose the first comprehensive framework for Backdo… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: 27 pages, including main paper, references, and appendix

  30. arXiv:2405.19534  [pdf, other

    cs.LG cs.AI cs.CL

    Preference Learning Algorithms Do Not Learn Preference Rankings

    Authors: Angelica Chen, Sadhika Malladi, Lily H. Zhang, Xinyi Chen, Qiuyi Zhang, Rajesh Ranganath, Kyunghyun Cho

    Abstract: Preference learning algorithms (e.g., RLHF and DPO) are frequently used to steer LLMs to produce generations that are more preferred by humans, but our understanding of their inner workings is still limited. In this work, we study the conventional wisdom that preference learning trains models to assign higher likelihoods to more preferred outputs than less preferred outputs, measured via… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  31. arXiv:2405.16798  [pdf, other

    cs.LG

    Exploring Fairness in Educational Data Mining in the Context of the Right to be Forgotten

    Authors: Wei Qian, Aobo Chen, Chenxu Zhao, Yangyi Li, Mengdi Huai

    Abstract: In education data mining (EDM) communities, machine learning has achieved remarkable success in discovering patterns and structures to tackle educational challenges. Notably, fairness and algorithmic bias have gained attention in learning analytics of EDM. With the increasing demand for the right to be forgotten, there is a growing need for machine learning models to forget sensitive data and its… ▽ More

    Submitted 29 May, 2024; v1 submitted 26 May, 2024; originally announced May 2024.

  32. arXiv:2405.03003  [pdf, other

    cs.LG cs.AI cs.CL

    Parameter-Efficient Fine-Tuning with Discrete Fourier Transform

    Authors: Ziqi Gao, Qichao Wang, Aochuan Chen, Zijing Liu, Bingzhe Wu, Liang Chen, Jia Li

    Abstract: Low-rank adaptation~(LoRA) has recently gained much interest in fine-tuning foundation models. It effectively reduces the number of trainable parameters by incorporating low-rank matrices $A$ and $B$ to represent the weight change, i.e., $ΔW=BA$. Despite LoRA's progress, it faces storage challenges when handling extensive customization adaptations or larger base models. In this work, we aim to fur… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

    Comments: Accepted by ICML 2024

  33. arXiv:2405.02517  [pdf, other

    cs.CL

    Mothman at SemEval-2024 Task 9: An Iterative System for Chain-of-Thought Prompt Optimization

    Authors: Alvin Po-Chun Chen, Ray Groshan, Sean von Bayern

    Abstract: Extensive research exists on the performance of large language models on logic-based tasks, whereas relatively little has been done on their ability to generate creative solutions on lateral thinking tasks. The BrainTeaser shared task tests lateral thinking and uses adversarial datasets to prevent memorization, resulting in poor performance for out-of-the-box models. We propose a system for iterat… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

    Comments: 13 pages, 2 figures, to be published in Proceedings of the 18th International Workshop on Semantic Evaluation (SemEval-2024)

  34. arXiv:2405.01798  [pdf, other

    cs.IT econ.GN

    The Economy and Public Diplomacy: An Analysis of RT's Economic Content and Context on Facebook

    Authors: Ayse D. Lokmanoglu, Carol K. Winkler, Kareem El Damanhoury, Virginia Massignan, Esteban Villa-Turek, Keyu Alexander Chen

    Abstract: With globalization's rise, economic interdependence's impacts have become a prominent factor affecting personal lives, as well as national and international dynamics. This study examines RT's public diplomacy efforts on its non-Russian Facebook accounts over the past five years to identify the prominence of economic topics across language accounts. Computational analysis, including word embeddings… ▽ More

    Submitted 5 May, 2024; v1 submitted 2 May, 2024; originally announced May 2024.

    Comments: 14 pages, 6 figures

  35. arXiv:2405.00565  [pdf, other

    cs.SE

    Leveraging Stack Traces for Spectrum-based Fault Localization in the Absence of Failing Tests

    Authors: Lorena Barreto Simedo Pacheco, An Ran Chen, Jinqiu Yang, Tse-Hsun, Chen

    Abstract: Bug fixing is a crucial task in software maintenance to hold user trust. Although various automated fault localization techniques exist, they often require specific conditions to be effective. For example, Spectrum-Based Fault Localization (SBFL) techniques need at least one failing test to identify bugs, which may not always be available. Bug reports, particularly those with stack traces, provide… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

  36. arXiv:2404.18231  [pdf, other

    cs.CL cs.AI

    From Persona to Personalization: A Survey on Role-Playing Language Agents

    Authors: Jiangjie Chen, Xintao Wang, Rui Xu, Siyu Yuan, Yikai Zhang, Wei Shi, Jian Xie, Shuang Li, Ruihan Yang, Tinghui Zhu, Aili Chen, Nianqi Li, Lida Chen, Caiyu Hu, Siye Wu, Scott Ren, Ziquan Fu, Yanghua Xiao

    Abstract: Recent advancements in large language models (LLMs) have significantly boosted the rise of Role-Playing Language Agents (RPLAs), i.e., specialized AI systems designed to simulate assigned personas. By harnessing multiple advanced abilities of LLMs, including in-context learning, instruction following, and social intelligence, RPLAs achieve a remarkable sense of human likeness and vivid role-playin… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

    Comments: Preprint

  37. LLMParser: An Exploratory Study on Using Large Language Models for Log Parsing

    Authors: Zeyang Ma, An Ran Chen, Dong Jae Kim, Tse-Hsun Chen, Shaowei Wang

    Abstract: Logs are important in modern software development with runtime information. Log parsing is the first step in many log-based analyses, that involve extracting structured information from unstructured log data. Traditional log parsers face challenges in accurately parsing logs due to the diversity of log formats, which directly impacts the performance of downstream log-analysis tasks. In this paper,… ▽ More

    Submitted 27 April, 2024; originally announced April 2024.

  38. arXiv:2404.16837  [pdf

    cs.CR cs.CY cs.SE

    The Security Performance Analysis of Blockchain System Based on Post-Quantum Cryptography -- A Case Study of Cryptocurrency Exchanges

    Authors: Abel C. H. Chen

    Abstract: The current blockchain system for cryptocurrency exchanges primarily employs elliptic curve cryptography (ECC) for generating key pairs in wallets, and elliptic curve digital signature algorithms (ECDSA) for generating signatures in transactions. Consequently, with the maturation of quantum computing technology, the current blockchain system faces the risk of quantum computing attacks. Quantum com… ▽ More

    Submitted 23 January, 2024; originally announced April 2024.

    Comments: in Chinese language

  39. arXiv:2404.12824  [pdf, other

    cs.RO cs.LG cs.MA

    MAexp: A Generic Platform for RL-based Multi-Agent Exploration

    Authors: Shaohao Zhu, Jiacheng Zhou, Anjun Chen, Mingming Bai, Jiming Chen, Jinming Xu

    Abstract: The sim-to-real gap poses a significant challenge in RL-based multi-agent exploration due to scene quantization and action discretization. Existing platforms suffer from the inefficiency in sampling and the lack of diversity in Multi-Agent Reinforcement Learning (MARL) algorithms across different scenarios, restraining their widespread applications. To fill these gaps, we propose MAexp, a generic… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

  40. arXiv:2404.10194  [pdf, other

    cs.SE cs.HC

    Impostor Syndrome in Final Year Computer Science Students: An Eye Tracking and Biometrics Study

    Authors: Alyssia Chen, Carol Wong, Katy Tarrit, Anthony Peruma

    Abstract: Imposter syndrome is a psychological phenomenon that affects individuals who doubt their skills and abilities, despite possessing the necessary competencies. This can lead to a lack of confidence and poor performance. While research has explored the impacts of imposter syndrome on students and professionals in various fields, there is limited knowledge on how it affects code comprehension in softw… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: Accepted at: 18th International Conference, AC 2024, Held as Part of the 26th HCI International Conference, HCII 2024

  41. arXiv:2404.09993  [pdf, other

    cs.CV

    No More Ambiguity in 360° Room Layout via Bi-Layout Estimation

    Authors: Yu-Ju Tsai, Jin-Cheng Jhang, Jingjing Zheng, Wei Wang, Albert Y. C. Chen, Min Sun, Cheng-Hao Kuo, Ming-Hsuan Yang

    Abstract: Inherent ambiguity in layout annotations poses significant challenges to developing accurate 360° room layout estimation models. To address this issue, we propose a novel Bi-Layout model capable of predicting two distinct layout types. One stops at ambiguous regions, while the other extends to encompass all visible areas. Our model employs two global context embeddings, where each embedding is des… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: CVPR 2024, Project page: https://liagm.github.io/Bi_Layout/

  42. arXiv:2404.04496  [pdf, other

    cs.SE

    Towards Better Graph Neural Network-based Fault Localization Through Enhanced Code Representation

    Authors: Md Nakhla Rafi, Dong Jae Kim, An Ran Chen, Tse-Hsun Chen, Shaowei Wang

    Abstract: Automatic software fault localization plays an important role in software quality assurance by pinpointing faulty locations for easier debugging. Coverage-based fault localization, a widely used technique, employs statistics on coverage spectra to rank code based on suspiciousness scores. However, the rigidity of statistical approaches calls for learning-based techniques. Amongst all, Grace, a gra… ▽ More

    Submitted 30 April, 2024; v1 submitted 6 April, 2024; originally announced April 2024.

  43. arXiv:2404.02733  [pdf, other

    cs.CV

    InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation

    Authors: Haofan Wang, Matteo Spinelli, Qixun Wang, Xu Bai, Zekui Qin, Anthony Chen

    Abstract: Tuning-free diffusion-based models have demonstrated significant potential in the realm of image personalization and customization. However, despite this notable progress, current models continue to grapple with several complex challenges in producing style-consistent image generation. Firstly, the concept of style is inherently underdetermined, encompassing a multitude of elements such as color,… ▽ More

    Submitted 4 April, 2024; v1 submitted 3 April, 2024; originally announced April 2024.

    Comments: Technical Report

  44. arXiv:2404.00095  [pdf, other

    cs.CV

    GDA: Generalized Diffusion for Robust Test-time Adaptation

    Authors: Yun-Yun Tsai, Fu-Chen Chen, Albert Y. C. Chen, Junfeng Yang, Che-Chun Su, Min Sun, Cheng-Hao Kuo

    Abstract: Machine learning models struggle with generalization when encountering out-of-distribution (OOD) samples with unexpected distribution shifts. For vision tasks, recent studies have shown that test-time adaptation employing diffusion models can achieve state-of-the-art accuracy improvements on OOD samples by generating new samples that align with the model's domain without the need to modify the mod… ▽ More

    Submitted 2 April, 2024; v1 submitted 29 March, 2024; originally announced April 2024.

  45. arXiv:2403.19669  [pdf, other

    cs.LG cs.AI cs.CL cs.CV

    Analyzing the Roles of Language and Vision in Learning from Limited Data

    Authors: Allison Chen, Ilia Sucholutsky, Olga Russakovsky, Thomas L. Griffiths

    Abstract: Does language help make sense of the visual world? How important is it to actually see the world rather than having it described with words? These basic questions about the nature of intelligence have been difficult to answer because we only had one example of an intelligent system -- humans -- and limited access to cases that isolated language or vision. However, the development of sophisticated… ▽ More

    Submitted 10 May, 2024; v1 submitted 15 February, 2024; originally announced March 2024.

    Comments: 8 pages, 4 figures

  46. arXiv:2403.19374  [pdf, other

    cs.ET eess.SY

    A noise-tolerant, resource-saving probabilistic binary neural network implemented by the SOT-MRAM compute-in-memory system

    Authors: Yu Gu, Puyang Huang, Tianhao Chen, Chenyi Fu, Aitian Chen, Shouzhong Peng, Xixiang Zhang, Xufeng Kou

    Abstract: We report a spin-orbit torque(SOT) magnetoresistive random-access memory(MRAM)-based probabilistic binary neural network(PBNN) for resource-saving and hardware noise-tolerant computing applications. With the presence of thermal fluctuation, the non-destructive SOT-driven magnetization switching characteristics lead to a random weight matrix with controllable probability distribution. In the meanwh… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    Comments: 5 pages, 10 figures

    MSC Class: 94C60 ACM Class: B.2.4; B.3.0

  47. 2D Gaussian Splatting for Geometrically Accurate Radiance Fields

    Authors: Binbin Huang, Zehao Yu, Anpei Chen, Andreas Geiger, Shenghua Gao

    Abstract: 3D Gaussian Splatting (3DGS) has recently revolutionized radiance field reconstruction, achieving high quality novel view synthesis and fast rendering speed without baking. However, 3DGS fails to accurately represent surfaces due to the multi-view inconsistent nature of 3D Gaussians. We present 2D Gaussian Splatting (2DGS), a novel approach to model and reconstruct geometrically accurate radiance… ▽ More

    Submitted 9 June, 2024; v1 submitted 26 March, 2024; originally announced March 2024.

    Comments: 13 pages, 12 figures

  48. arXiv:2403.13147  [pdf, other

    cs.RO

    Meta-Learning for Fast Adaptation in Intent Inferral on a Robotic Hand Orthosis for Stroke

    Authors: Pedro Leandro La Rotta, Jingxi Xu, Ava Chen, Lauren Winterbottom, Wenxi Chen, Dawn Nilsen, Joel Stein, Matei Ciocarlie

    Abstract: We propose MetaEMG, a meta-learning approach for fast adaptation in intent inferral on a robotic hand orthosis for stroke. One key challenge in machine learning for assistive and rehabilitative robotics with disabled-bodied subjects is the difficulty of collecting labeled training data. Muscle tone and spasticity often vary significantly among stroke subjects, and hand function can even change acr… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  49. arXiv:2403.12982  [pdf

    cond-mat.mtrl-sci cs.LG physics.chem-ph

    Knowledge-Reuse Transfer Learning Methods in Molecular and Material Science

    Authors: An Chen, Zhilong Wang, Karl Luigi Loza Vidaurre, Yanqiang Han, Simin Ye, Kehao Tao, Shiwei Wang, Jing Gao, Jinjin Li

    Abstract: Molecules and materials are the foundation for the development of modern advanced industries such as energy storage systems and semiconductor devices. However, traditional trial-and-error methods or theoretical calculations are highly resource-intensive, and extremely long R&D (Research and Development) periods cannot meet the urgent need for molecules/materials in industrial development. Machine… ▽ More

    Submitted 2 March, 2024; originally announced March 2024.

    Comments: 42 pages, 10 figures

  50. arXiv:2403.10548  [pdf, other

    eess.AS cs.SD

    Two-sided Acoustic Metascreen for Broadband and Individual Reflection and Transmission Control

    Authors: Ao Chen, Xin Zhang

    Abstract: Acoustic wave modulation plays a pivotal role in various applications, including sound-field reconstruction, wireless communication, and particle manipulation, among others. However, current acoustic metamaterial and metasurface designs typically focus on controlling either reflection or transmission waves, often overlooking the coupling between amplitude and phase of acoustic waves. To fulfill th… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.