Skip to main content

Showing 1–9 of 9 results for author: Hossain, S B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.18649  [pdf, other

    cs.CL cs.AI cs.SE

    Training LLMs to Better Self-Debug and Explain Code

    Authors: Nan Jiang, Xiaopeng Li, Shiqi Wang, Qiang Zhou, Soneya Binta Hossain, Baishakhi Ray, Varun Kumar, Xiaofei Ma, Anoop Deoras

    Abstract: In the domain of code generation, self-debugging is crucial. It allows LLMs to refine their generated code based on execution feedback. This is particularly important because generating correct solutions in one attempt proves challenging for complex tasks. Prior works on self-debugging mostly focus on prompting methods by providing LLMs with few-shot examples, which work poorly on small open-sourc… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  2. arXiv:2405.03786  [pdf, other

    cs.SE

    TOGLL: Correct and Strong Test Oracle Generation with LLMs

    Authors: Soneya Binta Hossain, Matthew Dwyer

    Abstract: Test oracles play a crucial role in software testing, enabling effective bug detection. Despite initial promise, neural-based methods for automated test oracle generation often result in a large number of false positives and weaker test oracles. While LLMs have demonstrated impressive effectiveness in various software engineering tasks, including code generation, test case creation, and bug fixing… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

  3. arXiv:2404.11595  [pdf, other

    cs.SE

    A Deep Dive into Large Language Models for Automated Bug Localization and Repair

    Authors: Soneya Binta Hossain, Nan Jiang, Qiang Zhou, Xiaopeng Li, Wen-Hao Chiang, Yingjun Lyu, Hoan Nguyen, Omer Tripp

    Abstract: Large language models (LLMs) have shown impressive effectiveness in various software engineering tasks, including automated program repair (APR). In this study, we take a deep dive into automated bug fixing utilizing LLMs. In contrast to many deep learning-based APR methods that assume known bug locations, rely on line-level localization tools, or address bug prediction and fixing in one step, our… ▽ More

    Submitted 10 May, 2024; v1 submitted 17 April, 2024; originally announced April 2024.

  4. arXiv:2402.01208  [pdf, other

    cs.LG cs.AI

    Location Agnostic Adaptive Rain Precipitation Prediction using Deep Learning

    Authors: Md Shazid Islam, Md Saydur Rahman, Md Saad Ul Haque, Farhana Akter Tumpa, Md Sanzid Bin Hossain, Abul Al Arabi

    Abstract: Rain precipitation prediction is a challenging task as it depends on weather and meteorological features which vary from location to location. As a result, a prediction model that performs well at one location does not perform well at other locations due to the distribution shifts. In addition, due to global warming, the weather patterns are changing very rapidly year by year which creates the pos… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

  5. arXiv:2402.01206  [pdf, other

    cs.LG

    Comparative Evaluation of Weather Forecasting using Machine Learning Models

    Authors: Md Saydur Rahman, Farhana Akter Tumpa, Md Shazid Islam, Abul Al Arabi, Md Sanzid Bin Hossain, Md Saad Ul Haque

    Abstract: Gaining a deeper understanding of weather and being able to predict its future conduct have always been considered important endeavors for the growth of our society. This research paper explores the advancements in understanding and predicting nature's behavior, particularly in the context of weather forecasting, through the application of machine learning algorithms. By leveraging the power of ma… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

  6. arXiv:2308.00316  [pdf, ps, other

    cs.SE

    Artifact: Measuring and Mitigating Gaps in Structural Testing

    Authors: Soneya Binta Hossain, Matthew B. Dwyer, Sebastian Elbaum, Anh Nguyen-Tuong

    Abstract: The artifact used for evaluating the experimental results of Measuring and Mitigating Gaps in Structural Testing is publicly available on GitHub, Software Heritage and figshare, and is reusable. The artifact consists of necessary data, tools, scripts, and detailed documentation for running the experiments and reproducing the results shown in the paper. We have also provided a VirtualBox VM image a… ▽ More

    Submitted 1 August, 2023; originally announced August 2023.

  7. Neural-Based Test Oracle Generation: A Large-scale Evaluation and Lessons Learned

    Authors: Soneya Binta Hossain, Antonio Filieri, Matthew B. Dwyer, Sebastian Elbaum, Willem Visser

    Abstract: Defining test oracles is crucial and central to test development, but manual construction of oracles is expensive. While recent neural-based automated test oracle generation techniques have shown promise, their real-world effectiveness remains a compelling question requiring further exploration and understanding. This paper investigates the effectiveness of TOGA, a recently developed neural-based… ▽ More

    Submitted 25 August, 2023; v1 submitted 29 July, 2023; originally announced July 2023.

  8. arXiv:2212.06118  [pdf, ps, other

    cs.SE

    A Brief Survey on Oracle-based Test Adequacy Metrics

    Authors: Soneya Binta Hossain, Matthew B. Dwyer

    Abstract: Code coverage is a popular and widespread test adequacy metric that measures the percentage of program codes executed by a test suite. Despite its popularity, code coverage has several limitations. One of the major limitations is that it does not provide any insights into the quality or quantity of test oracles, a core component of testing. Due to this limitation, several studies have suggested th… ▽ More

    Submitted 15 February, 2023; v1 submitted 12 December, 2022; originally announced December 2022.

    Comments: 7 pages

  9. arXiv:1902.01544  [pdf, ps, other

    cs.SD cs.LG eess.AS stat.ML

    An Ensemble SVM-based Approach for Voice Activity Detection

    Authors: Jayanta Dey, Md Sanzid Bin Hossain, Mohammad Ariful Haque

    Abstract: Voice activity detection (VAD), used as the front end of speech enhancement, speech and speaker recognition algorithms, determines the overall accuracy and efficiency of the algorithms. Therefore, a VAD with low complexity and high accuracy is highly desirable for speech processing applications. In this paper, we propose a novel training method on large dataset for supervised learning-based VAD sy… ▽ More

    Submitted 4 February, 2019; originally announced February 2019.