Skip to main content

Showing 1–50 of 154 results for author: Islam, M S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.07220  [pdf, other

    cs.CV cs.AI cs.CY cs.HC

    Handwritten Code Recognition for Pen-and-Paper CS Education

    Authors: Md Sazzad Islam, Moussa Koulako Bala Doumbouya, Christopher D. Manning, Chris Piech

    Abstract: Teaching Computer Science (CS) by having students write programs by hand on paper has key pedagogical advantages: It allows focused learning and requires careful thinking compared to the use of Integrated Development Environments (IDEs) with intelligent support tools or "just trying things out". The familiar environment of pens and paper also lessens the cognitive load of students with no prior ex… ▽ More

    Submitted 7 August, 2024; originally announced August 2024.

  2. arXiv:2408.05346  [pdf, other

    cs.CL

    DataNarrative: Automated Data-Driven Storytelling with Visualizations and Texts

    Authors: Mohammed Saidul Islam, Md Tahmid Rahman Laskar, Md Rizwan Parvez, Enamul Hoque, Shafiq Joty

    Abstract: Data-driven storytelling is a powerful method for conveying insights by combining narrative techniques with visualizations and text. These stories integrate visual aids, such as highlighted bars and lines in charts, along with textual annotations explaining insights. However, creating such stories requires a deep understanding of the data and meticulous narrative planning, often necessitating huma… ▽ More

    Submitted 13 August, 2024; v1 submitted 9 August, 2024; originally announced August 2024.

  3. arXiv:2407.16721  [pdf

    q-bio.QM cs.LG

    Machine Learning Models for the Identification of Cardiovascular Diseases Using UK Biobank Data

    Authors: Sheikh Mohammed Shariful Islam, Moloud Abrar, Teketo Tegegne, Liliana Loranjo, Chandan Karmakar, Md Abdul Awal, Md. Shahadat Hossain, Muhammad Ashad Kabir, Mufti Mahmud, Abbas Khosravi, George Siopis, Jeban C Moses, Ralph Maddison

    Abstract: Machine learning models have the potential to identify cardiovascular diseases (CVDs) early and accurately in primary healthcare settings, which is crucial for delivering timely treatment and management. Although population-based CVD risk models have been used traditionally, these models often do not consider variations in lifestyles, socioeconomic conditions, or genetic predispositions. Therefore… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

    Comments: 19 pages, 3 figures

  4. arXiv:2407.07124  [pdf, other

    cs.DC cs.AI cs.LG

    FedClust: Tackling Data Heterogeneity in Federated Learning through Weight-Driven Client Clustering

    Authors: Md Sirajul Islam, Simin Javaherian, Fei Xu, Xu Yuan, Li Chen, Nian-Feng Tzeng

    Abstract: Federated learning (FL) is an emerging distributed machine learning paradigm that enables collaborative training of machine learning models over decentralized devices without exposing their local data. One of the major challenges in FL is the presence of uneven data distributions across client devices, violating the well-known assumption of independent-and-identically-distributed (IID) training sa… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

  5. arXiv:2406.14856  [pdf, other

    cs.CV cs.HC cs.LG

    Accessible, At-Home Detection of Parkinson's Disease via Multi-task Video Analysis

    Authors: Md Saiful Islam, Tariq Adnan, Jan Freyberg, Sangwu Lee, Abdelrahman Abdelkader, Meghan Pawlik, Cathe Schwartz, Karen Jaffe, Ruth B. Schneider, E Ray Dorsey, Ehsan Hoque

    Abstract: Limited access to neurological care leads to missed diagnoses of Parkinson's disease (PD), leaving many individuals unidentified and untreated. We trained a novel neural network-based fusion architecture to detect Parkinson's disease (PD) by analyzing features extracted from webcam recordings of three tasks: finger tapping, facial expression (smiling), and speech (uttering a sentence containing al… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  6. arXiv:2406.12954  [pdf, other

    cs.CV cs.AI cs.LG

    Skin Cancer Images Classification using Transfer Learning Techniques

    Authors: Md Sirajul Islam, Sanjeev Panta

    Abstract: Skin cancer is one of the most common and deadliest types of cancer. Early diagnosis of skin cancer at a benign stage is critical to reducing cancer mortality. To detect skin cancer at an earlier stage an automated system is compulsory that can save the life of many patients. Many previous studies have addressed the problem of skin cancer diagnosis using various deep learning and transfer learning… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  7. arXiv:2406.08534  [pdf, ps, other

    cs.NE cs.AI

    Optimizing Container Loading and Unloading through Dual-Cycling and Dockyard Rehandle Reduction Using a Hybrid Genetic Algorithm

    Authors: Md. Mahfuzur Rahman, Md Abrar Jahin, Md. Saiful Islam, M. F. Mridha

    Abstract: This paper addresses the optimization of container unloading and loading operations at ports, integrating quay-crane dual-cycling with dockyard rehandle minimization. We present a unified model encompassing both operations: ship container unloading and loading by quay crane, and the other is reducing dockyard rehandles while loading the ship. We recognize that optimizing one aspect in isolation ca… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  8. arXiv:2406.07707  [pdf, other

    cs.CV cs.LG

    A Deep Learning Approach to Detect Complete Safety Equipment For Construction Workers Based On YOLOv7

    Authors: Md. Shariful Islam, SM Shaqib, Shahriar Sultan Ramit, Shahrun Akter Khushbu, Mr. Abdus Sattar, Dr. Sheak Rashed Haider Noori

    Abstract: In the construction sector, ensuring worker safety is of the utmost significance. In this study, a deep learning-based technique is presented for identifying safety gear worn by construction workers, such as helmets, goggles, jackets, gloves, and footwears. The recommended approach uses the YOLO v7 (You Only Look Once) object detection algorithm to precisely locate these safety items. The dataset… ▽ More

    Submitted 13 June, 2024; v1 submitted 11 June, 2024; originally announced June 2024.

  9. arXiv:2406.00257  [pdf, other

    cs.CL

    Are Large Vision Language Models up to the Challenge of Chart Comprehension and Reasoning? An Extensive Investigation into the Capabilities and Limitations of LVLMs

    Authors: Mohammed Saidul Islam, Raian Rahman, Ahmed Masry, Md Tahmid Rahman Laskar, Mir Tafseer Nayeem, Enamul Hoque

    Abstract: Natural language is a powerful complementary modality of communication for data visualizations, such as bar and line charts. To facilitate chart-based reasoning using natural language, various downstream tasks have been introduced recently such as chart question answering, chart summarization, and fact-checking with charts. These tasks pose a unique challenge, demanding both vision-language reason… ▽ More

    Submitted 31 May, 2024; originally announced June 2024.

  10. arXiv:2405.17206  [pdf, other

    cs.SD cs.LG

    A Novel Fusion Architecture for PD Detection Using Semi-Supervised Speech Embeddings

    Authors: Tariq Adnan, Abdelrahman Abdelkader, Zipei Liu, Ekram Hossain, Sooyong Park, MD Saiful Islam, Ehsan Hoque

    Abstract: We present a framework to recognize Parkinson's disease (PD) through an English pangram utterance speech collected using a web application from diverse recording settings and environments, including participants' homes. Our dataset includes a global cohort of 1306 participants, including 392 diagnosed with PD. Leveraging the diversity of the dataset, spanning various demographic properties (such a… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Comments: 25 pages, 5 figures, and 4 tables

  11. From CNNs to Transformers in Multimodal Human Action Recognition: A Survey

    Authors: Muhammad Bilal Shaikh, Syed Mohammed Shamsul Islam, Douglas Chai, Naveed Akhtar

    Abstract: Due to its widespread applications, human action recognition is one of the most widely studied research problems in Computer Vision. Recent studies have shown that addressing it using multimodal data leads to superior performance as compared to relying on a single data modality. During the adoption of deep learning for visual modelling in the last decade, action recognition approaches have mainly… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Comments: 23 pages, 5 figures and 3 Tables. To appear in ACM Trans. Multimedia Comput. Commun. Appl.(TOMM) 2024

    ACM Class: A.1; I.2.10

  12. arXiv:2405.13219  [pdf, other

    cs.AI cs.CL

    How Reliable AI Chatbots are for Disease Prediction from Patient Complaints?

    Authors: Ayesha Siddika Nipu, K M Sajjadul Islam, Praveen Madiraju

    Abstract: Artificial Intelligence (AI) chatbots leveraging Large Language Models (LLMs) are gaining traction in healthcare for their potential to automate patient interactions and aid clinical decision-making. This study examines the reliability of AI chatbots, specifically GPT 4.0, Claude 3 Opus, and Gemini Ultra 1.0, in predicting diseases from patient complaints in the emergency department. The methodolo… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Comments: 24th IEEE International Conference on Information Reuse and Integration (IEEE IRI 2024), San Jose, CA, USA

  13. arXiv:2405.11188  [pdf, other

    cs.LG

    Wind Power Prediction across Different Locations using Deep Domain Adaptive Learning

    Authors: Md Saiful Islam Sajol, Md Shazid Islam, A S M Jahid Hasan, Md Saydur Rahman, Jubair Yusuf

    Abstract: Accurate prediction of wind power is essential for the grid integration of this intermittent renewable source and aiding grid planners in forecasting available wind capacity. Spatial differences lead to discrepancies in climatological data distributions between two geographically dispersed regions, consequently making the prediction task more difficult. Thus, a prediction model that learns from th… ▽ More

    Submitted 18 May, 2024; originally announced May 2024.

  14. Authorship Attribution in Bangla Literature (AABL) via Transfer Learning using ULMFiT

    Authors: Aisha Khatun, Anisur Rahman, Md Saiful Islam, Hemayet Ahmed Chowdhury, Ayesha Tasnim

    Abstract: Authorship Attribution is the task of creating an appropriate characterization of text that captures the authors' writing style to identify the original author of a given piece of text. With increased anonymity on the internet, this task has become increasingly crucial in various security and plagiarism detection fields. Despite significant advancements in other languages such as English, Spanish,… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

    Comments: Accepted in ACM TALLIP August 2022

  15. arXiv:2403.04144  [pdf, other

    cs.DC cs.LG

    FedClust: Optimizing Federated Learning on Non-IID Data through Weight-Driven Client Clustering

    Authors: Md Sirajul Islam, Simin Javaherian, Fei Xu, Xu Yuan, Li Chen, Nian-Feng Tzeng

    Abstract: Federated learning (FL) is an emerging distributed machine learning paradigm enabling collaborative model training on decentralized devices without exposing their local data. A key challenge in FL is the uneven data distribution across client devices, violating the well-known assumption of independent-and-identically-distributed (IID) training samples in conventional machine learning. Clustered fe… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

  16. arXiv:2402.18600  [pdf

    eess.IV cs.AI q-bio.TO

    Artificial Intelligence and Diabetes Mellitus: An Inside Look Through the Retina

    Authors: Yasin Sadeghi Bazargani, Majid Mirzaei, Navid Sobhi, Mirsaeed Abdollahi, Ali Jafarizadeh, Siamak Pedrammehr, Roohallah Alizadehsani, Ru San Tan, Sheikh Mohammed Shariful Islam, U. Rajendra Acharya

    Abstract: Diabetes mellitus (DM) predisposes patients to vascular complications. Retinal images and vasculature reflect the body's micro- and macrovascular health. They can be used to diagnose DM complications, including diabetic retinopathy (DR), neuropathy, nephropathy, and atherosclerotic cardiovascular disease, as well as forecast the risk of cardiovascular events. Artificial intelligence (AI)-enabled s… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

    Comments: 44 Pages, 6 figures, 1 table, 166 references

    ACM Class: J.3.2; J.3.3

  17. arXiv:2402.09975  [pdf

    eess.IV cs.CV

    Current and future roles of artificial intelligence in retinopathy of prematurity

    Authors: Ali Jafarizadeh, Shadi Farabi Maleki, Parnia Pouya, Navid Sobhi, Mirsaeed Abdollahi, Siamak Pedrammehr, Chee Peng Lim, Houshyar Asadi, Roohallah Alizadehsani, Ru-San Tan, Sheikh Mohammad Shariful Islam, U. Rajendra Acharya

    Abstract: Retinopathy of prematurity (ROP) is a severe condition affecting premature infants, leading to abnormal retinal blood vessel growth, retinal detachment, and potential blindness. While semi-automated systems have been used in the past to diagnose ROP-related plus disease by quantifying retinal vessel features, traditional machine learning (ML) models face challenges like accuracy and overfitting. R… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

    Comments: 28 pages, 8 figures, 2 tables, 235 references, 1 supplementary table

    ACM Class: J.3.2; J.3.3

  18. arXiv:2402.01804  [pdf, other

    cs.CY cs.AI

    Analysis of Internet of Things Implementation Barriers in the Cold Supply Chain: An Integrated ISM-MICMAC and DEMATEL Approach

    Authors: Kazrin Ahmad, Md. Saiful Islam, Md Abrar Jahin, M. F. Mridha

    Abstract: Integrating Internet of Things (IoT) technology inside the cold supply chain can enhance transparency, efficiency, and quality, optimizing operating procedures and increasing productivity. The integration of IoT in this complicated setting is hindered by specific barriers that need a thorough examination. Prominent barriers to IoT implementation in the cold supply chain are identified using a two-… ▽ More

    Submitted 27 May, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

  19. arXiv:2402.01208  [pdf, other

    cs.LG cs.AI

    Location Agnostic Adaptive Rain Precipitation Prediction using Deep Learning

    Authors: Md Shazid Islam, Md Saydur Rahman, Md Saad Ul Haque, Farhana Akter Tumpa, Md Sanzid Bin Hossain, Abul Al Arabi

    Abstract: Rain precipitation prediction is a challenging task as it depends on weather and meteorological features which vary from location to location. As a result, a prediction model that performs well at one location does not perform well at other locations due to the distribution shifts. In addition, due to global warming, the weather patterns are changing very rapidly year by year which creates the pos… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

  20. arXiv:2402.01206  [pdf, other

    cs.LG

    Comparative Evaluation of Weather Forecasting using Machine Learning Models

    Authors: Md Saydur Rahman, Farhana Akter Tumpa, Md Shazid Islam, Abul Al Arabi, Md Sanzid Bin Hossain, Md Saad Ul Haque

    Abstract: Gaining a deeper understanding of weather and being able to predict its future conduct have always been considered important endeavors for the growth of our society. This research paper explores the advancements in understanding and predicting nature's behavior, particularly in the context of weather forecasting, through the application of machine learning algorithms. By leveraging the power of ma… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

  21. arXiv:2401.17574  [pdf, other

    cs.CL cs.LG

    Scavenging Hyena: Distilling Transformers into Long Convolution Models

    Authors: Tokiniaina Raharison Ralambomihanta, Shahrad Mohammadzadeh, Mohammad Sami Nur Islam, Wassim Jabbour, Laurence Liang

    Abstract: The rapid evolution of Large Language Models (LLMs), epitomized by architectures like GPT-4, has reshaped the landscape of natural language processing. This paper introduces a pioneering approach to address the efficiency concerns associated with LLM pre-training, proposing the use of knowledge distillation for cross-architecture transfer. Leveraging insights from the efficient Hyena mechanism, ou… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

    Comments: 9 pages, 2 figures

  22. arXiv:2401.16350  [pdf, other

    cs.LG cs.AI cs.CY cs.DC

    FedFair^3: Unlocking Threefold Fairness in Federated Learning

    Authors: Simin Javaherian, Sanjeev Panta, Shelby Williams, Md Sirajul Islam, Li Chen

    Abstract: Federated Learning (FL) is an emerging paradigm in machine learning without exposing clients' raw data. In practical scenarios with numerous clients, encouraging fair and efficient client participation in federated learning is of utmost importance, which is also challenging given the heterogeneity in data distribution and device properties. Existing works have proposed different client-selection m… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

  23. arXiv:2401.15299  [pdf, other

    cs.LG cs.AI cs.IR eess.SY stat.AP

    SupplyGraph: A Benchmark Dataset for Supply Chain Planning using Graph Neural Networks

    Authors: Azmine Toushik Wasi, MD Shafikul Islam, Adipto Raihan Akib

    Abstract: Graph Neural Networks (GNNs) have gained traction across different domains such as transportation, bio-informatics, language processing, and computer vision. However, there is a noticeable absence of research on applying GNNs to supply chain networks. Supply chain networks are inherently graph-like in structure, making them prime candidates for applying GNN methodologies. This opens up a world of… ▽ More

    Submitted 27 January, 2024; originally announced January 2024.

    Comments: 9 pages, 8 figures; Accepted to 4th workshop on Graphs and more Complex structures for Learning and Reasoning, colocated with AAAI 2024

    ACM Class: I.2.1; I.2.8; E.0; J.2; H.3.7

  24. arXiv:2401.14422  [pdf, other

    cs.LG

    Location Agnostic Source-Free Domain Adaptive Learning to Predict Solar Power Generation

    Authors: Md Shazid Islam, A S M Jahid Hasan, Md Saydur Rahman, Jubair Yusuf, Md Saiful Islam Sajol, Farhana Akter Tumpa

    Abstract: The prediction of solar power generation is a challenging task due to its dependence on climatic characteristics that exhibit spatial and temporal variability. The performance of a prediction model may vary across different places due to changes in data distribution, resulting in a model that works well in one region but not in others. Furthermore, as a consequence of global warming, there is a no… ▽ More

    Submitted 6 February, 2024; v1 submitted 23 January, 2024; originally announced January 2024.

  25. arXiv:2401.06088  [pdf, other

    cs.CL cs.AI cs.LG

    Autocompletion of Chief Complaints in the Electronic Health Records using Large Language Models

    Authors: K M Sajjadul Islam, Ayesha Siddika Nipu, Praveen Madiraju, Priya Deshpande

    Abstract: The Chief Complaint (CC) is a crucial component of a patient's medical record as it describes the main reason or concern for seeking medical care. It provides critical information for healthcare providers to make informed decisions about patient care. However, documenting CCs can be time-consuming for healthcare providers, especially in busy emergency departments. To address this issue, an autocom… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

    Comments: IEEE BigData 2023 - Sorrento, Italy. 10 Pages, 4 Figures, 5 Tables

  26. arXiv:2312.15160  [pdf, other

    cs.AI cs.HC cs.LG cs.MA

    Human-AI Collaboration in Real-World Complex Environment with Reinforcement Learning

    Authors: Md Saiful Islam, Srijita Das, Sai Krishna Gottipati, William Duguay, Clodéric Mars, Jalal Arabneydi, Antoine Fagette, Matthew Guzdial, Matthew-E-Taylor

    Abstract: Recent advances in reinforcement learning (RL) and Human-in-the-Loop (HitL) learning have made human-AI collaboration easier for humans to team with AI agents. Leveraging human expertise and experience with AI in intelligent systems can be efficient and beneficial. Still, it is unclear to what extent human-AI collaboration will be successful, and how such teaming performs compared to humans or AI… ▽ More

    Submitted 22 December, 2023; originally announced December 2023.

    Comments: Submitted to Neural Computing and Applications

  27. An Explainable Machine Learning Framework for the Accurate Diagnosis of Ovarian Cancer

    Authors: Asif Newaz, Abdullah Taharat, Md Sakibul Islam, A. G. M. Fuad Hasan Akanda

    Abstract: Ovarian cancer (OC) is one of the most prevalent types of cancer in women. Early and accurate diagnosis is crucial for the survival of the patients. However, the majority of women are diagnosed in advanced stages due to the lack of effective biomarkers and accurate screening tools. While previous studies sought a common biomarker, our study suggests different biomarkers for the premenopausal and p… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

  28. arXiv:2312.05780  [pdf, other

    cs.CV cs.LG

    PULSAR: Graph based Positive Unlabeled Learning with Multi Stream Adaptive Convolutions for Parkinson's Disease Recognition

    Authors: Md. Zarif Ul Alam, Md Saiful Islam, Ehsan Hoque, M Saifur Rahman

    Abstract: Parkinson's disease (PD) is a neuro-degenerative disorder that affects movement, speech, and coordination. Timely diagnosis and treatment can improve the quality of life for PD patients. However, access to clinical diagnosis is limited in low and middle income countries (LMICs). Therefore, development of automated screening tools for PD can have a huge social impact, particularly in the public hea… ▽ More

    Submitted 16 February, 2024; v1 submitted 10 December, 2023; originally announced December 2023.

  29. arXiv:2312.05407  [pdf, other

    cs.CV

    Active Learning Guided Federated Online Adaptation: Applications in Medical Image Segmentation

    Authors: Md Shazid Islam, Sayak Nag, Arindam Dutta, Miraj Ahmed, Fahim Faisal Niloy, Amit K. Roy-Chowdhury

    Abstract: Data privacy, storage, and distribution shifts are major bottlenecks in medical image analysis. Data cannot be shared across patients, physicians, and facilities due to privacy concerns, usually requiring each patient's data to be analyzed in a discreet setting at a near real-time pace. However, one would like to take advantage of the accumulated knowledge across healthcare facilities as the compu… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

  30. arXiv:2311.12654  [pdf, other

    cs.HC cs.AI

    PARK: Parkinson's Analysis with Remote Kinetic-tasks

    Authors: Md Saiful Islam, Sangwu Lee, Abdelrahman Abdelkader, Sooyong Park, Ehsan Hoque

    Abstract: We present a web-based framework to screen for Parkinson's disease (PD) by allowing users to perform neurological tests in their homes. Our web framework guides the users to complete three tasks involving speech, facial expression, and finger movements. The task videos are analyzed to classify whether the users show signs of PD. We present the results in an easy-to-understand manner, along with pe… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

  31. arXiv:2311.00983  [pdf, other

    cs.LG cs.AI eess.SY math.OC stat.ML

    Optimizing Inventory Routing: A Decision-Focused Learning Approach using Neural Networks

    Authors: MD Shafikul Islam, Azmine Toushik Wasi

    Abstract: Inventory Routing Problem (IRP) is a crucial challenge in supply chain management as it involves optimizing efficient route selection while considering the uncertainty of inventory demand planning. To solve IRPs, usually a two-stage approach is employed, where demand is predicted using machine learning techniques first, and then an optimization algorithm is used to minimize routing costs. Our expe… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

    Comments: 3 Pages, 2 figures, New in ML Workshop at NeurIPS 2023. Openreview forum: https://openreview.net/forum?id=r0fzjB8f7f&

    Journal ref: New in Machine Learning Workshop, NeurIPS 2023

  32. arXiv:2310.06848  [pdf, other

    cs.CV cs.AI cs.LG

    DeepTriNet: A Tri-Level Attention Based DeepLabv3+ Architecture for Semantic Segmentation of Satellite Images

    Authors: Tareque Bashar Ovi, Shakil Mosharrof, Nomaiya Bashree, Md Shofiqul Islam, Muhammad Nazrul Islam

    Abstract: The segmentation of satellite images is crucial in remote sensing applications. Existing methods face challenges in recognizing small-scale objects in satellite images for semantic segmentation primarily due to ignoring the low-level characteristics of the underlying network and due to containing distinct amounts of information by different feature maps. Thus, in this research, a tri-level attenti… ▽ More

    Submitted 5 September, 2023; originally announced October 2023.

    Comments: Keywords: Attention mechanism, Deep learning, Satellite image, DeepLabv3+, Segmentation

  33. DANet: Enhancing Small Object Detection through an Efficient Deformable Attention Network

    Authors: Md Sohag Mia, Abdullah Al Bary Voban, Abu Bakor Hayat Arnob, Abdu Naim, Md Kawsar Ahmed, Md Shariful Islam

    Abstract: Efficient and accurate detection of small objects in manufacturing settings, such as defects and cracks, is crucial for ensuring product quality and safety. To address this issue, we proposed a comprehensive strategy by synergizing Faster R-CNN with cutting-edge methods. By combining Faster R-CNN with Feature Pyramid Network, we enable the model to efficiently handle multi-scale features intrinsic… ▽ More

    Submitted 13 October, 2023; v1 submitted 9 October, 2023; originally announced October 2023.

    Comments: ICCD-23

    Report number: 10.1109/ICCD59681.2023

    Journal ref: International Conference on the Cognitive Computing and Complex Data (ICCD) 2023

  34. ViTs are Everywhere: A Comprehensive Study Showcasing Vision Transformers in Different Domain

    Authors: Md Sohag Mia, Abu Bakor Hayat Arnob, Abdu Naim, Abdullah Al Bary Voban, Md Shariful Islam

    Abstract: Transformer design is the de facto standard for natural language processing tasks. The success of the transformer design in natural language processing has lately piqued the interest of researchers in the domain of computer vision. When compared to Convolutional Neural Networks (CNNs), Vision Transformers (ViTs) are becoming more popular and dominant solutions for many vision problems. Transformer… ▽ More

    Submitted 13 October, 2023; v1 submitted 9 October, 2023; originally announced October 2023.

    Comments: ICCD-2023. arXiv admin note: substantial text overlap with arXiv:2208.04309 by other authors

    Journal ref: International Conference on the Cognitive Computing and Complex Data (ICCD) 2023

  35. arXiv:2309.15132  [pdf, other

    q-bio.QM cs.LG

    Genetic InfoMax: Exploring Mutual Information Maximization in High-Dimensional Imaging Genetics Studies

    Authors: Yaochen Xie, Ziqian Xie, Sheikh Muhammad Saiful Islam, Degui Zhi, Shuiwang Ji

    Abstract: Genome-wide association studies (GWAS) are used to identify relationships between genetic variations and specific traits. When applied to high-dimensional medical imaging data, a key step is to extract lower-dimensional, yet informative representations of the data as traits. Representation learning for imaging genetics is largely under-explored due to the unique challenges posed by GWAS in compari… ▽ More

    Submitted 25 September, 2023; originally announced September 2023.

    Comments: 17 pages, 7 figures

  36. arXiv:2309.13173  [pdf, other

    cs.CL

    BenLLMEval: A Comprehensive Evaluation into the Potentials and Pitfalls of Large Language Models on Bengali NLP

    Authors: Mohsinul Kabir, Mohammed Saidul Islam, Md Tahmid Rahman Laskar, Mir Tafseer Nayeem, M Saiful Bari, Enamul Hoque

    Abstract: Large Language Models (LLMs) have emerged as one of the most important breakthroughs in NLP for their impressive skills in language generation and other language-specific tasks. Though LLMs have been evaluated in various tasks, mostly in English, they have not yet undergone thorough evaluation in under-resourced languages such as Bengali (Bangla). To this end, this paper introduces BenLLM-Eval, wh… ▽ More

    Submitted 19 March, 2024; v1 submitted 22 September, 2023; originally announced September 2023.

    Comments: Accepted by LREC-COLING 2024. The first two authors contributed equally

  37. arXiv:2309.12350  [pdf, other

    cs.HC

    Exploring Internet of Things Adoption Challenges in Manufacturing Firms: A Delphi Fuzzy Analytical Hierarchy Process Approach

    Authors: Hasan Shahriar, Md. Saiful Islam, Md Abrar Jahin, Istiyaque Ahmed Ridoy, Raihan Rafi Prottoy, Adiba Abid, M. F. Mridha

    Abstract: Innovation is crucial for sustainable success in today's fiercely competitive global manufacturing landscape. Bangladesh's manufacturing sector must embrace transformative technologies like the Internet of Things (IoT) to thrive in this environment. This article addresses the vital task of identifying and evaluating barriers to IoT adoption in Bangladesh's manufacturing industry. Through synthesiz… ▽ More

    Submitted 11 December, 2023; v1 submitted 30 August, 2023; originally announced September 2023.

  38. arXiv:2309.11157  [pdf, other

    cs.CV

    Learning Deformable 3D Graph Similarity to Track Plant Cells in Unregistered Time Lapse Images

    Authors: Md Shazid Islam, Arindam Dutta, Calvin-Khang Ta, Kevin Rodriguez, Christian Michael, Mark Alber, G. Venugopala Reddy, Amit K. Roy-Chowdhury

    Abstract: Tracking of plant cells in images obtained by microscope is a challenging problem due to biological phenomena such as large number of cells, non-uniform growth of different layers of the tightly packed plant cells and cell division. Moreover, images in deeper layers of the tissue being noisy and unavoidable systemic errors inherent in the imaging process further complicates the problem. In this pa… ▽ More

    Submitted 21 September, 2023; v1 submitted 20 September, 2023; originally announced September 2023.

  39. arXiv:2308.03741  [pdf, other

    cs.CV cs.AI cs.LG cs.MM

    MAiVAR-T: Multimodal Audio-image and Video Action Recognizer using Transformers

    Authors: Muhammad Bilal Shaikh, Douglas Chai, Syed Mohammed Shamsul Islam, Naveed Akhtar

    Abstract: In line with the human capacity to perceive the world by simultaneously processing and integrating high-dimensional inputs from multiple modalities like vision and audio, we propose a novel model, MAiVAR-T (Multimodal Audio-Image to Video Action Recognition Transformer). This model employs an intuitive approach for the combination of audio-image and video modalities, with a primary aim to escalate… ▽ More

    Submitted 1 August, 2023; originally announced August 2023.

    Comments: 6 pages, 7 figures, 4 tables, Peer reviewed, Accepted @ The 11th European Workshop on Visual Information Processing (EUVIP) will be held on 11th-14th September 2023, in Gjøvik, Norway. arXiv admin note: text overlap with arXiv:2103.15691 by other authors

  40. arXiv:2308.02588  [pdf, other

    eess.IV cs.CV cs.LG

    Unmasking Parkinson's Disease with Smile: An AI-enabled Screening Framework

    Authors: Tariq Adnan, Md Saiful Islam, Wasifur Rahman, Sangwu Lee, Sutapa Dey Tithi, Kazi Noshin, Imran Sarker, M Saifur Rahman, Ehsan Hoque

    Abstract: Parkinson's disease (PD) diagnosis remains challenging due to lacking a reliable biomarker and limited access to clinical care. In this study, we present an analysis of the largest video dataset containing micro-expressions to screen for PD. We collected 3,871 videos from 1,059 unique participants, including 256 self-reported PD patients. The recordings are from diverse sources encompassing partic… ▽ More

    Submitted 3 August, 2023; originally announced August 2023.

  41. arXiv:2307.12906  [pdf, other

    cs.LG cs.AI quant-ph

    QAmplifyNet: Pushing the Boundaries of Supply Chain Backorder Prediction Using Interpretable Hybrid Quantum-Classical Neural Network

    Authors: Md Abrar Jahin, Md Sakib Hossain Shovon, Md. Saiful Islam, Jungpil Shin, M. F. Mridha, Yuichi Okuyama

    Abstract: Supply chain management relies on accurate backorder prediction for optimizing inventory control, reducing costs, and enhancing customer satisfaction. However, traditional machine-learning models struggle with large-scale datasets and complex relationships, hindering real-world data collection. This research introduces a novel methodological framework for supply chain backorder prediction, address… ▽ More

    Submitted 15 October, 2023; v1 submitted 24 July, 2023; originally announced July 2023.

  42. arXiv:2306.06452  [pdf, other

    cs.CY cs.AI

    BlockTheFall: Wearable Device-based Fall Detection Framework Powered by Machine Learning and Blockchain for Elderly Care

    Authors: Bilash Saha, Md Saiful Islam, Abm Kamrul Riad, Sharaban Tahora, Hossain Shahriar, Sweta Sneha

    Abstract: Falls among the elderly are a major health concern, frequently resulting in serious injuries and a reduced quality of life. In this paper, we propose "BlockTheFall," a wearable device-based fall detection framework which detects falls in real time by using sensor data from wearable devices. To accurately identify patterns and detect falls, the collected sensor data is analyzed using machine learni… ▽ More

    Submitted 10 June, 2023; originally announced June 2023.

    Comments: Accepted to publish in The 1st IEEE International Workshop on Digital and Public Health

  43. arXiv:2305.10698  [pdf

    cs.IR cs.CY cs.LG

    Ranking the locations and predicting future crime occurrence by retrieving news from different Bangla online newspapers

    Authors: Jumman Hossain, Rajib Chandra Das, Md. Ruhul Amin, Md. Saiful Islam

    Abstract: There have thousands of crimes are happening daily all around. But people keep statistics only few of them, therefore crime rates are increasing day by day. The reason behind can be less concern or less statistics of previous crimes. It is much more important to observe the previous crime statistics for general people to make their outing decision and police for catching the criminals are taking s… ▽ More

    Submitted 18 May, 2023; originally announced May 2023.

    Comments: 9 pages

  44. arXiv:2305.01484  [pdf, other

    cs.ET

    Powering Disturb-Free Reconfigurable Computing and Tunable Analog Electronics with Dual-Port Ferroelectric FET

    Authors: Zijian Zhao, Shan Deng, Swetaki Chatterjee, Zhouhang Jiang, Muhammad Shaffatul Islam, Yi Xiao, Yixin Xu, Scott Meninger, Mohamed Mohamed, Rajiv Joshi, Yogesh Singh Chauhan, Halid Mulaosmanovic, Stefan Duenkel, Dominik Kleimaier, Sven Beyer, Hussam Amrouch, Vijaykrishnan Narayanan, Kai Ni

    Abstract: Single-port ferroelectric FET (FeFET) that performs write and read operations on the same electrical gate prevents its wide application in tunable analog electronics and suffers from read disturb, especially to the high-threshold voltage (VTH) state as the retention energy barrier is reduced by the applied read bias. To address both issues, we propose to adopt a read disturb-free dual-port FeFET w… ▽ More

    Submitted 2 May, 2023; originally announced May 2023.

    Comments: 32 pages

  45. arXiv:2305.00640  [pdf, other

    cs.CV cs.LG physics.geo-ph

    Inferring the past: a combined CNN-LSTM deep learning framework to fuse satellites for historical inundation mapping

    Authors: Jonathan Giezendanner, Rohit Mukherjee, Matthew Purri, Mitchell Thomas, Max Mauerman, A. K. M. Saiful Islam, Beth Tellman

    Abstract: Mapping floods using satellite data is crucial for managing and mitigating flood risks. Satellite imagery enables rapid and accurate analysis of large areas, providing critical information for emergency response and disaster management. Historical flood data derived from satellite imagery can inform long-term planning, risk management strategies, and insurance-related decisions. The Sentinel-1 sat… ▽ More

    Submitted 30 April, 2023; originally announced May 2023.

    Comments: CVPR 2023: Earthvision Workshop

  46. arXiv:2303.17573  [pdf, other

    cs.LG cs.AI

    Using AI to Measure Parkinson's Disease Severity at Home

    Authors: Md Saiful Islam, Wasifur Rahman, Abdelrahman Abdelkader, Phillip T. Yang, Sangwu Lee, Jamie L. Adams, Ruth B. Schneider, E. Ray Dorsey, Ehsan Hoque

    Abstract: We present an artificial intelligence system to remotely assess the motor performance of individuals with Parkinson's disease (PD). Participants performed a motor task (i.e., tapping fingers) in front of a webcam, and data from 250 global participants were rated by three expert neurologists following the Movement Disorder Society Unified Parkinson's Disease Rating Scale (MDS-UPDRS). The neurologis… ▽ More

    Submitted 17 August, 2023; v1 submitted 30 March, 2023; originally announced March 2023.

  47. arXiv:2303.15430  [pdf, other

    cs.CL cs.LG

    TextMI: Textualize Multimodal Information for Integrating Non-verbal Cues in Pre-trained Language Models

    Authors: Md Kamrul Hasan, Md Saiful Islam, Sangwu Lee, Wasifur Rahman, Iftekhar Naim, Mohammed Ibrahim Khan, Ehsan Hoque

    Abstract: Pre-trained large language models have recently achieved ground-breaking performance in a wide variety of language understanding tasks. However, the same model can not be applied to multimodal behavior understanding tasks (e.g., video sentiment/humor detection) unless non-verbal features (e.g., acoustic and visual) can be integrated with language. Jointly modeling multiple modalities significantly… ▽ More

    Submitted 29 March, 2023; v1 submitted 27 March, 2023; originally announced March 2023.

  48. HARDC : A novel ECG-based heartbeat classification method to detect arrhythmia using hierarchical attention based dual structured RNN with dilated CNN

    Authors: Md Shofiqul Islam, Khondokar Fida Hasan, Sunjida Sultana, Shahadat Uddin, Pietro Lio, Julian M. W. Quinn, Mohammad Ali Moni

    Abstract: In this paper have developed a novel hybrid hierarchical attention-based bidirectional recurrent neural network with dilated CNN (HARDC) method for arrhythmia classification. This solves problems that arise when traditional dilated convolutional neural network (CNN) models disregard the correlation between contexts and gradient dispersion. The proposed HARDC fully exploits the dilated CNN and bidi… ▽ More

    Submitted 6 March, 2023; originally announced March 2023.

    Comments: 23 pages

  49. arXiv:2302.11559  [pdf, other

    cs.CV

    Word level Bangla Sign Language Dataset for Continuous BSL Recognition

    Authors: Md Shamimul Islam, A. J. M. Akhtarujjaman Joha, Md Nur Hossain, Sohaib Abdullah, Ibrahim Elwarfalli, Md Mahedi Hasan

    Abstract: An robust sign language recognition system can greatly alleviate communication barriers, particularly for people who struggle with verbal communication. This is crucial for human growth and progress as it enables the expression of thoughts, feelings, and ideas. However, sign recognition is a complex task that faces numerous challenges such as same gesture patterns for multiple signs, lighting, clo… ▽ More

    Submitted 9 April, 2023; v1 submitted 22 February, 2023; originally announced February 2023.

  50. arXiv:2301.08888  [pdf, other

    eess.IV cs.CV cs.LG

    Pre-text Representation Transfer for Deep Learning with Limited Imbalanced Data : Application to CT-based COVID-19 Detection

    Authors: Fouzia Altaf, Syed M. S. Islam, Naeem K. Janjua, Naveed Akhtar

    Abstract: Annotating medical images for disease detection is often tedious and expensive. Moreover, the available training samples for a given task are generally scarce and imbalanced. These conditions are not conducive for learning effective deep neural models. Hence, it is common to 'transfer' neural networks trained on natural images to the medical image domain. However, this paradigm lacks in performanc… ▽ More

    Submitted 20 January, 2023; originally announced January 2023.

    Comments: Best paper at IVCNZ