Skip to main content

Showing 1–14 of 14 results for author: Amin, M R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2401.16638  [pdf, other

    cs.CL cs.AI

    Breaking Free Transformer Models: Task-specific Context Attribution Promises Improved Generalizability Without Fine-tuning Pre-trained LLMs

    Authors: Stepan Tytarenko, Mohammad Ruhul Amin

    Abstract: Fine-tuning large pre-trained language models (LLMs) on particular datasets is a commonly employed strategy in Natural Language Processing (NLP) classification tasks. However, this approach usually results in a loss of models generalizability. In this paper, we present a framework that allows for maintaining generalizability, and enhances the performance on the downstream task by utilizing task-sp… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

    Comments: 8 pages, 3 figures, 5 tables, To be published in 2024 AAAI workshop on Responsible Language Models (ReLM)

    ACM Class: I.2.7; I.2.4

  2. arXiv:2311.03078  [pdf

    cs.CL

    BanLemma: A Word Formation Dependent Rule and Dictionary Based Bangla Lemmatizer

    Authors: Sadia Afrin, Md. Shahad Mahmud Chowdhury, Md. Ekramul Islam, Faisal Ahamed Khan, Labib Imam Chowdhury, MD. Motahar Mahtab, Nazifa Nuha Chowdhury, Massud Forkan, Neelima Kundu, Hakim Arif, Mohammad Mamun Or Rashid, Mohammad Ruhul Amin, Nabeel Mohammed

    Abstract: Lemmatization holds significance in both natural language processing (NLP) and linguistics, as it effectively decreases data density and aids in comprehending contextual meaning. However, due to the highly inflected nature and morphological richness, lemmatization in Bangla text poses a complex challenge. In this study, we propose linguistic rules for lemmatization and utilize a dictionary along w… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

  3. SentiGOLD: A Large Bangla Gold Standard Multi-Domain Sentiment Analysis Dataset and its Evaluation

    Authors: Md. Ekramul Islam, Labib Chowdhury, Faisal Ahamed Khan, Shazzad Hossain, Sourave Hossain, Mohammad Mamun Or Rashid, Nabeel Mohammed, Mohammad Ruhul Amin

    Abstract: This study introduces SentiGOLD, a Bangla multi-domain sentiment analysis dataset. Comprising 70,000 samples, it was created from diverse sources and annotated by a gender-balanced team of linguists. SentiGOLD adheres to established linguistic conventions agreed upon by the Government of Bangladesh and a Bangla linguistics committee. Unlike English and other languages, Bangla lacks standard sentim… ▽ More

    Submitted 9 June, 2023; originally announced June 2023.

    Comments: Accepted in KDD 2023 Applied Data Science Track; 12 pages, 14 figures

  4. arXiv:2305.10698  [pdf

    cs.IR cs.CY cs.LG

    Ranking the locations and predicting future crime occurrence by retrieving news from different Bangla online newspapers

    Authors: Jumman Hossain, Rajib Chandra Das, Md. Ruhul Amin, Md. Saiful Islam

    Abstract: There have thousands of crimes are happening daily all around. But people keep statistics only few of them, therefore crime rates are increasing day by day. The reason behind can be less concern or less statistics of previous crimes. It is much more important to observe the previous crime statistics for general people to make their outing decision and police for catching the criminals are taking s… ▽ More

    Submitted 18 May, 2023; originally announced May 2023.

    Comments: 9 pages

  5. arXiv:2210.07286  [pdf, other

    cs.HC

    Augmenting Online Classes with an Attention Tracking Tool May Improve Student Engagement

    Authors: Arnab Sen Sharma, Mohammad Ruhul Amin, Muztaba Fuad

    Abstract: Online remote learning has certain advantages, such as higher flexibility and greater inclusiveness. However, a caveat is the teachers' limited ability to monitor student interaction during an online class, especially while teachers are sharing their screens. We have taken feedback from 12 teachers experienced in teaching undergraduate-level online classes on the necessity of an attention tracking… ▽ More

    Submitted 13 October, 2022; originally announced October 2022.

    Comments: 18 pages, 10 figures,

  6. arXiv:2206.00372  [pdf

    cs.CL

    BD-SHS: A Benchmark Dataset for Learning to Detect Online Bangla Hate Speech in Different Social Contexts

    Authors: Nauros Romim, Mosahed Ahmed, Md. Saiful Islam, Arnab Sen Sharma, Hriteshwar Talukder, Mohammad Ruhul Amin

    Abstract: Social media platforms and online streaming services have spawned a new breed of Hate Speech (HS). Due to the massive amount of user-generated content on these sites, modern machine learning techniques are found to be feasible and cost-effective to tackle this problem. However, linguistically diverse datasets covering different social contexts in which offensive language is typically used are requ… ▽ More

    Submitted 1 June, 2022; originally announced June 2022.

  7. arXiv:2112.04298  [pdf, other

    cs.CV cs.LG

    GCA-Net : Utilizing Gated Context Attention for Improving Image Forgery Localization and Detection

    Authors: Sowmen Das, Md. Saiful Islam, Md. Ruhul Amin

    Abstract: Forensic analysis of manipulated pixels requires the identification of various hidden and subtle features from images. Conventional image recognition models generally fail at this task because they are biased and more attentive toward the dominant local and spatial features. In this paper, we propose a novel Gated Context Attention Network (GCA-Net) that utilizes non-local attention in conjunction… ▽ More

    Submitted 7 April, 2022; v1 submitted 8 December, 2021; originally announced December 2021.

    Comments: Accepted for publication at the CVPR 2022 Media Forensics Workshop

  8. arXiv:2112.01902  [pdf, other

    cs.CL

    HS-BAN: A Benchmark Dataset of Social Media Comments for Hate Speech Detection in Bangla

    Authors: Nauros Romim, Mosahed Ahmed, Md Saiful Islam, Arnab Sen Sharma, Hriteshwar Talukder, Mohammad Ruhul Amin

    Abstract: In this paper, we present HS-BAN, a binary class hate speech (HS) dataset in Bangla language consisting of more than 50,000 labeled comments, including 40.17% hate and rest are non hate speech. While preparing the dataset a strict and detailed annotation guideline was followed to reduce human annotation bias. The HS dataset was also preprocessed linguistically to extract different types of slang c… ▽ More

    Submitted 3 December, 2021; originally announced December 2021.

    Comments: Submitted to ICON 21 (Rejected)

  9. arXiv:2110.05906  [pdf, other

    cs.NI eess.SP

    Energy-cost aware off-grid base stations with IoT devices for developing a green heterogeneous network

    Authors: Khondoker Ziaul Islam, MD. Sanwar Hossain, B. M. Ruhul Amin, Ferdous Sohel

    Abstract: Heterogeneous network (HetNet) is a specified cellular platform to tackle the rapidly growing anticipated data traffic. From communications perspective, data loads can be mapped to energy loads that are generally placed on the operator networks. Meanwhile, renewable energy aided networks offer to curtail fossil fuel consumption, so to reduce environmental pollution. This paper proposes a renewable… ▽ More

    Submitted 12 October, 2021; originally announced October 2021.

  10. arXiv:2107.14095  [pdf, other

    cs.CY

    Exploring the Scope and Potential of Local Newspaper-based Dengue Surveillance in Bangladesh

    Authors: Nazia Tasnim, Md. Istiak Hossain Shihab, Moqsadur Rahman, Sheikh Rabiul Islam, Mohammad Ruhul Amin

    Abstract: Dengue fever has been considered to be one of the global public health problems of the twenty-first century, especially in tropical and subtropical countries of the global south. The high morbidity and mortality rates of Dengue fever impose a huge economic and health burden for middle and low-income countries. It is so prevalent in such regions that enforcing a granular level of surveillance is qu… ▽ More

    Submitted 7 July, 2021; originally announced July 2021.

    Comments: 5 Pages, Joint KDD 2021 Health Day and 2021 KDD Workshop on Applied Data Science for Healthcare

  11. arXiv:2102.09603  [pdf, other

    cs.CV cs.AI cs.LG

    Towards Solving the DeepFake Problem : An Analysis on Improving DeepFake Detection using Dynamic Face Augmentation

    Authors: Sowmen Das, Selim Seferbekov, Arup Datta, Md. Saiful Islam, Md. Ruhul Amin

    Abstract: The creation of altered and manipulated faces has become more common due to the improvement of DeepFake generation methods. Simultaneously, we have seen detection models' development for differentiating between a manipulated and original face from image or video content. In this paper, we focus on identifying the limitations and shortcomings of existing deepfake detection frameworks. We identified… ▽ More

    Submitted 25 August, 2021; v1 submitted 18 February, 2021; originally announced February 2021.

  12. arXiv:2012.07538  [pdf, other

    cs.CL

    Sentiment analysis in Bengali via transfer learning using multi-lingual BERT

    Authors: Khondoker Ittehadul Islam, Md. Saiful Islam, Md Ruhul Amin

    Abstract: Sentiment analysis (SA) in Bengali is challenging due to this Indo-Aryan language's highly inflected properties with more than 160 different inflected forms for verbs and 36 different forms for noun and 24 different forms for pronouns. The lack of standard labeled datasets in the Bengali domain makes the task of SA even harder. In this paper, we present manually tagged 2-class and 3-class SA datas… ▽ More

    Submitted 3 December, 2020; originally announced December 2020.

    Comments: 5 pages

  13. arXiv:1610.00369  [pdf

    cs.CL cs.IR cs.LG cs.NE

    Sentiment Analysis on Bangla and Romanized Bangla Text (BRBT) using Deep Recurrent models

    Authors: A. Hassan, M. R. Amin, N. Mohammed, A. K. A. Azad

    Abstract: Sentiment Analysis (SA) is an action research area in the digital age. With rapid and constant growth of online social media sites and services, and the increasing amount of textual data such as - statuses, comments, reviews etc. available in them, application of automatic SA is on the rise. However, most of the research works on SA in natural language processing (NLP) are based on English languag… ▽ More

    Submitted 23 November, 2016; v1 submitted 2 October, 2016; originally announced October 2016.

  14. arXiv:1401.6082  [pdf

    cs.IT

    Performance Evaluation of Two-Hop Wireless Link under Nakagami-m Fading

    Authors: Afsana Nadia, Arifur Rahim Chowdhury, Md. Shoayeb Hossain, Md. Imdadul Islam, M. R. Amin

    Abstract: Now-a-days, intense research is going on two-hop wireless link under different fading conditions with its remedial measures. In this paper work, a two-hop link under three different conditions is considered: (i) MIMO on both hops, (ii) MISO in first hop and SIMO in second hop and finally (iii) SIMO in first hop and MISO in second hop. The three models used here give the flexibility of using STBC (… ▽ More

    Submitted 21 December, 2013; originally announced January 2014.

    Journal ref: IJACSA,Vol. 4,No. 7,July 2013