Hello, My name is
Md. Shahidul Salim
Lecturer, CSE, KUET
And I'm a
My CV My LinkedIn Profile My Github Profile My Google Scholar Profile My Hugging Face Profile
Updates

  • 🌐(06.11.2024) Research paper accepted(ICRPSET-2024): Automated Classification of Gastrointestinal Polyps from Endoscopic Images Using a Deep Learning Approach
  • 🌐(24.10.2024) Revision(Data in Brief): Climate Data Dynamics: A High-Volume Real World Structured Weather Dataset
  • 🌐(20.10.2024) Research paper revision submitted(SoftwareX): LLM based QA chatbot builder (https://github.com/shahidul034/LLM-based-QA-chatbot-builder)
  • 🌐Research paper Submitted: Deep learning models for dermoscopic skin lesion hair segmentation: An extensive experimental study
  • 🌐(02.08.2024) Research paper accepted(ICCA 2024): Suggesting bengali words using masked language model
  • 🌐(29.07.2024) Revision(EMNLP 2024): BCoQA: Benchmark and Resources for Bangla Context-based Conversational Question Answering
  • 🌐(20.07.2024) Research paper accepted(Data in Brief): Bangla news article dataset
  • 🌐Ongoing research(EMNLP): Comparing Prompt Based and Standard Fine Tuning for Bangla Text Classification
  • 🌐Ongoing research: ProGAN and Diffusion-Based Hair Mask Generation

Blogs

About me

I'm Md. Shahidul Salim. My nick name is Shakib and I'm a

I am currently employed as a Lecturer in the Department of Computer Science and Engineering at Khulna University of Engineering & Technology, Bangladesh. I graduated from Khulna University of Engineering and Technology with a computer science and engineering degree, earning a CGPA of 3.86 out of 4.0 and achieving the fourth position out of 121. My scholarly contributions include several published journal articles as well as a substantial body of work presented through conference papers, including EMNLP. Additionally, I currently have several journals under review, including Data in Brief, Engineering Applications of Artificial Intelligence, SoftwareX, and Annals of Data Science. During my undergraduate studies, I worked with Professor Dr. K. M. Azharul Hasan, and I am now working with Dr. Sheikh Imran Hossain, Associate Professor. I am actively engaged in ongoing research initiatives.
Research Interests: My research primarily revolves around Natural Language Processing, focusing on LLM Quantization, lightweight LLM testing and analysis, Transformers, text generation, text summarization, and conversational question answering. Additionally, I work on Machine Learning, emphasizing Convolutional Neural Networks, time series analysis, and multimodal (text+image) applications. In Artificial Intelligence, my interests include AI vs. Human text classification, Generative AI, with a particular focus on Generative Adversarial Networks (GANs) and Diffusion models.

I love playing FIFA, traveling to new places, and watching movies and TV series—especially in the sci-fi and thriller genres. Some of my favorite movies are Interstellar and the Avengers series, and my top TV shows are Dark, Stranger Things, and Prison Break.

Academic Timeline and Working Experience

Research Area

Machine Learning, Deep learning
✅Bioinformatics
✅Convolutional Neural Networks
✅Time series analysis
Natural Language Processing
✅LLM- Quantization, Light weight LLM testing and analysis
✅Transformer
✅Bangla stemmer
✅Text generation
✅Text summarization
✅Conversational question answering
Generative AI
✅Generative Adversarial Network (GAN)
✅Diffusion

Publication and Under Review

Natural Language Processing

  1. Md. Shahidul Salim, Sk Imran Hossain, An Applied Statistics dataset for human vs AI-generated answer classification, Data in Brief, 2024, 110240, ISSN 2352-3409,https://doi.org/10.1016/j.dib.2024.110240. 🌐paper 🔗Github

  2. Bose, D., & Salim, M. S. (2024). Suggesting Bengali words using masked language model. In 3rd international conference on computing advancements (ICCA).

  3. Asif Mohammed Saad, Umme Niraj Mahi, Md. Shahidul Salim, Sk Imran Hossain, Bangla news article dataset, Data in Brief, 2024, 110874, ISSN 2352-3409, https://doi.org/10.1016/j.dib.2024.110874.

  4. BCoQA: Benchmark and Resources for Bangla Context-based Conversational Question Answering(Submitted to EMNLP 2024(Revision completed)) 🌐OpenReview

  5. Nabil, A., Das, d., Salim, M. S., Arifeen, S., & Fattah, H. M. A. (2023). Bangla emergency post classification on social media using transformer based bert models. In 6th international conference on electrical information and communication technology (EICT 2023). (Accepted) 🌐paper

  6. Salim, M. S., Murad, H., Das, D., & Ahmed, F. (2023). "BanglaGPT: A Generative Pretrained Transformer-Based Model for Bangla Language," 2023 International Conference on Information and Communication Technology for Sustainable Development (ICICT4SD), Dhaka, Bangladesh, 2023, pp. 56-59, doi: 10.1109/ICICT4SD59951.2023.10303383.🌐paper

  7. S. Salim, T. Islam, R. Zannat, N. Mia, M. Fuad and H. Murad, "Towards Developing a Transformer-Based Bangla Typing Error Correction Model: A Deep Learning-Based Approach," 2023 International Conference on Information and Communication Technology for Sustainable Development (ICICT4SD), Dhaka, Bangladesh, 2023, pp. 75-78, doi: 10.1109/ICICT4SD59951.2023.10303361.🌐paper

  8. T. Ahmed, S. Hossain, M. S. Salim, A. Anjum and K. M. Azharul Hasan, "Gold Dataset for the Evaluation of Bangla Stemmer," 2021 5th International Conference on Electrical Information and Communication Technology (EICT), Khulna, Bangladesh, 2021, pp. 1-6, doi:10.1109/EICT54103.2021.9733662.🌐paper

  9. M. S. Salim Shakib, T. Ahmed and K. M. Azharul Hasan, "Designing a Bangla Stemmer using rule based approach," 2019 International Conference on Bangla Speech and Language Processing (ICBSLP), Sylhet, Bangladesh, 2019, pp. 1-4, doi: 10.1109/ICBSLP47725.2019.201533.🌐paper 🔗Github

Machine Learning/Deep Learning

  1. "Trisha, Shahid, Md. Shahidul Salim , Jeba and Mahbub, "Automated Classification of Gastrointestinal Polyps from Endoscopic Images Using a Deep Learning Approach" 2024 International Conference on Recent Progresses in Science, Engineering and Technology (Accepted)

  2. R. T. H. Promi, R. A. Nazri, M. S. Salim and S. M. T. U. Raju, "A Deep Learning Approach for Non-Invasive Hypertension Classification from PPG Signal," 2023 International Conference on Next-Generation Computing, IoT and Machine Learning (NCIM), Gazipur, Bangladesh, 2023, pp. 1-5, doi: 10.1109/NCIM59001.2023.10212940.🌐paper

  3. Hossain, L., Hossain, I., Salim, M. S., Raju, S. M. T. U., & Saha, J. (2023). A novel technique for classification of motor imagery EEG signal based on deep learning approaches. In Proceedings of the 2nd international conference on big data, IoT and machine learning (bim 2023). (Accepted)🌐paper

  4. Ashiqussalehin, M., Jahan, K., Rahaman, M., & Salim, M. (2022). Human Abnormal Behavior Detection Using Convolution Neural Network. Specialusis Ugdymas, 1(43), 4076–4083.🌐paper

Under Review

  1. LLM based QA chatbot builder: A generative AI-based chatbot for question answering(Submitted to softwareX)
  2. 📝This software describes the development of a web application called the LLM QA Builder, designed to streamline the creation of LLM-based interactive chatbots for organizational information retrieval. The application integrates various development phases including data collection and preprocessing, LLM fine-tuning, testing, inference, and chat interface development. It supports fine-tuning multiple LLMs such as Zephyr, Mistral, Llama-3, Phi, Flan-T5, and user-provided models, with enhanced retrieval capabilities via retrieval-augmented generation (RAG). It also includes an automatic web crawling RAG data scraper and a human evaluation feature for model quality assessment. The system's capabilities are demonstrated through a university information chatbot, with comparative analysis of different LLMs using a benchmark crowd-sourced dataset.
  3. Agricultural Recommendation System based on Multivariate Weather Forecasting Model(Submitted to Annals of Data Science) 🌐PRE-PRINT 🔗Github
  4. 📝This paper proposes a context-based crop recommendation system using a weather forecast model to improve farming practices in Bangladesh. The multivariate Stacked Bi-LSTM Network is used for accurate weather prediction, including rainfall, temperature, humidity, and sunshine. The system guides farmers in making informed decisions about planting, irrigation, harvesting, and more. It also alerts farmers about extreme weather conditions and provides knowledge-based crop recommendations for flood and drought-prone areas.
  5. A Suffix Independent Algorithm for Stemming Bangla Words using Finite State Transducer
  6. 📝This study proposes and evaluates a suffix-independent stemming algorithm for Bangla language using a finite state transducer (FST)-based framework. The algorithm creates a dictionary of root words implemented as an FST, achieving high speed and no memory usage for vocabulary keeping. A novel stemmer dataset was developed to evaluate the algorithm's performance, resulting in 96.58% detection accuracy and 96.34% stemming accuracy. The proposed scheme outperforms existing methods and demonstrates effectiveness through experiments.
  7. Detecting AI-Generated Assignments in Educational Evaluation: A Transformer-Based Approach
  8. 📝This research work presents a transformer-based model to detect whether an assignment is AI-generated or human-written. The model was trained on a dataset of 5410 assignments, with 2742 being AI-generated and 2668 being human-written. Among the explored transformer-based architectures, DistilBERT provided the highest accuracy of 92%.

Ongoing Research

My projects

My skills

My creative skills & experiences.

I have experience in programming using C++ and Python. I have finished several projects, such as a chatbot for university information, websites for evaluating students, a personal voice assistant, and a file locker that uses an encryption algorithm. Recently, I have been working on LLM and langchain. I am trying to retrain and finetuned the LLM models using the PEFT library. I have also completed machine learning projects such as fake note detection, human abnormality detection, brain tumour detection, and an intelligent chatbot. I have also finished some natural language processing projects, like an intelligent chatbot that uses LSTM, Bangla text summarization using a transformer model, Bangla keyboard error correction using a transformer model, a Bangla stemmer design, a Bangla word-to-vector conversion, and a chatbot that uses the seq2seq model. I also have experience in hardware projects such as Home Automation using IoT and an image processing project, Fake Bangladeshi currency detection using OpenCV. Furthermore, I have experience in database management using MySQL.

Read more
Pytorch,Tensorflow
90%
Python
90%
C++
90%
Machine learning, NLP
80%
Image processing and Secruity
60%
Data structure and algorithm
70%
JavaScript
80%
Java,C#,swift
60%
SQL
90%
MySQL,HTML,CSS
70%

Contact me

Get in Touch

For any information ,you can contact with me.

Name
Md. Shahidul Salim
Present Address
CSE, KUET, Khulna, Bangladesh
Permanent Address
Bromottor, Rangunia, Chattogram, Bangladesh
Email
shahidulshakib034@gmail.com
ss@cse.kuet.ac.bd
Social Media