Efficient Prediction of Water Quality Index (WQI) Using Machine Learning Algorithms

Jan 28, 2025

Liked by India Ambler

Explore the Research

Every impactful research project has a story, and for our team, this story began with the question: how can technology improve water quality monitoring to ensure better health and environmental outcomes? Our paper, "Efficient Prediction of Water Quality Index (WQI) Using Machine Learning Algorithms," addressed this question and earned the 2022 Best Paper Award from Human-Centric Intelligent Systems.

The Process and Methodology

The foundation of our research was built on a comprehensive analysis of water quality data sourced from India's diverse water bodies. The dataset included essential parameters such as dissolved oxygen (DO), biological oxygen demand (BOD), pH, and total coliform (TC). To ensure a reliable and replicable process, we designed a robust workflow for data preparation and modeling, as depicted in Figure 1.

Working diagram of proposed model.

Figure 1: Research Workflow
This figure illustrates the sequence of steps followed in the study:

Data Collection: Acquiring datasets from Kaggle, focusing on key water quality parameters.
Data Preprocessing: Addressing missing data using Random Forest imputation and applying Min-Max normalization for scaling.
Feature Selection: Identifying critical variables using a correlation matrix.
Machine Learning Models: Training and testing five algorithms (Neural Network, Random Forest, Multinomial Logistic Regression, Support Vector Machine, and Bagged Tree Model).
Performance Evaluation: Comparing model accuracies and identifying the best performer.

This structured approach not only streamlined our study but also ensured replicability, a cornerstone of rigorous research.

Key Findings and Insights

The performance of the machine learning algorithms was assessed using metrics such as accuracy and kappa values.

The Multinomial Logistic Regression (MLR) model achieved the highest accuracy of 99.83%, setting a benchmark for water quality prediction systems.
Random Forest (RF) followed closely with an accuracy of 98.99%, demonstrating its strength in handling complex datasets.
Other models, including Neural Network (98.65%), Bagged Tree Model (98.99%), and Support Vector Machine (96.98%), also performed well, though slightly lower than MLR.

The chart underscores the reliability of MLR in WQI prediction, making it an ideal choice for real-world applications.

Practical Implications

Our study's results provide a roadmap for developing efficient, data-driven systems for water quality monitoring. The insights gained can support policymakers, environmental agencies, and researchers in implementing proactive measures to ensure safe water access.

Looking forward, we aim to build a software application using our proposed model, enabling real-time water quality predictions. Such a tool could revolutionize water resource management, particularly in regions facing acute water quality challenges.

Final Thoughts

Winning the Best Paper Award has been a tremendous honor, motivating us to continue exploring the potential of machine learning in solving critical environmental problems. We extend our heartfelt thanks to the editorial board of Human-Centric Intelligent Systems for this recognition and to our research team at VRD Research Lab for their dedication and collaboration.

Md. Mehedi Hassan

MD. MEHEDI HASSAN (Member, IEEE) received the B.Sc. degree in computer science and engineering from North Western University, Khulna, Bangladesh, in 2022, where he excelled in his studies and demonstrated a strong aptitude for research. He is completed the M.Sc. degree in computer science and engineering with Khulna University, Khulna, in 2024. He is a dedicated and accomplished Researcher. As the Founder and the CEO of The Virtual BD IT Firm and the VRD Research Laboratory, Bangladesh, he has established himself as a highly respected leader in the fields of biomedical engineering, data science, and expert systems. He is a member of the prestigious Institute of Electrical and Electronics Engineers (IEEE). He is highly skilled in association rule mining, predictive analysis, machine learning, and data analysis, with a particular focus on the biomedical sciences. As a Young Researcher, he has published 76 articles in various international top journals and conferences, which is a remarkable achievement. His work has been well-received by the research community and has significantly contributed to the advancement of knowledge in his field. He is a highly motivated and a skilled researcher with a strong commitment to improving human health and well-being through cutting-edge scientific research. His accomplishments to date are impressive, and his potential for future contributions to his field is very promising. He has filed more than three patents out of which two are granted to his name. His research interests include broad and include important human diseases, such as oncology, cancer, hepatitis, human behavior analysis, and mental health. He serves as a reviewer for 60 prestigious journals.

Please sign in or register for FREE

If you are a registered user on Research Communities by Springer Nature, please sign in

Follow the Topic

Soil and Water Protection

Technology and Engineering > Civil Engineering > Environmental Civil Engineering > Soil and Water Protection

Human-Centric Intelligent Systems

Human-Centric Intelligent Systems

Human-Centric Intelligent Systems is an open access, international journal, dedicated to disseminating latest research findings on theoretical and practical applications in human-centric intelligent systems, providing theoretical and algorithmic insights in human-centric computing and analytics.

More about the journal

Related Collections

With Collections, you can get published faster and increase your visibility.

Human-Centric Digital Innovation for Energy Systems

The global transformation of energy systems toward sustainability, resilience, and efficiency is increasingly enabled by digital innovation. Emerging technologies such as artificial intelligence, digital twins, advanced analytics, and intelligent control are are reshaping the future of energy systems. These advances are accelerating the transition to low-carbon energy futures and opening new opportunities for flexibility, reliability, and integration across diverse sectors.

However, unlocking the full potential of smart energy systems requires more than technological progress, it calls for human-centric design and innovation that address user needs, deliver effective decision support, foster social acceptance, and align with policy and governance frameworks. Embedding human perspectives within digital solutions is crucial for energy systems that are not only technologically advanced but also equitable, trusted, and widely adopted. This special issue offers a high-visibility platform linked to the 2nd International Conference on Digital Intelligence for Energy Systems (ICDIES 2026), providing global reach and interdisciplinary exposure. It welcomes original research articles, case studies, and comprehensive reviews that investigate the convergence of digital intelligence and human-centric approaches in energy systems. Topics of interest include, but are not limited to:

• Human-centric AI, digital twins, and data-driven modelling designed to enhance transparency, interpretability, and user engagement in energy systems

• Decision-support tools that empower operators, prosumers, and policymakers to make informed, user-aligned energy choices

• Digital solutions that prioritise usability, accessibility, trust, and social acceptance in the adoption of smart energy technologies

• Human-in-the-loop approaches to system optimization, adaptive control, and resilience that integrate user feedback and participation

• Interdisciplinary perspectives that align digital innovation with human values

• Human-oriented applications across buildings, transport, industry, and integrated energy networks, highlighting real-world adoption and societal impact

By placing the human dimension at the center of digital innovation, this special issue seeks to advance the development of smart energy systems that are not only intelligent and efficient but also inclusive, user-focused, and sustainable. It aligns with SDG 7: Affordable and Clean Energy; SDG 9: Industry, Innovation & Infrastructure; and SDG 11: Sustainable Cities & Communities. We invite contributions from researchers and practitioners to shape this emerging field and to highlight the transformative potential of human-centric digital solutions for energy futures.

Publishing Model: Open Access

Deadline: Sep 30, 2026

Explore this Collection

Human-Centric Intelligent Systems for Sustainable Innovation, Industry, and Economic Growth

Aims and Scope:

The advancement of human-centric intelligent systems plays a pivotal role in fostering sustainable industry innovation (SDG 9) and driving decent work and economic growth (SDG 8). Intelligent systems that prioritize human needs, ethical AI, and adaptive technologies have the potential to revolutionize industries, improve workplace productivity, and create new job opportunities while ensuring inclusive and sustainable economic development.

This special issue explores cutting-edge research in AI-driven, human-centered solutions that enhance industrial automation, optimize labor productivity, and promote responsible digital transformation. By integrating artificial intelligence (AI), machine learning (ML), human-computer interaction (HCI), and intelligent automation, aiming to highlight how these technologies can contribute to:

• AI-driven decision support for workers, adaptive learning technologies, and intelligent workplace environments.

• AI-enhanced automation, ethical and responsible AI in industrial applications, and human-in-the-loop machine intelligence.

• AI-driven skill development, workforce augmentation, and ethical automation frameworks ensuring job security.

• Explainable AI (XAI), trust in AI, and reducing bias in intelligent decision-making systems.

This special issue focuses on the intersection of intelligent systems and human needs, aiming to bridge the gap between technological innovation and sustainable industrial and economic progress.

This collection supports United Nations Sustainable Development Goal 8: Decent Work and Economic Growth and Sustainable Development Goal 9: Industry, Innovation and Infrastructure.

Publishing Model: Open Access

Deadline: Feb 28, 2026

Explore this Collection

A fuzzy set-based hybrid SWARA-CoCoSo-William Fine framework for safety risk assessment in a ceramic granule preparation unit

Behind the Paper

Comprehensive risk profiling of occupational harmful factors in the ceramic industry: a case study from Iran

Behind the Paper

How to select the best candidate or the key factors? Hierarchical topological clustering can help

Behind the Paper

REM-related obstructive sleep apnoea in neuromuscular diseases: A 10-year retrospective cohort study

Behind the Paper

Insights into hyperuricemia amelioration mechanisms of Lactobacillus rhamnosus GG may enable probiotics therapy

Cookies

We use cookies to ensure the functionality of our website, to personalize content and advertising, to provide social media features, and to analyze our traffic. If you allow us to do so, we also inform our social media, advertising and analysis partners about your use of our website. You can decide for yourself which categories you want to deny or allow. Please note that based on your settings not all functionalities of the site are available.

Further information can be found in our privacy policy.

Efficient Prediction of Water Quality Index (WQI) Using Machine Learning Algorithms

Share this post

Share with...

...or copy the link