Abstract

The present study predicts, cross validate and classify the data of COVID-19 based on four machine learning algorithm with four major parameters namely confirmed cases, recoveries, deaths and active cases. The secondary sources of database were collected from Ministry of Health and Family Welfare Department (MHFWD), from Indian State and Union Territories up to March, 2021. Based on these background, the database classified and predicted various machine learning Algorithm, like SVM, k NN, Random Forest and Logistic Regression. Initially, the k-mean clustering analysis is used to perform and identified five meaningful clusters and is labeled as Very Low, Low, Moderate, High and Very High of four major parameters based on their average values. In addition the five clusters are cross validated using four machine learning algorithm and affected states were visualized with help of prediction and probabilities. The different machine learning models achieved cross validation accuracy of 88%, 97%, 91% and 91%. . Delhi, Uttar Pradesh and West Bengal were Moderately Affected States, Assam, Bihar, Chhattisgarh, Haryana, Gujarat, Madhya Pradesh, Odisha, Punjab, Rajasthan and Telangana are Low Affected States, wherein Tamil Nadu, Kerala, Andhra Pradesh and Karnataka are highly affected States. and Maharashtra the Very Highly Affected State. Rest of the States and Union Territories has Very Low affected Covid-19 Cases is clearly identified.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call