DataSpeak: Data Extraction, Aggregation, and Classification Using Big Data Novel Algorithm

Venkatesh Gauri Shankar,Bali Devi,Sumit Srivastava

doi:10.1007/978-981-13-1513-8_16

Abstract

A huge amount of data is coming due to large set of computing devices. As a birth of the variety of data, data processing and analysis is a big issue in big data analytics. On other hand, data consistency and scalability is also a major problem in the large set of data. Our research and proposed algorithm aims to data extraction, aggregation, and classification based on novel approach as “DataSpeak”. We have used k-Nearest Neighbors with Spark as reference and produced a novel approach with modified algorithm. We have analyzed our approach on the large dataset from travel and tourism, placement papers, movies and historical, smartphone, etc., domains. As for ability and accuracy of our algorithm, we have used cross validation, precision, recall, and comparative statistical analysis with the existing algorithm. Our approach returns with the fast accessing of data with efficient data extraction in a minimal time when compared to the existing algorithm in same domain. As concerned with the data aggregation and classification, our approach returns 98% of data aggregation and classification based on the data structure.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

DataSpeak: Data Extraction, Aggregation, and Classification Using Big Data Novel Algorithm

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Evaluating Pattern Classification Techniques of Neural Network Using k-Means Clustering Algorithm
Swati Sah ... Manu Pratap Singh
-
Swati Sah, et. al.Swati Sah ... Manu Pratap Singh
21 Nov 2017
21 Nov 2017

Synthetization of bicycle route data from aggregate GPS-based cycling data and its utility for bicycle route choice analysis
Stefan Huber
-
Stefan HuberStefan Huber
16 Jun 2021
16 Jun 2021

Machine learning in pain research.
Jörn Lötsch ... Alfred Ultsch
Pain | VOL. 159
Jörn Lötsch, et. al.Jörn Lötsch ... Alfred Ultsch
24 Nov 2017
Pain | VOL. 159

Machine learning for Big Data analytics in plants.
Chuang Ma ... Xiangfeng Wang
Trends in Plant Science | VOL. 19
Chuang Ma, et. al.Chuang Ma ... Xiangfeng Wang
14 Sep 2014
Trends in Plant Science | VOL. 19

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

DataSpeak: Data Extraction, Aggregation, and Classification Using Big Data Novel Algorithm

Abstract

Talk to us

Similar Papers