Enhancing data quality to mine credible patterns

Muhammad Imran,Adnan Ahmad

doi:10.1177/01655515211013693

Abstract

The importance of big data is widely accepted in various fields. Organisations spend a lot of money to collect, process and mine the data to identify patterns. These patterns facilitate their future decision-making process to improve the organisational performance and profitability. However, among discovered patterns, there are some meaningless and misleading patterns which restrict the effectiveness of decision-making process. The presence of data discrepancies, noise and outliers also impacts the quality of discovered patterns and leads towards missing strategic goals and objectives. Quality inception of these discovered patterns is vital before utilising them in making predictions, decision-making process or strategic planning. Mining useful and credible patterns over social media is a challenging task. Often, people spread targeted content for character assassination or defamation of brands. Recently, some studies have evaluated the credibility of information over social media based on users’ surveys, experts’ judgement and manually annotating Twitter tweets to predict credibility. Unfortunately, due to the large volume and exponential growth of data, these surveys and annotation-based information credibility techniques are not efficiently applicable. This article presents a data quality and credibility evaluation framework to determine the quality of individual data instances. This framework provides a way to discover useful and credible patterns using credibility indicators. Moreover, a new Twitter bot detection algorithm is proposed to classify tweets generated by Twitter bots and real users. The results of conducted experiments showed that the proposed model generates a positive impact on improving classification accuracy and quality of discovered patterns.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Enhancing data quality to mine credible patterns

Abstract

Talk to us

Similar Papers

More From: Journal of Information Science

Lead the way for us

Journal: Journal of Information Science	Publication Date: Jun 6, 2021
Citations: 5

Similar Papers

Framework for Integral Data Quality and Security Evaluation in Smartphones
Igor Khokhlov ... Sergey Chuprov
IEEE Systems Journal | VOL. 15
Igor Khokhlov, et. al.Igor Khokhlov ... Sergey Chuprov
01 Jun 2021
IEEE Systems Journal | VOL. 15

Assessing Data Quality in the Age of Digital Social Research: A Systematic Review
Jessica Daikeler ... Bernd Weiß
Social Science Computer Review | VOL. -
Jessica Daikeler, et. al.Jessica Daikeler ... Bernd Weiß
27 Apr 2024
Social Science Computer Review | VOL. -

Using Business Data in Customs Risk Management: Data Quality and Data Value Perspective
Wout Hofman ... Jonathan Migeotte
-
Wout Hofman, et. al.Wout Hofman ... Jonathan Migeotte
01 Jan 2020
01 Jan 2020

Intelligent Monitoring of Data Quality Based on Multiple Data Structures
Yanhong Bai
Procedia Computer Science | VOL. 243
Yanhong BaiYanhong Bai
01 Jan 2024
Procedia Computer Science | VOL. 243

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Enhancing data quality to mine credible patterns

Abstract

Talk to us

Similar Papers

More From: Journal of Information Science