Identification of approximately duplicate material records in ERP systems

Wei Zong,Feng Wu,Lap-Keung Chu,Domenic Sculli

doi:10.1080/17517575.2015.1065513

Abstract

ABSTRACTThe quality of master data is crucial for the accurate functioning of the various modules of an enterprise resource planning (ERP) system. This study addresses specific data problems arising from the generation of approximately duplicate material records in ERP databases. Such problems are mainly due to the firm’s lack of unique and global identifiers for the material records, and to the arbitrary assignment of alternative names for the same material by various users. Traditional duplicate detection methods are ineffective in identifying such approximately duplicate material records because these methods typically rely on string comparisons of each field. To address this problem, a machine learning-based framework is developed to recognise semantic similarity between strings and to further identify and reunify approximately duplicate material records – a process referred to as de-duplication in this article. First, the keywords of the material records are extracted to form vectors of discriminating words. Second, a machine learning method using a probabilistic neural network is applied to determine the semantic similarity between these material records. The approach was evaluated using data from a real case study. The test results indicate that the proposed method outperforms traditional algorithms in identifying approximately duplicate material records.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Identification of approximately duplicate material records in ERP systems

Abstract

Talk to us

Similar Papers

More From: Enterprise Information Systems

Lead the way for us

Journal: Enterprise Information Systems	Publication Date: Jul 9, 2015
Citations: 8

Similar Papers

Assessing Critical Success Factors of ERP Implementation
Leopoldo Colmenares
-
Leopoldo ColmenaresLeopoldo Colmenares
01 Jan 2009
01 Jan 2009

Assessing Critical Success Factors of ERP Implementation
Leopoldo Colmenares
-
Leopoldo ColmenaresLeopoldo Colmenares
01 Jan 2009
01 Jan 2009

SELECTING ENTERPRISE RESOURCE PLANNING SYSTEM USING OF FUZZY ANALYTIC HIERARCHY PROCESS APPROACH

-

24 Dec 2015
24 Dec 2015

Investigating ERP Misalignment between ERP Systems and Implementing Organizations in Developing Countries
Nkosinathi Bitsini
Journal of Enterprise Resource Planning Studies | VOL. -
Nkosinathi BitsiniNkosinathi Bitsini
02 Apr 2015
Journal of Enterprise Resource Planning Studies | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Identification of approximately duplicate material records in ERP systems

Abstract

Talk to us

Similar Papers

More From: Enterprise Information Systems