Abstract

With the rapid growth in number and dimension of databases and database applications in business, administrative, industrial and other fields, it is necessary to examine the automatic extraction of knowledge from these large databases. Due to knowledge extraction from databases, these have become rich and safe sources for generating and verification of knowledge, and the knowledge discovery can be applied in software management, querying process, making decisions, process control and many other fields of interest. At the same time, there is a challenge in managing unstructured data. Among organizations with large concentration of unstructured information, there is a greater tendency to devote more resources to this kind of data. The acquisition of knowledge from unstructured data is often difficult and expensive. Some possible solutions on extracting useful information (knowledge) from unstructured data are provided. Knowledge extraction is the process of creation of knowledge from structured, unstructured and semi-structured data. The objective of this paper is to present the possibilities of extracting knowledge from unstructured and semi-structured data particularly. The theories and tools for knowledge extraction are the subject of the emerging field of knowledge discovery in databases (KDD). Definitions of KDD are provided and the general multistep KDD process is outlined. A brief summary of recent KDD real-world applications is also provided. Finally, the article enumerates challenges for future research and development in KDD systems.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.