Mining based on Extraction and Importance Evaluation Using Multi-Measures Methods for Electronic Documents

Wen Xiong,Zi-Hui Ding,L Long,H Yang,Y Li,Y Dai,X Li

doi:10.1051/itmconf/20171205018

Wen Xiong, Zi-Hui Ding + Show 5 more

Open Access

https://doi.org/10.1051/itmconf/20171205018

Copy DOI

Abstract

Mining the implicit knowledge in the electronic documents is a critical task in text analysis and data mining. To attain a knowledge-based view of the electronic documents, the clustering method based upon the topic cannot only be used, but also that based upon the extraction can be done. Therefore, a novel method for the clustering of the electronic documents, summarizing of the full text based on the extracted segments, and an evaluation using multi-measures for the importance to the document were presented. In the method, eighteen kinds of named entities and two kinds of syntactical phrases were extracted, and exploited for the text clustering. Then, a novel similarity equation was proposed for the calculation about the extractions. Meantime, three measures for the importance to the document were proposed, which provided a different view for the document’s content, and recommended a prior checking for the users. Therefore, the method can improve the efficiency of the knowledge discovery, and enhance the management of the document on the large scale of document collection.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: ITM Web of Conferences	Publication Date: Jan 1, 2017
Citations: 1	License type: cc-by

R Discovery Prime

R Discovery Prime

Mining based on Extraction and Importance Evaluation Using Multi-Measures Methods for Electronic Documents

Abstract

Talk to us

Similar Papers

More From: ITM Web of Conferences

Lead the way for us

Similar Papers

ALTAS: An Intelligent Text Analysis System Based on Knowledge Graphs
Xiaoli Wang ... Jiangjiang Cao
-
Xiaoli Wang, et. al.Xiaoli Wang ... Jiangjiang Cao
01 Jan 2018
01 Jan 2018

Membership Detection Using Cooperative Data Mining Algorithms
Calvin Newport ... Yiqing Ren
-
Calvin Newport, et. al.Calvin Newport ... Yiqing Ren
28 Apr 2014
28 Apr 2014

An ontology-based retrieval system for mammographic reports
Albert Comelli ... Salvatore Vitabile
-
Albert Comelli, et. al.Albert Comelli ... Salvatore Vitabile
01 Jul 2015
01 Jul 2015

Linguistic Dumpster Diving: Geographical Classification of Arabic Text
...
-
, et. al. ...
17 Jul 2009
17 Jul 2009

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Mining based on Extraction and Importance Evaluation Using Multi-Measures Methods for Electronic Documents

Abstract

Talk to us

Similar Papers

More From: ITM Web of Conferences