Concept-based document classification using Wikipedia and value function

Pekka Malo,Ankur Sinha,Pekka Korhonen,Jyrki Wallenius

doi:10.1002/asi.21596

Abstract

In this article, we propose a new concept-based method for document classification. The conceptual knowledge associated with the words is drawn from Wikipedia. The purpose is to utilize the abundant semantic relatedness information available in Wikipedia in an efficient value function-based query learning algorithm. The procedure learns the value function by solving a simple linear programming problem formulated using the training documents. The learning involves a step-wise iterative process that helps in generating a value function with an appropriate set of concepts (dimensions) chosen from a collection of concepts. Once the value function is formulated, it is utilized to make a decision between relevance and irrelevance. The value assigned to a particular document from the value function can be further used to rank the documents according to their relevance. Reuters newswire documents have been used to evaluate the efficacy of the procedure. An extensive comparison with other frameworks has been performed. The results are promising.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Concept-based document classification using Wikipedia and value function

Abstract

Talk to us

Similar Papers

More From: Journal of the American Society for Information Science and Technology

Lead the way for us

Journal: Journal of the American Society for Information Science and Technology	Publication Date: Sep 21, 2011
Citations: 25

Similar Papers

Impact of Energy Storage on Economic Dispatch of Distribution Systems: A Multi-Parametric Linear Programming Approach and its Implications
Wei Wei ... Joao P S Catalao
IEEE Open Access Journal of Power and Energy | VOL. 7
Wei Wei, et. al.Wei Wei ... Joao P S Catalao
01 Jan 2020
IEEE Open Access Journal of Power and Energy | VOL. 7

Adaptive Video Streaming for Massive MIMO Networks via Approximate MDP and Reinforcement Learning
Qiao Lan ... Yi Gong
IEEE Transactions on Wireless Communications | VOL. 19
Qiao Lan, et. al.Qiao Lan ... Yi Gong
01 Sep 2020
IEEE Transactions on Wireless Communications | VOL. 19

Exact Learning Algorithms, Betting Games, and Circuit Lower Bounds
Ryan C Harkins ... John M Hitchcock
ACM Transactions on Computation Theory | VOL. 5
Ryan C Harkins, et. al.Ryan C Harkins ... John M Hitchcock
01 Nov 2013
ACM Transactions on Computation Theory | VOL. 5

Economic Value of Energy Storages in Unit Commitment With Renewables and Its Implication on Storage Sizing
Zhongjie Guo ... Laijun Chen
IEEE Transactions on Sustainable Energy | VOL. 12
Zhongjie Guo, et. al.Zhongjie Guo ... Laijun Chen
01 Oct 2021
IEEE Transactions on Sustainable Energy | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Concept-based document classification using Wikipedia and value function

Abstract

Talk to us

Similar Papers

More From: Journal of the American Society for Information Science and Technology