Algorithm of Text Categorization Based on Cloud Computing

Li Qin Huang,Li Qun Lin,Yan Huang Liu

doi:10.4028/www.scientific.net/amm.311.158

Algorithm of Text Categorization Based on Cloud Computing

Li Qin Huang, Li Qun Lin + Show 1 more

https://doi.org/10.4028/www.scientific.net/amm.311.158

Copy DOI

Journal: Applied Mechanics and Materials	Publication Date: Feb 1, 2013
Citations: 1

Affiliation: Fuzhou University

#Cloud Computing #Multi-class Support Vector Machines + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

MapReduce framework of cloud computing has an effective way to achieve massive text categorization. In this paper a distributed parallel text training algorithm in cloud computing environment based on multi-class Support Vector Machines(SVM) is designed. In cloud computing environment Map tasks realize distributing various types of samples and Reduce tasks realize the specific SVM training. Experimental results show that the execution time of text training decreases with the number of Reduce tasks increasing. Also a parallel text classifying based on cloud computing is designed and implemented, which classify the unknown type texts. Experimental results show that the speed of text classifying increases with the number of Map tasks increasing.

Full Text