Abstract

The feature extraction is the most critical technology of text categorization. The method of feature extraction from Chinese text based on CILIN is different from the conventional feature extraction, which uses two feature extraction methods. This method is good at dealing with synonyms and polysemes, and reducing the dimension. Firstly, it uses the method of feature extraction from Chinese text based on CILIN to analyze the meaning of key words. Secondly, use the mutual information to extract the feature, it can give the relation between class and lemma. The experiment results proposed that comprehend to the meaning of key words can distinctively improve the text classification precision.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call