Abstract
Various public information interactive processes, such as email, Instant Messaging (IM), Short Message Service (SMS), contain lots of advertising, obscene, illegal, and other spam information. Most of such spam information is text. From the computational linguistics perspective, textual information from different sources can be processed in a similar way. So the processing models or systems are expected to be portable on different information types. This paper introduces a unified spam filtering model for multi-source information, and proposes an approximate estimate method for the model portability. Based on the proposed model, a SVM has been used to classify the information. The experimental results show that the unified spam filtering model can be applied to multi-source information, and the SVM classification algorithm achieved encouraging performance.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.