Abstract

Salford Predictive Modeler (SPM) developed many data mining tools. In this research paper, Random Forest (RF) model classification has recently been used as new tool which involves extensive research in various disciplines. Application of RF method has increased considerably in areas of data mining and classification problems in the field of Text Mining over the most recent decade. In this research paper, an attempt is completed to deal with the authorship attribution problems using a RF model for qualifying the articles of unknown authorship to one of the existing writers of the same period. A set of stylistic variables such as function words is made use of for classification purposes. After that the results of authorship attribution are discussed. The database is based on blocks of articles, in which each block contains many sentences. Initially, the three scholars writing styles are identified in the previous studies, and they had three different writing styles. The group of articles of anonymous authorship to the articles written by contemporary Tamil scholars of the similar period, namely Mahakavi Bharathiar (MB), Subramaniya Iyer (SI) and T. V. Kalyanasundaram (TVK) is dealt in the current research paper. The above said three accepted scholars had written a number of articles on India's Freedom Movement during the pre-independence time. Originally, all the three writers contributed their articles by attributing their names. The oppressive attitude of the then British administration forced all the three patriots to write articles on the same theme for unknown publications without mentioning their names.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.