Abstract
Authorship identification is a problem of data mining and classification. There are numerous methods and algorithms have been published to understand its nature. Although, researchers still investigate best and simple solutions due to its heterogeneous and multilingual characteristics. This study introduced new authorship identification process based on artificial neural network (ANN) model using embedded stylistics features. It is well known that stylistics features mostly depend on the topic or genre of the article. Our dataset contains 22.000 Turkish newspaper articles which belong to different genres. The experimental results indicate that %97 success rate has been achieved with Levenberg Marguardt based classifier. It can be concluded that the corpus presented in this work for the first time might contribute to not only authorship identification but also other identification purposes.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.