Abstract

Twitter usage is outgrowing each day and so is the ever changing trend. The data flow keeps on increasing each day through tweets and thus analysis on the latest trends are done by using most widely used framework for processing large datasets-Apache Hadoop. This paper presents an overview of Twitter Trend analysis using Apache PIG in Hadoop framework and its performance will be tested using lexical analysis. Hadoop is preferred over other frameworks due to its robustness, scalability, cost effectiveness and high speed which makes parallel processing over distributed network easy. The paper also shows results obtained through lexical analysis done on the Hadoop cluster to show the efficiency of the program.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.