Software development and associated data is the most critical factor these days. Currently, people are living in an internet world where data and related artifacts are major sets of information these days. The data is correlated with real-world data. The analysis of large datasets was done as part of the experimental analysis. The dataset for online social media like Facebook and Twitter was taken for the identification of influential nodes. The analysis of the dataset provides an overview and observation of the dataset for Facebook or Twitter. Here, in the current activity, an overview of cloud computing and big data technologies are discussed along with effective methods and approaches to resolve the problem statement. Particularly, big data technologies such as Hadoop provided by Apache for processing and analysis of Gigabyte(GB) or petabyte(PB) scale datasets are discussed for processing data in distributed and parallel data fashion. Here, the processing of large datasets is done by big data technology by implementing Apache Hadoop in online social media.