Abstract

Background/Objectives: The development of various communication media has generated few problems in retrieving information. The objective of the study is to analyze the performance of retrieving heterogeneous data. Methods/Statistical Analysis: A model was run to simulate the process of retrieving heterogeneous data from several servers. The information was distributed with different load and the servers were randomly selected. The performance had been analysed based on response time and CPU utilisation. A few types of load balancing techniques were applied to distribute the loads among the servers. The impacts on the overall system performance were discussed. Findings: Retrieving data requires high speed, where the response time must be very fast. The performance of retrieving heterogeneous data is a challenge, when servers have high load. When the load balancing techniques were not applied, some of the servers handle the entire load and the other servers have not been fully utilised. The results showed the response time decrease drastically when high load of data were applied to the server. When the load balancing was applied, the results were compared and presented. The results showed an improvement in the overall performance. Improvements/Applications: The load balancing techniques were applied based on several approaches. It allows an improvement in distributing the server load, which results in improvement in the performance. Keywords: Analysing Performance, Big Data Environment, Heterogeneous Information

Highlights

  • Since the past few years, the Internet plays an important role in our lives

  • Response time for web applications is influent by many factors including server capabilities, protocol, and network characteristics

  • The response time is affected by the situation where information may be placed on a single server

Read more

Summary

Introduction

Since the past few years, the Internet plays an important role in our lives. Many of our activities are searching for something via Internet, watching videos, writing in social media even make video or voice call. Information can be retrieved frequently as a different type of documents. When considering heterogeneous information retrieval system, the potential users with their information needs are important to be discussed. If the types of results to be used by the system can be identified, the responses from the retrieval process would be more relevant to the user needs. The performance in retrieving information for heterogeneous data in big data environments has become a significant issue. Information within documents, which includes searching unstructured and structured information. A study in information retrieval domain has been gradually increasing since the year 2000. This reflects the growing needs for studies of retrieving information in heterogeneousdata

Web Search
System Models
Response Time
Server Load
Results and Discussion
Effect on Response Time
Effect on CPU Utilisation
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call