Performance optimization of computing task scheduling based on the Hadoop big data platform

Yang Li,Xinhong Hei

doi:10.1007/s00521-022-08114-3

Abstract

AbstractHadoop, a distributed computing framework that can efficiently process large-scale datasets, has been used by an increasing number of organizations as the basic computing framework to build cloud computing platforms. Improving its execution efficiency is a hot research direction in the industry, and the scheduling problem is a key factor affecting the execution efficiency of Hadoop. It is very important to identify its shortcomings and improve them. This paper examines and analyses the optimization of computing task scheduling performance based on the Hadoop big data platform. This paper first analyses Hadoop big data processing. Hadoop has high scalability. Computing nodes can be added at any time, and they can participate in cluster work through simple configuration. The paper discusses the improvement in the Hadoop resource scheduling algorithm. The task scheduling algorithm in the Hadoop-based data task localization proposed in this paper is compared with the default algorithm used in the Hadoop task scheduling algorithm. The former shows better local data in all four jobs, there are more data localization tasks, and the expected goal is achieved. The effectiveness of the algorithm is verified, and the performance is improved by 30%.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Neural Computing and Applications	Publication Date: Dec 25, 2022
Citations: 5	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Performance optimization of computing task scheduling based on the Hadoop big data platform

Abstract

Talk to us

Similar Papers

More From: Neural Computing and Applications

Lead the way for us

Similar Papers

Railway Traffic Volume Prediction Method Based on Hadoop Big Data Platform
Pei Su
-
Pei SuPei Su
01 Jan 2021
01 Jan 2021

Bounds on Multiprocessing Timing Anomalies
R L Graham
SIAM Journal on Applied Mathematics | VOL. 17
R L GrahamR L Graham
01 Mar 1969
SIAM Journal on Applied Mathematics | VOL. 17

Design and Implementation of Vehicle Scheduling Optimization for Smart Logistics Platform Powered by Hadoop Big Data
Guangtian Yu ... Wangtianhua Yu
Scalable Computing: Practice and Experience | VOL. 24
Guangtian Yu, et. al.Guangtian Yu ... Wangtianhua Yu
17 Nov 2023
Scalable Computing: Practice and Experience | VOL. 24

Optimization Method for Human Resource Decision Based on Hadoop Big Data Platform
Xueqi Zhang
-
Xueqi ZhangXueqi Zhang
26 Jun 2024
26 Jun 2024

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Performance optimization of computing task scheduling based on the Hadoop big data platform

Abstract

Talk to us

Similar Papers

More From: Neural Computing and Applications