A Study on the Application of Distributed System Technology-Guided Machine Learning in Malware Detection.

Shi Jin,Zhaofeng Guo,Yanhua Yang,Dongli Liu

doi:10.1155/2022/4977898

Abstract

In recent years, with the development of information technology, the Internet has become an essential tool for human daily life. However, as the popularity and scale of the Internet continue to expand, malware has also emerged as an increasingly widespread trend, and its development has brought many negative impacts to the society. As the number of types of malware is getting enormous, the attacks are constantly updated, and at the same time, the spread is very fast, causing more and more damage to the network, the requirements and standards for malware detection are constantly rising. How to effectively detect malware is a research trend; in order to tackle the new needs and problems arising from the development of malware, this paper proposes to guide machine learning algorithms to implement malware detection in a distributed environment: firstly, each detection node in the distributed network performs anomaly detection on the captured software information and data, then performs feature analysis to discover unknown malware and obtain its samples, updates the new malware features to all feature detection nodes in the whole distributed network, and trains the random forest-based machine learning algorithm for malware classification and detection, thus completing the global response processing capability for malware. By building a distributed system framework, the global capture capability of malware detection is enhanced to robustly respond to the increasing and rapid spread of malware, and machine learning algorithms are integrated into it to achieve effective detection of malware. Extended experiments on the Ember 2017 and Ember 2018 databases show that our proposed approach achieves advanced performance and effectively addresses the problem of malware detection.

Highlights

In recent years, with the development of information technology, the Internet has become an essential tool for human daily life
How to effectively detect malware is a research trend; in order to tackle the new needs and problems arising from the development of malware, this paper proposes to guide machine learning algorithms to implement malware detection in a distributed environment: firstly, each detection node in the distributed network performs anomaly detection on the captured software information and data, performs feature analysis to discover unknown malware and obtain its samples, updates the new malware features to all feature detection nodes in the whole distributed network, and trains the random forest-based machine learning algorithm for malware classification and detection, completing the global response processing capability for malware
For the subnodes in the distributed system, we describe in detail their algorithms for performing feature extraction and random forest-based malware detection

Summary

Distributed Architecture

Earlier detection of computer malware was done on the host computer in a completely isolated and controlled environment that did not require collaboration. In the face of the growing demand for big data, systematic research on the mining architecture of big data and its core mining models and algorithms under the related architecture becomes a problem that must be faced. Amer and Zelinka [14] designed the statistical information of microcluster-like data into a tree structure that grows with time to maintain it Both of these works are oriented to single data stream mining. Based on the above two points, considering the high predictive capability and better robust performance of the integrated learning technique, this paper will study the malware detection methods suitable for the distributed approach by drawing on the existing integrated learning technique

Based on Machine Learning Methods

Node Detection

Distributed Topology

Analysis

Mechanisms for Collaboration

Detection Algorithm

Datasets

Experimental Setup and Evaluation Metrics

Detection Performance Comparison

Distributed System Performance Verification

Findings

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Computational intelligence and neuroscience	Publication Date: Feb 23, 2022
Citations: 3	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

A Study on the Application of Distributed System Technology-Guided Machine Learning in Malware Detection.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Computational intelligence and neuroscience

Lead the way for us

Similar Papers

Assessing the efficacy of machine learning algorithms for syncope classification: A systematic review
Choon-Hian Goh ... Maw Pin Tan
MethodsX | VOL. 12
Choon-Hian Goh, et. al.Choon-Hian Goh ... Maw Pin Tan
06 Dec 2023
MethodsX | VOL. 12

Cardiovascular Signal Processing: State of the Art and Algorithms
Hiwot Birhanu ... Amare Kassaw
-
Hiwot Birhanu, et. al.Hiwot Birhanu ... Amare Kassaw
01 Jan 2020
01 Jan 2020

Enhancement of text categorization results via an ensemble learning technique
Wasf A Taha ... Suhad A Yousif
-
Wasf A Taha, et. al.Wasf A Taha ... Suhad A Yousif
01 Jan 2023
01 Jan 2023

Time-interval temporal patterns can beat and explain the malware
Ido Finder ... Nir Nissim
Knowledge-Based Systems | VOL. 241
Ido Finder, et. al.Ido Finder ... Nir Nissim
29 Jan 2022
Knowledge-Based Systems | VOL. 241

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Study on the Application of Distributed System Technology-Guided Machine Learning in Malware Detection.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Computational intelligence and neuroscience