Concordance and Term Frequency in Analyzing API Calls for Malware Behavior Detection

Nur Hilda Amira Abd Wahab,Balaji Rajendran,Masnizah Mohd,Ravie Chandren Muniyandi,Gopinath Palaniappan

doi:10.3844/jcssp.2019.1307.1319

Nur Hilda Amira Abd Wahab, Balaji Rajendran + Show 3 more

Open Access

https://doi.org/10.3844/jcssp.2019.1307.1319

Copy DOI

Abstract

Application Programming Interface (API) is used for the software to interact with an operating system to do certain task such as opening file, deleting file and many more. Programmers use this API to make it easier for their program to communicate with the operating system without having the knowledge of the hardware of the target system. Malware author is an attacker that may belong to an organization or work for themselves. Some malware author has the capabilities to write their own malware, uses the same kind of APIs that is used to create normal programs to create malware. There are many researches done in this field, however, most researchers used n-gram to detect the sequence of API calls and although it gave good results, it is time consuming to process through all the output. This is the reason why this paper proposed to use Concordance to search for the API call sequence of a malware because it uses KWIC (Key Word in Context), thus only displayed the output based on the queried keyword. After that, Term Frequency (TF) is used to search for the most commonly used APIs in the dataset. The results of the experiment show that concordance can be used to search for API call sequence as we manage to identify six malicious behaviors (Install Itself at Startup, Enumerate All Process, Privilege Escalation, Terminate Process, Process Hollowing and Ant debugging) using this method. And based on the TF score, the most commonly used API in the dataset is the Reg Close Key (TF: 1.388), which on its own is not a dangerous API, hence we can infer that most API is not malicious in nature, it is how they were implemented is making them dangerous.

Highlights

Nowadays, with a new variant of malware being discovered, we can see that malware is becoming more sophisticated in design
The Application Programming Interface (API) calls used in this step are chosen randomly and not based on the categories of the APIs. This is because the Term Frequency (TF) is used to show which of the malicious or suspicious APIs is favorable by the malware in the dataset
The Key Word In Context (KWIC) concordance method is easier to use than n-gram because n-gram listed all possible outcomes based on n value, meaning there will be a lot of output being displayed as compared to this method who will only display results based on the queried keywords

Summary

Introduction

With a new variant of malware being discovered, we can see that malware is becoming more sophisticated in design. According to Cisco (2018), security breaches can cause significant economic damages to an organization as it takes considerable time to fix the damages done. More than half of the breaches cost more than $500,000 in financial damages. This shows how severe it is the effect of the malware attack on an organization. Take example the WannaCry ransomware outbreak in 2017 which shows how dangerous modern malware is. This ransomware affected more than 200 000 computers in over 150 countries worldwide and cause huge financial damages to its victims (Business Advantage, 2017)

Objectives

Methods

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Concordance and Term Frequency in Analyzing API Calls for Malware Behavior Detection

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Computer Science

Lead the way for us

Journal: Journal of Computer Science	Publication Date: Sep 1, 2019
License type: cc-by

Similar Papers

Exploring the API Calls for Malware Behavior Detection using Concordance and Document Frequency
Dr.G.S.N Murth* ... Dr.T.P.R Vital
International Journal of Engineering and Advanced Technology | VOL. 8
Dr.G.S.N Murth*, et. al.Dr.G.S.N Murth* ... Dr.T.P.R Vital
30 Aug 2019
International Journal of Engineering and Advanced Technology | VOL. 8

How Did Governments Address the Needs of People With Disabilities During the COVID-19 Pandemic? An Analysis of 14 Countries' Policies Based on the UN Convention on the Rights of Persons With Disabilities.
Keiko Shikako ... Raphael Lencucha
International journal of health policy and management | VOL. 12
Keiko Shikako, et. al.Keiko Shikako ... Raphael Lencucha
17 May 2023
International journal of health policy and management | VOL. 12

Comparing concordances of language patterns and words by ESL intermediate learners: a preliminary experiment with two mobile concordancers
Zhi Quan ... Darryl Hocking
Computer Assisted Language Learning | VOL. 37
Zhi Quan, et. al.Zhi Quan ... Darryl Hocking
23 May 2022
Computer Assisted Language Learning | VOL. 37

A Web-Based Visualization System for Interdisciplinary Research Using Japanese Local Political Corpus
Hokuto Ototake ... Keiichi Takamaru
-
Hokuto Ototake, et. al.Hokuto Ototake ... Keiichi Takamaru
24 Aug 2017
24 Aug 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Concordance and Term Frequency in Analyzing API Calls for Malware Behavior Detection

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Computer Science