Probabilistic suffix models for API sequence analysis of Windows XP applications

Geoffrey Mazeroff,Jens Gregor,Michael Thomason,Richard Ford

doi:10.1016/j.patcog.2007.04.006

Abstract

Given the pervasive nature of malicious mobile code (viruses, worms, etc.), developing statistical/structural models of code execution is of considerable importance. We investigate using probabilistic suffix trees (PSTs) and associated suffix automata (PSAs) to build models of benign application behavior with the goal of subsequently being able to detect malicious applications as anything that deviates therefrom. We describe these probabilistic suffix models and present new generic analysis and manipulation algorithms. The models and the algorithms are then used in the context of API (i.e., system call) sequences realized by Windows XP applications. The analysis algorithms, when applied to traces (i.e., sequences of API calls) of benign and malicious applications, aid in choosing an appropriate modeling strategy in terms of distance metrics and consequently provide classification measures in terms of sequence-to-model matching. We give experimental results based on classification of unobserved traces of benign and malicious applications against a suffix model trained solely from traces generated by a small set of benign applications.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Probabilistic suffix models for API sequence analysis of Windows XP applications

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition

Lead the way for us

Journal: Pattern Recognition	Publication Date: May 3, 2007
Citations: 44

Similar Papers

Sub-curve HMM: A malware detection approach based on partial analysis of API call sequences
Jakapan Suaboot ... Wei Li
Computers & Security | VOL. 92
Jakapan Suaboot, et. al.Jakapan Suaboot ... Wei Li
22 Feb 2020
Computers & Security | VOL. 92

Detecting binary theft via static major-path birthmarks
Seongsoo Park ... Hwansoo Han
-
Seongsoo Park, et. al.Seongsoo Park ... Hwansoo Han
05 Oct 2014
05 Oct 2014

Protein family classification using sparse markov transducers.
Eleazar Eskin ... Yoram Singer
Journal of computational biology : a journal of computational molecular cell biology | VOL. 10
Eleazar Eskin, et. al.Eleazar Eskin ... Yoram Singer
01 Apr 2003
Journal of computational biology : a journal of computational molecular cell biology | VOL. 10

정적 주요 경로 API 시퀀스를 이용한 소프트웨어 유사성 검사
Seongsoo Park ... Hwansoo Han
Journal of KIISE | VOL. 41
Seongsoo Park, et. al.Seongsoo Park ... Hwansoo Han
15 Dec 2014
Journal of KIISE | VOL. 41

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Probabilistic suffix models for API sequence analysis of Windows XP applications

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition