LTmatch: A Method to Abstract Pattern from Unstructured Log

Xiaodong Wang,Haili Xiao,Xiaoning Wang,Yining Zhao,Xuebin Chi

doi:10.3390/app11115302

Xiaodong Wang, Haili Xiao + Show 3 more

Open Access

PDF Available

https://doi.org/10.3390/app11115302

Copy DOI

Export

Save

Cite

Abstract
Highlights/Summary
Full-Text PDF
Similar Papers

Abstract

Listen

Logs record valuable data from different software and systems. Execution logs are widely available and are helpful in monitoring, examination, and system understanding of complex applications. However, log files usually contain too many lines of data for a human to deal with, therefore it is important to develop methods to process logs by computers. Logs are usually unstructured, which is not conducive to automatic analysis. How to categorize logs and turn into structured data automatically is of great practical significance. In this paper, LTmatch algorithm is proposed, which implements a log pattern extracting algorithm based on a weighted word matching rate. Compared with our preview work, this algorithm not only classifies the logs according to the longest common subsequence(LCS) but also gets and updates the log template in real-time. Besides, the pattern warehouse of the algorithm uses a fixed deep tree to store the log patterns, which optimizes the matching efficiency of log pattern extraction. To verify the advantages of the algorithm, we applied the proposed algorithm to the open-source data set with different kinds of labeled log data. A variety of state-of-the-art log pattern extraction algorithms are used for comparison. The result shows our method is improved by 2.67% in average accuracy when compared with the best result in all the other methods.

Highlights

Logs are an essential part of computer systems
In order to illustrate the rationality of the weight-based log word matching rate design in the LTmatch algorithm, the ratio of the number of constants and variables in log templates of all the templates in the log dataset 1 is counted
Log pattern extraction algorithm is critical for downstream log analysis tasks

Summary

Introduction

Logs are an essential part of computer systems. The main purpose of logging is to record the necessary information generated during the running of programs and systems, and logs are widely used for runtime state recovering, performance analyzing, failure tracing and anomaly detecting. Due to the importance of logs, the vast majority of public released software and systems have certain types of log services. For large-scale applications, such as software and systems running in distributed systems and environments, the generated log files may contain a very large amount of data. The traditional manual method for analyzing such a big number of logs has become a time-consuming and errorprone task. How to automatically analyze the logs are of great significance for reducing system maintainers’ workload and tracing the causes of failures and anomalies

Methods

Results

Discussion

Conclusion

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Applied Sciences	Publication Date: Jun 7, 2021
Citations: 6	License type: CC BY 4.0

R Discovery Prime

LTmatch: A Method to Abstract Pattern from Unstructured Log

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: Applied Sciences

Lead the way for us

Similar Papers

Discovering Business Process Model from Unstructured Activity Logs
Rahul Kumar ... Chiranjib Bhattacharyya
-
Rahul Kumar, et. al.Rahul Kumar ... Chiranjib Bhattacharyya
01 Jul 2010
01 Jul 2010

Process Trace Identification from Unstructured Execution Logs
Nirmit Desai ... Anuradha Bhamidipaty
-
Nirmit Desai, et. al.Nirmit Desai ... Anuradha Bhamidipaty
01 Jul 2010
01 Jul 2010

The set-set LCS problem
D S Hirschberg ... L L Larmore
Algorithmica | VOL. 4
D S Hirschberg, et. al.D S Hirschberg ... L L Larmore
01 Jun 1989
Algorithmica | VOL. 4

Computing a Longest Common Palindromic Subsequence
Shihabur Rahman Chowdhury ... Md Mahbubul Hasan
-
Shihabur Rahman Chowdhury, et. al.Shihabur Rahman Chowdhury ... Md Mahbubul Hasan
01 Jan 2012
01 Jan 2012

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

LTmatch: A Method to Abstract Pattern from Unstructured Log

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: Applied Sciences