Log Template Research Articles

Logs are pervasive in modern computing systems, and are valuable to service and system management. Nevertheless, with the rapidly growing size and complexity of computing systems, the log volume is exploding, which makes automatic log analysis imperative. Generally, in automatic log analysis, the first and fundamental step is log parsing, to which a lot of effort has been devoted. However, in most existing log parsing methods, log messages are merely treated as plain text. In natural language processing (NLP) area, it is a common practice to represent words and sentences with vectors, then the similarity between two words or sentences can be measured by the distance between their vectors. Inspired by these, we put forward a novel log parsing framework, named LPV (Log Parser based on Vectorization), which performs log parsing by converting log messages and log templates into vectors, with the help of a vectorization method in NLP. LPV incorporates offline and online log parsing. In the offline log parsing, the central idea is to first represent log messages with vectors, so that the similarity between two log messages can be measured by the distance between their vectors, then we cluster log messages via clustering the vectors, and finally we extract log templates from the resultant clusters. By the end of the offline log parsing, each log template is assigned with an average vector, so that in the online log parsing, the similarity between an incoming log message and each log template can also be measured by the distance between their vectors. Extensive experiments have been conducted based on several public log datasets to evaluate LPV with three different vectorization methods. The results demonstrate that, with a proper vectorization method, LPV performs competitive with state-of-the-art log parsing methods, in both effectiveness and efficiency.

Read full abstract

SummaryOne of the ways to analyze unstructured log messages from large‐scale IT systems is to classify log messages with log templates generated by template generation methods. However, there is currently no common knowledge pertained to the comparison and practical use of log template generation methods because they are implemented on the basis of diverse environments. To this end, we design and implement amulog, a general log analysis framework for comparing and combining diverse log template generation methods. Amulog consists of three key functions: (1) parsing log messages into headers and segmented messages, (2) classifying the log messages using a scalable template‐matching method, and (3) storing the structured data in a database. This framework helps us easily utilize time‐series data corresponding to the log templates for further analysis. We evaluate amulog with a log dataset collected from a nation‐wide academic network and demonstrate that it classifies the log data in a reasonable amount of time even with over 100,000 log template candidates. The template‐matching method in amulog also reduces 75% processing time for template generation and keeps the accuracy when combined with an existing structure‐based template generation method. In order to show the effectiveness of amulog in comparing log template generation methods, we demonstrate that the appropriate template generation methods and accuracy metrics largely depend on the purpose of further analysis by comparing the accuracy of six existing log template generation methods with 10 different accuracy metrics on amulog.

Read full abstract

Log Template Research Articles

Articles published on Log Template

TPLAD: Template-Parsed Log Anomaly Detection for Electrical Database Systems

XDrain: Effective log parsing in log streams using fixed-depth forest

DLLog: An Online Log Parsing Approach for Large-Scale System

Polo: Adaptive Trie-Based Log Parser for Anomaly Detection

An Intelligent Framework for Log Anomaly Detection Based on Log Template Extraction

LPV: A Log Parsing Framework Based on Vectorization

Brain: Log Parsing With Bidirectional Parallel Tree

PVE: A log parsing method based on VAE using embedding vectors

LogPal: A Generic Anomaly Detection Scheme of Heterogeneous Logs for Network Systems

MDFULog: Multi-Feature Deep Fusion of Unstable Log Anomaly Detection Model

PatCluster: A Top-Down Log Parsing Method Based on Frequent Words

Power monitoring system log analysis method based on K-Means clustering

Amulog: A general log analysis framework for comparison and combination of diverse template generation methods*

Log Sequence Anomaly Detection Based on Local Information Extraction and Globally Sparse Transformer Model

LTmatch: A Method to Abstract Pattern from Unstructured Log

HitAnomaly: Hierarchical Transformers for Anomaly Detection in System Log

Log Template Extraction Algorithm Based on Normalized Feature Discrimination

Priolog: Mining Important Logs via Temporal Analysis and Prioritization

An efficient real-time data collection framework on petascale systems

An online log template extraction method based on hierarchical clustering

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Log Template Research Articles

Articles published on Log Template

TPLAD: Template-Parsed Log Anomaly Detection for Electrical Database Systems

XDrain: Effective log parsing in log streams using fixed-depth forest

DLLog: An Online Log Parsing Approach for Large-Scale System

Polo: Adaptive Trie-Based Log Parser for Anomaly Detection

An Intelligent Framework for Log Anomaly Detection Based on Log Template Extraction

LPV: A Log Parsing Framework Based on Vectorization

Brain: Log Parsing With Bidirectional Parallel Tree

PVE: A log parsing method based on VAE using embedding vectors

LogPal: A Generic Anomaly Detection Scheme of Heterogeneous Logs for Network Systems

MDFULog: Multi-Feature Deep Fusion of Unstable Log Anomaly Detection Model

PatCluster: A Top-Down Log Parsing Method Based on Frequent Words

Power monitoring system log analysis method based on K-Means clustering

Amulog: A general log analysis framework for comparison and combination of diverse template generation methods*

Log Sequence Anomaly Detection Based on Local Information Extraction and Globally Sparse Transformer Model

LTmatch: A Method to Abstract Pattern from Unstructured Log

HitAnomaly: Hierarchical Transformers for Anomaly Detection in System Log

Log Template Extraction Algorithm Based on Normalized Feature Discrimination

Priolog: Mining Important Logs via Temporal Analysis and Prioritization

An efficient real-time data collection framework on petascale systems

An online log template extraction method based on hierarchical clustering