Detailed Information on Publication Record
2016
Practical Multi-pattern Matching Approach for Fast and Scalable Log Abstraction
TOVARŇÁK, DanielBasic information
Original name
Practical Multi-pattern Matching Approach for Fast and Scalable Log Abstraction
Authors
TOVARŇÁK, Daniel (203 Czech Republic, guarantor, belonging to the institution)
Edition
Lisbon, Portugal, ICSOFT-EA 2016 - Proceedings of the 11th International Joint Conference on Software Technologies, p. 319-329, 11 pp. 2016
Publisher
SCITEPRESS
Other information
Language
English
Type of outcome
Stať ve sborníku
Field of Study
10201 Computer sciences, information science, bioinformatics
Confidentiality degree
není předmětem státního či obchodního tajemství
Publication form
printed version "print"
References:
RIV identification code
RIV/00216224:14330/16:00091170
Organization unit
Faculty of Informatics
ISBN
978-989-758-194-6
UT WoS
000391095600037
Keywords in English
Log Processing; Pattern Matching; Log Abstraction; Big Data
Změněno: 13/5/2020 19:19, RNDr. Pavel Šmerk, Ph.D.
Abstract
V originále
Log abstraction, i.e. the separation of static and dynamic part of log message, is becoming an indispensable task when processing logs generated by large enterprise systems and networks. In practice, the log message types are described via regex matching patterns that are in turn used to actually facilitate the abstraction process. Although the area of multi-regex matching is well studied, there is a lack of suitable practical implementations available for common programming languages. In this paper we present an alternative approach to multi-pattern matching for the purposes of log abstraction that is based on a trie-like data structure we refer to as regex trie. REtrie is easy to implement and the real world experiments show its scalability and good performance even for thousands of matching patterns.