2016
Practical Multi-pattern Matching Approach for Fast and Scalable Log Abstraction
TOVARŇÁK, DanielZákladní údaje
Originální název
Practical Multi-pattern Matching Approach for Fast and Scalable Log Abstraction
Autoři
TOVARŇÁK, Daniel (203 Česká republika, garant, domácí)
Vydání
Lisbon, Portugal, ICSOFT-EA 2016 - Proceedings of the 11th International Joint Conference on Software Technologies, od s. 319-329, 11 s. 2016
Nakladatel
SCITEPRESS
Další údaje
Jazyk
angličtina
Typ výsledku
Stať ve sborníku
Obor
10201 Computer sciences, information science, bioinformatics
Utajení
není předmětem státního či obchodního tajemství
Forma vydání
tištěná verze "print"
Odkazy
Kód RIV
RIV/00216224:14330/16:00091170
Organizační jednotka
Fakulta informatiky
ISBN
978-989-758-194-6
UT WoS
000391095600037
Klíčová slova anglicky
Log Processing; Pattern Matching; Log Abstraction; Big Data
Změněno: 13. 5. 2020 19:19, RNDr. Pavel Šmerk, Ph.D.
Anotace
V originále
Log abstraction, i.e. the separation of static and dynamic part of log message, is becoming an indispensable task when processing logs generated by large enterprise systems and networks. In practice, the log message types are described via regex matching patterns that are in turn used to actually facilitate the abstraction process. Although the area of multi-regex matching is well studied, there is a lack of suitable practical implementations available for common programming languages. In this paper we present an alternative approach to multi-pattern matching for the purposes of log abstraction that is based on a trie-like data structure we refer to as regex trie. REtrie is easy to implement and the real world experiments show its scalability and good performance even for thousands of matching patterns.