On Selection of Efficient Sequential Pattern Mining Algorithm
Based on Characteristics of Data

PESCHEL, Jakub, Michal BATKO a Pavel ZEZULA. On Selection of Efficient Sequential Pattern Mining Algorithm Based on Characteristics of Data. Online. In 2022 IEEE International Symposium on Multimedia (ISM). Neuveden: IEEE, 2022, s. 202-205. ISBN 978-1-6654-7173-2. Dostupné z: https://dx.doi.org/10.1109/ISM55400.2022.00044.

Další formáty: BibTeX LaTeX RIS

Základní údaje
Originální název	On Selection of Efficient Sequential Pattern Mining Algorithm Based on Characteristics of Data
Autoři	PESCHEL, Jakub (203 Česká republika, garant, domácí), Michal BATKO (203 Česká republika, domácí) a Pavel ZEZULA (203 Česká republika, domácí).
Vydání	Neuveden, 2022 IEEE International Symposium on Multimedia (ISM), od s. 202-205, 4 s. 2022.
Nakladatel	IEEE

Další údaje
Originální jazyk	angličtina
Typ výsledku	Stať ve sborníku
Obor	10200 1.2 Computer and information sciences
Utajení	není předmětem státního či obchodního tajemství
Forma vydání	elektronická verze "online"
WWW	URL
Kód RIV	RIV/00216224:14330/22:00127166
Organizační jednotka	Fakulta informatiky
ISBN	978-1-6654-7173-2
Doi	http://dx.doi.org/10.1109/ISM55400.2022.00044
UT WoS	000964457800037
Klíčová slova anglicky	Sequential Pattern Mining; GSP; SPAM; Prefix-span
Štítky	DISA, firank_B
Příznaky	Mezinárodní význam, Recenzováno
Změnil	Změnil: RNDr. Pavel Šmerk, Ph.D., učo 3880. Změněno: 16. 8. 2023 13:26.

Anotace

Sequential pattern mining, which is one of the core tasks in data mining, allows to gain insight into datasets with complex sequential data. As the task is computationally intensive, there are many different approaches that are suitable for various types of data. We explore the possibility of optimising the analysis of sequences based on the characteristic (quickly obtainable) properties of the analysed data. In this paper, we propose five such characteristics and explore the efficiency of three algorithms that are representatives of the three main approaches to sequential pattern mining. We discovered that it is possible to save up to 21% of the search time compared to the best-performing representative. We trained a decision tree model with 87% accuracy of choosing the best algorithm for selected data based on these characteristics.

Návaznosti
EF16_019/0000822, projekt VaV	Název: Centrum excelence pro kyberkriminalitu, kyberbezpečnost a ochranu kritických informačních infrastruktur

VytisknoutZobrazeno: 19. 7. 2024 12:19

On Selection of Efficient Sequential Pattern Mining Algorithm Based on Characteristics of Data

Další aplikace