Detailed Information on Publication Record
2022
Human-Generated Web Data Disentanglement for Complex Event Processing
BLANCO SÁNCHEZ, José Miguel, Mouzhi GE and Tomáš PITNERBasic information
Original name
Human-Generated Web Data Disentanglement for Complex Event Processing
Authors
BLANCO SÁNCHEZ, José Miguel (724 Spain, guarantor, belonging to the institution), Mouzhi GE and Tomáš PITNER (203 Czech Republic, belonging to the institution)
Edition
Verona, Italy, 26th International Conference on Knowledge-Based and Intelligent Information and Engineering Systems, KES 2022, p. 1341-1349, 9 pp. 2022
Publisher
Elsevier
Other information
Language
English
Type of outcome
Stať ve sborníku
Field of Study
10201 Computer sciences, information science, bioinformatics
Country of publisher
Netherlands
Confidentiality degree
není předmětem státního či obchodního tajemství
Publication form
printed version "print"
RIV identification code
RIV/00216224:14330/22:00126674
Organization unit
Faculty of Informatics
ISSN
Keywords in English
Complex Event Processing; Semantic Web; Data Disentanglement; Web Data Preprocessing;
Změněno: 6/4/2023 07:36, RNDr. Pavel Šmerk, Ph.D.
Abstract
V originále
In social media, human-generated web data from real-world events have become exponentially complex due to the chaotic and spontaneous features of natural language. This may create an information overload for the information consumers, and in turn not easily digest a large amount of information in a limited time. To tackle this issue, we propose to use Complex Event Processing (CEP) and semantic web reasoners to disentangle the human-generated data and present users with only relevant and important data. However, one of the key obstacles is that the human-generated data can have no structured meaning sometimes even for the speaker, hindering the output of the CEP. Therefore, in order to adapt to the CEP inputs, we present two different techniques that allow for the discrimination and digestion of value of human-generated data. The first one relies on the Variable Sharing Property that was developed for relevance logics, while the second one is based on semantic equivalence and natural language processing. The results can be given to CEP for further semantic reasoning and generate digested information for users.