k 2023

Maximising the Power of Semantic Textual Data : CASTEMO Data Collection and the InkVisitor Application

ZBÍRAL, David; Robert Laurence John SHAW; Tomáš HAMPEJS a Adam MERTEL

Základní údaje

Originální název

Maximising the Power of Semantic Textual Data : CASTEMO Data Collection and the InkVisitor Application

Název anglicky

Maximising the Power of Semantic Textual Data : CASTEMO Data Collection and the InkVisitor Application

Vydání

DH2023: Collaboration as Opportunity, 10.-14. 07. 2023, Graz, 2023

Další údaje

Typ výsledku

Prezentace na konferencích

Utajení

není předmětem státního či obchodního tajemství

Klíčová slova česky

sémantické modelování textu; digital humanities; inkvizice

Klíčová slova anglicky

semantic text modelling; digital humanities; inquisition

Příznaky

Mezinárodní význam, Recenzováno
Změněno: 15. 12. 2023 10:19, Mgr. Jolana Navrátilová

Anotace

V originále

The authors present Computer-Assisted Semantic Text Modelling (CASTEMO), a novel but now well-developed approach to transformation of textual resources into rich structured data, CASTEMO knowledge graphs, stored in JSON-based document databases. They also introduce the open-source InkVisitor research environment which assists in CASTEMO data collection workflow. Both the workflow and the environment were developed within the ERC-funded Dissident Networks Project (DISSINET] but are now made available to use by other researchers and projects. The CASTEMO data collection approach aims to preserve the rich qualitative texture of texts and at the same time produce structured data suitable for computational analysis. It preserves the contextual embeddedness of knowledge and the natural features of human knowledge, such as conflicting evidence and information given in a non-indicative modality, e.g. questions and conditional sentences. It thus answers a significant challenge in the digital study of texts, where a decision must often be taken to prefer extracting content or analysing discursive features, as well as whether to focus on distant or close reading. With CASTEMO, these levels can be readily interwoven into “scalable reading”. This presentation introduces the essential data modelling principles of CASTEMO, as well as its use cases and advantages for certain types of study. It also introduces the InkVisitor research environment.

Anglicky

The authors present Computer-Assisted Semantic Text Modelling (CASTEMO), a novel but now well-developed approach to transformation of textual resources into rich structured data, CASTEMO knowledge graphs, stored in JSON-based document databases. They also introduce the open-source InkVisitor research environment which assists in CASTEMO data collection workflow. Both the workflow and the environment were developed within the ERC-funded Dissident Networks Project (DISSINET] but are now made available to use by other researchers and projects. The CASTEMO data collection approach aims to preserve the rich qualitative texture of texts and at the same time produce structured data suitable for computational analysis. It preserves the contextual embeddedness of knowledge and the natural features of human knowledge, such as conflicting evidence and information given in a non-indicative modality, e.g. questions and conditional sentences. It thus answers a significant challenge in the digital study of texts, where a decision must often be taken to prefer extracting content or analysing discursive features, as well as whether to focus on distant or close reading. With CASTEMO, these levels can be readily interwoven into “scalable reading”. This presentation introduces the essential data modelling principles of CASTEMO, as well as its use cases and advantages for certain types of study. It also introduces the InkVisitor research environment.

Návaznosti

101000442, interní kód MU
Název: Networks of Dissent: Computational Modelling of Dissident and Inquisitorial Cultures in Medieval Europe (Akronym: DISSINET)
Investor: Evropská unie, Networks of Dissent: Computational Modelling of Dissident and Inquisitorial Cultures in Medieval Europe, ERC (Excellent Science)