J 2023

Better than XML: Towards a lexicographic markup language

MĚCHURA, Michal

Basic information

Original name

Better than XML: Towards a lexicographic markup language

Authors

MĚCHURA, Michal (203 Czech Republic, guarantor, belonging to the institution)

Edition

Data & Knowledge Engineering, Elsevier, 2023, 0169-023X

Other information

Language

English

Type of outcome

Článek v odborném periodiku

Field of Study

10201 Computer sciences, information science, bioinformatics

Country of publisher

Netherlands

Confidentiality degree

není předmětem státního či obchodního tajemství

References:

Impact factor

Impact factor: 2.500 in 2022

RIV identification code

RIV/00216224:14330/23:00131137

Organization unit

Faculty of Informatics

UT WoS

001054114900001

Keywords in English

XML; JSON; YAML; Verbose markup;Data models; Lexicography

Tags

International impact, Reviewed
Změněno: 8/4/2024 06:40, RNDr. Pavel Šmerk, Ph.D.

Abstract

V originále

This article takes a critical look at how XML is used in lexicography and asks the question, why do dictionary entries often end up looking so complex when encoded in XML? The main reason for the perceived complexity of XML-encoded dictionaries is purely structural markup: XML elements which contain other XML elements instead of human-readable text. The overabundance of purely structural markup in lexicography is caused by the nature of lexicographic content, much of which is inherently headed. XML has no support for headedness and neither do other commonly used languages such as JSON and YAML. In this article we propose a number of constraints and extensions to XML, JSON and YAML which add support for headedness into these languages.