Semi-automatic mining of correlated data from a complex
database: Correlation network visualization

LEXA, Matej a Radovan LAPÁR. Semi-automatic mining of correlated data from a complex database: Correlation network visualization. Online. In Computational Advances in Bio and Medical Sciences (ICCABS), 2016 IEEE 6th International Conference on. New York: IEEE, 2016, s. 1-2. ISBN 978-1-5090-4199-2. Dostupné z: https://dx.doi.org/10.1109/ICCABS.2016.7802783.

Další formáty: BibTeX LaTeX RIS

Základní údaje
Originální název	Semi-automatic mining of correlated data from a complex database: Correlation network visualization
Autoři	LEXA, Matej (703 Slovensko, garant, domácí) a Radovan LAPÁR (703 Slovensko, domácí).
Vydání	New York, Computational Advances in Bio and Medical Sciences (ICCABS), 2016 IEEE 6th International Conference on, od s. 1-2, 2 s. 2016.
Nakladatel	IEEE

Další údaje
Originální jazyk	angličtina
Typ výsledku	Stať ve sborníku
Obor	10201 Computer sciences, information science, bioinformatics
Stát vydavatele	Spojené státy
Utajení	není předmětem státního či obchodního tajemství
Forma vydání	elektronická verze "online"
WWW	URL URL
Kód RIV	RIV/00216224:14330/16:00092617
Organizační jednotka	Fakulta informatiky
ISBN	978-1-5090-4199-2
ISSN	2473-4659
Doi	http://dx.doi.org/10.1109/ICCABS.2016.7802783
UT WoS	000392416700019
Klíčová slova anglicky	data mining; biomedical database; denormalization; visualization; correlation network
Příznaky	Mezinárodní význam, Recenzováno
Změnil	Změnil: RNDr. Pavel Šmerk, Ph.D., učo 3880. Změněno: 13. 5. 2020 19:33.

Anotace

In previous work we have addressed the issue of frequent ad-hoc queries in deeply-structured databases. We wrote a library of functions AutodenormLib.py for issuing proper JOIN commands to denormalize an arbitrary subset of stored data for downstream processing. This may include statistical analysis, visualization or machine learning. Here, we visualize the content of the Thalamoss biomedical database as a correlation network. The network is created by calculating pairwise correlations through all pairs of variables, whether they be numerical, ordinal or nominal. We subsequently construct the network over the entire set of variables, clustering variables with similar effects to discover group relationships between the various biomedical characteristics. We use a semi-automatic procedure that makes the selection of all pairs possible and discuss issues of dealing with different types of variables. This is done either by limiting the analysis to numerical and ordinal ones, or by binning their values into intervals of values. Knowledge extracted from the data in this mode can be used to select variables for statistical models, or as markers of medically interesting conditions.

Návaznosti
7E13011, projekt VaV	Název: THALAssaemia MOdular Stratification System for personalized therapy of beta-thalassemia (Akronym: THALAMOSS)
7E13011, projekt VaV	Investor: Ministerstvo školství, mládeže a tělovýchovy ČR, THALAssaemia MOdular Stratification System for personalized therapy of beta-thalassemia

VytisknoutZobrazeno: 24. 8. 2024 10:35

Semi-automatic mining of correlated data from a complex database: Correlation network visualization

Další aplikace