D 2011

Is On-Line Data Analysis Safety? Pitfalls Steaming from Automated Processing of Heterogeneous Environmental Data and Possible Solutions

JARKOVSKÝ, Jiří; Ladislav DUŠEK a Eva JANOUŠOVÁ

Základní údaje

Originální název

Is On-Line Data Analysis Safety? Pitfalls Steaming from Automated Processing of Heterogeneous Environmental Data and Possible Solutions

Vydání

Neuveden, Environmental Software Systems: Frameworks of Environment, IFIP Advances in Information and Communication Technology, vol. 359, od s. 486-490, 5 s. 2011

Nakladatel

Springer

Další údaje

Jazyk

angličtina

Typ výsledku

Stať ve sborníku

Obor

30000 3. Medical and Health Sciences

Stát vydavatele

Německo

Utajení

není předmětem státního či obchodního tajemství

Forma vydání

paměťový nosič (CD, DVD, flash disk)

Označené pro přenos do RIV

Ano

Kód RIV

RIV/00216224:14110/11:00052824

Organizační jednotka

Lékařská fakulta

ISBN

978-3-642-22284-9

ISSN

UT WoS

000306579200052

Klíčová slova anglicky

classification; nonparametric multivariate analysis; heterogeneous data
Změněno: 22. 1. 2014 23:58, RNDr. Eva Koriťáková, Ph.D.

Anotace

V originále

The current situation in environmental monitoring is characterized by increasing amount of data from monitoring networks together with increasing requirements on joining of these data from various sources in comprehensive databases and their usage for decision support in environmental protection and management. The automated analysis of such a heterogeneous datasets is a complicated process, rich in statistical pitfalls. There is a number of methods for multivariate classification of objects, e.g. logistic regression, discriminant analysis or neural networks; however, most of commonly used classification techniques have prerequisites about distribution of data, are computationally demanding or their model can be considered as “black box”. Keeping these facts in mind, we attempted to develop a robust multivariate method suitable for classification of unknown cases with minimum sensitivity to data distribution problems; and thus, suitable for routine use in practice.