TICHÝ, Lubomír, Milan CHYTRÝ and Petr ŠMARDA. Evaluating the stability of the classification of community data. Ecography. 2011, vol. 34, No 5, p. 807-813. ISSN 0906-7590. Available from: https://dx.doi.org/10.1111/j.1600-0587.2010.06599.x.
Other formats:   BibTeX LaTeX RIS
Basic information
Original name Evaluating the stability of the classification of community data
Name in Czech Hodnocení stability klasifikace dat o společenstvech
Authors TICHÝ, Lubomír (203 Czech Republic, guarantor, belonging to the institution), Milan CHYTRÝ (203 Czech Republic, belonging to the institution) and Petr ŠMARDA (203 Czech Republic, belonging to the institution).
Edition Ecography, 2011, 0906-7590.
Other information
Original language English
Type of outcome Article in a journal
Field of Study 10600 1.6 Biological sciences
Country of publisher United States of America
Confidentiality degree is not subject to a state or trade secret
WWW Fulltext on Wiley Online Library
Impact factor Impact factor: 4.188
RIV identification code RIV/00216224:14310/11:00050300
Organization unit Faculty of Science
Doi http://dx.doi.org/10.1111/j.1600-0587.2010.06599.x
UT WoS 000296972200011
Keywords in English Clustering methods; Vegetation classification strategies; Validation; Bootstrap; Algorithm; Fidelity
Tags AKR, rivok
Tags International impact, Reviewed
Changed by Changed by: prof. RNDr. Milan Chytrý, Ph.D., učo 871. Changed: 3/1/2012 11:01.
Abstract
We propose a method for a posteriori evaluation of classification stability which compares the classification of sites in the original data set (a matrix of species by sites) with classifications of subsets of its sites created by without-replacement bootstrap resampling. Site assignments to clusters of the original classification and to clusters of the classification of each subset are compared using Goodman-Kruskal's lambda index. Many resampled subsets are classified and the mean of lambda values calculated for the classifications of these subsets is used as an estimation of classification stability. Furthermore, the mean of the lambda values based on different resampled subsets, calculated for each site of the data set separately, can be used as a measure of the influence of particular sites on classification stability. This method was tested on several artificial data sets classified by commonly used clustering methods and on a real data set of forest vegetation plots.
Links
GA206/09/0329, research and development projectName: Vegetace České republiky: dokončení národního přehledu rostlinných společenstev
Investor: Czech Science Foundation, Vegetation of the Czech Republic: completion of the national survey of plant communities
MSM0021622416, plan (intention)Name: Diverzita biotických společenstev a populací: kauzální analýza variability v prostoru a čase
Investor: Ministry of Education, Youth and Sports of the CR, Diversity of Biotic Communities and Populations: Causal Analysis of variation in space and time
PrintDisplayed: 1/5/2024 00:01