BARTOŇ, Vojtěch and Helena SKUTKOVA. Data Transformation for Clustering Utilization for Feature Detection in Mass Spectrometry. In Rojas I., Valenzuela O., Rojas F., Herrera L.J., Ortuño F. Bioinformatics and Biomedical Engineering (IWBBIO 2022) : Lecture Notes in Computer Science, vol 13347. Cham: Springer, 2022, p. 288-299. ISBN 978-3-031-07801-9. Available from: https://dx.doi.org/10.1007/978-3-031-07802-6_24.
Other formats:   BibTeX LaTeX RIS
Basic information
Original name Data Transformation for Clustering Utilization for Feature Detection in Mass Spectrometry
Authors BARTOŇ, Vojtěch (203 Czech Republic, guarantor, belonging to the institution) and Helena SKUTKOVA.
Edition Cham, Bioinformatics and Biomedical Engineering (IWBBIO 2022) : Lecture Notes in Computer Science, vol 13347, p. 288-299, 12 pp. 2022.
Publisher Springer
Other information
Original language English
Type of outcome Proceedings paper
Field of Study 10201 Computer sciences, information science, bioinformatics
Country of publisher Switzerland
Confidentiality degree is not subject to a state or trade secret
Publication form printed version "print"
WWW URL
Impact factor Impact factor: 0.402 in 2005
RIV identification code RIV/00216224:14310/22:00128788
Organization unit Faculty of Science
ISBN 978-3-031-07801-9
ISSN 0302-9743
Doi http://dx.doi.org/10.1007/978-3-031-07802-6_24
UT WoS 000871766000024
Keywords in English Mass spectrometry; Clustering; Feature identification
Tags rivok
Tags International impact, Reviewed
Changed by Changed by: Mgr. Marie Šípková, DiS., učo 437722. Changed: 1/3/2023 10:11.
Abstract
Feature detection and peak detection are one of the first steps of mass spectrometry data processing. This data comes in large volumes; thus, the processing needs to be optimized, not overloaded. State-of-the-art clustering algorithms can not perform feature detection for several reasons. First issue is the volume of the data, second is the disparity of the sampling frequency in the MZ and RT axis. Here we show the data transformation to utilize the clustering algorithms without the need to redefine its kernel. Data are first pre-clustered to obtain regions that can be processed independently. Then we transform the data so that the numerical differences between consecutive points should be the same in both space axes. We applied a set of clustering algorithms for each region to find the features, and we compared the result with the Gridmass peak detector. These findings may facilitate better utilization of the 2D clustering method as feature detectors for mass spectra.
Links
EF17_043/0009632, research and development projectName: CETOCOEN Excellence
LM2018121, research and development projectName: Výzkumná infrastruktura RECETOX (Acronym: RECETOX RI)
Investor: Ministry of Education, Youth and Sports of the CR, RECETOX RI
PrintDisplayed: 29/6/2024 15:38