Efficient Combination of Classifiers for 3D Action Recognition

SEDMIDUBSKÝ, Jan and Pavel ZEZULA. Efficient Combination of Classifiers for 3D Action Recognition. Multimedia Systems. Springer, 2021, vol. 27, No 5, p. 941-952. ISSN 0942-4962. Available from: https://dx.doi.org/10.1007/s00530-021-00767-9.

Other formats: BibTeX LaTeX RIS

Basic information
Original name	Efficient Combination of Classifiers for 3D Action Recognition
Authors	SEDMIDUBSKÝ, Jan (203 Czech Republic, guarantor, belonging to the institution) and Pavel ZEZULA (203 Czech Republic, belonging to the institution).
Edition	Multimedia Systems, Springer, 2021, 0942-4962.

Other information
Original language	English
Type of outcome	Article in a journal
Field of Study	10200 1.2 Computer and information sciences
Country of publisher	United States of America
Confidentiality degree	is not subject to a state or trade secret
WWW	URL
Impact factor	Impact factor: 2.603
RIV identification code	RIV/00216224:14330/21:00118857
Organization unit	Faculty of Informatics
Doi	http://dx.doi.org/10.1007/s00530-021-00767-9
UT WoS	000628724200001
Keywords in English	action recognition;skeleton sequence;fusion;augmentation;normalization
Tags	DISA
Tags	International impact, Reviewed
Changed by	Changed by: doc. RNDr. Jan Sedmidubský, Ph.D., učo 60474. Changed: 19/4/2022 21:42.

Abstract

The popular task of 3D human action recognition is almost exclusively solved by training deep-learning classifiers. To achieve high recognition accuracy, input 3D actions are often pre-processed by various normalization or augmentation techniques. However, it is not computationally feasible to train a classifier for each possible variant of training data in order to select the best-performing combination of pre-processing techniques for a given dataset. In this paper, we propose an evaluation procedure that determines the best combination in a very efficient way. In particular, we only train one independent classifier for each available pre-processing technique and estimate the accuracy of a specific combination by efficient fusion of the corresponding classification results based on a strict majority vote rule. In addition, for the best-ranked combination, we can retrospectively apply the normalized/augmented variants of input data to train only a single classifier. This enables to decide whether it is generally better to train a single model, or rather a set of independent classifiers whose results are fused within the classification phase. We evaluate the experiments on single-subject as well as person-interaction datasets of 3D skeleton sequences and all combinations of up to 16 normalization and augmentation techniques, some of them also proposed in this paper.

Links
GA19-02033S, research and development project	Name: Vyhledávání, analytika a anotace datových toků lidských pohybů
GA19-02033S, research and development project	Investor: Czech Science Foundation

PrintDisplayed: 27/4/2024 12:58

Efficient Combination of Classifiers for 3D Action Recognition

Other applications