ELIÁŠ, Petr, Jan SEDMIDUBSKÝ and Pavel ZEZULA. Understanding the Gap between 2D and 3D Skeleton-Based Action Recognition. Online. In 21st IEEE International Symposium on Multimedia (ISM). Neuveden: IEEE Computer Society, 2019, p. 192-195. ISBN 978-1-72815-606-4. Available from: https://dx.doi.org/10.1109/ISM46123.2019.00041.
Other formats:   BibTeX LaTeX RIS
Basic information
Original name Understanding the Gap between 2D and 3D Skeleton-Based Action Recognition
Authors ELIÁŠ, Petr (203 Czech Republic, belonging to the institution), Jan SEDMIDUBSKÝ (203 Czech Republic, belonging to the institution) and Pavel ZEZULA (203 Czech Republic, belonging to the institution).
Edition Neuveden, 21st IEEE International Symposium on Multimedia (ISM), p. 192-195, 4 pp. 2019.
Publisher IEEE Computer Society
Other information
Original language English
Type of outcome Proceedings paper
Field of Study 10200 1.2 Computer and information sciences
Confidentiality degree is not subject to a state or trade secret
Publication form electronic version available online
RIV identification code RIV/00216224:14330/19:00107709
Organization unit Faculty of Informatics
ISBN 978-1-72815-606-4
Doi http://dx.doi.org/10.1109/ISM46123.2019.00041
UT WoS 000528909200030
Keywords in English 2D skeleton data;3D skeleton data;action recognition;LSTM;motion data understanding
Tags DISA, firank_B
Tags International impact, Reviewed
Changed by Changed by: RNDr. Pavel Šmerk, Ph.D., učo 3880. Changed: 28/4/2020 00:07.
Abstract
Large volumes of RGB video data are recorded and processed every day. One of the embedded data modality within these videos is the information about human motions. Up to now, this information has been almost unfeasible to extract, and thus human-motion understanding research has been mainly limited to 3D skeleton data captured by dedicated hardware only. However, with recent advances in computer vision, it is possible to estimate 2D skeleton sequences from ordinary videos quite accurately. Such 2D skeleton data possess an excellent potential for future motion understanding applications. In this paper, we adopt a state-of-the-art bidirectional LSTM network to analyze the accuracy gap in the expressive power of 2D and 3D skeleton data recorded simultaneously on a high number of 20k human actions. We further examine how the missing depth information and fluctuations in 2D skeleton sizes influence the recognition rate. We also demonstrate the suitability of 2D skeleton data for general daily activity recognition by reporting baselines on the PKU-MMD dataset.
Links
GA19-02033S, research and development projectName: Vyhledávání, analytika a anotace datových toků lidských pohybů
Investor: Czech Science Foundation
PrintDisplayed: 27/4/2024 04:38