D 2023

SegmentCodeList: Unsupervised Representation Learning for Human Skeleton Data Retrieval

SEDMIDUBSKÝ, Jan, Fabio CARRARA and Giuseppe AMATO

Basic information

Original name

SegmentCodeList: Unsupervised Representation Learning for Human Skeleton Data Retrieval

Authors

SEDMIDUBSKÝ, Jan (203 Czech Republic, guarantor, belonging to the institution), Fabio CARRARA and Giuseppe AMATO

Edition

Cham, 45th European Conference on Information Retrieval (ECIR), p. 110-124, 15 pp. 2023

Publisher

Springer

Other information

Language

English

Type of outcome

Stať ve sborníku

Field of Study

10200 1.2 Computer and information sciences

Country of publisher

Czech Republic

Confidentiality degree

není předmětem státního či obchodního tajemství

Publication form

electronic version available online

Impact factor

Impact factor: 0.402 in 2005

RIV identification code

RIV/00216224:14330/23:00130177

Organization unit

Faculty of Informatics

ISBN

978-3-031-28237-9

ISSN

UT WoS

000995489700008

Keywords in English

3D skeleton sequence;segment similarity;unsupervised feature learning;Variational AutoEncoder;segment code list;action retrieval

Tags

International impact, Reviewed
Změněno: 7/4/2024 22:39, RNDr. Pavel Šmerk, Ph.D.

Abstract

V originále

Recent progress in pose-estimation methods enables the extraction of sufficiently-precise 3D human skeleton data from ordinary videos, which offers great opportunities for a wide range of applications. However, such spatio-temporal data are typically extracted in the form of a continuous skeleton sequence without any information about semantic segmentation or annotation. To make the extracted data reusable for further processing, there is a need to access them based on their content. In this paper, we introduce a universal retrieval approach that compares any two skeleton sequences based on temporal order and similarities of their underlying segments. The similarity of segments is determined by their content-preserving low-dimensional code representation that is learned using the Variational AutoEncoder principle in an unsupervised way. The quality of the proposed representation is validated in retrieval and classification scenarios; our proposal outperforms the state-of-the-art approaches in effectiveness and reaches speed-ups up to 64x on common skeleton sequence datasets.

Links

EF16_019/0000822, research and development project
Name: Centrum excelence pro kyberkriminalitu, kyberbezpečnost a ochranu kritických informačních infrastruktur