J 2022

Tracking subjects and detecting relationships in crowded city videos

ELIÁŠ, Petr, Matúš MACKO, Jan SEDMIDUBSKÝ and Pavel ZEZULA

Basic information

Original name

Tracking subjects and detecting relationships in crowded city videos

Authors

ELIÁŠ, Petr (203 Czech Republic, guarantor, belonging to the institution), Matúš MACKO (703 Slovakia, belonging to the institution), Jan SEDMIDUBSKÝ (203 Czech Republic, belonging to the institution) and Pavel ZEZULA (203 Czech Republic, belonging to the institution)

Edition

Multimedia Tools and Applications, Springer, 2022, 1380-7501

Other information

Language

English

Type of outcome

Článek v odborném periodiku

Field of Study

10200 1.2 Computer and information sciences

Country of publisher

United States of America

Confidentiality degree

není předmětem státního či obchodního tajemství

References:

Impact factor

Impact factor: 3.600

RIV identification code

RIV/00216224:14330/22:00129021

Organization unit

Faculty of Informatics

UT WoS

000743891900007

Keywords in English

Multi-subject tracking;Relationship detection;2D skeleton sequences;Video analysis;Smart cities

Tags

Tags

International impact, Reviewed
Změněno: 10/3/2023 12:58, doc. RNDr. Jan Sedmidubský, Ph.D.

Abstract

V originále

Multi-subject tracking in crowded videos is an established yet challenging research direction in computer vision and information processing. High applicability of multi-subject tracking is demonstrated in smart cities (e.g., public safety, crowd management, urban planning), autonomous driving vehicles, robotic vision, or psychology (e.g., social interaction and crowd behavior understanding). In this work, we propose a real-time approach that reveals tracks of subjects in ordinary videos, captured in highly populated pedestrian areas, such as squares, malls, and stations. The tracks are discovered based on the proximity of detected bounding boxes of subjects in consecutive video frames. The reduction of track fragmentation and identity switching is achieved by the re-identification phase that uses caching of unassociated detections and mutual projection of interrupted tracks. As the proposed approach does not require time-consuming extraction of appearance-based features, the superior tracking speed is achieved. In addition, we demonstrate tracker usability and applicability by extracting valuable information about body-joint positions from discovered tracks, which opens promising possibilities for detecting human relationships and interactions. We demonstrate accurate detection of couples based on their holding hand activity and families based on children's body proportions. The discovery of these entitative groups is especially challenging in crowded city scenes where many subjects appear in each frame.

Links

GA19-02033S, research and development project
Name: Vyhledávání, analytika a anotace datových toků lidských pohybů
Investor: Czech Science Foundation