Research and education for brain imaging, information, computing, communication sciences

Balazia2025VUCHA

Title
Video Understanding of Complex Human Activities
View PDF | Save PDF
Authors
Michal Balazia
Affiliations
INRIA Université Côte d’Azur, Sophia Antipolis, France.
Abstract
This report presents a comprehensive collection of novel methods and datasets advancing multimodal AI for behavioral analysis in social interactions, medical training, emotion recognition, and psychiatric phenotyping. Contributions include the MultiMediate'25 challenge for cross-cultural engagement estimation using expanded NOXI and NOXI-J corpora, a Go-ELAN YOLOv9 model for real-time surgical instrument detection in cataract videos, the CM3T adapter framework for efficient multimodal transfer learning on inhomogeneous datasets, the BLEMORE dataset with relative salience annotations for blended emotions, and MEPHESTO analyses revealing context-aware synchrony for therapeutic alliance, temporal variability for depression-schizophrenia classification, and trauma-modulated speech patterns in depression via MADRS/BDI-II assessments. These works of the INRIA-STARS team on video understanding of complex human activities bridge gaps in multimodal behavioral AI, supporting applications from assistive systems to personalized psychiatry.
KeyPhrases
Video understanding, pattern recognition, human behavior analysis, social interactions, psychiatric phenotyping.
Dates
Created 2025-10-02, presented 2025-10-09, updated 2025-12- 13, published 2025-12-13.
Citation
Brainiacs Journal 2025 Volume 6 Issue 3 Edoc F3DAD390E
DOI: 10.48085/F3DAD390E
NPDS: LINKS/Brainiacs/Balazia2025VUCHA
Brainiacs Journal | Current Issue | Archived Volumes
2025v6i3 | 2025v6i2 | 2025v6i1 | 2024v5i2 | 2024v5i1 | 2023v4i2 | 2023v4i1 | 2022v3i2 | 2022v3i1 | 2021v2i1 | 2020v1i1