Loose Social-Interaction Recognition in Real-World Therapy Scenarios

Name: Loose Social-Interaction Recognition in Real-World Therapy Scenarios
Keywords: FOS: Computer and information sciences, [INFO.INFO-CV] Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV], Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition

Ali, Abid; Dai, Rui; Marisetty, Ashish; Astruc, Guillaume; Thonnat, Monique; Odobez, Jean-Marc; Thümmler, Susanne; Bremond, Francois

Found an issue? Give us feedback

arXiv.org e-Print Ar...arrow_drop_down

arXiv.org e-Print Archive

Preprint . 2024

Data sources: arXiv.org e-Print Archive

https://doi.org/10.1109/wacv61...

Article . 2025 . Peer-reviewed

License: STM Policy #29

Data sources: Crossref

INRIA2

Conference object . 2025

License: CC BY

Data sources: INRIA2

INRIA a CCSD electronic archive server

Conference object . 2025

License: CC BY

Data sources: INRIA a CCSD electronic archive server

https://dx.doi.org/10.48550/ar...

Article . 2024

License: CC BY

Data sources: Datacite

Loose Social-Interaction Recognition in Real-World Therapy Scenarios

descriptionPublicationkeyboard_double_arrow_right Article , Preprint , Conference object 26 Feb 2025Embargo end date: 01 Jan 2024Publisher:IEEEJournal:2025 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)

Authors: Ali, Abid; Dai, Rui; Marisetty, Ashish; Astruc, Guillaume; Thonnat, Monique; Odobez, Jean-Marc; Thümmler, Susanne; +1 Authors

doi: 10.1109/wacv61041.2025.00504 , 10.48550/arxiv.2409.20270

arXiv: 2409.20270

Loose Social-Interaction Recognition in Real-World Therapy Scenarios

- Summary
- Subjects
- Metrics

Abstract

The computer vision community has explored dyadic interactions for atomic actions such as pushing, carrying-object, etc. However, with the advancement in deep learning models, there is a need to explore more complex dyadic situations such as loose interactions. These are interactions where two people perform certain atomic activities to complete a global action irrespective of temporal synchronisation and physical engagement, like cooking-together for example. Analysing these types of dyadic-interactions has several useful applications in the medical domain for social-skills development and mental health diagnosis. To achieve this, we propose a novel dual-path architecture to capture the loose interaction between two individuals. Our model learns global abstract features from each stream via a CNNs backbone and fuses them using a new Global-Layer-Attention module based on a cross-attention strategy. We evaluate our model on real-world autism diagnoses such as our Loose-Interaction dataset, and the publicly available Autism dataset for loose interactions. Our network achieves baseline results on the Loose-Interaction and SOTA results on the Autism datasets. Moreover, we study different social interactions by experimenting on a publicly available dataset i.e. NTU-RGB+D (interactive classes from both NTU-60 and NTU-120). We have found that different interactions require different network designs. We also compare a slightly different version of our method by incorporating time information to address tight interactions achieving SOTA results.

Related Organizations

Université Côte d Azur
France
Université Côte d Azur
France
Université Côte d'Azur
France
French Institute for Research in Computer Science and Automation
France
UNIVERSITE COTE D'AZUR
France

View all View all

Keywords

FOS: Computer and information sciences, [INFO.INFO-CV] Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV], Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

0

Average

Green

Related to Research communities

INRIA