Multimodal Behaviour in an Online Environment: The GEHM Zoom Corpus Collection

Patrizia Paggio, Manex Agirrezabal, Costanza Navarretta

Publikation: Konference artikel i Proceeding eller bog/rapport kapitelKonferencebidrag i proceedingsForskningpeer review

Abstract

This paper introduces a novel multimodal corpus consisting of 12 video recordings of Zoom meetings held in English by an international group of researchers from September 2021 to March 2023. The meetings have an average duration of about 40 minutes each, for a total of 8 hours. The number of participants varies from 5 to 9 per meeting. The participants’ speech was transcribed automatically using WhisperX, while visual coordinates of several keypoints of the participants’ head, their shoulders and wrists, were extracted using OpenPose. The audio-visual recordings will be distributed together with the orthographic transcription as well as the visual coordinates. In the paper we describe the way the corpus was collected, transcribed and enriched with the visual coordinates, we give descriptive statistics concerning both the speech transcription and the visual keypoint values and we present and discuss visualisations of these values. Finally, we carry out a short preliminary analysis of the role of feedback in the meetings, and show how visualising the coordinates extracted via OpenPosecanbeusedtoseehowgesturalbehavioursupports the use of feedback words during the interaction.
OriginalsprogEngelsk
TitelProceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)
Antal sider11
ForlagELRA and ICCL
Publikationsdatomaj 2024
Sider11890–11900
StatusUdgivet - maj 2024
Udgivet eksterntJa
BegivenhedThe 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation - Lingotto Conference Centre , Torino, Italien
Varighed: 20 maj 202425 maj 2024

Konference

KonferenceThe 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation
LokationLingotto Conference Centre
Land/OmrådeItalien
ByTorino
Periode20/05/202425/05/2024

Fingeraftryk

Dyk ned i forskningsemnerne om 'Multimodal Behaviour in an Online Environment: The GEHM Zoom Corpus Collection'. Sammen danner de et unikt fingeraftryk.

Citationsformater