2009
DOI: 10.1007/978-3-642-04793-0_3
|View full text |Cite
|
Sign up to set email alerts
|

Creating and Exploiting Multimodal Annotated Corpora: The ToMA Project

Abstract: Abstract. The paper presents a project aiming at collecting, annotating and exploiting a dialogue corpus from a multimodal perspective. The goal of the project is the description of the different parameters involved in a natural interaction process. Describing such complex mechanism requires corpora annotated in different domains. This paper first presents the corpus and the scheme used in order to annotate the different domains that have to be taken into consideration, namely phonetics, morphology, syntax, pr… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
13
0

Year Published

2010
2010
2018
2018

Publication Types

Select...
7
2

Relationship

3
6

Authors

Journals

citations
Cited by 19 publications
(13 citation statements)
references
References 15 publications
0
13
0
Order By: Relevance
“…We then drew up a list of occurrences of similar events in our dataset, i.e., a collection of examples, as recommended in CA (Mondada, 2013). However, unlike a CA -style transcription that exhibits all the relevant cues (prosodic, gestural, phonetic) on the same line (see (Selting et al, 1998) for the GAT system, adapted from the Jefferson-style transcription system), the transcription used in this work is one of the levels of the annotation process elaborated within the framework of the OTIM project (Blache et al, 2009). Using Praat (Boersma and Weenink, 2009), rich and systematic annotations related to the different linguistic domains, from phonetic to gestural, are provided.…”
Section: Methodsmentioning
confidence: 99%
“…We then drew up a list of occurrences of similar events in our dataset, i.e., a collection of examples, as recommended in CA (Mondada, 2013). However, unlike a CA -style transcription that exhibits all the relevant cues (prosodic, gestural, phonetic) on the same line (see (Selting et al, 1998) for the GAT system, adapted from the Jefferson-style transcription system), the transcription used in this work is one of the levels of the annotation process elaborated within the framework of the OTIM project (Blache et al, 2009). Using Praat (Boersma and Weenink, 2009), rich and systematic annotations related to the different linguistic domains, from phonetic to gestural, are provided.…”
Section: Methodsmentioning
confidence: 99%
“…Unfortunately, annotating multimodal data is still problematic for both theoretical and technical reasons: first, there is a lack of linguistic and cognitive theories taking into consideration all the different aspects of multimodality; second, we need to specify a standardized way of representing multimodal information in order to give access to large multimodal corpora, as richly annotated as possible (Blache et al 2009). So there is still a long way to go in this field.…”
Section: Multimodal Corporamentioning
confidence: 99%
“…Because of this ambivalence, the interlocutors have to be very close to be able to perceive and/or 8 hours of audio-video recorded dialogues. The interactions are transcribed and annotated to be useful for different fields: phonetics, prosody, syntax, discourse and non-verbal analysis (see the ToMA project (OTIM in French), Blache et al 2009). This goal required a particular protocol of experiment, which allowed us to obtain dialogues interesting both for the interactional level and the phonetic and prosodic approach.…”
Section: Conversational Humormentioning
confidence: 99%