Berlin Dialogue Corpus (BeDiaCo): Version 2 Dataset uri icon

abstract

  • BeDiaCo contains topic-led and task-led spontaneous dialogues of 36 participants as well as read word lists. The main corpus BeDiaCom contains 16 subjects not known to each other in a co-present face-to-face situation. The subcorpus BeDiaCov contains 20 subjects known and familiar to each other in a co-present face-to-face situation and, additionally, in a spatially separated videoconference situation. The corpus contains audio and TextGrid files and is annotated by employing a multi-layer architecture. In addition to a diplomatic transliteration and its phonetic segmentation, further annotation levels contain, for example, annotation values for filler particles, intonation phrases, dialogue structure, word types, and the realization of inflection.

publication date

  • 2021