PL EN


2004 | 65 | 4 | 243-270
Article title

Mluvená čeština v televizních debatách: korpus DIALOG

Content
Title variants
EN
SPOKEN CZECH IN TELEVISION DEBATES: THE DIALOG CORPUS
Languages of publication
CS
Abstracts
EN
The DIALOG corpus is one of two collections of spoken language gathered in the audio-visual studio at the Czech Language Institute of the Czech Academy of Sciences. The article begins by recalling the establishment of the corpus in 1997 as part of the project 'Dialogue in a World of People and Machines', defines the aim motivating the collection of data for this corpus, formulates distinctive criteria for this corpus as a specifically 'spoken' one in terms of time, interaction and genre and partially even as topic-specific, and attempts to define the types of spoken dialogues which the corpus can aid in analysing. It characterizes speech in the media, which makes up a focal point here, and details the procedures for storing audio and video recordings of this speech and the resulting transcriptions. The second part provides an overview of the fundamentals of transcription systems and offers theoretical support for transcription method selection as determined by the aim of capturing segmental, supra-segmental, sequential, para-linguistic and extra-linguistic phenomena, including several examples of practical solutions. The third part reports on how this corpus has been thus far utilized in linguistic research, both in the creation of a contemporary Czech theory of dialogue and in the analysis of specific features of spoken Czech. The article concludes by detailing the prospects for further use of this corpus.
Contributors
author
author
  • Svetla Cmejrkova, Ustav pro jazyk cesky AV CR, v.v.i., Letenska 4, Praha 1, 118 51, Czech Republic, http://dlib.lib.cas.cz/2814/
References
Document Type
Publication order reference
Identifiers
CEJSH db identifier
09CZAAAA057211
YADDA identifier
bwmeta1.element.2f2a2014-b250-35a2-9a64-407b34d5d6c3
JavaScript is turned off in your web browser. Turn it on to take full advantage of this site, then refresh the page.