Korpus spontánní mluvené češtiny ORAL2013

Benešová, Lucie; Křen, Michal; Waclawičová, Martina

Article details

Journal

Časopis pro moderní filologii (Journal for Modern Philology)

2015 | 97 | 1 | 42-50

Article title

Korpus spontánní mluvené češtiny ORAL2013

Authors

Benešová Lucie , Křen Michal , Waclawičová Martina

Content

Full texts:

Download

Title variants

EN

THE CORPUS OF SPONTANEOUS SPOKEN CZECH ORAL2013

Languages of publication

CS

Abstracts

EN

The paper presents a corpus of spontaneous spoken Czech called ORAL2013, its design principles and practical solutions adopted during the data collection. The corpus is designed to represent contemporary spontaneous spoken language used in informal, real-life situations across the whole of the Czech Republic. The corpus consists of audio recordings and their transcriptions aligned with time stamps; it features manual annotation and broad regional coverage with a large variety of speakers. ORAL2013 contains 835 recordings from the period 2008 to 2011 made with 2,544 speakers (of whom 1,297 speakers are unique); the total length of the audio tracks is almost 300 hours and the total size of the transcriptions exceeds 3.28 million tokens. ORAL2013 is made publicly available by the Czech National Corpus at http://www.korpus.cz/.

Keywords

CS

jazykový korpus složení korpusu spontánní mluvený jazyk čeština transkripce

EN

language corpus corpus design spontaneous spoken language Czech transcription

Year

2015

Volume

97

Issue

1

Pages

42-50

Physical description

Contributors

author

Benešová Lucie

lucie.benesova@ff.cuni.cz

Ústav Českého národního korpusu, FFUK | nám. J. Palacha 2, 116 38 Praha 1

author

Křen Michal

michal.kren@ff.cuni.cz

Ústav Českého národního korpusu, FFUK | nám. J. Palacha 2, 116 38 Praha 1

author

Waclawičová Martina

martina.waclawicova@ff.cuni.cz

Ústav Českého národního korpusu, FFUK | nám. J. Palacha 2, 116 38 Praha 1

References

Document Type

Publication order reference

Identifiers

YADDA identifier

bwmeta1.element.desklight-7e69fcd9-a698-40b1-9353-b21dc64d3bdf

Article details

Journal

Časopis pro moderní filologii (Journal for Modern Philology)

Article title

Korpus spontánní mluvené češtiny ORAL2013

Authors

Content

Title variants

Languages of publication

Abstracts

Keywords

Discipline

Publisher

Journal

Year

Volume

Issue

Pages

Physical description

Contributors

References

Document Type

Publication order reference

Identifiers

YADDA identifier