Full-text resources of CEJSH and other databases are now available in the new Library of Science.
Visit https://bibliotekanauki.pl

PL EN


2014 | 97 | 4-5 | 208-215

Article title

Diachronní složka Českého národního korpusu a hranice možností korpusového výzkumu vývoje češtiny:

Authors

Content

Title variants

EN
The diachronic part of the Czech National Corpus: limitations of corpus research into the history of Czech

Languages of publication

CS

Abstracts

EN
The paper reviews the present state of the diachronic part of the Czech National Corpus, with the focus on the two-million-word unannotated pivotal corpus Diakorp and its limitations in relation to corpus-based research into the history of Czech. A minimum 1,000,000-token growth, lemmatization and morphological tagging are cited as near-future enhancements to the corpus. A series of thoroughly structured monitoring diachronic corpora to be built from 2017 on is considered as a future basis for research into long-term trends in the history of Czech, thus complementing the quantity-oriented Diakorp.

Year

Volume

97

Issue

4-5

Pages

208-215

Physical description

Document type

ARTICLE

Contributors

  • Naše řeč, redakce, Ústav pro jazyk český AV ČR, v.v.i., Letenská 4, 118 51 Praha 1, Czech Republic

References

Document Type

Publication order reference

Identifiers

YADDA identifier

bwmeta1.element.bbf9ea7b-df1d-463a-9f47-adc89a5a042d
JavaScript is turned off in your web browser. Turn it on to take full advantage of this site, then refresh the page.