Full-text resources of CEJSH and other databases are now available in the new Library of Science.
Visit https://bibliotekanauki.pl

PL EN


2013 | 74 | 2 | 139-146

Article title

Ke klasifikaci morfologickych variant

Authors

Content

Title variants

EN
ON THE CLASSIFICATION OF MORPHOLOGICAL VARIANTS

Languages of publication

CS

Abstracts

EN
After briefly discussing the heterogeneities inherent to language production and how they influence corpus evidence, we describe a scale for the classification of individual morphological variants by their relative frequencies that has recently been independently proposed in Mluvnice současné češtiny (2010) (A Grammar of Contemporary Czech, hereafter GCCz), of which we are co-authors, and in Bermel & Knittl (2012). Those variants with relative frequency (roughly) within 1% and 10% are classified by the respective authors as “sparse” and “marked”, and those occurring in (roughly) less than 1% cases as “unexpected” and “isolated”. Another feature of the scale is the “equipollence” of variants of a doublet having relative frequencies within (roughly) 1/3 and 2/3 (for this criterion see also Štícha 2009). The scale in GCCz is heuristically based on Shannon entropy and valid for synchronic functionally equivalent variants. Recently, R. Čech (2012) has claimed to have revealed “a serious statistical deficiency” in GCCz. We show that this is a misunderstanding stemming from his not distinguishing between the null-hypothesis statistical significance testing and the effect size evaluation. We end with a brief note on the structureof the resources employed in GCCz.

Contributors

author
  • Slovo a slovesnost, redakce, Ústav pro jazyk český AV ČR, v.v.i., Letenská 4, 118 51 Praha 1, Czech Republic

References

Document Type

Publication order reference

Identifiers

YADDA identifier

bwmeta1.element.cejsh-eeec97b6-f4e2-4807-b66b-41ee01a96819
JavaScript is turned off in your web browser. Turn it on to take full advantage of this site, then refresh the page.