Full-text resources of CEJSH and other databases are now available in the new Library of Science.
Visit https://bibliotekanauki.pl

PL EN


2014 | 97 | 4-5 | 194-207

Article title

Problémy automatické morfologické disambiguace češtiny:

Content

Title variants

EN
Problems of automatic morphological disambiguation of Czech:

Languages of publication

CS

Abstracts

EN
The article focuses on some of the main problems in the current automatic morphological disambiguation of Czech. Following a description of the disambiguation methods used for disambiguating Czech texts and of their accuracy, the author discusses the main reasons why the correct morphological disambiguation of Czech texts contained in the corpora of the SYN series of the Czech National Corpus project is very difficult to achieve, and why, notwithstanding can improvement in disambiguation (e.g. the SYN2013PUB corpus is tagged in a better way than the SYN2000 corpus), there is still a lot of work to be accomplished. The author concentrates exclusively on the problems of rule-based disambiguation rather than on the stochastic one, trying to identify areas where disambiguation could be improved in the future. The necessity of a reliable disambiguation of Czech texts as a key prerequisite for their successful subsequent syntactic analysis is also stressed.

Year

Volume

97

Issue

4-5

Pages

194-207

Physical description

Document type

ARTICLE

Contributors

  • Naše řeč, redakce, Ústav pro jazyk český AV ČR, v.v.i., Letenská 4, 118 51 Praha 1, Czech Republic

References

Document Type

Publication order reference

Identifiers

YADDA identifier

bwmeta1.element.370e2dc9-bff7-49eb-897c-f332a704cbdf
JavaScript is turned off in your web browser. Turn it on to take full advantage of this site, then refresh the page.