Full-text resources of CEJSH and other databases are now available in the new Library of Science.
Visit https://bibliotekanauki.pl

PL EN


2019 | 102 | 1-2 | 64-75

Article title

K efektivitě manuální a poloautomatické excerpce neologismů

Authors

Content

Title variants

EN
On the efficiency of manual and semi-automatic detection of neologisms

Languages of publication

CS

Abstracts

EN
The paper presents a simple semi-automatic neologism detection procedure: a trivial Python script processes a text file, making use of a Czech morphological tagger, and extracts all words unrecognized by the tagger as potential neologisms. The list of these candidates has to be checked by a human (hence the label semi-automatic). This method was applied to a set of texts that were also analyzed in a more traditional way, by the “reading and marking” technique (i.e. the current practice). The comparison of the two methods has revealed that the semi-automatic procedure clearly outperforms the current practice both in speed and in efficiency.

Year

Volume

102

Issue

1-2

Pages

64-75

Physical description

Document type

ARTICLE

Contributors

  • Naše řeč, redakce, Ústav pro jazyk český AV ČR, v.v.i., Letenská 4, 118 51 Praha 1, Czech Republic

References

Document Type

Publication order reference

Identifiers

YADDA identifier

bwmeta1.element.f02ee5c8-f959-440d-8733-2efc788a69d0
JavaScript is turned off in your web browser. Turn it on to take full advantage of this site, then refresh the page.