POLISH-UKRAINIAN PARALLEL CORPUS (POLUKR)
Selected contents from this journal
Languages of publication
A fairly considerable lack of Polish-Ukrainian electronic language resources, especially large dictionaries and parallel corpora, is presently felt among the professional translators and those who learn either language. A morpho-syntactically annotated aligned corpus with a parallel concordancer can be used in many cases as a traditional dictionary. One of its possible advantages is being an up-to-date database of numerous examples of language use, which could be of use while dealing with neologisms and new lexical meanings. The authors aim is to create such a corpus and make it publicly available through the Internet. At the moment one can advise the prototype of the parallel Polish-Ukrainian corpus on-line from the website http://corpus.domeczek.pl. It contains ca. 50 parallel bitexts compliant with the XCES standard. Not all sentence segments have been tagged yet, the work on morphological annotation is on-going as well. The corpus is aligned presently only at the paragraph level. The authors are convinced that such a parallel corpus with an efficient search engine, a flexible query language and a friendly user interface will be one of the most powerful tools for all Polish and Ukrainian researchers, translators and students.
Publication order reference
CEJSH db identifier