SOME PROBLEMS IN MULTILINGUAL DIGITAL DICTIONARIES

Dimitrova, Ludmila; Koseska-Toszewa, Violetta

Article details

Journal

Cognitive Studies

2008 | 8 | 237-254

Article title

SOME PROBLEMS IN MULTILINGUAL DIGITAL DICTIONARIES

Authors

Dimitrova Ludmila , Koseska-Toszewa Violetta

Title variants

Languages of publication

EN

Abstracts

EN

The article discusses some observations from the joint work of Polish and Bulgarian research groups on the digital Bulgarian-Polish and Polish-Ukrainian dictionaries, as well as the projected multilingual (initially: Bulgarian-Polish-Ukrainian) dictionary. The researchers are currently working on a parallel corpus containing texts in Bulgarian and Polish, distributed over the Internet, whereby the translation correspondence is one-to-one. They are developing a comparable corpus that includes texts in Bulgarian and Polish (excerpts from newspapers, literary works, Internet textual documents) with the text sizes being comparable across the two languages. The two corpora, parallel and comparable, form the first Bulgarian-Polish corpus, that will be prepared in CES format, manually or using ad-hoc tools, and will be annotated on 'paragraph' and 'sentence' levels, according to the text annotation international standards. This bilingual corpus will provide a sample of the vocabulary to be included in an initial experimental version of the Bulgarian-Polish digital dictionary. The bi- and multilingual digital dictionaries have more limitations and require even more so that the description of language specifications of the headword in each entry of the dictionary be simple and simultaneously more comprehensive. The fact that the lexical form in every language may have several meanings that do not overlap across the respective compared languages also has to be addressed. Great difficulties have to be addressed in order for a dictionary to satisfy the needs of a translator, a language researcher or an everyday user.

Keywords

EN

BULGARIAN-POLISH DIGITAL DICTIONARY (PREPARATORY WORK)

Discipline

PHILOLOGY_&_LINGUISTICS: PHILOLOGY & LINGUISTICS

Publisher

Polska Akademia Nauk. Instytut Slawistyki PAN

Journal

Cognitive Studies

Year

2008

Issue

8

Pages

237-254

Physical description

Document type

ARTICLE

Contributors

author

Dimitrova Ludmila

author

Koseska-Toszewa Violetta

Ludmila Dimitrova, Institute of Mathematics and Informatics, Bulgarian Academy of Sciences, Sofia, Bulgaria

Article details

Journal

Cognitive Studies

Article title

SOME PROBLEMS IN MULTILINGUAL DIGITAL DICTIONARIES

Authors

Title variants

Languages of publication

Abstracts

Keywords

Discipline

Publisher

Journal

Year

Issue

Pages

Physical description

Document type

Contributors

References

Document Type

Publication order reference

Identifiers

YADDA identifier