Full-text resources of CEJSH and other databases are now available in the new Library of Science.
Visit https://bibliotekanauki.pl

PL EN


2020 | 777 | 8 | 66-80

Article title

Analiza fleksyjna tekstów historycznych i zmienność fleksji polskiej z perspektywy danych korpusowych

Content

Title variants

Languages of publication

Abstracts

EN
The subject matter of this paper is Chronofleks, a computer system (http://chronofleks.nlp.ipipan.waw.pl/) modelling Polish inflection based on a corpus material. The system visualises changes of inflectional paradigms of individual lexemes over time and enables examination of the variability of the frequency of inflected form groups distinguished based on various criteria. Feeding Chronofleks with corpus data required development of IT tools to ensure an inflectional processing sequence of texts analogous to the ones used for modern language; they comprise a transcriber, a morphological analyser, and a tagger. The work was performed on data from three historical periods (1601–1772, 1830–1918, and modern ones) elaborated in independent projects. Therefore, finding a common manner of describing data from the individual periods was a significant element of the work.

Year

Volume

777

Issue

8

Pages

66-80

Physical description

Dates

published
2020

Contributors

  • Instytut Podstaw Informatyki Polskiej Akademii Nauk
  • Instytut Podstaw Informatyki Polskiej Akademii Nauk

References

  • J. Bilińska, M. Derwojedowa, W. Kieraś, M. Kwiecień, 2016, Mikrokorpus polszczyzny 1830–1918 [w:] Ł. Karpiński, P. Michałowski, Komunikacja Specjalistyczna 11, s. 149–161.
  • W. Gruszczyński, R. Bronikowska, 2018, Instrukcja korzystania z wyszukiwarki do Elektronicznego Korpusu Tekstów Polskich z XVII i XVIII wieku (do 1772 r.), https://www.korba.edu.pl/manual.
  • T. Erjavec, 2015, The IMP Historical Slovene Language Resources, „Language Resources and Evaluation” 49(3), s. 753–75; https://doi.org/10.1007/s10579-015-9294-7.
  • W. Kier aś, D. Komosińska, E. Modrzejewski, M. Woliński, 2017, Morphosyntactic annotation of historical texts. The making of the baroque corpus of Polish [w:] K. Ekštein, V. Matoušek (red.), Text, Speech, and Dialogue 20th International Conference, TSD 2017, Prague, Czech Republic, August 27–31, „Lecture Notes in Computer Science” 10415, s. 308–316.
  • W. Kieraś, M. Woliński, 2018, Manually annotated corpus of Polish texts published between 1830 and 1918 [w:] N. Calzolari i in. (red.), Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018), Paris, s. 3854–3859.
  • Ł. Kobyliński, W. Kieraś, 2016, Part of speech tagging for Polish: State of the art and future perspectives [w:] Proceedings of the 17th International Conference on Intelligent Text Processing and Computational Linguistics (CICLing 2016), Konya.
  • K. Krasnowska-Kieraś, 2017, Morphosyntactic disambiguation for Polish with bi-LSTM neural networks [w:] Z. Vetulani, P. Paroubek (red.), Proceedings of the 8th Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics, Poznań, s. 367–371.
  • M. Król, M. Derwojedowa, R.L. Górski, W. Gruszczyński, K.W. Opaliński, P. Potoniec, M. Woliński, W. Kieraś, M. Eder, 2019, Narodowy Korpus Diachroniczny Polszczyzny. Projekt, „Język Polski” XCIX(1), s. 92–101.
  • A. Przepiórkowski, M. Bańko, R.L. Górski, B. Lewandowska-Tomaszczyk (red.), 2012, Narodowy Korpus Języka Polskiego, Warszawa.
  • Z. Saloni, 1988, O tzw. formach nieosobowych rzeczowników męskoosobowych we współczesnej polszczyźnie, „Biuletyn Polskiego Towarzystwa Językoznawczego” XLI, Kraków, s. 155–166.
  • Z. Saloni, M. Woliński, R. Wołosz, W. Gruszczyński, D. Skowrońska, 2015, Słownik gramatyczny języka polskiego, wyd. III on-line, Warszawa; http://sgjp.pl.
  • J. Waszczuk, W. Kieraś, M. Woliński, 2018, Morphosyntactic disambiguation and segmentation for historical Polish with graph-based conditional random fields [w:] P. Sojka, A. Horák, I. Kopeček, K. Pala (red.), Text, Speech, and Dialogue: 21st International Conference, TSD 2018, Brno, Czech Republic, s. 188–196.
  • M. Woliński, 2014, Morfeusz reloaded [w:] N. Calzolari i in. (red.), Proceedings of the Ninth International Conference on Language Resources and Evaluation, LREC 2014, Reykjavík, Iceland, s. 1106–1111.

Document Type

Publication order reference

Identifiers

Biblioteka Nauki
1630443

YADDA identifier

bwmeta1.element.ojs-doi-10_33896_PorJ_2020_8_5
JavaScript is turned off in your web browser. Turn it on to take full advantage of this site, then refresh the page.