This paper concerns the vocabulary of the Church’s law and administration in Latin texts written during the Middle Ages in Poland and its automatic extraction using the methods of corpus linguistics. The first part of this article considers the basic theoretical assumptions of the automatic extraction of this specialized vocabulary and the main characteristics of the Electronic Corpus of Polish Medieval Latin. In the second part presents the methods and results of term extraction. For the purpose of this research, a specialized subcorpus, including synodal statues and documents of ecclesiastical chapters, was created and then compared with the reference corpus. As a result, a list of lexemes, which appeared relatively frequently in the subcorpus and rarely in the reference corpus, was obtained. This difference in relative frequency was the main criterion for the recognition of potential terminological units. Verification on the basis of lexicographic data demonstrated the effectiveness of the adopted methods. The aim of this paper was to present the usefulness of the Electronic Corpus of Polish Medieval Latin for the research and analysis of specialized vocabulary.
JavaScript is turned off in your web browser. Turn it on to take full advantage of this site, then refresh the page.