PL EN


2014 | 14 |
Article title

The IMPACT project Polish Ground-Truth texts as a Djvu corpus

Authors
Content
Title variants
Languages of publication
EN
Abstracts
EN
The IMPACT project Polish Ground-Truth texts as a Djvu corpusThe purpose of the paper is twofold. First, to describe the already implemented idea of DjVu corpora, i.e. corpora which consist of both scanned images and a transcription of the texts with the words associated with their occurrences in the scans. Secondly, to present a case study of a corpus consisting of almost 5 000 pages of Polish historical texts dating from 1570 to 1756 (it is practically the very first corpus of historical Polish). The tools described have universal character and are freely available under the GNU GPL license, hence they can be used also for other purposes.
Year
Issue
14
Physical description
Dates
published
2014
online
2014-09-04
Contributors
References
Document Type
Publication order reference
Identifiers
YADDA identifier
bwmeta1.element.ojs-doi-10_11649_cs_2014_008
JavaScript is turned off in your web browser. Turn it on to take full advantage of this site, then refresh the page.