Słowa znaczące, słowa kluczowe, słowozbiory – o statystycznych metodach wyszukiwania wyrazów istotnych

Eder, Maciej

Article details

Journal

Przegląd Humanistyczny

2016 | 60(3 (454)) | 31-44

Article title

Słowa znaczące, słowa kluczowe, słowozbiory – o statystycznych metodach wyszukiwania wyrazów istotnych

Authors

Eder, Maciej

Content

Full texts:

Download

Title variants

Languages of publication

Abstracts

PL

This article discusses automatic extraction of relevant words from sets of texts. The author briefly presents three methods aimed to extract the words from the corpus of words with regard to their frequency, or words whose occurrence next to each other is not random. First, he focuses on the keyword analysis method, then he discusses the Zeta method developed by John Burrows and Hugh Craig, and the third method covered in the article is the topic modelling method, which is becoming very popular recently, and consists in finding clusters of words co-occurring in similar contexts. Topic modelling was intended for a quick content search in large collections of documents. On the basis of 100 Polish novels, the article presents how this method can be used for linguistic studies.

Keywords

PL

quantitative linguistics stylometry keywords Zeta method topic modelling wordlist

Publisher

Wydawnictwa Uniwersytetu Warszawskiego

Journal

Przegląd Humanistyczny

Year

2016

Volume

60(3 (454))

Pages

31-44

Physical description

Contributors

author

Eder, Maciej

References

Document Type

Publication order reference

Identifiers

YADDA identifier

bwmeta1.element.ceon.element-04af8a39-6a7d-37df-8e1c-70a6699583ba

Article details

Journal

Przegląd Humanistyczny

Article title

Słowa znaczące, słowa kluczowe, słowozbiory – o statystycznych metodach wyszukiwania wyrazów istotnych

Authors

Content

Title variants

Languages of publication

Abstracts

Keywords

Publisher

Journal

Year

Volume

Pages

Physical description

Contributors

References

Document Type

Publication order reference

Identifiers

YADDA identifier