PL EN


2016 | 3 (454) | 31-44
Article title

Słowa znaczące, słowa kluczowe, słowozbiory – o statystycznych metodach wyszukiwania wyrazów istotnych

Authors
Title variants
EN
Significant Words, Keywords, Wordlists – on Statistical Methods of Searching for Relevant Terms
Languages of publication
PL EN
Abstracts
EN
This article discusses automatic extraction of relevant words from sets of texts. The author briefly presents three methods aimed to extract the words from the corpus of words with regard to their frequency, or words whose occurrence next to each other is not random. First, he focuses on the keyword analysis method, then he discusses the Zeta method developed by John Burrows and Hugh Craig, and the third method covered in the article is the topic modelling method, which is becoming very popular recently, and consists in finding clusters of words co-occurring in similar contexts. Topic modelling was intended for a quick content search in large collections of documents. On the basis of 100 Polish novels, the article presents how this method can be used for linguistic studies.
Year
Issue
Pages
31-44
Physical description
Contributors
author
  • Instytut Języka Polskiego PAN, Uniwersytet Pedagogiczny
References
Document Type
Publication order reference
Identifiers
YADDA identifier
bwmeta1.element.desklight-d4b777d1-c640-42bf-9629-f388b9f99114
JavaScript is turned off in your web browser. Turn it on to take full advantage of this site, then refresh the page.