Full-text resources of CEJSH and other databases are now available in the new Library of Science.
Visit https://bibliotekanauki.pl

Results found: 4

first rewind previous Page / 1 next fast forward last

Search results

Search:
in the keywords:  WSD
help Sort By:

help Limit search:
first rewind previous Page / 1 next fast forward last
PL
Sprawozdanie z działalności WSD w Sandomierzu w roku akademickim 2015/2016
PL
Wykaz prac magisterskich obronionych w Katolickim Uniwersytecie Lubelskim przez diakonów Wyższego Seminarium Duchownego w 2016 roku
EN
Word Sense Disambiguation Based on Large Scale Polish CLARIN Heterogeneous Lexical ResourcesLexical resources can be applied in many different Natural Language Engineering tasks, but the most fundamental task is the recognition of word senses used in text contexts. The problem is difficult, not yet fully solved and different lexical resources provided varied support for it. Polish CLARIN lexical semantic resources are based on the plWordNet - a very large wordnet for Polish - as a central structure which is a basis for linking together several resources of different types. In this paper, several Word Sense Disambiguation (henceforth WSD) methods developed for Polish that utilise plWordNet are discussed. Textual sense descriptions in the traditional lexicon can be compared with text contexts using Lesk’s algorithm in order to find best matching senses. In the case of a wordnet, lexico-semantic relations provide the main description of word senses. Thus, first, we adapted and applied to Polish a WSD method based on the Page Rank. According to it, text words are mapped on their senses in the plWordNet graph and Page Rank algorithm is run to find senses with the highest scores. The method presents results lower but comparable to those reported for English. The error analysis showed that the main problems are: fine grained sense distinctions in plWordNet and limited number of connections between words of different parts of speech. In the second approach plWordNet expanded with the mapping onto the SUMO ontology concepts was used. Two scenarios for WSD were investigated: two step disambiguation and disambiguation based on combined networks of plWordNet and SUMO. In the former scenario, words are first assigned SUMO concepts and next plWordNet senses are disambiguated. In latter, plWordNet and SUMO are combined in one large network used next for the disambiguation of senses. The additional knowledge sources used in WSD improved the performance. The obtained results and potential further lines of developments were discussed.
PL
The aim of the article is to present several problems encountered by the editor compiling the Corpus of Wrocław University of Technology. The problems are connected with the application of the method of disambiguation of meanings of words in text (WSD – Word Sense Disambiguation) during the work on material of phraseological character. Any researcher using this method confronts the problem of the lack of semantic independence of words entering into the composition of fixed word combinations. The necessity to develop a unique procedure of conduct for this type of words, inclined the author of the article to experiment in the isolation of independent meanings of words co-forming phraseologisms and to make an attempt at their typologization. The isolation of such meanings has proved possible, although the accuracy of their description is very unstable: in some cases the motivation of semantic changes-which the meaning of a word undergoes under the influence of “being petrified” in a set phrase-is transparent, in other cases, however, we can speak of a hypothetical source of changes rather than a definite modification of meaning. It is worth mentioning here about the rigour imposed by the adopted WSD marking method: a set of all meanings which a given word form may assume in the analysed text must be entered into the database. That is why the problem of providing the definition of words forming phraseologisms is vital from the point of view of a linguist dealing with machine processing of language.
first rewind previous Page / 1 next fast forward last
JavaScript is turned off in your web browser. Turn it on to take full advantage of this site, then refresh the page.