Full-text resources of CEJSH and other databases are now available in the new Library of Science.
Visit https://bibliotekanauki.pl

PL EN


2018 | XX/2 | 75-97

Article title

Korpus internetowy jako źródło informacji lingwistycznej: ograniczenia*

Title variants

EN
Internet corpus as a source of linguistic information: some limitations

Languages of publication

PL

Abstracts

EN
The aim of present analysis is to show the use of Internet corpus in syntactic studies of Slavic languages (especially Russian). Corpus analysis is treated as a research tool, useful in describing linguistic system as well as linguistic activity. The information coming from the corpus allows to determine the frequency of occurrence of units and their combinations in texts as well as the regularity of occurrences of features/properties in the paradigmatic classes. Corpus analysis also provides the ability to verify whether a particular valence property is characteristic for a given word or not. The author shows that the use of Internet corpus in the syntactic research has its limitations. In the case of frequent phenomena, corpus analysis is effective, but does not always allow to document less typical phenomena (for example occasional and potential combinations of tokens). One of the author’s conclusions is that corpus analysis should be configured with introspection and qualitative analysis.

Year

Issue

Pages

75-97

Physical description

References

Document Type

Publication order reference

Identifiers

YADDA identifier

bwmeta1.element.mhp-5b90c815-a2a0-4306-a49e-5a4bd29d6691
JavaScript is turned off in your web browser. Turn it on to take full advantage of this site, then refresh the page.