Full-text resources of CEJSH and other databases are now available in the new Library of Science.
Visit https://bibliotekanauki.pl

PL EN


2020 | 11 | 2 | 63-84

Article title

Víceslovné lexémy v syntaktickém kontextu

Content

Title variants

EN
Multi-word lexemes in syntactic context

Languages of publication

CS

Abstracts

EN
We start with the assumption that (i) a corpus represents the use of language, i.e. linguistic performance, (ii) a rule-based grammar represents language as a system, i.e. linguistic competence, and (iii) corpus annotation represents the interface between the two. To detect and diagnose mismatches between the language use and the language system we use a constraint-based grammar run as a constraint solver on texts tagged and dependency-parsed by stochastic tools. The texts also have MWEs (multi-word expressions) identified and transformed into a constituency-based format before the grammar is applied. We describe the role and results of the grammar, and its use to check texts annotated with morphosyntactic categories, syntactic structure and information about the status of relevant expressions as MWEs. The grammar also employs lexical resources such as a valency lexicon and a database of MWEs to make the checking more accurate and the annotation more informative. The results are represented as typed feature structures where MWE-related information can be shared by lexical and phrasal nodes. This allows for the annotation of MWEs as lexical units, independently of their analysis in terms of syntactic structure. Focusing on the interplay of MWEs with their syntactic context we analyse a number of representative examples, pointing out the pros and cons of specific solutions and the whole approach.

Contributors

  • Ústav teoretické a komputační lingvistiky FF UK
  • Ústav teoretické a komputační lingvistiky FF UK
  • Ústav informatiky a chemie, Vysoká škola chemicko-technologická v Praze

References

Document Type

Publication order reference

Identifiers

YADDA identifier

bwmeta1.element.desklight-469035ef-490a-4822-becc-6f994b9a318d
JavaScript is turned off in your web browser. Turn it on to take full advantage of this site, then refresh the page.