Search results

1

Gramatyka konfrontatywna języków polskiego i bułgarskiego

100%

Koseska-Toszewa V.

LingVaria : Półrocznik Wydziału Polonistyki UJ poświęcony zagadnieniom języka i językoznawstwa

|

2011

|

issue 1(11)

27-39

XX

Polish-Bulgarian confrontative grammar (PBCG) is the first, and so far the only one in the world, so extensive an attempt at semantic confrontation with a gradually developed interlanguage. It comprises nine tomes which have been published in twelve volumes. The decision has been taken that the description in the Grammar will lead in the direction from content to form. The semantic interlanguage allowed for the unfolding of two equivalent grammars: that of modern Bulgarian and that of modern Polish. The analysis of semantic categories employed in PBCG ensures a coherent confrontative description, regardless of whether the languages being described do or do not possess grammatical markers of the meanings. PBCG is a part of modern theoretical confrontative research based on the logical theory of quantification, the modern process theory called the Petri net, and on the theory of logical predicate-argument structures. Our study, thanks to the removal of a strict division between the grammatical and lexical levels, contributes many new observations about the phenomena under examination. Universal semantic categories, important for the description of language and unanalyzed so far, have been chosen here, viz. the basic categories of language such as tense, modality, definiteness/ indefiniteness and semantic case, none of which has as yet been exhaustively described in the academic grammars of Polish or Bulgarian.

2

Introduction

100%

Koseska-Toszewa V.

Cognitive Studies

|

2015

|

issue 15

3

Definiteness-indefiniteness category and logical quantification

100%

Koseska-Toszewa V.

Cognitive Studies

|

2011

|

issue 11

EN

Definiteness-indefiniteness category and logical quantificationThis work has been written within the “Logic and Language” project, realized under a cooperation agreement between the Institute of Slavic Studies of the Polish Academy of Sciences and the Institute of Bulgarian Language of the Bulgarian Academy of Sciences. The publication of a many-volume Bulgarian Polish Grammar (12 monographs), and of a Polish-Bulgarian Contrastive Grammar, from now on: Synthesis, is completed. Theories of mathematical logic are satisfied in perfect way in a natural language, which is shown by our understanding of the semantic definiteness / indefiniteness category as a system of scope-based quantification notions. A natural language complements and modifies logical theories. For example, understanding of the definiteness / indefiniteness category as a sentence category follows from the language material, especially in article languages. In such languages, quantification of the predicate is a necessary prerequisite for interpretation. Also introduction of the notions of strong and weak quantification meanings is motivated, for example, by the need to distinguish between the meanings of expressions like Pol. pewien x, jakiś x / Bulg. njakoj x, njakakyv x on the nominal phrase level, and by understanding of the ambiguity of quantification expressions like Bulg. vinagi, ponjakoga / Pol. zawsze, czasami. The fact of treating the definiteness / indefiniteness category as a sentence category expands understanding of the incomplete quantification notion also with respect to the verbal phrase, rather than, as in Ajdukiewicz’s approach, with respect to the nominal phrase only. Similarly, a proper name in a natural language is not a unique name, like in mathematical logic, but requires additional unique quantification. A network-based description of time is the first attempt at this type of description in the subject literature. In the present article, I stress that fact that quantification of states and events understood as in Petri net theory reveals the dependence between the aspectual and temporal meanings of verb, and proves that the information on time and aspect need not be dealt with separately.

4

Introduction

100%

Koseska-Toszewa V.

Cognitive Studies

|

2011

|

issue 11

5

Introduction

100%

Koseska-Toszewa V.

Cognitive Studies

|

2012

|

issue 12

6

About Certain Semantic Annotation in Parallel Corpora

100%

Koseska-Toszewa V.

Cognitive Studies

|

2013

|

issue 13

EN

About Certain Semantic Annotation in Parallel CorporaThe semantic notation analyzed in this works is contained in the second stream of semantic theories presented here – in the direct approach semantics. We used this stream in our work on the Bulgarian-Polish Contrastive Grammar. Our semantic notation distinguishes quantificational meanings of names and predicates, and indicates aspectual and temporal meanings of verbs. It relies on logical scope-based quantification and on the contemporary theory of processes, known as “Petri nets”. Thanks to it, we can distinguish precisely between a language form and its contents, e.g. a perfective verb form has two meanings: an event or a sequence of events and states, finally ended with an event. An imperfective verb form also has two meanings: a state or a sequence of states and events, finally ended with a state. In turn, names are quantified universally or existentially when they are “undefined”, and uniquely (using the iota operator) when they are “defined”. A fact worth emphasizing is the possibility of quantifying not only names, but also the predicate, and then quantification concerns time and aspect. This is a novum in elaborating sentence-level semantics in parallel corpora. For this reason, our semantic notation is manual. We are hoping that it will raise the interest of computer scientists working on automatic methods for processing the given natural languages. Semantic annotation defined like in this work will facilitate contrastive studies of natural languages, and this in turn will verify the results of those studies, and will certainly facilitate human and machine translations.

7

Introduction

100%

Koseska-Toszewa V.

Cognitive Studies

|

2013

|

issue 13

8

From the Editorial Committee

100%

Koseska-Toszewa V.

Cognitive Studies

|

2010

|

issue 10

9

Form, its meaning, and dictionary entries

100%

Koseska-Toszewa V.

Cognitive Studies

|

2010

|

issue 10

EN

Form, its meaning, and dictionary entriesAs we know, a language form is a unit which plays a specific form in the language, e.g. a semantic or syntactical one. We establish the function of a form based on its use (occurrence), i.e. its relation with the meanings of other forms in speech or in a text. The meaning of a form is the value of its function. In the traditional grammar, form is opposed to its meaning. However, various grammar schools have big problems with distinguishingbetween a form and its function. For example, the present tense form has a number of basic temporal meanings in Bulgarian as well as in Polish and Russian, and in none of those languages this is only the present time, (see past, future and habituality expressed using the present tense form). It is a big mistake not to distinguish between the meanings of article in article languages. For example, in Bulgarian the same form of article canexpress both uniqueness and universality (or, respectively: definiteness and indefiniteness). In the quoted book (Koseska-Toszewa 1982), I put forward a hypothesis on the development of the meaning of Bulgarian article. In my opinion, initially the article expressed uniqueness of an element (object), and then started to express also uniqueness of a set, which later, due to equalling two completely different semantically-logical structures, i.e. structures with universal and unique quantification, lead to a homonymy and to the article expressing also universality, i.e. indefiniteness. Similarly in English, French, Rumanian or Albanian, where the same form of article can express either uniqueness or universality. This proves that the above homonymy is of a general rather than typological (e.g. Balkan) character. Naturally, in the above languages the definite article form can also express uniqueness of an object or a set, so it also expresses definiteness. Ambiguity of the definite article form is a phenomenon exceeding the area of Balkan languages, and the only Balkanism is the position of the article - speaking more precisely,its postpositiveness (postpositive position). However, that position gives us no right to treat it differently than the English or French article. In Bulgarian, Rumanian and Albanian the postpositive article is written together with the name its concerns, but it is neither a unit belonging to the root of the word nor the ending of the word.The above observations, based first of all on the semantically-logical aspects of the definiteness category, have been confirmed by the language material from the Suprasl Code, where Bulgarian article does not occur in universally quantified nominal structures, but in uniquely quantified nominal expressions, denoting satisfaction of the predicate either by one element of the sentence or by the whole set treated as the only one.It is worth stressing that distinguishing between the form and its meaning in comparing the material 6 languages belonging to three different groups of Slavic languages (as is the case in the MONDILEX Project) will allow us to avoid numeorus substantiva mistakes and erroneous conclusions. Hence dictionary entries should be verified and made uniform in that respect before they are “digitalized”... Distinction between the form and its meaning in a dictionary entry is fully possible, as shown by works of Z. Saloni (2002) and A.Przepiórkowski (2008).

10

Wielotomowa gramatyka konfrontatywna języka bułgarskiego i polskiego

100%

Koseska-Toszewa V.

Slavia Meridionalis

|

2010

|

vol. 10

EN

Multi-volume Polish-Bulgarian Confrontative GrammarPolish-Bulgarian Confrontative Grammar (GKBP) is the first and so far the only expanded attempt to make the semantic confrontation with the gradually developed interlanguage. GKBP consists of 9 volumes which were published in 12 volumen. It was decided to arrange the description of Polish-Bulgarian confrontative grammar in the direction from the contents to the form. The semantic interlanguage enabled the emergence of two equivalent grammars: the grammar of the modern Bulgarian language and the grammar of the modern Polish language. The analysis of semantic categories, which was applied in GKBP, ensures the coherent confrontative description, irrespective of whether the described languages have grammatical exponent of meanings or not. GKBP falls into the stream of modern theoretical confrontative research based on the logical theory of quantification, on the modern theory of processes titled „Petri nets”, and on the theory of logical predicate-argument structures. Our research removes the exact division into grammatical and lexical levels, thanks to which our research has introduced a lot of new observations of the examined phenomena. Universal semantic linguistic categories such as the time, the modality, the definiteness / the indefiniteness and the semantic case – which are essential for the language description, but have not yet been examined and have not been sufficiently described in the academic grammars of the Polish and Bulgarian languages – have been selected. The sequence of description in this synthesis was established not on the basis of order of developed volumes of GKBP, but on the basis of the generally accepted order of elements of the semantic structure of the sentence. The most external in the semantic structure of the sentence is its modal characteristics. Thereafter, the time, quantifiers and their order in the semantic structure of the sentence as well as the predicate-argument items are placed. Therefore, it is not an abbreviated summary of the issues analyzed in GKBP volumes. It is the description of selected semantic categories organized in accordance with the semantic order of the semantic structure of the Polish and Bulgarian sentences.

11

Multilingualism and Dictionaries

63%

Sosnowski W. P., Koseska-Toszewa V.

Cognitive Studies

|

2015

|

issue 15

EN

Multilingualism and DictionariesThe Russian-Bulgarian-Polish dictionary that we (Wojciech Sosnowski, Violetta Koseska-Toszewa and Anna Kisiel) are currently developing has no precedent as far as its theoretical foundations and its structure are concerned. The dictionary offers a unique combination of three Slavic languages that belong to three different groups: a West Slavic language (Polish), a South Slavic language (Bulgarian) and an East Slavic language (Russian). The dictionary describes semantic and syntactic equivalents of words between the languages. When completed, the dictionary will contain around 30,000 entries. The principle we build the dictionary on is that every language should be given equal status. Many of our data come from the Parallel Polish-Bulgarian-Russian corpus developed by us as part of the CLARIN-PL initiative. In the print version, the entries come in the order of the Cyrillic alphabet and they are not numbered (except for homonyms, which are disambiguated with Roman numbers). We selected the lemmas for the dictionary on the basis of their frequency in the corpus. Our dictionary is the first dictionary to include forms of address and most recent neologisms in the three languages. Faithful to the recent developments in contrastive linguistics, we begin with a form from the dictionary’s primary language and we define it in Polish. Subsequently, based on this definition, we try to find an equivalent in the second and the third language. Therefore, the meaning comes first and only then we look for the form (i.e. the equivalent) that corresponds to this meaning. This principle, outlined in Gramatyka konfrontatywna języków polskiego i bułgarskiego (GKBP), allows us to treat data from multiple languages as equal. In the dictionary, we draw attention to the correct choice of equivalents in translation; we also provide categorisers that indicate the meaning of verbal tenses and aspects. The definitions of states, events and their different configurations follow those outlined in the net model of verbal tense and aspect. The transitive vs. intransitive categorisers are vital for the languages in question, since they belong to two different types: synthetic (Bulgarian) and analytic (Polish and Russian). We predict that the equal status of every language in the dictionary will facilitate easier and faster development of an electronic version in the future.

12

On Semantic Annotation in Clarin-PL Parallel Corpora

63%

Koseska-Toszewa V., Roszko R.

Cognitive Studies

|

2015

|

issue 15

EN

On Semantic Annotation in Clarin-PL Parallel CorporaIn the article, the authors present a proposal for semantic annotation in Clarin-PL parallel corpora: Polish-Bulgarian-Russian and Polish-Lithuanian ones. Semantic annotation of quantification is a novum in developing sentence level semantics in multilingual parallel corpora. This is why our semantic annotation is manual. The authors hope it will be interesting to IT specialists working on automatic processing of the given natural languages. Semantic annotation defined the way it is defined here will make contrastive studies of natural languages more efficient, which in turn will help verify the results of those studies, and will certainly improve human and machine translations.

13

About meanings as a rule not included in dictionaries

63%

Koseska-Toszewa V., Duszkin M.

Cognitive Studies

|

2012

|

issue 12

EN

About meanings as a rule not included in dictionariesThe article discusses the problem of ambiguity of the exponents of logical existential quantification occurring in the verbal group of Bulgarian, Polish and Russian sentences: Bulg. отвреме навреме, понякога, Pol. od czasu do czasu, czasem, czasami, Rus. время от времени, иногда. This ambiguity can be explained by strong and weak existential quantification. Lexical semantics did not distinguish between these two types of existentiality. Here we present a description of selected exponents of logical existential quantification in Bulgarian, Polish and Russian.

14

Bulgarian-Polish parallel digital corpus and quantification of time

63%

Dimitrova L., Koseska-Toszewa V.

Cognitive Studies

|

2012

|

issue 12

EN

Bulgarian-Polish parallel digital corpus and quantification of timeThe paper presents the current state of the first Bulgarian-Polish parallel and aligned corpus, prepared in the frame of the joint research project “Semantics and Contrastive linguistics with a focus on a bilingual electronic dictionary” between the Institute of Mathematics and Informatics, Bulgarian Academy of Sciences and the Institute of Slavic Studies, Polish Academy of Sciences, coordinated by L. Dimitrova and V. Koseska-Toszewa. In particular, problems related to tense quantification are also discussed

15

Dialog between a Lexicographer and a Translator

63%

Kit M., Koseska-Toszewa V.

Cognitive Studies

|

2013

|

issue 13

EN

Dialog between a Lexicographer and a TranslatorThe discussion between the authors of the paper concerns the most pressing issues encountered in natural language semantics, as well as in corpus linguistics and computational linguistics. A broad range of knowledge, allowing linguists and information scientists to work together, is required in these areas. The paper describes some primary problems of human and machine translation caused by gaps between different fields of knowledge. The authors suggest that interdisciplinary approach is required when it comes to contrastive studies in linguistics.

16

Bulgarian-Polish Language Resources (Current State and Future Development)

63%

Dimitrova L., Koseska-Toszewa V.

Cognitive Studies

|

2013

|

issue 13

EN

Bulgarian-Polish Language Resources (Current State and Future Development)The paper briefly reviews the first Bulgarian-Polish digital bilingual resources: corpora and dictionaries, which are currently developed under bilateral collaboration between IMI-BAS and ISS-PAS: joint research project “Semantics and contrastive linguistics with a focus on a bilingual electronic dictionary”, coordinated by L. Dimitrova (IMI-BAS) and V. Koseska (ISS-PAS).

17

Constructing catalogue of temporal situations

63%

Koseska-Toszewa V., Mazurkiewicz A.

Cognitive Studies

|

2010

|

issue 10

EN

Constructing catalogue of temporal situationsThe paper is aiming to create a common basis for description, comparing, and analysis natural languages. As a subject of comparison we have chosen temporal structures of some languages. For such a choice there exists a perfect tool, describing basic temporal phenomena, namely an ordering of states and events in time, certainty and uncertainty, independency of histories of separate objects, necessity and possibility. This tool is supported by the Petri nets formalism, which seems to be well suited for expressing the above mentioned phenomena. Petri nets are built form three primitive notions: of states, of events that begin or end the states, and so-called flow relation indicating succession of states and events. This simple constituents give rise to many possibilities of representing temporal phenomena; it turns out that such representations are sufficient for many (clearly, not necessarily all) temporal situations appearing in natural languages.In description formalisms used till now there is no possibility of expressing such reality phenomena as temporal dependencies in compound statement, or combination of temporality and modality. Moreover, using these formalisms one cannot distinguish between two different sources of uncertainty of the speaker while describing the reality: one, due to the lack of knowledge of the speaker what is going on in outside world, the second, due to objective impossibility of foreseen ways in which some conflict situations will be (or already have been) resolved. Petri net formalism seems to be perfectly suited for such differentiations.There are two main description principles that encompassed this paper. First, that assigns meaning to names of grammatical structures in different languages may lead to misunderstanding. Two grammatical structures with apparently close names may describe different reality. Additionally, some grammatical terms used in one language may be absent and not understandable in the other. It leads to assign meanings to situations, rather than to linguistic forms used for their expression. The second principle is limit the discussed issues to such a piece of reality that can be possible for precise description. The third is to avoid introducing such information to the described reality that is not explicitly mentioned by linguistic means. The authors try to following these principles in the present paper.The paper is organized as follows. First, some samples of situations related to present tense are given together with examples of their expressions in four languages: English, (as a reference language) and three Slavic languages, representing South slavonic languages (Bulgarian), West slavonic languages (Polish), and East slavonic languages (Russian). Within the same framework the next parts of the paper are constructed, supplying samples of using Past tenses and, finally, future tenses and modalities.The formal tools for description purposes are introduced stepwise, according to needs caused be the described reality. There are mainly Petri nets, equipped additionally with inscriptions or labeling in order to keep proper assignations of description units to described objects.

18

Języki słowiańskie i litewski w korpusach równoległych Clarin-PL

63%

Koseska-Toszewa V., Roszko R.

Studia z Filologii Polskiej i Słowiańskiej

|

2016

|

vol. 51

EN

Slavic languages and the Lithuanian language in the Clarin-PL parallel corporaThe Clarin Eric and Clarin-PL strategic scientific purpose is to support humanistic research in a multicultural and multilingual Europe. Polish researchers put the emphasis on building a bridge between the Polish language and Polish linguistic technologies and other European languages and their linguistic technologies. So far, the Polish scientific community has mainly focused on Polish-English connections. Clarin-PL has been developing the first and only multilingual corpora of the Polish language in conjunction with other Slavic languages and the Lithuanian language: the Polish-Bulgarian-Russian Parallel Corpus and the Polish- Lithuanian Parallel Corpus. The parallel corpora created by the ISS PAS Corpus Linguistics and Semantics Team break through the existing “canons” and allow scientists access to interlinked multilingual language resources – in the first phase limited to the languages of the three Slavic groups and the Lithuanian language. In the article, the authors present very detailed information on their original system of the semantic annotation of scope quantification in multilingual parallel corpora, hitherto unused in the subject literature. Due to the system’s originality, the semantic annotation is carried out manually. Identification of particular values of scope quantification in a sentence and the hereby presented attempts of its recording are supported by long-term research conducted by an international team of linguists and computer scientists / mathematicians developing the issue of quantification of names, time and aspect in natural languages. Języki słowiańskie i litewski w korpusach równoległych Clarin-PLStrategicznym celem naukowym Clarin ERIC i Clarin-PL jest wspieranie badań humanistycznych w wielokulturowej i wielojęzycznej Europie. Dla polskich badaczy ważna jest budowa pomostu między językiem polskim, polskimi technologiami językowymi a innymi językami europejskimi i na ich rzecz opracowanymi technologiami językowymi. Dotychczas w nauce polskiej największy nacisk był kładziony na powiązania polsko-angielskie. Clarin-PL opracowuje zatem pierwsze jak dotąd wielojęzyczne korpusy języka polskiego w zestawieniu z innymi językami słowiańskimi oraz z językiem litewskim: Korpus równoległy polsko-bułgarsko-rosyjski i Korpus równoległy polsko-litewski. Tworzone przez Zespół Lingwistyki Korpusowej i Semantyki (IS PAN) korpusy równoległe przełamują dotychczasowe „kanony” i udostępniają nauce powiązane wielojęzyczne zasoby – w pierwszym etapie ograniczone do języków trzech grup słowiańskich oraz języka litewskiego. W artykule autorzy przedstawiają bardzo szczegółową informację o zastosowanej po raz pierwszy w literaturze przedmiotu anotacji semantycznej dotyczącej kwantyfikacji zakresowej w wielojęzycznych korpusach równoległych. Z powodu swojego rozległego zakresu i nowatorstwa ta anotacja semantyczna jest nanoszona ręcznie. Identyfikacja poszczególnych wartości kwantyfikacji zakresowej w zdaniu oraz przedstawiane tu próby jej zapisu są poparte wieloletnimi badaniami międzynarodowego zespołu lingwistów i matematyków-informatyków opracowujących zagadnienie kwantyfikacji imion, czasu i aspektu w językach naturalnych.

19

Web presentation of bilingual corpora (Slovak-Bulgarian and Bulgarian-Polish)

51%

Garabík R., Dimitrova L., Koseska-Toszewa V.

Cognitive Studies

|

2011

|

issue 11

EN

Web presentation of bilingual corpora (Slovak-Bulgarian and Bulgarian-Polish)In this paper we focus on the web-presentation of bilingual corpora in three Slavic languages and their possible applications. Slovak-Bulgarian and Bulgarian-Polish corpora are collected and developed as results of the collaboration in the frameworks of two joint research projects between Institute of Mathematics and Informatics, Bulgarian Academy of Sciences, from one side, and from the other side: Ľ. Štúr Institute of Linguistics, Slovak Academy of Sciences and Institute of Slavic Studies, Polish Academy of Sciences, coordinate by authors of this paper.

20

Neologisms in bilingual digital dictionaries (on the example of Bulgarian-Polish dictionary)

51%

Dimitrova L., Koseska-Toszewa V., Satoła-Staśkowiak J.

Cognitive Studies

|

2012

|

issue 12

EN

Neologisms in bilingual digital dictionaries (on the example of Bulgarian-Polish dictionary)The paper discusses the presentation of neologisms in the recent version of the Bulgarian-Polish digital dictionary. We also continue the discussion of important problems related to the classifiers of the verbs as headwords of the digital dictionary entries. We analyze some examples from ongoing experimental version of the Bulgarian-Polish digital dictionary.

Refine search results

Gramatyka konfrontatywna języków polskiego i bułgarskiego

Introduction

Definiteness-indefiniteness category and logical quantification

Introduction

Introduction

About Certain Semantic Annotation in Parallel Corpora

Introduction

From the Editorial Committee

Form, its meaning, and dictionary entries

Wielotomowa gramatyka konfrontatywna języka bułgarskiego i polskiego

Multilingualism and Dictionaries

On Semantic Annotation in Clarin-PL Parallel Corpora

About meanings as a rule not included in dictionaries

Bulgarian-Polish parallel digital corpus and quantification of time

Dialog between a Lexicographer and a Translator

Bulgarian-Polish Language Resources (Current State and Future Development)

Constructing catalogue of temporal situations

Języki słowiańskie i litewski w korpusach równoległych Clarin-PL

Web presentation of bilingual corpora (Slovak-Bulgarian and Bulgarian-Polish)

Neologisms in bilingual digital dictionaries (on the example of Bulgarian-Polish dictionary)