Full-text resources of CEJSH and other databases are now available in the new Library of Science.
Visit https://bibliotekanauki.pl

Results found: 3

first rewind previous Page / 1 next fast forward last

Search results

Search:
in the keywords:  fonotaktika
help Sort By:

help Limit search:
first rewind previous Page / 1 next fast forward last
1
100%
EN
Phonotactic probability refers to the frequency with which phonological segments and sequences of phonological segments occur in words in a given language (Vitevitch – Luce, 2004). It has been shown that phonotactic probabilities of words are important in language processing and language acquisition (Jusczyk et al., 1994; Mattys – Jusczyk, 2001; Pitt – McQueen, 1998). For example, words with high phonotactic probability are processed faster by native speakers in same-different tasks (Luce – Large, 2001), and pseudowords with high phonotactic probability are judged as more word-like by adults (Vitevitch et al., 1997). In this paper we present a phonotactic calculator for Czech implemented as a Python script. The script relies on frequency data from three freely available corpora of Czech: SYN2015 and SYN2020, corpora of written Czech (Křen et al., 2015; 2020), and ORAL v1, a corpus of spoken Czech (Kopřivová et al., 2017). The steps of the calculation mirror those developed by Vitevitch and Luce (2004) for English, and the script can provide phonotactic (and additionally orthotactic) probability for any Czech word or pseudoword. The script can be downloaded at .
CS
Pojem fonotaktická probabilita odkazuje k frekvenci, s níž se fonologické segmenty a sekvence těchto segmentů objevují ve slovech v určitém jazyce (Vitevitch – Luce, 2004). Předchozí výzkumy ukázaly, že fonotaktická probabilita slov hraje důležitou roli při zpracování a akvizici jazyka (Jusczyk et al., 1994; Mattys – Jusczyk, 2001; Pitt – McQueen, 1998). Slova s vysokou fonotaktickou probabilitou jsou například rychleji zpracována rodilými mluvčími v úloze „same-different“ (Luce – Large, 2001) a pseudoslova s vysokou fonotaktickou probabilitou jsou dospělými hodnocena jako pravděpodobnější slova daného jazyka (Vitevitch et al., 1997). V tomto článku představujeme nástroj pro výpočet fonotaktické probability pro češtinu, který je volně dostupný jako skript v programovacím jazyce Python. Nástroj vychází z údajů o frekvenci slov ze tří volně dostupných korpusů českého jazyka: korpus psaného jazyka SYN2015 (Křen et al., 2015), korpus psaného jazyka SYN2020 (Křen et al., 2020) a korpus mluveného jazyka ORAL v1 (Kopřivová et al., 2017). Výpočet replikuje postup původního kalkulátoru pro anglický jazyk (Vitevitch – Luce, 2004) a výstupem je odhad fonotaktické (a navíc také ortotaktické) probability pro jakékoliv české slovo či pseudoslovo. Skript je dostupný z internetové stránky .
EN
This study aims to provide and analyze a representative list of Czech initial syllable onsets and final codas along with their frequencies of occurrence in running text (token frequencies) and in the vocabulary of unique word forms extracted from it (type frequencies). The frequency data are important because many experiments have demonstrated that phonotactics is not categorical, but rather gradient in nature. Importantly, the study analyzes and compares both spoken and written texts, using the Czech National Corpus, and the two modalities are hypothesized to yield different outcomes. All words in the sample were transcribed phonemically and analyzed. A general preference was found for phonotactic structures that are simple in the context of the attested inventory, and the two corpora differed most in the repertoire of complex onsets/codas (some sequences being unique to one modality) as well as in their respective frequencies. The results are discussed in relation to previous studies of Czech phonotactics, and evaluated with respect to implications for phonological theory, focusing on spoken/written and type/token comparisons.
EN
Analyzing a corpus of over 36,000 words of foreign origin and 33,000 native words, the paper presents a detailed account and comparison of the distribution of consonant-vowel combinations in Czech. These combinations are discussed according to their vowel class. Both combinations typical for foreign-origin words and combinations typical for native words were found. In some cases, the typicality is predictable from the phonemes in the combinations (i.e. from the phonemes typical for either type of vocabulary), but there are many combination types for which this is not possible. The paper concludes with a discussion of the factors that may influence the distribution of consonant-vowel combinations, including sound changes, phoneme frequency, morphological processes, and universal preferences in the combinability of consonants with vowels. Special attention is paid to loanwords from English.
first rewind previous Page / 1 next fast forward last
JavaScript is turned off in your web browser. Turn it on to take full advantage of this site, then refresh the page.