Wprowadzenie do metod statystycznych w tłumaczeniu automatycznym

Junczys-Dowmunt, Marcin

doi:10.14746/il.2008.16.5

Article details

Journal

Investigationes Linguisticae

2008 | 16 | 44-66

Article title

Wprowadzenie do metod statystycznych w tłumaczeniu automatycznym

Authors

Marcin Junczys-Dowmunt

Content

Full texts:

Download

Title variants

EN

An introduction to statistical methods in machine translation

Languages of publication

PL

Abstracts

PL

The intention of this article is to provide a concise introduction to the basic mathematical concepts of statistical translation models as they were introduced by Brown et al. (1993) in their groundbreaking work The Mathematics of Statistical Machine Translation: Parameter Estimation. We concentrate on a simplified description of the first two translation models known as IBM Model 1 and 2. It is one major aim of this work to serve as tutoring material for students of computational linguistics, mathematics or computer science and therefore a lot of comments, additional examples and step-by-step explanations are given, augmenting the original formula by Brown et al. (1993). For both discussed models the calculations for a small parallel corpus are described in detail.

EN

The intention of this article is to provide a concise introduction to the basic mathematical concepts of statistical translation models as they were introduced by Brown et al. (1993) in their groundbreaking work The Mathematics of Statistical Machine Translation: Parameter Estimation. We concentrate on a simplified description of the first two translation models known as IBM Model 1 and 2. It is one major aim of this work to serve as tutoring material for students of computational linguistics, mathematics or computer science and therefore a lot of comments, additional examples and step-by-step explanations are given, augmenting the original formula by Brown et al. (1993). For both discussed models the calculations for a small parallel corpus are described in detail.

Keywords

Publisher

Uniwersytet im. Adama Mickiewicza w Poznaniu

Journal

Investigationes Linguisticae

Year

2008

Volume

16

Pages

44-66

Physical description

Dates

published

2008-12-15

Contributors

author

Marcin Junczys-Dowmunt

Instytut Językoznawstwa, Uniwersytet im. Adama Mickiewicza al. Niepodległości 4, 61-874 Poznań

References

Al-Onaizan, Y., J. Curin, M. Jahr, K. Knight, J. Lafferty, I. Melamed, F. Och, D. Purdy, N. Smith i D. Yarowsky (1999). Statistical machine translation. Rap. tech., JHU workshop.
Brown, P. F., V. J. Della Pietra, S. A. Della Pietra i R. L. Mercer (1993). The mathematics of statistical machine translation: parameter estimation. Comput. Linguist., 19(2):263-311.
Dempster, A. P.,N. M. Laird i D. B. Rubin (1977). Maximum Likelihood from Incomplete Data via the EM Algorithm.
Journal of the Royal Statistical Society. Series B (Methodological), 39(1): 1-38.
Jurafsky, D. i J. H. Martin (2000). Speech andLanguage Processing: An Introduction to Natural Language Processing, Computational Linguistics and Speech Recognition (International Edition). Prentice Hall.
Kay, M. i M. Roscheisen (1993). Text-trańslatioń aligńmeńt. Comput. Linguist., 19(1): 121-142.
Knight, K. (1999). A Statistical MT Tutorial Workbook. Niepublikowane.
Manning, C. D. i H. Schutze (1999). Foundations o f Statistical Natural Language Processing. The MIT Press.
Melamed, I. D. (2000). Models of translational eąuwalence among words. Comput. Linguist., 26(2):221-249.
Och, F. J. i H. Ney (2003). A systematic comparison of various statistical aligńmeńt models. Comput. Linguist., 29(1): 19-51.
Somers, H. (2001). Bilingual parallel corpora and Language Engineering.

Document Type

Publication order reference

Identifiers

DOI

10.14746/il.2008.16.5

YADDA identifier

bwmeta1.element.ojs-doi-10_14746_il_2008_16_5

Article details

Journal

Investigationes Linguisticae

Article title

Wprowadzenie do metod statystycznych w tłumaczeniu automatycznym

Authors

Content

Title variants

Languages of publication

Abstracts

Keywords

Publisher

Journal

Year

Volume

Pages

Physical description

Dates

Contributors

References

Document Type

Publication order reference

Identifiers

YADDA identifier