EN
The article introduces methods of estimating the number of duplications in large databases. The analysis uses simulation methods. The simulation analyses and compares three different types of theoretical model, based on research into the actual database of a company.