The article introduces methods of estimating the number of duplications in large databases. The analysis uses simulation methods. The simulation analyses and compares three different types of theoretical model, based on research into the actual database of a company.
JavaScript is turned off in your web browser. Turn it on to take full advantage of this site, then refresh the page.