The paper presents a system for automatic content extraction from mammogram reports written in Polish. The system combines general information extraction (IE) techniques with external post-processing aimed at structuralizing the results. The paper contains a characteristics of the specific type of texts as well as a description of the results obtained together with a short analysis of advantages and disadvantages of shallow text processing.
JavaScript is turned off in your web browser. Turn it on to take full advantage of this site, then refresh the page.