Full-text resources of CEJSH and other databases are now available in the new Library of Science.
Visit https://bibliotekanauki.pl

PL EN


2010 | 22(35) | 141-157

Article title

INFORMATION EXTRACTION FROM WEB PAGES FOR THE NEEDS OF EXPERT FINDING

Title variants

Languages of publication

EN

Abstracts

EN
This paper describes a mechanism for the extraction of relevant information about people from Polish portals for professionals. The method of information extraction is based on hierarchical execution of XPath commands and regular expressions depending on the structure of processed documents. The extraction component EXT is a part of the eXtraSpec system, which task is to support Human Resources departments of Polish companies during recruitment and team building. EXT is able to deal with several sources of information and with user profiles that are acquired from professionals' portals. In this article we also discuss the advantages of the chosen extraction method in the context of the goals of the whole eXtraSpec system and we show the directions of future research.

Publisher

Year

Issue

Pages

141-157

Physical description

Document type

ARTICLE

Contributors

author
  • Tomasz Kaczmarek, Poznan University of Economics, Faculty of Informatics and Electronic Economy, Department of Information Systems, Poznan, Poland

References

Document Type

Publication order reference

Identifiers

CEJSH db identifier
11PLAAAA101629

YADDA identifier

bwmeta1.element.043771e7-f4b0-39d4-8503-ef19fcceb9e7
JavaScript is turned off in your web browser. Turn it on to take full advantage of this site, then refresh the page.