Detail publikace

Semantic Enrichment Across Language: A Case Study of Czech Bibliographic Databases

OTRUSINA, L. SMRŽ, P.

Originální název

Semantic Enrichment Across Language: A Case Study of Czech Bibliographic Databases

Typ

článek ve sborníku mimo WoS a Scopus

Jazyk

angličtina

Originální abstrakt

This paper deals with semantic enrichment of textual resources by means of automatically generated named entity recognizers-linkers and advanced indexing and searching mechanisms that can be integrated into various information retrieval and information extraction systems. It introduces a new system transforming Wikipedia and other available sources into task-specific knowledge bases and employs contextual information to build state-of-the-art entity disambiguation components. Although some components are language-dependent (for example, that responsible for the morphology analysis or the semantic role identification), they can be easily replaced by existing tools providing specific functions. As a case study, we demonstrate an instantiation of the system for the task of semantic annotation of Czech bibliographic databases in the context of the CPK project. We particularly stress the role of problem-specific knowledge sources that can be easily integrated into our system and play a key role in the success of the tool in real applications.

Autoři

OTRUSINA, L.; SMRŽ, P.

Vydáno

18. 12. 2017

Místo

Kolkata

Strany od

523

Strany do

532

Strany počet

10

URL

BibTex

@inproceedings{BUT168464,
  author="Lubomír {Otrusina} and Pavel {Smrž}",
  title="Semantic Enrichment Across Language: A Case Study of Czech Bibliographic Databases",
  booktitle="Proceedings of the 14th International Conference on Natural Language Processing (ICON-2017)",
  year="2017",
  pages="523--532",
  address="Kolkata",
  url="http://www.aclweb.org/anthology/W/W17/W17-7563"
}