Přístupnostní navigace
E-přihláška
Vyhledávání Vyhledat Zavřít
Detail publikace
BURGET, R.
Originální název
HTML Document Analysis for Information Extraction
Typ
článek ve sborníku mimo WoS a Scopus
Jazyk
angličtina
Originální abstrakt
The today's World Wide Web contains a vast amount of information stored in HTML documents. However, the HTML language primarily describes the look of the documents and it doesn't contain facilities for the description of contained data structure. In this paper we propose a model of a Web site that describes logical structure of contained data. Furthermore, we propose methods for creating such a model by analyzing the look and the structure of HTML documents.
Klíčová slova
HTML Analysis, Information Extraction
Autoři
Vydáno
25. 4. 2002
Nakladatel
Faculty of Information Technology BUT
Místo
Brno
ISBN
80-214-2116-9
Kniha
Proceedings of 8th EEICT conference
Strany od
426
Strany do
430
Strany počet
5
BibTex
@inproceedings{BUT10014, author="Radek {Burget}", title="HTML Document Analysis for Information Extraction", booktitle="Proceedings of 8th EEICT conference", year="2002", pages="426--430", publisher="Faculty of Information Technology BUT", address="Brno", isbn="80-214-2116-9" }