Přístupnostní navigace
E-application
Search Search Close
Publication detail
ZELENÝ, J. BURGET, R.
Original Title
Isomorphic mapping of DOM trees for Cluster-Based Page Segmentation
Type
article in a collection out of WoS and Scopus
Language
English
Original Abstract
In our previous work we have designed a method for fast and precise Web page segmentation. In this paper we propose a complementary algorithm and data structures that extend the original design. The extension is focused on isomorphic mapping between two DOM trees. Our main objective is to improve robustness of our original solution. We successfully design and implement a solution that is more robust while keeping the efficiency of the original simple one. To prove qualities of our new design we also offer an experimental evaluation of the new implementation.
Keywords
vision-based page segmentation, cache, template detection, cluster-based page segmentation, DOM, tree mapping
Authors
ZELENÝ, J.; BURGET, R.
RIV year
2013
Released
5. 11. 2013
Publisher
The University of Technology Košice
Location
Spišská Nová Ves
ISBN
978-80-8143-127-2
Book
Proceedings of the Twelfth International Conference on Informatics INFORMATICS'2013
Pages from
256
Pages to
261
Pages count
6
URL
https://www.fit.vut.cz/research/publication/10414/
BibTex
@inproceedings{BUT103543, author="Jan {Zelený} and Radek {Burget}", title="Isomorphic mapping of DOM trees for Cluster-Based Page Segmentation", booktitle="Proceedings of the Twelfth International Conference on Informatics INFORMATICS'2013", year="2013", pages="256--261", publisher="The University of Technology Košice", address="Spišská Nová Ves", isbn="978-80-8143-127-2", url="https://www.fit.vut.cz/research/publication/10414/" }
Documents
jzeleny.pdf