Přístupnostní navigace
E-přihláška
Vyhledávání Vyhledat Zavřít
Detail produktu
BURGET, R. MILIČKA, M.
Typ produktu
software
Abstrakt
FitLayout is an extensible web page segmentation framework written in Java. It defines a generic Java API for representing a rendered web page and its division to visual areas and it provides a base for implementing page segmentation algorithms with a common application interface. As a sample segmentation method, it implements a previously published segmentation algorithm based on recursive visual area merging and separator detection. The framework includes tools for post-processing the segmentation result by different text or visual classification methods. Finally, it also provides tools for controlling the segmentation process and examining the segmentation results through a graphical user interface. The segmentation result may be stored as RDF data for later analysis.
Klíčová slova
web page segmentation, document analysis, text classification, web page rendering
Datum vzniku
3. 12. 2014
Umístění
http://www.fit.vutbr.cz/~burgetr/FITLayout/
Možnosti využití
K využití výsledku jiným subjektem je vždy nutné nabytí licence
Licenční poplatek
Poskytovatel licence na výsledek nepožaduje licenční poplatek
www