Product detail

proof_platform: Platform for automated analysis and archiving of data from the web

KOCMAN, T. POLČÁK, L.

Product type

software

Abstract

This platform enables scraping of web page content and storing the content in offline persistent database. The web crawl is performed using user-supplied regular expressions that may represent for example Torrent file names, Bitcoin wallets or keywords. Collected data may be used for law enforcement and other entitites, such as searching for information about a specific product. Archived data are stored in a database and available for later use without the possibility of modification due to web server updates.

Keywords

Web crawling, web scrapping.

Create date

25. 4. 2019

Location

https://gitlab.com/tomaskocman/proof_platform

Possibilities of use

K využití výsledku jiným subjektem je vždy nutné nabytí licence

Licence fee

Poskytovatel licence na výsledek nepožaduje licenční poplatek

www

Documents