Přístupnostní navigace
E-přihláška
Vyhledávání Vyhledat Zavřít
Detail publikace
VESELÝ, V.
Originální název
Workshop on Evidence Collection and Analysis of Webpages
Typ
audiovizuální tvorba
Jazyk
angličtina
Originální abstrakt
Cybercrimes such as ransomware, cyberbullying, scam, illicit darknet activities, inappropriate sexual content distribution, or even phishing have a very unstable nature when it comes to the collection of evidence. Webpages related to these crimes are usually available only for a couple of days, sometimes even hours. The workshop presents methods on how to effectively download, decode, parse and archive such webpages. It focuses on a safe and auditable collection of valuable (meta)data that can be later used as proof. The presentation outlines the theory behind modern web design (HTML, CSS, Java/TypeScript), well-known libraries for scraping, and decoding (e.g., Scrapy, Selenium) and current challenges (such as single-page applications, access to dynamic content, execution of JavaScript). The session includes demonstrations of the collection process and existing tools. Participants will receive our open-source tool, which easily archives a given URL content together with a basic set of metadata.
Klíčová slova
webscrabing, HTTP, HTTPS, decoding
Autoři
Vydáno
5. 12. 2019
Místo
Kuala Lumpur
Strany počet
58
URL
https://www.fit.vut.cz/research/publication/12148/
BibTex
@misc{BUT162297, author="Vladimír {Veselý}", title="Workshop on Evidence Collection and Analysis of Webpages", year="2019", pages="58", address="Kuala Lumpur", url="https://www.fit.vut.cz/research/publication/12148/", note="presentation" }