Product detail

Tool for Distributed Extraction of Timestamped Events from Files

RYCHLÝ, M. BURGET, R.

Product type

software

Abstract

A tool for distributed extraction of timestamps from various files using extractors adapted from the Plaso engine to Apache Spark infrastructure. The files to extract are uploaded to distributed file-system HDFS and the extraction process is controlled by a Web service via its REST API. The tool is able to utilise efficiently a large distributed clusters.

Keywords

files, events, timestamps, extraction, distributed system

Create date

20. 12. 2019

Location

https://github.com/nesfit/pyspark-plaso

Possibilities of use

Využití výsledku jiným subjektem je v některých případech možné bez nabytí licence

Licence fee

Poskytovatel licence na výsledek nepožaduje licenční poplatek

www

Documents