PySpark Plaso  Release 2019
A tool for distributed extraction of timestamps from various files using extractors adapted from the Plaso engine to Apache Spark.
lib Directory Reference

Files

file  __init__.py
 
file  custom_pip_package_dir.py
 
file  hdfs.py
 
file  pyarrow_hdfs.py
 
file  pyspark_hdfs.py