PySpark Plaso
Release 2019
A tool for distributed extraction of timestamps from various files using extractors adapted from the Plaso engine to Apache Spark.
|
Public Member Functions | |
def | __init__ (self, hdfs_base_uri) |
def | ls (self, hdfs_path="") |
def | rm (self, hdfs_path="") |
def | get (self, hdfs_path="") |
def | get_zip (self, hdfs_path="") |
def | put (self, request, hdfs_path) |
def | put_form (self, request, hdfs_path) |
def | zip_put (self, request, hdfs_path="") |
def | zip_put_form (self, request, hdfs_path="") |
![]() | |
def | make_hdfs_uri (self, hdfs_path) |
def | strip_hdfs_uri (self, hdfs_path) |
Public Attributes | |
hdfs | |
![]() | |
hdfs_base_uri | |
Controller for HDFS-related management operations.
def plaso.tarzan.app.controllers.filemancontroller.FileManController.__init__ | ( | self, | |
hdfs_base_uri | |||
) |
Create a new controller that is able to store and utilize HDFS URI in HDFS queries and operations. :param hdfs_base_uri: the base HDFS URI to store
Reimplemented from plaso.tarzan.app.controllers.controller.Controller.
def plaso.tarzan.app.controllers.filemancontroller.FileManController.get | ( | self, | |
hdfs_path = "" |
|||
) |
Get a Flask Response to download a file in a given HDFS path. :param hdfs_path: the HDFS path of the file to download :return: the Flask Response of the download
def plaso.tarzan.app.controllers.filemancontroller.FileManController.get_zip | ( | self, | |
hdfs_path = "" |
|||
) |
Get a Flask Response to download a ZIP archive of files and directories in a given HDFS path. :param hdfs_path: the HDFS path which include into the ZIP archive :return: the Flask Response to download the ZIP file
def plaso.tarzan.app.controllers.filemancontroller.FileManController.ls | ( | self, | |
hdfs_path = "" |
|||
) |
Get a Flask Response listing all files and directories (the dirs are suffixed by "/") that are in a given HDFS path. :param hdfs_path: the HDFS path to search for the files and directories :return: the Flask Response of the list of files and directories in the path
def plaso.tarzan.app.controllers.filemancontroller.FileManController.put | ( | self, | |
request, | |||
hdfs_path | |||
) |
Copy the content uploaded by a PUT request in the REST API to a file in a given HDFS path. :param request: the REST API PUT request :param hdfs_path: the HDFS path where to upload :return: the Flask Response confirming the upload
def plaso.tarzan.app.controllers.filemancontroller.FileManController.put_form | ( | self, | |
request, | |||
hdfs_path | |||
) |
Copy the content uploaded by a HTML FORM to a file in a given HDFS path. :param request: the request which contains the FORM upload :param hdfs_path: the HDFS path where to upload :return: the Flask Response confirming the upload
def plaso.tarzan.app.controllers.filemancontroller.FileManController.rm | ( | self, | |
hdfs_path = "" |
|||
) |
Get a Flask Response confirming a recursive removal of files and directories in a given HDFS path. :param hdfs_path: the HDFS path which should be removed including to remove its files and directories :return: the Flask Response of the confirmation of the remove
def plaso.tarzan.app.controllers.filemancontroller.FileManController.zip_put | ( | self, | |
request, | |||
hdfs_path = "" |
|||
) |
Extract the content of a ZIP archive uploaded by a PUT request in the REST API into a given HDFS path. :param request: the REST API PUT request :param hdfs_path: the HDFS path where to extract the ZIP archive :return: the Flask Response confirming the upload and extraction
def plaso.tarzan.app.controllers.filemancontroller.FileManController.zip_put_form | ( | self, | |
request, | |||
hdfs_path = "" |
|||
) |
Extract the content of a ZIP archive uploaded by a HTML FORM into a given HDFS path. :param request: the request which contains the FORM upload :param hdfs_path: the HDFS path where to extract the ZIP archive :return: the Flask Response confirming the upload and extraction
plaso.tarzan.app.controllers.filemancontroller.FileManController.hdfs |