groupby_logger

Module that defines the Logger class

class tasrif.processing_pipeline.observers.groupby_logger.GroupbyLogger(groupby_args, method='')

GroupbyLogger class to log a dataframe after grouping

__init__(groupby_args, method='')

The constructor of the GroupbyLogger class will provide options to configure the operation using keyword arguments. The logging is invoked via the observe method and the data to be logged is passed to the observe method.

Parameters
  • groupby_args (String or list) – Arguments to pandas pd.groupby function

  • method (String) – Logging method to log the dataframe Options: “head”, “tail”, “info”, “first”, “last”

observe(operator, *data_frames)

Function that performs checks on operator and data frame before observation This observation is only performed on non-infrastructure operators

Parameters
  • operator (ProcessingOperator) – Processing operator which is observed

  • *data_frames (list of pd.DataFrame) – Variable number of pandas dataframes to be observed