pivot_reset_columns_operator

Pivots a dataframe and realigns its columns to remove any multi-indices.

class tasrif.processing_pipeline.pandas.pivot_reset_columns_operator.PivotResetColumnsOperator(level, **kwargs)

Pivots a dataframe and realigns its columns to remove any multi-indices.

__init__(level, **kwargs)

Creates a new instance of PivotResetColumnsOperator

Examples

>>> import pandas as pd
>>> from tasrif.processing_pipeline.pandas import PivotResetColumnsOperator
>>>
>>> df = pd.DataFrame([
...     [1, "2020-05-01 00:00:00", 1],
...     [1, "2020-05-01 01:00:00", 1],
...     [1, "2020-05-01 03:00:00", 2],
...     [2, "2020-05-02 00:00:00", 1],
...     [1, "2020-05-02 00:00:00", 2],
...     [2, "2020-05-02 01:00:00", 1]],
...     columns=['logId', 'timestamp', 'sleep_level'])
>>>
>>> df['timestamp'] = pd.to_datetime(df['timestamp'])
>>>
>>> op = PivotResetColumnsOperator(level=0, index='timestamp', columns='logId', values='sleep_level')
>>> op.process(df)[0]
.   timestamp               1       2
0       2020-05-01 00:00:00         1.0     NaN
1       2020-05-01 01:00:00         1.0     NaN
2       2020-05-01 03:00:00         2.0     NaN
3       2020-05-02 00:00:00         2.0     1.0
4       2020-05-02 01:00:00         NaN     1.0
Parameters
  • level (int or str) – Either the integer position or the name of the level to reset the columns to.

  • **kwargs – Arguments to pandas pivot function