dfply.summary_functions

Module Contents

dfply.summary_functions.mean(series)

Returns the mean of a series.

Args:
series (pandas.Series): column to summarize.
dfply.summary_functions.first(series, order_by=None)

Returns the first value of a series.

Args:
series (pandas.Series): column to summarize.
Kwargs:
order_by: a pandas.Series or list of series (can be symbolic) to order
the input series by before summarization.
dfply.summary_functions.last(series, order_by=None)

Returns the last value of a series.

Args:
series (pandas.Series): column to summarize.
Kwargs:
order_by: a pandas.Series or list of series (can be symbolic) to order
the input series by before summarization.
dfply.summary_functions.nth(series, n, order_by=None)

Returns the nth value of a series.

Args:
series (pandas.Series): column to summarize. n (integer): position of desired value. Returns NaN if out of range.
Kwargs:
order_by: a pandas.Series or list of series (can be symbolic) to order
the input series by before summarization.
dfply.summary_functions.n(series)

Returns the length of a series.

Args:
series (pandas.Series): column to summarize.
dfply.summary_functions.n_distinct(series)

Returns the number of distinct values in a series.

Args:
series (pandas.Series): column to summarize.
dfply.summary_functions.IQR(series)

Returns the inter-quartile range (IQR) of a series.

The IRQ is defined as the 75th quantile minus the 25th quantile values.

Args:
series (pandas.Series): column to summarize.
dfply.summary_functions.colmin(series)

Returns the minimum value of a series.

Args:
series (pandas.Series): column to summarize.
dfply.summary_functions.colmax(series)

Returns the maximum value of a series.

Args:
series (pandas.Series): column to summarize.
dfply.summary_functions.median(series)

Returns the median value of a series.

Args:
series (pandas.Series): column to summarize.
dfply.summary_functions.var(series)

Returns the variance of values in a series.

Args:
series (pandas.Series): column to summarize.
dfply.summary_functions.sd(series)

Returns the standard deviation of values in a series.

Args:
series (pandas.Series): column to summarize.