dfply.summary_functions¶
Module Contents¶
-
dfply.summary_functions.mean(series)¶ Returns the mean of a series.
- Args:
- series (pandas.Series): column to summarize.
-
dfply.summary_functions.first(series, order_by=None)¶ Returns the first value of a series.
- Args:
- series (pandas.Series): column to summarize.
- Kwargs:
- order_by: a pandas.Series or list of series (can be symbolic) to order
- the input series by before summarization.
-
dfply.summary_functions.last(series, order_by=None)¶ Returns the last value of a series.
- Args:
- series (pandas.Series): column to summarize.
- Kwargs:
- order_by: a pandas.Series or list of series (can be symbolic) to order
- the input series by before summarization.
-
dfply.summary_functions.nth(series, n, order_by=None)¶ Returns the nth value of a series.
- Args:
- series (pandas.Series): column to summarize. n (integer): position of desired value. Returns NaN if out of range.
- Kwargs:
- order_by: a pandas.Series or list of series (can be symbolic) to order
- the input series by before summarization.
-
dfply.summary_functions.n(series)¶ Returns the length of a series.
- Args:
- series (pandas.Series): column to summarize.
-
dfply.summary_functions.n_distinct(series)¶ Returns the number of distinct values in a series.
- Args:
- series (pandas.Series): column to summarize.
-
dfply.summary_functions.IQR(series)¶ Returns the inter-quartile range (IQR) of a series.
The IRQ is defined as the 75th quantile minus the 25th quantile values.
- Args:
- series (pandas.Series): column to summarize.
-
dfply.summary_functions.colmin(series)¶ Returns the minimum value of a series.
- Args:
- series (pandas.Series): column to summarize.
-
dfply.summary_functions.colmax(series)¶ Returns the maximum value of a series.
- Args:
- series (pandas.Series): column to summarize.
-
dfply.summary_functions.median(series)¶ Returns the median value of a series.
- Args:
- series (pandas.Series): column to summarize.
-
dfply.summary_functions.var(series)¶ Returns the variance of values in a series.
- Args:
- series (pandas.Series): column to summarize.
-
dfply.summary_functions.sd(series)¶ Returns the standard deviation of values in a series.
- Args:
- series (pandas.Series): column to summarize.