data_describe.core.summary

data_summary(data, as_percentage: bool = False, auto_float: bool = True, compute_backend=None)

Summary statistics and data description.

mode1(x)

Mode (counts only) by Warren Weckesser.

class data_describe.core.summary.SummaryWidget(input_data=None, info_data=None, summary_data=None, as_percentage: Optional[bool] = False, auto_float: Optional[bool] = True, **kwargs)

Bases: data_describe._widget.BaseWidget

Container for data summary.

This class (object) is returned from the data_summary function. The attributes documented below can be accessed or extracted.

input_data

The input data.

info_data

Information about the data shape and size.

summary_data

The summary statistics.

Type

DataFrame

as_percentage

If True, display counts as percentage over total

Type

bool

auto_float

If True, apply formatting to float values

Type

bool

show(self, viz_backend=None, as_percentage: Optional[bool] = None, auto_float: Optional[bool] = None, **kwargs)

The default display for this output.

Displays the summary information.

Parameters
  • viz_backend – The visualization backend.

  • as_percentage (bool) – If True, display counts as percentage over total

  • auto_float (bool) – If True, apply formatting to float values

  • **kwargs – Keyword arguments.

Returns

The correlation matrix plot.

data_describe.core.summary.data_summary(data, as_percentage: bool = False, auto_float: bool = True, compute_backend=None)

Summary statistics and data description.

Parameters
  • data – The dataframe

  • as_percentage (bool) – If True, display counts as percentage over total

  • auto_float (bool) – If True, apply formatting to float values

  • compute_backend – The compute backend.

Returns

The dataframe with metrics in rows

data_describe.core.summary.mode1(x)

Mode (counts only) by Warren Weckesser.

https://stackoverflow.com/questions/46365859/what-is-the-fastest-way-to-get-the-mode-of-a-numpy-array

Parameters

x – Input array