data_describe.misc.load_data

load_data(filepath, all_folders=False, **kwargs)

Create pandas data frame from filepath.

download_gcs_file(filepath, bucket=None, prefix=None, **kwargs)

Downloads files from Google Cloud Storage.

data_describe.misc.load_data.load_data(filepath, all_folders=False, **kwargs)

Create pandas data frame from filepath.

Parameters
  • filepath – The file path. Can be either a local filepath or Google Cloud Storage URI filepath

  • all_folders – If True, searches for text files in nested folders. If False, looks for text files in the current folder

  • **kwargs – Keyword arguments to pass to the reader

Raises
  • ImportError – gcsfs not installed.

  • FileNotFoundError – File doesn’t exist.

Returns

A pandas data frame

data_describe.misc.load_data.download_gcs_file(filepath, bucket=None, prefix=None, **kwargs)

Downloads files from Google Cloud Storage.

Parameters
  • filepath – The file path

  • bucket – bucket to which the file belongs to in Google Cloud Storage

  • prefix – parameter in list_blobs to limit the results to objects that have the specified prefix

  • kwargs – Keyword arguments for list_blobs

Returns

The shape file

Return type

shapefile_dir