Base

The base module contains the main abstractions used throughout the package.

class base.TypeInformation[source]

For a dataset, provides information on columns types, how they’re used, and any other potential identifiers.

TypeInformation is generated within infer.infer_types(), where small samples of each column are evaluated in a custom framework to understand what kind of data type the model is. The user may override data types, but it is recommended to do so within a JSON-AI config file.

Parameters:
  • dtypes – For each column’s name, the associated data type inferred.

  • additional_info – Any possible sub-categories or additional descriptive information.

  • identifiers – Columns within the dataset highly suspected of being identifiers or IDs. These do not contain informatic value, therefore will be ignored in subsequent training/analysis procedures unless manually indicated.