pgmpy.datasets.list_datasets#

pgmpy.datasets.list_datasets(**filter_tags) list[str][source]#

Returns a list of all available datasets, optionally filtered by a query string.

Parameters:
**filter_tagsoptional arguments

If specified, returns only datasets matching the provided tag filters. Any dataset tag can be used as a filter. Available tags:

  • n_variables

  • n_samples

  • has_ground_truth

  • has_expert_knowledge

  • has_missing_data

  • is_simulated

  • is_interventional

  • is_discrete

  • is_continuous

  • is_mixed

  • is_ordinal

Returns:
list of str

A sorted list of available dataset names.

Examples

>>> from pgmpy.datasets import list_datasets
>>> list_datasets()
['abalone_continuous', 'abalone_mixed', ..., 'sachs_continuous', ...]
>>> list_datasets(is_discrete=True, has_ground_truth=True)
['sachs_discrete']