dctools.data.transforms.get_dataset_transform

dctools.data.transforms.get_dataset_transform(alias, metadata, dataset_processor, transform_name=None, config=None)

Retrieves the appropriate transformation pipeline for a dataset based on its alias and name.

Parameters:
  • alias (str) – The alias of the dataset.

  • metadata (Dict[str, Any]) – Dictionary containing dataset metadata (renaming dicts, keep vars).

  • dataset_processor (DatasetProcessor) – The processor for distributed/lazy operations.

  • transform_name (str, optional) – The name of the transformation pipeline to build. If None, infers the default pipeline for the alias.

  • config (Optional[Dict[str, Any]]) – Additional configuration (paths, precision, ranges).

Returns:

The configured transformation pipeline.

Return type:

CustomTransforms