anndict.ensure_label_consistency_main

anndict.ensure_label_consistency_main#

anndict.ensure_label_consistency_main(df, simplification_level='unified, typo-fixed')[source]#

Function to ensure label consistency across multiple columns in a DataFrame by mapping labels to a unified and simplified set.

Parameters:
df DataFrame

a DataFrame containing categorical columns across which to unify category labels (so that all columns share the same set of labels).

simplification_level str (default: 'unified, typo-fixed')

Qualitative direction about how to process the labels.

Return type:

tuple[DataFrame, dict]

Returns:

class:DataFrame containing the columns in df, now with labels shared across all columns class:dict containing the full mapping of original labels to the new, shared set of labels