validation
validation
¶
Data validation helpers shared across pipeline stages.
Functions:
| Name | Description |
|---|---|
validate_groupby_column |
Validate that the configured group-by column exists and has no missing values. |
validate_orderby_column |
Validate that the configured order-by column exists. |
validate_groupby_column(data, group_by)
¶
Validate that the configured group-by column exists and has no missing values.
Parameters:
| Name | Type | Description | Default |
|---|---|---|---|
data
|
DataFrame | Collection[str]
|
A DataFrame or collection of column names to validate against. |
required |
group_by
|
str | None
|
Name of the configured grouping column. |
required |
Raises:
| Type | Description |
|---|---|
ParameterError
|
If |
DataError
|
If |
Source code in src/nemo_safe_synthesizer/data_processing/validation.py
validate_orderby_column(data, order_by)
¶
Validate that the configured order-by column exists.
Parameters:
| Name | Type | Description | Default |
|---|---|---|---|
data
|
DataFrame | Collection[str]
|
A DataFrame or collection of column names to validate against. |
required |
order_by
|
str | None
|
Name of the configured ordering column. |
required |
Raises:
| Type | Description |
|---|---|
ParameterError
|
If |