sqs_score
sqs_score
¶
Classes:
| Name | Description |
|---|---|
SQSScore |
Synthetic Quality Score -- weighted aggregate of quality sub-metrics. |
SQSScore
pydantic-model
¶
Bases: CompositeScore
Synthetic Quality Score -- weighted aggregate of quality sub-metrics.
Combines column distribution stability, correlation stability, deep structure stability, text semantic similarity, and text structure similarity into a single 0--10 score weighted by the number of tabular vs. text columns.
Fields:
-
score(EvaluationScore) -
name(str)
from_components(components, name='Synthetic Quality Score')
staticmethod
¶
Compute the SQS from a list of quality sub-metric components.
Source code in src/nemo_safe_synthesizer/evaluation/components/sqs_score.py
get_overall_synthetic_data_quality_score(field_correlation_stability, principal_component_stability, field_distribution_stability, text_semantic_similarity, text_structure_similarity, tabular_cols, text_cols)
staticmethod
¶
Compute the overall SQS from individual sub-metric scores.
The tabular SQS is a weighted combination of correlation, distribution, and PCA stability. The text SQS blends semantic and structural similarity. The final score weights tabular and text SQS by their respective column counts.
Parameters:
| Name | Type | Description | Default |
|---|---|---|---|
field_correlation_stability
|
float | None
|
Correlation stability sub-score (0--10). |
required |
principal_component_stability
|
float | None
|
PCA stability sub-score (0--10). |
required |
field_distribution_stability
|
float | None
|
Distribution stability sub-score (0--10). |
required |
text_semantic_similarity
|
float | None
|
Semantic similarity sub-score (0--10). |
required |
text_structure_similarity
|
float | None
|
Structural similarity sub-score (0--10). |
required |
tabular_cols
|
int
|
Number of tabular columns in the dataset. |
required |
text_cols
|
int
|
Number of text columns in the dataset. |
required |
Returns:
| Type | Description |
|---|---|
EvaluationScore
|
A finalized |