Effect Sizes¶
Effect sizes quantify the magnitude of differences in standardized units, making comparisons meaningful across different scales.
Cohen's d¶
Feature Name: cohens_d
where the pooled standard deviation is: $$ s_{\text{pooled}} = \sqrt{\frac{(n_1-1)s_1^2 + (n_2-1)s_2^2}{n_1 + n_2 - 2}} $$
Interpretation (Cohen's Conventions)¶
| Value | Interpretation |
|---|---|
| |d| ≈ 0.2 | Small effect |
| |d| ≈ 0.5 | Medium effect |
| |d| ≈ 0.8 | Large effect |
| |d| > 1.0 | Very large effect |
Properties¶
- Range: (-∞, +∞)
- Independent of sample size (unlike p-values)
Why Useful
Cohen's d provides a standardized measure of how "different" the post-break segment is from the pre-break segment. It won't show artificial significance with large samples.
Used by: All 25 detectors
Glass's Delta¶
Feature Name: glass_delta
Key Difference from Cohen's d¶
Uses only the pre-break standard deviation as denominator. Appropriate when:
- The break is expected to change variance
- Pre-break represents "normal" regime
Why Useful
Captures how many pre-break standard deviations the mean has shifted — relevant when the "before" period represents baseline behavior.
Used by: mlp_ensemble_deep_features, meta_stacking_7models
Hedges' g¶
Feature Name: hedges_g
Why Use Hedges' g?¶
- Bias-corrected version of Cohen's d
- Cohen's d slightly overestimates effect in small samples
- More accurate for small samples (n < 20)
Used by: meta_stacking_7models
Cliff's Delta¶
Feature Name: cliffs_delta
Interpretation¶
| Value | Interpretation |
|---|---|
| δ = 0 | Groups overlap completely |
| δ = 1 | All values in group 2 exceed group 1 |
| δ = -1 | All values in group 1 exceed group 2 |
Effect Size Thresholds¶
| Range | Interpretation |
|---|---|
| |δ| < 0.147 | Negligible |
| |δ| < 0.33 | Small |
| |δ| < 0.474 | Medium |
| |δ| ≥ 0.474 | Large |
Why Useful
Non-parametric effect size — makes no assumptions about distribution shape. Robust to outliers.
Used by: meta_stacking_7models