Statistical Models¶

Pure statistical approaches without machine learning.

Overview¶

Model	ROC AUC	F1 Score	Train Time
Hypothesis Testing	0.5394	0.4167	0s
Bayesian BOCPD	0.5005	0.0625	183s
welch_ttest	0.4634	0.0000	0s

Training-Free

These models don't require training data. They use statistical theory to detect breaks directly.

\[ H_0: \text{No structural break (same distribution)} $$ $$ H_1: \text{Structural break exists (different distributions)} \]

Test	What It Measures
Welch's t-test	Mean difference significance
Kolmogorov-Smirnov	Maximum CDF difference
Mann-Whitney U	Rank-based comparison
Levene's test	Variance equality
CUSUM	Cumulative deviation from mean

Statistical Models as Baseline

hypothesis_testing_pure serves as a baseline showing the gap between pure statistics and ML approaches.

Tree-based models achieved higher AUC (0.7423 vs 0.5394) and robust scores than statistical methods.