Score Validation
We test the current scoring model against a mixed validation set of benchmark archetypes and a smaller number of measured/monitored homes. This page reports where the model aligns and where assumptions still dominate.
Sample size
10
MAE (pts)
5.7
RMSE (pts)
7.0
Correlation (r)
0.99
Confidence distribution
High10
Medium0
Low0
Measured homes: 4 / Benchmark homes: 6
Legend
High / Medium / Low are derived from current confidence metadata.
If the wrapper exposes numeric confidence, buckets are 80+ / 60–79 / <60.
Inferences flagged most often
All benchmark homes have full EPC data — no inferences required.
Avg inferred fields per home: 0.0 (0 = all fields provided)
Filter set:
Filtered MAE 5.7 · RMSE 7.0
Victorian Terrace (Pre-1930)
benchmarkExpected
16
Computed
18
Δ
2
Confidence: 100 · completeness — · ±—
Typical EPC D/E, legacy boiler
1950s Semi-Detached
benchmarkExpected
20
Computed
24
Δ
4
Confidence: 100 · completeness — · ±—
Typical EPC D, cavity unfilled
1960s Detached (Partial Retrofit)
benchmarkExpected
29
Computed
34
Δ
5
Confidence: 100 · completeness — · ±—
Partial cavity fill, condensing combi
1980s Semi
benchmarkExpected
28
Computed
35
Δ
7
Confidence: 100 · completeness — · ±—
Filled cavities, condensing boiler
2000s New Build
benchmarkExpected
33
Computed
41
Δ
8
Confidence: 100 · completeness — · ±—
Part L era fabric + condensing combi
ASHP + MVHR Retrofit
measuredExpected
63
Computed
63
Δ
0
Confidence: 100 · completeness — · ±—
Monitored smart-meter retrofit case
Deep Retrofit Terraced + Solar
measuredExpected
58
Computed
53
Δ
5
Confidence: 100 · completeness — · ±—
Post-retrofit meter data, UK social housing pilot
Passivhaus Retrofit Benchmark
measuredExpected
90
Computed
77
Δ
13
Confidence: 100 · completeness — · ±—
Passivhaus-certified retrofit, <15 kWh/m²/yr space heat
Flat with Electric Resistance Heating
benchmarkExpected
24
Computed
25
Δ
1
Confidence: 100 · completeness — · ±—
Urban flat baseline with high electricity demand
Modern Net-Zero New Build
measuredExpected
86
Computed
74
Δ
12
Confidence: 100 · completeness — · ±—
Operational net-zero home with monitored exports