Given What You Received — The Outcome Differential in London's Small-Sites Planning

The hardest boroughs split on delivery

The boroughs with the largest negative differentials show a correlation with delivery. Havering (−21.4pp differential, 61% of required delivery), Barking & Dagenham (−16.4pp, 66%) and Newham (−14.6pp, 61%) pair large negative differentials with delivery well below the level required of them. Croydon and Waltham Forest sit at comparable levels of small-sites restriction (−21.8pp and −15.3pp) yet clear their delivery test comfortably — Croydon at 160% of required, Waltham Forest at 119% — by delivering through strategic allocations rather than small sites. The same small-sites posture can sit beside opposite delivery records, so the label "hard" covers more than one situation.

The permissive end splits also

The same division repeats at the top of the differential. Hammersmith & Fulham (+16.3pp, 143%) and Bexley (+18.9pp, 106%) combine above-expectation small-sites approval with delivery above requirement. Richmond (+21.8pp, 60%), Kensington & Chelsea (+15.1pp, 63%) and Camden (+12.2pp, 53%) approve small sites more readily than their caseload predicts yet still deliver far below requirement. Permission at the front of the pipeline does not guarantee homes at the end of it: build-out lag, scheme viability, land supply and a borough's small absolute size all sit between a consent and a completion. A permissive planning department is not, on its own, a delivery engine.

Reading the four quadrants. The matrix sorts boroughs into four cells. Restrictive and under-delivering (upper-left; 11 boroughs): the largest negative differentials together with a delivery shortfall, and no sign the restriction corresponds to homes secured elsewhere. Restrictive and productive (upper-right; 6): hard on small sites while delivering through larger ones — the posture a borough can most readily defend. Permissive and under-delivering (lower-left; 10): generous on approval but low on delivery, which points the supply question away from the planning department. Permissive and productive (lower-right; 5). Borough membership is in the appendix table.

How to read this for each borough. The matrix is a framing device. A local authority can ask which quadrant it occupies, and whether its small-sites posture helps explain its delivery record or its delivery is coming from somewhere its small-sites posture has little to do with. A developer can ask whether a borough combines an attractive risk-adjusted permission environment at the front of the pipeline with credible delivery prospects at the end of it and, if not, which of the two is the binding constraint on a scheme like theirs. The same cell can arise from different mechanisms, so the quadrant points each reader to the follow-up question worth their time rather than dictating an answer. The implications sections (§8 for authorities, §9 for developers) take those two questions in turn.

What the matrix can and cannot show. Its delivery axis is all-sites while its permissiveness axis is small-sites, so a "restrictive and productive" borough is delivering through larger sites. The matrix locates that delivery elsewhere rather than vindicating its small-sites posture. The HDT figures span roughly 2019–2022 and the differential spans 2022–2026; delivery lags the decisions that produce it, so the two axes are not contemporaneous and a borough's position can move. Borough-level small-sites completions are not published consistently or crossed centrally by the GLA but individual borough monitoring does report them (Russell Curtis cites Croydon small-site net completions rising from 770 in 2012/13–2016/17 to 1,965 in 2017/18–2021/22, with Barnet next at 710), but there is no clean, comparable London-wide series, and that absence is itself an accountability gap. Quadrant placement is descriptive: delivery is shaped by viability, land supply, build-out rates and borough size as much as by planning posture.

What better data would change. Were borough-level small-sites completions ever published, the delivery axis could be rebuilt on a like-for-like, small-sites basis, and the matrix would move. The restrictive and productive quadrant would likely thin, as boroughs that clear their all-sites test through strategic allocations (Croydon, Waltham Forest) lose the credit that currently lifts them. The restrictive and under-delivering reading of the affordable outer boroughs would, if anything, sharpen. And the permissive and under-delivering quadrant would finally become testable, turning on whether permissive small-sites approval actually converts into small-sites completions. The matrix's qualitative message — that permissiveness and delivery are distinct — survives that data swap; the specific quadrant placements, especially for the all-sites beneficiaries, may not.

Can the delivery axis be made small-sites-specific?

The obvious objection to the matrix is that its delivery axis is all-sites while its permissiveness axis is small-sites. Can the Housing Delivery Test be disaggregated to fix that? On the available data, no. The HDT is a single net-completions total per borough, with no site-size dimension to split; apportioning it by each borough’s small-sites target share is circular, returning the overall HDT ratio for the small-sites slice and adding nothing; and a genuine scheme-level split would need completions records carrying unit counts or site areas — the London Development Database, which undercounts small schemes, or individual borough monitoring, which is not compiled on a comparable London-wide basis. There is currently no direct route from the published HDT to a small-sites completions figure, though we are looking into proxies to approximate this missing data.

What the data does support is a reliance weighting, shown as the SS % of target column in the appendix table. The London Plan sets each borough a small-sites target (sites below 0.25 hectares) and that target is a very different share of the whole from one borough to the next: from about 12% of Barking & Dagenham’s requirement to 84% of Richmond’s, median roughly 30%. The same HDT result therefore means different things in different boroughs. Richmond fails its delivery test while approving small sites readily and depending on them for 84% of its target, so its shortfall is almost entirely a small-sites build-out problem, not a consenting one. Barking & Dagenham, Greenwich and Newham also fail, but lean on small sites for an eighth to a sixth of their targets, meaning that their shortfall sits with larger sites, and their restrictive small-sites posture, genuine as it is, is not the main driver of the underperformance. The reliance share does not measure small-sites delivery; it tells you how much of a borough’s delivery record a small-sites reading can claim.

A £60k preparation spend is in the order of 1% of a Westminster scheme's residual, roughly 11% of a Havering scheme's and over 20% of a Croydon one. On simple ROI grounds, applicants in more affordable boroughs have reason to underinvest in preparation relative to those in expensive ones. They are no less capable; the system simply makes preparation pay less there. This is the "regressive cost asymmetry" reading: a mechanism in which the visible cross-sectional pattern is produced upstream of any decision, by a filter that screens applicants on their ability to absorb preparation costs as a fraction of residual.

Two pieces of internal evidence are consistent with this mechanism operating in the visible window. First, the rising share of "insufficient information" and "insufficient detail" refusals in affordable boroughs (see §6) is the direct fingerprint of under-prepared applications. Second, the withdrawal-rate asymmetry (also §6) is consistent with expensive-borough applicants having consultant intelligence that affordable-borough applicants lack. Neither is a clean test of the mechanism in isolation, but both are consistent with it.

Why neither pure mechanism exhausts the cross-section

First, the Bexley–Havering pair sits within the same applicant-pool quality distribution at identical price points and produces a 40pp gap. The agent-quality reading is weak here: Bexley sustains higher outcomes on observably similar inputs.

Second, the Croydon political-revocation case is the hardest piece of evidence to reconcile with a pure applicant-quality interpretation. Croydon's applicant pool and the agents serving it did not change on 25 July 2022. The same applicants, prosecuted by the same agents, facing the same residual economics on the same sites, were approved at 56.3% pre-revocation and at 32.8% post-revocation.

The most defensible reading is therefore that the cross-section is generated by multiple mechanisms in unknown proportions. A substantial share of the affordability correlation is likely the regressive economics of preparation and the agent-quality concentration described above. A further share is likely downstream borough decision posture. The relative weight of the two mechanisms is not recoverable from this dataset alone; it remains a hypothesis warranting further work rather than a finding this essay establishes.

A within-agent test

The confound noted above, that the same price-approval correlation is produced by both the decision-culture and the applicant-quality mechanisms, can be partially tested by holding agent identity constant across boroughs. If the differential were entirely an artefact of agent sorting (better agents concentrating in permissive, expensive boroughs), then fixing the agent and varying the borough should reduce the estimated borough effect. Conversely, if the differential survives once you control for who the agent is, decision-culture is doing independent work.

We identified 29 named planning and design agents with ≥5 applications across ≥2 boroughs in the LSSPD (199 applications; 13 boroughs). On this sub-sample we ran two logistic regressions: one with agent fixed effects, one without, on identical observations and covariates (site type, conservation, PTAL band, unit count).

The results are unambiguous on rank-ordering but mixed on magnitude. Borough rankings are perfectly stable: Spearman ρ = 1.0 between the with-FE and without-FE borough coefficient; the same boroughs come out hardest and easiest regardless of whether agent identity is controlled. Individual coefficients shift (H&F, Tower Hamlets, and Southwark all have larger raw differentials than within-agent differentials, consistent with better agents concentrating there), but the rank ordering does not. A non-parametric check reinforces this reading: 79% of qualifying multi-borough agents show >10pp different approval rates across the boroughs they themselves work in. An agent achieving 65% approval in one borough earns around 45% working with a similar brief across the boundary.

Two caveats bound the interpretation. First, the sub-sample is thin: 29 agents, 199 applications, 13 boroughs. The key hard boroughs (Havering, Croydon, Waltham Forest, Newham) have no qualifying multi-borough agents and are therefore absent from the within-agent test. Second, 63% of the actor dataset has no resolved agent identity; the selected named-agent sub-sample over-represents the sophisticated end of the market, which is the very segment the agent-quality reading most concerns.

Within those bounds, this is evidence. The borough ranking is not an artefact of which agents happen to work there. The same agent, working the same types of scheme, achieves systematically different outcomes across boroughs. That is directional evidence for decision-culture and against a pure applicant-quality reading. The essay's characterisation of "multiple mechanisms in unknown proportions" remains correct; these results shift the weighting of the available evidence toward the decision-culture mechanism.

The inverted agent premium. A related finding from the same actor dataset needs care. Named professional agents (planning firms, architecture practices) approve at roughly 50%, while applications with no named agent approve at 61%. The gap is a selection artefact: named agents disproportionately take on the harder schemes — contested backland, conservation areas, refusal-risk sites in the difficult outer boroughs — so the 11pp shortfall reflects scheme difficulty rather than agent underperformance. The correlation between the agent premium and the borough differential is r = −0.59 (p = 0.04): the premium is most negative in the most restrictive boroughs. This fits the applicant-quality mechanism that sophisticated agents concentrate in difficult boroughs, work uphill against structural restrictiveness, and still lose more often than unrepresented applicants prosecuting easier schemes. This warns against reading raw agent-approval rates as a quality signal.

Regressive outcome, not regressive intent. We use "regressive" throughout in the technical economic sense, describing the distributional shape of an outcome rather than the intent of any borough or official. A borough responding to local preferences expressed through its democratic processes can produce a city-wide regressive outcome without anyone acting in bad faith: the diagnostic question is about effects rather than motives. We do not claim, and the data does not support, that any borough is deliberately producing a regressive outcome.

How the refusal-reason composition varies

Across the London-wide dataset, design (DES) is the primary ground in 35.5% of first refusal reasons, the single largest category but not a majority. (A reclassification of the reason text in June 2026 corrected an earlier keyword coding that had over-counted design at 54.8% by routing ambiguous reasons to it.) Design genuinely dominates in the wealthier inner boroughs (Camden 58%, Kensington & Chelsea 59%), with Havering at 46%, but far less elsewhere: Croydon, once read as 78% design, runs at 36% once parking and policy reasons are separated out, Richmond at 31%, and Newham at just 16%, refusing far more on policy and space standards. Design is the most discretionary category in the policy toolkit, the most subjective, and the hardest to adjudicate on appeal; the categories the earlier coding buried are transport/parking and policy.

§8. Implications for Local Authorities and for central government

This section is addressed to two readers in turn: first the authority reading its own number, then the government deciding what, if anything, to do about the pattern. The order is deliberate. The differential is built from each borough's own decisions, and the first claim on its meaning belongs to the borough.

8.1 Three readings of a negative differential

A borough sitting below expectation has at least three accounts available to it. They are not mutually exclusive, the data in this essay cannot apportion them for any individual Authority, and each carries a different implication but each is testable against evidence the borough already holds.

The first is deliberate, plan-led restraint. A borough whose adopted plan expresses a settled, democratically endorsed position on dispersed intensification and which applies that position consistently will sit below the London average by choice. That is not the development management function failing but rather the function doing what the plan instructs. The test of this reading is coherence, and §7 sets it out: refusals that cite the plan's policies accurately, fall consistently on the typologies the plan constrains, and are broadly upheld by Inspectors on appeal. A borough that passes that test holds a Category 1 position: a lawful, defensible differential whose proper forum is plan examination, not the development management statistics.

The second is caseload quality on the dimensions the model cannot see. The differential controls for four observable characteristics; it is blind to preparation. A borough receiving systematically less-prepared applications (thinner supporting information, less pre-application engagement, less experienced agents) will sit below expectation without deciding any harder than its peers. The test of this reading is in the refusal record: a rising share of insufficient-information refusals, validation failures, and schemes lost on matters a competent submission would have resolved before it arrived.

The third is capacity. A development management service under sustained resourcing pressure can drift below expectation without any policy choice at all: less pre-application capacity to repair weak schemes before submission, less officer time to negotiate amendments rather than refuse, more decisions taken on the papers as received. The test here is procedural: determination times, amendment rates, the balance of negotiated approvals against first-pass refusals.

An Authority that can say which of these accounts, in what proportion, explains its own number has already met the standard §11 proposes (explained variation) and met it on its own terms.

8.2 What the differential offers a borough

Used internally, the differential is management information of a kind no published statistic currently provides: a mix-adjusted account of the authority's own decision record, benchmarked against what its caseload would have produced under London-average behaviour. Four uses follow directly. In the Authority Monitoring Report, it gives the service a like-for-like measure to sit alongside the raw approval rate and a defensible answer to the member, journalist or Inspector who quotes the raw rate without the caseload behind it. In local plan evidence, the typology-level patterns beneath the headline show where the adopted plan is being applied as intended and where outcomes have drifted from the position the plan expresses. This signals evidence of consistency for examination, or early warning of a gap between the plan as written and the plan as applied. In member briefings, the three readings above give officers a vocabulary for the borough's position that neither concedes failure nor refuses the question. And at appeal, an Authority whose refusals are consistent with its differential, its plan and its Inspectorate record can evidence a coherent decision posture rather than defending each case as though it stood alone.

8.3 The finding that protects the restrictive boroughs

The most protective finding in this dataset belongs to the hardest boroughs: their refusals are upheld on appeal. Across the matched appeal cohort (§9.6), the most restrictive boroughs are overturned at roughly sixteen per cent of decided appeals, which is below the permissive boroughs' twenty-four, and below the national minor-residential rate. Whatever else a persistent negative differential means, in these boroughs it does not mean capricious decision-making that the Inspectorate corrects on review; it is consistent with a coherent position, consistently applied, that survives independent scrutiny. An Authority confident in the first reading above will find its strongest evidence here, and is entitled to use it. The usual caveat travels with the number: only around six per cent of refusals are appealed, so the sample is selected by applicant judgement but the cases that reach the Inspectorate are those applicants judged most vulnerable to challenge, which makes the upheld share harder, not easier, to dismiss.

8.4 The burden runs both ways

The cost asymmetry this essay describes is usually read from the applicant's side: preparation pays least where values are lowest, so the least-prepared applications concentrate in the most affordable boroughs. Read from the Authority's side of the counter, the same mechanism is a workload statement. The development management teams serving the most affordable boroughs are processing a rising share of applications that arrive unready; the insufficient-information trend of §6 is its visible edge. That work carries officer hours, a negotiation burden and appeal exposure, typically on thinner resourcing than their inner-London counterparts. An Authority in this position is not the author of the asymmetry but one of the parties carrying its cost. A policy response that treats the asymmetry solely as an applicant grievance has misread where the burden falls — which is why the proportionality test proposed at 8.11 is as much a protection for the boroughs asked to administer each new requirement as for the applicants asked to meet it.

8.5 Access to the underlying evidence

The number in this essay should not be the last word on any authority. Any London borough wishing to interrogate its own differential may request its case-level extract and the full methodology annexe, without charge. The differential is offered as the beginning of an account each authority can give in its own terms, on its own evidence, not as a verdict delivered from outside it.

The remainder of this section turns from the authority reading its own number to central government and the GLA deciding what, if anything, the pattern asks of them. For that reader, the differential works as a diagnostic rather than an accusation, and the policy frame it points to is a transparency regime rather than a central-override one.

8.6 Treat the outcome differential as a published transparency instrument

Calculate annually, publish, and read alongside local plans. Its function is as a standing question put to each authority: your caseload predicted X; you produced Y; account for the difference. The point of publication is not to shame boroughs but to give the democratic process the information it needs to mediate between local preference and strategic policy.

To make that concrete: publish the differential each year alongside the borough's small-sites monitoring, and ask each authority to attach a one-page note answering three questions. (1) Which quadrant of the delivery matrix are we in — what is our small-sites posture, and what is our delivery record? (2) What explains our position — caseload, viability, land supply, decision posture, or some mix? (3) What, if anything, are we doing to change it? This is deliberately not a performance league table, and it should not be administered as one: a borough in the "restrictive and productive" quadrant may give a complete and creditable answer. It is a standing prompt for democratic accountability, so that a borough's own voters can see the trade-off it is making between local character and city-wide supply, and judge it on terms the borough itself has set out.

8.7 Use the Mayor's s.24(7) conformity opinion as the principal lever

The Mayor's call-in and direction-to-refuse powers, expanded by Potential Strategic Importance (PSI) Category 3J, cover 50+ home schemes; the 50-home floor deliberately excludes the 1–9 unit tier this essay tracks. The s.24(7) conformity opinion is the appropriate instrument where a borough's adopted plan has materially diverged from London Plan Policies H2 and D3. It is not, on this evidence, an instrument that should be used as a routine override of borough discretion in the absence of policy divergence.

8.8 Let appeal outcomes carry their full evidential weight

If a meaningful share of the differential is officer or member risk-aversion, the cost of unjustified refusal must rise on appeal. The practical reform is not legislative but it is aimed at transparency. Government should require boroughs to publish, alongside their annual monitoring reports, the proportion of their small-site refusals overturned at appeal and the costs awarded against them.

8.9 Reframe affordability targets around the supply geography

This is a policy judgement, not a finding the data dictates. If raising housing supply in the most affordable boroughs is a policy objective, as capital-wide affordability ambitions imply, then persistent negative differentials in exactly those boroughs deserve particular scrutiny: on this evidence, the places where homes are most affordable to deliver are among those where they are hardest to consent relative to caseload. Where a borough's local plan has explicitly chosen that outcome, the mediation runs through s.24(7); where it has not, the gap is at least a question the borough should be asked to answer. The data establishes the gap; whether it should concern us is a judgement about what the planning system is for.

8.10 Learn from the affordable-and-permissive boroughs

Bexley, Sutton, and Enfield produce constructive small-sites outcomes at the most affordable end of the market. The Bexley–Havering forty-point gap at identical price points is the single most actionable data point. It should be read as evidence that the cross-section is not destiny, rather than as grounds for criticising Havering.

8.11 Act on the regressive cost asymmetry

Using Land Registry transaction prices, the preparation cost burden on a 5-unit scheme already varies roughly twenty-fold across London with ~1% of residual in K&C and Westminster to ~21% in Croydon. It is now a measured gradient rather than a mechanism still to be tested. Government should commission a cumulative-cost impact assessment of small-site validation requirements, expressed as a percentage of expected residual value across borough price quartiles, and publish it. The regressive outcome of additional validation requirements would then be established.

The same logic should run forward as well as back. Every new validation mandate on small sites, biodiversity net gain is the live example, should pass a small-sites proportionality test before adoption: model the preparation cost the mandate imposes as a percentage of expected residual value across borough price quartiles, and check whether it screens out a materially larger share of would-be applicants at the affordable end than at the expensive end. A requirement that costs a Westminster scheme a deductible afternoon and a Havering scheme a fifth of its residual is not neutral, however neutral it looks on the face of the regulation. The test does not pre-empt the policy; it makes the distributional shape visible before, rather than after, it is imposed.

Quadrant (examples)	What it tells you	The question to put to the site
Permissive and productive Bexley, H&F, Wandsworth, Westminster, Sutton	Front-of-pipeline odds and delivery both sit above what the caseload mix predicts — the cleanest risk-adjusted environment on this evidence.	The binding constraint is land price and competition, not consentability. You are not the only party who can read this; assume the environment is already in the price.
Permissive and under-delivering Richmond, K&C, Camden, Southwark, Hackney, Islington, Ealing, Enfield, Merton, Haringey	Permission is easier than the borough average implies, but the matrix locates the supply constraint after consent, not at the committee.	Stress-test the appraisal and the delivery programme hardest — viability, build-out, finance period — not the approval odds. Ask what kills schemes here once they are consented.
Restrictive and productive Croydon, Waltham Forest, Brent, Barnet, Hounslow, Harrow	Delivery runs through strategic allocations; small sites face a harder committee and may contribute little to the borough's headline delivery numbers.	Does the scheme align with the local plan the borough is actually delivering against? Here the local plan is the brief, and policy alignment carries more weight than design ambition.
Restrictive and under-delivering Havering, Newham, B&D, Lewisham, Redbridge, Kingston, Bromley, Tower Hamlets, Lambeth, Greenwich, Hillingdon	Lowest approval relative to mix and a delivery shortfall, with no sign the restriction is buying homes elsewhere — on this evidence, the most demanding of the four quadrants for a small-site applicant.	Is this scheme genuinely well-prepared and plan-compliant, with a read on how the borough has decided comparable schemes, and a fallback strategy — and is the risk reflected in the land price? This is the affordable end of the market, where preparation discipline plausibly carries the most weight.

Appendix

Methodology, Literature, and Limitations

A.0 Where this essay sits in the literature

This essay does not aim to displace the academic planning-discretion literature; it aims to add a borough-level descriptive ranking with verifiable methodology. Principal antecedents: Hilber & Vermeulen (2016, Economic Journal); Cheshire & Hilber (2008, Economic Journal); Bramley (1989 et seq, Environment and Planning A); Adams & Watkins (2014, Town Planning Review); Burgess, Monk & Whitehead (2011, Town Planning Review); Lord, Mell & Henneberry (2015); Murdoch & Abram (2002); Hilber & Lyytikäinen (2017, Journal of Public Economics). This essay is a policy essay informed by that literature, not a peer-reviewed contribution to it.

A.1 Data and cohort

LSSPD master.pkl build of 4 June 2026: 15,533 applications across 33 boroughs, 11,689 decided. City of London (n=7) excluded; effective cohort 11,689 decided across 32 boroughs. Withdrawals 915 of 15,533 records (5.9%) excluded from rate calculations; the asymmetric withdrawal pattern (12.2% expensive vs 6.9% affordable) makes the published cross-sectional differential a lower bound on the true outcome differential. Croydon pre-revocation cohort from GLA Planning London Datahub, 2,087 decided. Cross-source filter-harmonisation diagnostics within 3% on cohort size and 0.4pp on rate.

A.2 The outcome differential

Logistic regression on site type, PTAL band, conservation status, unit count, fit on the pooled 32-borough cohort with no borough fixed effects. The differential is the borough's observed approval rate minus the mean of predicted probabilities across its caseload. Two specifications (the headline + a borough-fixed-effects version on n ≈ 6,650 with complete covariates); differentials correlate at r = 0.97. The known limitation: the four covariates cannot exhaust unobserved scheme quality. The residual sweeps up unobserved quality alongside borough behaviour. The argument is therefore not "the differential measures behaviour" but "the differential is informative about outcome divergence under controls."

A.3 Robustness and inference

SEs for the four hardest boroughs: Croydon ±1.6, Havering ±2.3, Lambeth ±2.9, Barking & Dagenham ±4.0. Clustering caveat: SEs are not adjusted for clustering at borough × year, case officer, or repeat-agent level. Officer or agent clustering would have a larger effect but is not estimable without case-officer coverage across all boroughs. Treat reported SEs as a lower bound on true uncertainty. Croydon pre-vs-post: χ²(1) ≈ 135.8; 95% CI on difference [+19.7pp, +27.3pp]; the p-value assuming independent decisions is overwhelmingly significant, but that independence assumption overstates the effective sample size given clustering.

A.4 The affordability relationship

Pearson r ≈ +0.5, n=30 (Fisher-z transformation) — a moderate relationship. Describes outcome shape, and is not identifying of any particular mechanism.

A.5 Refusal-reason composition

DES (design) = 35.5% of first refusal reasons (1,416 of 3,991), after a reclassification of the reason text (June 2026) that corrected an earlier keyword coding which over-counted design at 54.8% by routing ambiguous reasons to it. Per-borough: Camden 58%, K&C 59%, Havering 46%, Croydon 36%, Richmond 31% — with Newham at just 16%. Design is the single largest primary reason but not a majority; transport/parking and policy are materially larger than the original codes showed.

A.6 The dynamic findings

Wealth–approval Pearson r: 0.66 (2023) → 0.63 (2024) → 0.55 (2025). Hard–soft spread: 55pp → 43pp. London-wide approval: 52% → 61%, with 2026 partial reversion to 57%. INF/INS share of refusals: affordable 4.4% → 5.7%; expensive 0.0% → 2.2% (refused base). Withdrawal asymmetry: 12.2% (expensive) vs 6.9% (affordable) over 2023–2024.

A.7 The Croydon shift

Pre-revocation: n=2,087, 56.3% approval. Post-revocation: n=878, 32.8%. Difference 23.5pp — a period-average contrast, not a discontinuity estimate. Within-pre-period: 69.7% (2019) → 55.7% (2020) → 47.8% (2021) → 52.5% (Jan–May 2022) → 44.2% (Jun–Jul 2022 pre-revocation) → 32.8% (post-period average) → 24.6% (2023) → 41.7% (2025). Outer-London-ex-Croydon synthetic benchmark: 51.0% (2023), 54.1% (2024), 57.8% (2025). Croydon sits 18–25pp below peers across post-window.

Pre-SPD context (Russell Curtis, RCKa, "Come Back to Croydon", russellcurtis.com, 31 May 2026, drawing on Croydon's monitoring): small-site approvals 307 homes (2015) → 555 (2016) → over 1,200 (2017 peak; ~400 approved in Q4 2017 alone, before the SPD was drafted) → 1,005 (2018) → 898 (2019). Small-site net completions rose from 770 (2012/13–2016/17) to 1,965 (2017/18–2021/22), Barnet next at 710. The approval surge thus predates the SPD (adopted April 2019) by roughly two years, which is why this essay treats the design guide and its 2022 revocation as expressions of Croydon's political culture rather than causes of the swing.

A.8 Principal limitations

The differential is a residual, not a measurement of behaviour. The four covariates cannot exhaust unobserved scheme quality.
Agent quality and repeat-player effects. The single largest unobserved confound. The same correlation between price and outcome that the regressive cost-asymmetry mechanism produces is also produced by agent-quality concentration. Observationally equivalent in the cross-section. The within-agent regression (A.9) is the first direct sub-sample test: borough rankings are stable at Spearman ρ = 1.0 with and without agent FEs on the 29-agent, 199-application sub-sample. This weighs toward decision-culture without closing the question; the sub-sample is thin and excludes the hardest boroughs (Newham, Havering, Croydon).
Allocation-stage selection. Boroughs route cases by ward, site type, complexity and seniority, so any within-borough variation in outcomes is consistent with discretion but also with routing that itself encodes selection on the variables that predict outcomes.
Asymmetric withdrawal filtration. 5.6% of all applications excluded from rate calculations; expensive boroughs withdraw at twice the rate of affordable boroughs. Filtration is asymmetric in the direction that makes the published cross-sectional differential a lower bound on the true outcome differential.
The Croydon evidence is suggestive, not formally identified. Pre-period non-stationary; synthetic benchmark is a peer-group comparison, not a synthetic-control estimator in the Abadie sense.
Compliance-cost figures. Calibrated against 174,049 Land Registry transactions (2024–25): preparation cost as % of 5-unit residual ranges ~1% (K&C/Westminster) to ~21% (Croydon), r = −0.47 with the differential. The gradient is measured; causal attribution is not established.
Damaged fields excluded: committee-route flag (effectively zero); approved-unit counts (high-outlier contamination); determination time (statutory-clock artifact, 22.7% at exactly day 56).
Temporal validity. Differentials are averages over 2022–2026. Three live instruments most likely to shift the 2026–2027 cross-section: draft new London Plan (expected summer 2026), December 2024 NPPF revision, PSI Category 3J (live 11 May 2026, 50-home floor).
Small-sites definition: unit count, not site area. The LSSPD tracks the 1–9 unit tier; the London Plan's small-site definition is spatial (sites below 0.25 hectares). Physically small plots proposing ten or more homes — e.g. the December 2017 Purley / Russell Hill approval, three houses replaced by thirty flats at 364 hr/ha — fall outside our filter, so the cross-section under-captures small-plot demolish-and-rebuild. Russell Curtis (RCKa) raises the same definitional gap against Centre for Cities / GLA analyses that counted only minor (sub-10-home) applications.
The Croydon regime predated its formal instrument. Curtis ("Come Back to Croydon", russellcurtis.com, 31 May 2026) shows Croydon's small-site approvals peaked in 2017, before the SPD was drafted, attributing the permissive era to political culture from the Labour administration's 2014 re-election rather than to the design guide. We therefore read the SPD and its revocation as expressions of Croydon's political culture, not causes, and present the 23.5pp pre/post fall as a period-average contrast rather than a discontinuity estimate.
What this essay does not attempt. Formal identification requires synthetic-control or difference-in-differences (DiD) design; clustered SEs on the cross-section; and an explicit identification strategy for the cost-asymmetry mechanism. The agent fixed-effects analysis (A.9) is a partial step.

A.9 Additional robustness analyses

Land Registry hedonic. 174,049 London transactions (2024–25), MHCLG Price Paid data. Borough Gross Development Value (GDV) proxy = median flat price (new-build where n≥20; all flats otherwise). 5-unit residual: GDV − build cost (£2,500/sqm × 50sqm) − 20% margin − £15k/unit fees. Prep-cost share vs differential: r = −0.47; GDV vs differential: r = +0.48.

Energy Performance Certificate (EPC) scheme-quality proxy. 78,086 new-build EPC registrations matched to 31 boroughs via outward postcode. Hard boroughs (diff < −8pp): median 70sqm/94% flats; permissive (>+8pp): 67sqm/96% flats. Floor-area–differential r = −0.14; flat-share r = −0.09. Observable scheme quality does not vary systematically with difficulty.

Agent fixed effects — within-agent cross-borough regression. Sub-sample: 3,408 decided applications in small_site_with_actors_v2.pkl with agent data; 29 named agents meeting ≥2 boroughs, ≥5 applications (199 applications, 13 boroughs). Two logistic regressions on identical observations: Model A (with agent FEs): is_approved ~ C(Borough) + C(agent) + C(site_type) + C(cons_category) + C(PTAL_band) + proposed_units_n; Model B (without agent FEs): same covariates minus agent dummies. Reference borough: Croydon. Results: Spearman ρ = 1.0 between the two borough-coefficient vectors — rank ordering perfectly stable. Pseudo-R² 0.170 (Model B) → 0.390 (Model A): agents explain substantial variance, but this does not change which boroughs rank harder or easier. Borough coefficient magnitudes shift: H&F, Tower Hamlets, and Southwark have systematically larger raw than within-agent coefficients (confound estimates −1.8, −2.1, −1.2 log-odds respectively), consistent with better agents concentrating in those boroughs; Sutton and Lewisham show positive confound (weaker agents working there). Non-parametric check: 79% of 29 qualifying agents show >10pp cross-borough approval-rate variation within their own portfolios. Key caveat: 63% of actor-dataset rows have no resolved agent identity; the hardest boroughs (Newham, Havering, Croydon) absent from within-agent test. Inverted agent premium: named agents approve at 50.0% vs 61.3% for no-agent applications — selection-bias artefact (named agents prosecute harder schemes in harder boroughs); correlation agent-premium magnitude vs borough differential r = −0.59 (p = 0.04, n=14 boroughs with data).

Appeal overturn. 458 decided appeals (96 allowed) matched from LSSPD. London overturn 21.0%. Hard boroughs ~16% vs permissive ~24% (borough means, ≥5-appeal boroughs); r = +0.40. ~6% of refusals appealed; per-borough n small.

National PS2 replication. MHCLG PS2 open data (downloaded June 2026, 2023–25, LPAs with ≥200 decisions, n=284). LA prices: LR PPD 2024–25. National r(log price, grant rate) = −0.30; London-only r = +0.67; non-London r = −0.21. London gradient does not replicate nationally.

PLD backfill (2019–2022). GLA PLD API, 43,667 decided residential apps, 32 boroughs, 16 quarterly pulls. Borough-year panel merged with LSSPD: extended_borough_year_panel.csv. Pre/post borough correlation r = +0.607. Note: broader filter vs LSSPD — level comparisons cautioned; trajectory/rank analysis robust.

A.10 The delivery matrix (§3)

The second axis pairs each borough's outcome differential (small-sites approvals) with its Housing Delivery Test result (homes delivered as a percentage of homes required, all sites; MHCLG 2023 measurement, covering roughly 2019–2022). Bubble size in the chart is the London Plan 2021 small-sites annual target (Table 4.2). Borough-level small-sites completions are not published — the GLA reports completions by borough or by site size but never crosses them, and the London Development Database undercounts small schemes — so the delivery axis is necessarily all-sites. The HDT consequence column is the statutory result: Pass, Buffer (20% buffer applied), Action plan, or Presumption (the tilted balance of NPPF para 11 engaged). Quadrant thresholds: differential = 0 and delivery = 100%. The SS % of target column is the small-sites annual target as a share of the borough's total annual housing requirement (London Plan Table 4.2 over the MHCLG requirement) — a site-area-based proxy (the small-sites target is the sub-0.25 ha component of the requirement) for how much of each borough’s delivery record a small-sites reading can honestly claim. It is not a measure of small-sites completions, which are not published on a comparable basis.

Borough	Differential	HDT	HDT consequence	Small-sites target (yr)	SS % of target	Quadrant
Croydon	-21.8	160%	Pass	641	43%	Restrictive and productive
Havering	-21.4	61%	Presumption	314	23%	Restrictive and under-delivering
Lambeth	-16.8	74%	Presumption	400	34%	Restrictive and under-delivering
Waltham Forest	-15.3	119%	Pass	359	32%	Restrictive and productive
Newham	-14.6	61%	Presumption	380	16%	Restrictive and under-delivering
Hounslow	-13.1	108%	Pass	280	19%	Restrictive and productive
Kingston	-10.5	48%	Presumption	225	26%	Restrictive and under-delivering
Tower Hamlets	-8.8	92%	Action plan	528	15%	Restrictive and under-delivering
Barnet	-8.7	104%	Pass	434	21%	Restrictive and productive
Brent	-7.6	131%	Pass	433	21%	Restrictive and productive
Bromley	-6.4	58%	Presumption	379	66%	Restrictive and under-delivering
Harrow	-3.2	101%	Pass	375	53%	Restrictive and productive
Lewisham	-1.9	32%	Presumption	379	26%	Restrictive and under-delivering
Hillingdon	-1.7	91%	Action plan	295	31%	Restrictive and under-delivering
Redbridge	-0.5	39%	Presumption	368	37%	Restrictive and under-delivering
Greenwich	-0.4	48%	Presumption	301	12%	Restrictive and under-delivering
Wandsworth	+4.0	112%	Pass	414	24%	Permissive and productive
Hackney	+7.1	88%	Action plan	658	56%	Permissive and under-delivering
Enfield	+7.3	76%	Buffer	353	32%	Permissive and under-delivering
Merton	+8.7	89%	Action plan	261	32%	Permissive and under-delivering
Islington	+11.3	83%	Buffer	484	70%	Permissive and under-delivering
Camden	+12.2	53%	Presumption	328	34%	Permissive and under-delivering
Haringey	+13.8	99%	Pass	260	19%	Permissive and under-delivering
Westminster	+13.9	129%	Pass	504	58%	Permissive and productive
Sutton	+15.0	114%	Pass	268	64%	Permissive and productive
Ealing	+15.5	84%	Buffer	424	22%	Permissive and under-delivering
Southwark	+15.5	82%	Buffer	601	29%	Permissive and under-delivering
Bexley	+18.9	106%	Pass	305	50%	Permissive and productive
Richmond	+21.8	60%	Presumption	234	84%	Permissive and under-delivering

Tallies: Restrictive and under-delivering 11; Restrictive and productive 6; Permissive and under-delivering 10; Permissive and productive 5. Differential source: LSSPD build 4 June 2026 (corrected PTAL). HDT source: MHCLG Housing Delivery Test 2023 measurement. Small-sites targets: London Plan 2021 Table 4.2.

A.11 Notes and sources

Figures drawn from the LSSPD and the GLA/PLD cohorts are sourced in the chart footers and methodology notes above. The notes below cover figures drawn from outside that dataset.

SME builders' share of new homes (~40% in the 1980s to ~10–12% today): Home Builders Federation, Reversing the Decline of Small Housebuilders (2017), and Federation of Master Builders house-builder surveys.
Sir Oliver Letwin, Independent Review of Build Out: Final Report (MHCLG, October 2018).
The London Plan 2021 (Greater London Authority), Policy H2 and Table 4.2 — ten-year small-sites target of 119,250 net dwellings, ~23% of the capital's total housing target.
Planning Inspectorate, appeals statistics: allowed rate for minor residential appeals (England), approximately 24%.
Greenaway-McGrevy & Phillips, "The impact of upzoning on housing construction in Auckland", Journal of Urban Economics (2023). The estimated 14–35% rent effect is contested — see Coleman / Motu Economic and Public Policy Research (2024).
Minneapolis 2040 outcomes: city building-permit data (2020–2023) and analysis by the Pew Charitable Trusts (2024).
California Department of Housing and Community Development, ADU permit data (2017–2022); SB9 uptake reported by the Terner Center for Housing Innovation, UC Berkeley (2023).
Housing starts: Japan, Ministry of Land, Infrastructure, Transport and Tourism (~940,000/yr); UK, MHCLG / ONS net additional dwellings (~194,000/yr).
Residual-model assumptions for the preparation-cost premium (5-unit scheme: GDV − build cost at £2,500/sqm × 50 sqm − 20% developer margin − £15k/unit fees; £60k assumed preparation cost) are set out in A.9. GDV proxy from 174,049 Land Registry Price Paid transactions, 2024–25.
MHCLG live planning-application statistics (PS2 open data), 2023–25, LPAs with ≥200 decisions; LA-level prices from Land Registry Price Paid Data, 2024–25.

Contents

Executive summary

What the evidence supports and what it does not

§1. The finding

Reading the outcome differential

§2. The ranking

The five hardest boroughs

The five most permissive boroughs

§3. The delivery matrix

The measure we wanted and the measure we can get

The hardest boroughs split on delivery

The permissive end splits also

Can the delivery axis be made small-sites-specific?

§4. The affordability problem

The Bexley–Havering pair: a counterexample

The affordable-and-permissive trio

§5. What the differential is (and is not) evidence of

The agent-quality and repeat-player confound

The regressive economics of preparation

Why neither pure mechanism exhausts the cross-section

A within-agent test

How the refusal-reason composition varies

§6. Two forces, one outcome

The dynamic trend

Year by year

The candidate structural mechanism in operation

What this means for the policy frame

§7. The decisive test: local plans against the London Plan

7.1 What the London Plan actually says

7.2 Two categories of difficult borough

A note on local democratic legitimacy

7.3 Croydon: a closely-watched political shift

The political timeline

The before-and-after numbers

7.4 Havering and Newham: the policy and the politics

7.5 Westminster and Richmond: high approval as a post-filter outcome

§8. Implications for Local Authorities and for central government

8.1 Three readings of a negative differential

8.2 What the differential offers a borough

8.3 The finding that protects the restrictive boroughs

8.4 The burden runs both ways

8.5 Access to the underlying evidence

8.6 Treat the outcome differential as a published transparency instrument

8.7 Use the Mayor's s.24(7) conformity opinion as the principal lever

8.8 Let appeal outcomes carry their full evidential weight

8.9 Reframe affordability targets around the supply geography

8.10 Learn from the affordable-and-permissive boroughs

8.11 Act on the regressive cost asymmetry

§9. Implications for developers

9.1 The borough is an acquisition input

9.2 Read your borough's quadrant before fixing the brief

9.3 Do not read the ranking as a static verdict

9.4 In hard boroughs, the local plan is the brief

9.5 The gradient is not fixed

9.6 Treat the appeal as a narrow instrument

9.7 Understand how the borough decides

§10. What this means for housebuilding

§11. What level of variation is acceptable?

§12. The question the differential cannot answer