Ridge ASCM — Augmented Synthetic Control (Ben-Michael, Feller & Rothstein 2021)#

Estimator:: Vanilla Synthetic Control (VanillaSC) — the ridge-augmentation layer (mlsynth.utils.bilevel.ridge_augment.ridge_augment_weights()).
Source:: Ben-Michael, Feller & Rothstein (2021), “The Augmented Synthetic Control Method,” JASA 116(536); reference implementation: the augsynth R package (ebenmichael/augsynth).
Replication type:: Cross-validation — mlsynth matched value-for-value to augsynth on its canonical Kansas study — and Path B — the paper’s Section-7 coverage / bias-reduction simulation.
Status:: Fully verified — empirical ladder and simulation reproduced.

Validation strategy#

The Augmented SCM is a bias-correction layer, so it is validated against the authors’ own R package, augsynth, on its flagship empirical example: the effect of Kansas’s 2012 tax cuts on quarterly log GDP per capita. augsynth walks up a ladder of estimators of increasing de-biasing — plain SCM, ridge ASCM, ridge ASCM with auxiliary covariates (balanced directly), and the residualized covariate variant — and the measured effect grows while the pre-treatment imbalance falls. We reproduce that ladder cell by cell, then reproduce the paper’s Section-7 Monte Carlo (Path B).

Cross-validation — the Kansas ladder#

The treated unit is Kansas (FIPS 20); treatment begins in 2012 Q2, leaving \(T_0 = 89\) pre-period quarters and \(J = 49\) donor states. The covariate model is augsynth’s documented spec,

covsyn <- augsynth(lngdpcapita ~ treated | lngdpcapita + log(revstatecapita) +
                     log(revlocalcapita) + log(avgwklywagecapita) +
                     estabscapita + emplvlcapita,
                   fips, year_qtr, kansas, progfunc = "ridge", scm = TRUE)

with each covariate transformed per row and aggregated to one pre-period mean per unit.

How that aggregation treats missing values decides the covariate cells, so it is worth being explicit. The two revenue series are reported annually and so are absent from 56 of the 89 pre-treatment quarters. augsynth’s extract_covariates passes na.action = NULL to model.frame, keeping every row, and then averages each covariate on its own with mean(x, na.rm = TRUE) — missing values are omitted per covariate, not per period. Averaging instead over the quarters in which every covariate is reported discards those 56 quarters from all six series rather than from the two that are sparse, which moves the covariate ASCM’s ATT from \(-0.0609\) to \(-0.0663\). mlsynth follows the per-covariate rule.

The whole ladder is reproduced through mlsynth’s public API – Augmented SCM is a mode of VanillaSC (augment="ridge"). Covariates are passed by column name; the user applies augsynth’s per-row log transforms to the DataFrame first (mlsynth’s covariate convention). residualize=True selects the residualized variant:

import numpy as np, pandas as pd
from mlsynth import VanillaSC

df = pd.read_csv("basedata/kansas_ascm.csv")          # long fips x quarter panel
for c in ("revstatecapita", "revlocalcapita", "avgwklywagecapita"):
    df[c] = np.log(df[c])                             # augsynth's log transforms
covs = ["lngdpcapita", "revstatecapita", "revlocalcapita",
        "avgwklywagecapita", "estabscapita", "emplvlcapita"]
base = dict(df=df, outcome="lngdpcapita", treat="treated",
            unitid="fips", time="year_qtr")

att = lambda cfg: VanillaSC({**base, **cfg}).fit().effects.att
att({})                                               # classic SCM    -0.029435
att({"augment": "ridge"})                             # ridge ASCM     -0.040063
att({"augment": "ridge", "covariates": covs})         # covariate ASCM -0.060937
att({"augment": "ridge", "covariates": covs,
     "residualize": True})                            # residualized   -0.056377

The reproduced ladder (mlsynth vs augsynth):

Specification	ATT (mlsynth)	Pre-fit L2	augsynth (ATT / L2)	Agreement
Classic SCM	-0.029435	0.082555	-0.029435 / 0.082555	exact
Ridge ASCM	-0.040063	0.061515	-0.040063 / 0.061515	exact
Covariate ASCM	-0.060937	0.053855	-0.060937 / 0.053855	exact
Residualized	-0.056377	0.060838	-0.052773 / 0.057637	0.004

Three of the four cells reproduce the package to six decimals, and the ladder is monotone in \(|\text{ATT}|\) (the un-augmented SCM is the conservative end). The joint-null conformal \(p\)-value for ridge ASCM (\(0.071\)) is also reproduced to Monte-Carlo precision.

A note on the residualized penalty — the one cell that is not exact, and deliberately so. After residualizing out \(K\) covariates the residual Gram is rank-deficient (\(T_0\) rows, rank \(\le J - K\)), so a cross-validation on the residuals is ill-posed and drifts to the grid floor. mlsynth tunes the penalty on the outcome scale instead — where augsynth’s residual CV lands anyway. The instability is visible in the package’s own output: its live value here is \(-0.0528\) / \(0.0576\) while the published vignette table reports \(-0.055\) / \(0.067\). mlsynth’s \(-0.0564\) / \(0.0608\) sits between them and is stable across reruns, which is the property worth having when the reference itself is not reproducible.

Path B — coverage and bias reduction (Section 7)#

Four data-generating processes are calibrated to the Kansas panel — a 3-factor interactive-fixed-effects model (calibrated exactly as gsynth/fect’s interFE does it with no covariates: two-way demean, then a rank-3 SVD of the residual), the same model at \(4\times\) noise, additive two-way fixed effects, and a fitted AR(3). Treatment is assigned to an extreme unit, so plain SCM struggles and the augmentation matters. Across all four DGPs ridge ASCM reduces \(|\text{bias}|\) versus plain SCM and gives near-nominal coverage (\(\approx 0.90\)–\(0.96\)), with the gain limited under high noise — the paper’s thesis.

Durable cases & tests#

ascm_kansas — the four-spec Kansas ladder cross-validated against augsynth (benchmarks/cases/ascm_kansas.py).
augsynth_calibrated — the Section-7 coverage / bias-reduction simulation (benchmarks/cases/augsynth_calibrated.py).
Regression tests: mlsynth/tests/test_bilevel_ridge.py (test_augsynth_kansas_replication, test_augsynth_kansas_conformal_pvalue, test_augsynth_kansas_covariate_ladder, test_kansas_covariate_lambda_max_matches_augsynth, and TestCovariateAggregation, which pins the per-covariate NA rule described above and how far the per-period rule would move it); mlsynth/tests/test_vanillasc_ascm.py (test_augsynth_kansas_ladder_public_api for the same four cells through the public API, and TestCovariateMeansOmitMissingPerColumn for the aggregation itself).

Ridge ASCM — Augmented Synthetic Control (Ben-Michael, Feller & Rothstein 2021)

Contents

Ridge ASCM — Augmented Synthetic Control (Ben-Michael, Feller & Rothstein 2021)#

Validation strategy#

Cross-validation — the Kansas ladder#

Path B — coverage and bias reduction (Section 7)#

Durable cases & tests#