NEWS

rdhte 0.2.0

First systematic audit pass following the 0.1.0 pre-release. Fourteen HIGH-severity and seven MED-severity bugs were fixed across the R and Stata code, plus a new data = argument and a multi-script verification harness. A 2026-05-01 Windows verification pass uncovered three further HIGH-severity bugs (Stata weights not forwarded to bandwidth selection; R rdhte_lincom silently applied multcomp's single-step multiplicity adjustment; harness orchestrator regex/CSV-staging gaps) which are included below.

A subsequent expert-review pass on the Stata code on 2026-05-01 PM uncovered six further HIGH-severity issues, surfaced one previously silent latent inference bug in the no-covs_hte path, and bumped the Stata floor from 14 to 16 (frames). All are included in this version.

Stata 16+ now required (was Stata 14+). Driven by the migration from preserve to frames for the in-program working sample.

New features

data = argument added to rdhte() and rdbwhte(). When supplied, y, x, covs.hte, covs.eff, weights, cluster, and subset may be given as bare variable names referring to columns of data. covs.hte additionally accepts a one-sided formula (e.g. "~ z1 + z2") whose variables are looked up in data first. Mirrors the rdrobust 4.0.0 pattern. Implemented via the internal .rdhte_resolve_data() NSE helper in R/helpers.R.

Bug fixes (R)

Undefined W in no-covs.hte + covs.eff branch. The internal regression formula referenced W even though no heterogeneity variable was supplied. rdhte() now uses Y ~ T*Xp + covs (and Xq for the q-fit). Previously crashed at lm().
Character / formula-string covs.hte path. Three separate bugs were fixed: a missing brace pair that left fml.txt undefined for ~-prefixed input; the formula's environment now uses parent.frame() so caller-frame variables resolve; and the data.frame(covs.hte = model.matrix(...)) step was rebuilt so the downstream grep("^covs.hte", ...) finds the correct columns (previously resulted in "subscript out of bounds" deeper in lm()). The illustration's covs.hte = "w_left*w_strength" example was silently affected by these and now produces correct output.
Bandwidth typo is.null(h.r) & is.null(h.r) (twice the same argument) is now is.null(h.l) & is.null(h.r).
Matrix-as-vector indexing h.lev[l] in three locations across rdhte() and rdbwhte(): the right-side effective-sample-size count was using the left bandwidth whenever bwselect = "msetwo" or per-group h.l/h.r were supplied. Now uses h.lev[l, 1] and h.lev[l, 2].
Nh.lev always 0 in continuous-covs.hte auto-bw branch because h[1]/h[2] was NULL. Now uses h.lev[1,1]/h.lev[1,2].
out$h shape inconsistency between branches (vector in no-covs.hte, matrix in covs.hte). All branches now return a consistent n.lev × 2 matrix.
Slot name typo W.names. = W.names (trailing dot) so summary.rdhte() could not find x$W.names for the continuous case. Renamed to W.names.
Bandwidth-vector validation check length(h.l) > 1 & length(h.r) always evaluated TRUE (bare length(h.r) coerces to TRUE when positive). Now length(h.l) > 1 | length(h.r) > 1 correctly errors on mismatched lengths.
do.call-unsafe NSE. deparse(substitute(covs.hte)) returned multi-line output when covs.hte was passed as a value rather than a symbol (e.g. via do.call), causing paste() recycling to inflate W.names to length-126 and the names(tau.hat.se) <- W.names assignment to error. The deparse output is now collapsed to one line and length-capped (>100 chars falls back to the placeholder "covs.hte").
N.lev is now consistently c(N_l, N_r) across all branches; the categorical-covs.hte branch previously stored table(W.covs), which made summary.rdhte() show wrong "Number of obs" for any group count other than two.
subset = accepts both logical and integer indices (matches rdrobust 4.0.0); previously only logical worked.
rdbwhte() gained weights = and subset = arguments to match rdhte()'s feature surface, plus the same string-formula covs.hte handling.
weights = is now forwarded to all internal rdbwselect() calls (previously dropped silently in three locations).
rdbwhte() no-covs.hte branch's covs.cont <- FALSE flipped to TRUE so the print path matches rdhte().
rdbwhte()'s continuous-covs.hte branch now returns Nh as a 1 x 2 matrix (consistent with the factor case) rather than a length-2 vector.
rdhte_lincom(): validate level is in [1, 100) so the common fraction-vs-percentage mistake (e.g. level = 0.95) errors clearly rather than silently producing a tiny CI.
rdhte_lincom(): column renamed t_stat -> z_stat. The values were already z-statistics (multcomp's glht uses a Gaussian approximation by default); the column label is now correct.
rdhte_lincom(): removed dead ses variable and a commented-out se column.
rdhte_lincom() p-values were silently multiplicity-adjusted. summary.glht() defaults to test = adjusted("single-step") whenever the user passes more than one hypothesis, so the p_value column in the individual data frame was an FWER-adjusted p-value rather than the per-hypothesis two-sided z-test the docstring promises (and that Stata's lincom reports). For >1 hypothesis the adjusted values were ~2x the raw values; the single-hypothesis case was already correct. Now passes test = adjusted(type = "none"). Point estimates and CI bounds are unchanged.
checkup_rdhte_basic.R was emitting r$N[1]/r$N[2] (full-sample left/right counts, post-na.omit) for the N_l/N_r cross-language diff fields, which mismatched Stata's e(tau_N) (effective N within bandwidth). Now emits r$Nh[1,1]/r$Nh[1,2] so both sides compare the same quantity.
Nh shape inconsistency in the continuous-covs.hte branch. At rdhte.R:509 the continuous (no W.lev) path returned Nh as a length-2 vector while every other branch returned a 1 x 2 (or n.lev x 2) matrix. Downstream code that indexed Nh[i, j] uniformly -- including the Akhtari/Moreira/Trucco replication script -- broke on the continuous path. Now matches the rest of the branches with matrix(c(Nh.lev.l, Nh.lev.r), 1, 2). Values are unchanged; positional access (Nh[1], Nh[2]) still works.

Bug fixes (Stata)

rdhte y x, vce(hc1) errored at regress. Stata's regress only accepts vce(robust), not vce(hc1). The vce string is now translated to vce(robust) for the internal regress calls (display still shows "HC1").
rdbwhte ... vce(cluster c) produced a different bandwidth from the auto-BW inside rdhte ... vce(cluster c) because rdbwhte was forwarding the user's raw vce string (and forcing vce_select = "hc0" internally) while rdhte built a proper vce_rdbw local. Both now use the same vce_rdbw construction.
rdbwhte ereturned e(cmd) = "rdhte"; now correctly "rdbwhte".
CI bounds at rdhte.ado:712 no longer hard-code 1.96; now use invnormal(1 - (1 - level/100)/2) so level() is honored.
Auto-i. prefix on a 0/1 variable now warns when the variable is constant (all 0 or all 1) and the heterogeneity analysis collapses to an overall RD ATE. Mirrored across rdhte.ado and rdbwhte.ado.
rdhte_lincom: lincom is now wrapped in capture so a malformed hypothesis errors with a clear message instead of silently picking up stale r() values from the previous estimation.
rdhte_lincom accepts a new level() option (default 95) -- the R version already had it; the Stata version was using the global set level only.
weights was not forwarded to bandwidth selection. rdhte.ado's two internal rdbwselect calls (the joint-bandwidth path and the per-level factor path) did not pass the user's weights() option, so rdhte ..., weights(w) bwselect(...) silently chose a different bandwidth from the R-side equivalent. Same shape of bug as the vce_rdbw cluster fix in this release. Now forwards weights() to every internal rdbwselect invocation.
rdbwhte.ado was missing the weights() option entirely. The R-side rdbwhte() already accepted weights =; the Stata syntax line did not declare weights(string) at all, and the three internal rdbwselect calls (joint-bandwidth, no-covs_hte, per-factor-level) did not forward it either. Both gaps fixed.
checkup_rdhte_stress.do had five if _rc == 0 { di "..."; local ++PASS } lines that triggered Stata's r(198) program error: code follows on the same line as open brace under batch mode. All five expanded to multi-line. The same script's NA-handling probe referenced e(N_h_l) + e(N_h_r) (neither saved by rdhte.ado); now asserts only _rc == 0. Note: rdhte.ado's e(N) is the pre-NA-drop input row count (line 158), not the post-drop count R's sum(r$N) reports -- a documented LOW-severity asymmetry left for a follow-up.

Verification harness

Eight new top-level scripts under the project root provide a verification gate analogous to rdrobust's:

checkup_rdhte_lm.R -- 40 R-only sanity checks. Compares each rdhte result to a manual reconstruction from coef.full / vcov.full.
checkup_rdhte_basic.{R,do} -- 14 specs of cross-language baseline parity (factor / continuous / cluster / weights / kernel / bwselect / p-q / manual h / bw.joint).
checkup_rdhte_extended.{R,do} -- 9 specs combining 3+ non-default options each, with per-(label, level) emission.
checkup_rdhte_bw.{R,do} -- 14 specs of standalone-rdbwhte parity, including the cluster path that the H10/H11/H12 fixes target directly.
checkup_rdhte_lincom.{R,do} -- 7 hypotheses across 3 specs comparing R's multcomp::glht to Stata's lincom.
checkup_rdhte_vs_rdrobust.R -- 11 R-only checks confirming the illustration's claim that rdhte reproduces rdrobust under matched settings (h fixed, rho=1, vce="hc3"). Matches to ~1e-15 on tau.
checkup_rdhte_stress.R -- 11 R-only edge cases.
checkup_rdhte_stress.do -- 9 Stata-portable stress probes.
checkup_rdhte_strformula.R -- 6 R-only checks for the string-formula covs.hte path with caller-frame variables.

run_rdhte_checks.py orchestrates all of the above end-to-end and reports per-spec |R-Stata|. Marker on success: RUN_RDHTE_CHECKS_OK.

Two orchestrator-level bugs found and fixed during the 2026-05-01 Windows verification pass:

The key=value parsing regex was (\w+)=(-?[0-9.eE+\-]+), which did not tolerate the whitespace that Stata's %15.10f display format inserts between = and the number. Every Stata float field (tau_cl=..., h_l=..., etc.) silently parsed as nothing -- only the unpadded integer fields (N_l=240) got through. The orchestrator was therefore comparing only N_l/N_r, which under the old full-sample-vs-effective N semantic produced ~700-unit "diffs" that masked the real silent-OK pattern. Fixed by adding \s* after =.
Each Stata batch is run from a C:\Temp\rdhte_stata_* scratch dir, but the .do files reference _rdhte_data.csv by relative path. run_STATA() now stages _rdhte_*.csv from the project root into the scratch dir before invoking Stata.

Stata expert audit (2026-05-01 PM)

A second pass on the Stata code, severities HIGH / MEDIUM / LOW. All HIGH fixed; most MEDIUM applied; LOW partially done.

HIGH

Latent missing-e(V) bug in the no-covs_hte path (surfaced by H4). rdhte y x (no heterogeneity) had been silently returning an e(V) matrix populated with .s -- the no-covs branch never wrote tau_V[1,1]. The cap ereturn post b V masked the resulting r(504) matrix has missing values, leaving downstream test/lincom calls with broken inference. Now correct (matrix tau_V[1,1] = se^2); the cap is gone.
rdrobust version-check rewritten. The previous code read line 5 of c(sysdir_plus)/r/rdrobust.ado with file open and tokenized to extract the version. Replaced with findfile rdrobust.ado (works for any adopath location: PERSONAL, SITE, PLUS) plus a regex scan of the first *!...vN.M.K line. Robust to future rdrobust file-layout changes.
vce() parser validation restored. vce(robust foo), vce(hc1 var), vce(cluster a b), etc., previously had their validation commented out (and silently mis-parsed). Now error with r(198) and a clear message; the legal 2-token forms are exactly vce(cluster|hc2|hc3 var).
Tempname migration of all working matrices. All 14 internal matrices in rdhte.ado (tau_h, tau_hat, tau_bc, tau_se, tau_V, tau_t, tau_pv, tau_N, tau_h_l/h_r, tau_ci_l/r, CV, plus the b/V ereturn pair) and 2 in rdbwhte.ado (tau_h, tau_N) are now tempname'd. They no longer leak into the caller's namespace and cannot clobber a user matrix of the same name.
rdhte_countvars no longer double-counts. The if-elseif chain wasn't mutually exclusive: any pure factor#factor term (e.g. i.W#i.Z) incremented BOTH n_interaction_factor AND n_interaction_continuous. Restructured so each term routes to exactly one bucket. Affected the downstream quad flag and the BW-routing decision for higher-order factor interactions.
rdhte_fvexpand declared rclass. Previously the program had no class declaration and callers relied on the inner fvexpand call's r() values surviving through program return -- accidental, undocumented, and brittle to future Stata. Now rclass, captures r(varlist) / r(fvops) inside nobreak before any reset, and explicitly returns them.

MEDIUM

preserve -> frames (M1). rdhte.ado and rdbwhte.ado now do all work in an isolated temporary frame: frame put * if touse, into(workframe); cwf workframe; ...; cwf orig_frame; frame drop workframe. Avoids the O(N) preserve disk round-trip and only the touse=1 subset is copied (typically a fraction of the user's dataset). The user's data and frame state are never mutated, even on error. Drives the Stata 16+ floor bump (frames were introduced in Stata 16).
e(N_h_l) and e(N_h_r) added (M2). Per-side bandwidth-effective sample sizes (summed across factor levels). Mirrors rdrobust's convention. The deeper post-NA-drop semantic for e(N) across covs_hte / weights / cluster missingness is documented as a deferred follow-up.
qui count consolidation (M3). Three qui count passes collapsed to two (left + right; total via addition). One fewer O(N) pass per call.
version 14.0 -> 16.0 (M8). Required by the frames refactor. Bumped across all 5 ados; rdhte_countvars.ado also gained the missing version directive (it had none before).

LOW

Dead / commented-out code removed (L1). ~10 commented blocks across rdhte.ado (check-it-exists scaffolding for value labels, dead tau_V covariance attempts, redundant local k = i inside nested forvalues, etc.) and rdbwhte.ado (orphan */, value-label scaffolding).
e(outcomevar) removed (L2). Was advertised in the help file but the 2026-04-27 harmonization audit had already removed it; line 855 had silently reintroduced it. Users get e(depvar) per Stata convention.
rdhte.sthlp Stored results section refreshed (L5). Removed e(outcomevar); added e(N_h_l) and e(N_h_r); e(N) description now correctly says "sample size after listwise deletion on (y, x)" instead of "original number of observations".
Continuous-W discretization warning (L6). When the user passes covs_hte(W) (no i. prefix) and W has fewer than 10 unique values, rdhte now emits a note: recommending i.W for subgroup-CATE estimation rather than the (probably unintended) slope-coefficient interpretation.

MEDIUM items deferred (not in this release)

M4 (length(subinstr(...)) hash-counting refactor): arithmetic is correct, just opaque.
M6 (display-formatting magic numbers): pure cleanup, high regression risk on output formatting.
M7 (DRY the 6 BW-selection branches): ~150 lines of near-duplicate code in rdhte.ado; should pull into a subprogram.
e(N) post-NA-drop for covs_hte / weights / cluster: needs careful covs_hte string parsing to markout factor variables.

rdhte 0.1.0 (2025-07-18)

Initial release.