Kato Advocates Riesz Regression Guided by Neyman Score

According to the arXiv paper arXiv:2605.06386 by Masahiro Kato, the author argues that in debiased machine learning (DML) balancing functions should be derived from the Neyman orthogonal score rather than chosen solely as functions of observed covariates. The paper states that covariate balancing is appropriate when the regression error entering the orthogonal score can be represented by covariate-only functions, which makes it a natural finite-dimensional approximation for targets such as ATT counterfactual means. For ATE estimation under treatment effect heterogeneity, the paper reports the score error typically contains treatment-specific components, and that balancing common covariate functions can leave those components unbalanced. The author therefore advocates regressor balancing, implemented via Riesz regression with basis functions of the full regressor, as the general balancing principle for DML (arXiv:2605.06386, submitted 7 May 2026).
What happened
According to the arXiv submission arXiv:2605.06386 (submitted 7 May 2026), Masahiro Kato presents a position paper arguing that balancing functions in debiased machine learning should be derived from the Neyman orthogonal score rather than selected only as functions of covariates. The paper frames covariate balancing as valid when the regression error entering the orthogonal score is representable by covariate-only functions, and it cites ATT targets as a setting where covariate balancing is a natural finite-dimensional approximation (arXiv:2605.06386). The paper further states that for ATE estimation under treatment-effect heterogeneity the score error generally includes treatment-specific components because the outcome regression depends on the full regressor, and that balancing shared covariate functions can leave those components unbalanced (arXiv:2605.06386).
Technical details
Per the paper, the recommended remedy is regressor balancing, operationalized through Riesz regression using basis functions of the full regressor. The author positions Riesz regression as a way to approximate the regression error relevant to the orthogonal score (arXiv:2605.06386). The paper emphasizes that this is not a rejection of covariate balancing; rather, covariate balancing is described as the special-case appropriate when the score-relevant error is covariate-only (arXiv:2605.06386).
Editorial analysis - technical context
Researchers building debiased estimators routinely rely on the Neyman orthogonal score because its orthogonality reduces sensitivity to first-stage nuisance estimation error. Industry-pattern observations: methodology papers that tie finite-dimensional balancing choices to the score often prompt reexamination of which basis functions practitioners include in balancing routines and how those bases interact with treatment heterogeneity.
Context and significance
Editorial analysis: For the causal-inference and DML communities, framing covariate balancing as a special case guided by the orthogonal score clarifies when simple covariate-based balancing is sufficient and when richer regressor bases are required. This distinction matters for empirical strategy selection, robustness checks, and simulation studies comparing ATE versus ATT targets.
What to watch
Observers will look for follow-up work that:
- •provides empirical comparisons of covariate versus regressor balancing across heterogeneous-treatment settings
- •releases replication code that implements Riesz regression bases in common DML toolkits
- •connects theoretical conditions in the paper to finite-sample performance in applied datasets
Scoring Rationale
This is a methodological position paper clarifying a theoretical principle relevant to debiased machine learning and causal estimation. It is notable for researchers and practitioners deciding balancing bases, but it is not an empirical or software breakthrough, so the impact is moderate.
Practice interview problems based on real data
1,500+ SQL & Python problems across 15 industry datasets — the exact type of data you work with.
Try 250 free problems
