Three Essays in Robust Causal Inference


Book Description

Economics research often addresses questions with an implicit or explicit policy goal. When such a goal involves an active intervention, such as the assignment of a particular treatment variable to participants, the analysis of its effects requires the tools of causal inference. In such settings, the opportunity to use experimental or observational data to tease out policy parameters of interest requires a combination of statistical and causal assumptions. In reduced form work, where an explicit economic theory is not laid out to allow identification of policy parameters from data, the investigation of the causal assumptions becomes a critical exercise for the credibility of the results. Many robustness exercises evaluate the effect that relaxing and/or modifying assumptions produces on the results of the study. The scope of these exercises is very broad, reflecting the need to tailor specific robustness exercises to whichever assumptions are most likely to be violated in a given domain. This dissertation is a collection of three essays on robust causal inference that share a unifying theme: preserving the nonparametric nature of the robustness exercise. This aspect has both a theoretical and practical relevance. First, causal assumptions are usually nonparametric: robustness exercises that restrict to parametric cases might lead to misleading insights. Further, economics research has started to incorporate more flexible nonparametric and semi-parametric techniques which may call for robustness exercises that are readily applicable to these approaches. Because robustness exercises are context specific, each of these essays addresses a separate aspect of it. Chapter 1 investigates how changes in the distribution of covariates may invalidate given experimental results, with implications for evidence based policy-making. It proposes an explicit metric of robustness that measures the distance of the closest distribution of covariates for which experimental results are violated. Chapter 2 analyses the practice of robustness checks as a way to validate a researcher's identification strategy. It details out the limitations of these exercises in detecting failure of identification and proposes a non-parametric robustness test that bypasses functional form assumptions. Finally, Chapter 3 focuses on the robustness of Marginal Treatment Effect identification when the instrumental variables fail to incentivize treatment for a subset of the population. It provides two alternative identification results which can be relevant in practice.




Three Essays on Causal Inference


Book Description

This thesis describes three research projects in causal inference, all related to the problem of contrasting the average counterfactual outcomes on two sides of a binary decision. In the first project, we discuss estimation of the average causal effect in a randomized control trial. Here, we find that statisticians find themselves in a kind of statistical paradise: a simple model-based procedure delivers correct confidence intervals even if the experimental participants are not randomly sampled and mis-specified models are used. In the second project, we consider the problem of testing for a treatment effect using observational data with no hidden confounders. Conceptually, this is no different from a rather complicated RCT, and one might expect that a return to statistical paradise is possible. Unfortunately, this is not the case: we show that even intuitively reasonable uses of correct models may still yield misleading conclusions. The final project looks at observational data with unobserved confounding and gives methods for computing bounds on average causal effects. Here, we discover some never-before-seen robustness properties unique to the partially-identified setting.




Essays on Causal Inference and Econometrics


Book Description

This dissertation is a collection of three essays on the econometric analysis of causal inference methods. Chapter 1 examines the identification and estimation of the structural function in fuzzy RD designs with a continuous treatment variable. We show that the nonlinear and nonseparable structural function can be nonparametrically identified at the RD cutoff under shape restrictions, including monotonicity and smoothness conditions. Based on the nonparametric identification equation, we propose a three-step semiparametric estimation procedure and establish the asymptotic normality of the estimator. The semiparametric estimator achieves the same convergence rate as in the case of a binary treatment variable. As an application of the method, we estimate the causal effect of sleep time on health status by using the discontinuity in natural light timing at time zone boundaries. Chapter 2 examines the local linear regression (LLR) estimate of the conditional distribution function F(y|x). We derive three uniform convergence results: the uniform bias expansion, the uniform convergence rate, and the uniform asymptotic linear representation. The uniformity in the above results is with respect to both x and y and therefore has not previously been addressed in the literature on local polynomial regression. Such uniform convergence results are especially useful when the conditional distribution estimator is the first stage of a semiparametric estimator. Chapter 3 studies the estimation of causal parameters in the generalized local average treatment effect model, a generalization of the classical LATE model encompassing multi-valued treatment and instrument. We derive the efficient influence function (EIF) and the semiparametric efficiency bound for two types of parameters: local average structural function (LASF) and local average structural function for the treated (LASF-T). The moment condition generated by the EIF satisfies two robustness properties: double robustness and Neyman orthogonality. Based on the robust moment condition, we propose the double/debiased machine learning (DML) estimators for LASF and LASF-T. We also propose null-restricted inference methods that are robust against weak identification issues. As an empirical application, we study the effects across different sources of health insurance by applying the developed methods to the Oregon Health Insurance Experiment.




Three Essays in Causal Inference


Book Description

This thesis is a collection of three essays on causal inference. Chapter 1 considers the problem of constructing confidence intervals or bands for the quantiles of treatment effects under settings where point identification is impossible. I show that under settings where selection is only on observables bounds for the entire quantile function can nonetheless be estimated, and this enables the estimation of confidence bands. I also extend these results to instrumental variable settings. Computational complexity analysis demonstrates that the methodology I propse is computationally attractive. Chapters 2 and 3 consider extending the synthetic control approach of Abadie, Diamond, and Haimueller (2010) to two different settings where individual-level data is available. In Chapter 2 I consider estimating average treatment effects by constructing for every subject in the treatment group a synthetic twin composed of individuals in the control group. I show that the resulting estimator is unbiased when selection is dependent only on observables. I also show that matching estimators and OLS estimators can be viewed as special cases of synthetic control estimators. Furthermore, I demonstrate that the estimator is highly scalable computationally. In Chapter 3, I consider settings where either panel data or repeated cross-sectional data is available. I show that the synthetic control estimator in this setting can yield asymptotically valid standard errors when aggregation is done from individual-level data, unlike the original work of Abadie, Diamond, and Hainmueller (2010). To demonstrate asymptotic properties, two types of asymptotic analysis are carried out: one appropriate when the number of observations at each point in time in each subpopulation tends to infinity, and one suitable for stationary aggregate data and in which the number of pre-intervention periods gets large.




Three Essays on Causal Inference in Comparative Political Behavior


Book Description

This dissertation contains three independent essays, each applying statistical methods for causal inference in observational studies to central topics in comparative political behavior.




Three Essays on Causal Inference for Observational Studies


Book Description

Finally, the third paper in this thesis addresses the question of unintended consequences in school segregation due to the introduction of a targeted voucher scheme. I use a difference-in-difference approach, in combination with matching on time-stable covariates, to estimate the effect that the 2008 Chilean voucher policy had on both average students' household income and academic performance at the school level. Results show that even though the policy had a positive effect on schools' standardized test scores, closing the gap between schools that subscribed to the policy compared to those that did not, there was also an increase in the differences between socioeconomic characteristics at the school level, such as average household income.




Essays in Econometrics


Book Description

In this dissertation, I propose novel approaches to causal inference in the settings characterized by an explicit clustering structure. I study different aspects of this problem, considering settings with few large clusters as well as with many small clusters. The dissertation consists of two essays. The first essay proposes a new model for causal inference in the settings with few large clusters and cluster-level treatment assignment. The second essay studies causal inference questions in the settings with many clusters of moderate size and individual-level treatment assignment. In the first essay, I construct a nonlinear model for causal inference in the empirical settings where researchers observe individual-level data for few large clusters over at least two time periods. It allows for identification (sometimes partial) of the counterfactual distribution, in particular, identifying average treatment effects and quantile treatment effects. The model is flexible enough to handle multiple outcome variables, multidimensional heterogeneity, and multiple clusters. It applies to the settings where the new policy is introduced in some of the clusters, and a researcher additionally has information about the pretreatment periods. I argue that in such environments we need to deal with two different sources of bias: selection and technological. In my model, I employ standard methods of causal inference to address the selection problem and use pretreatment information to eliminate the technological bias. In case of one-dimensional heterogeneity, identification is achieved under natural monotonicity assumptions. The situation is considerably more complicated in case of multidimensional heterogeneity where I propose three different approaches to identification using results from transportation theory. The second essay is co-authored with Guido Imbens. We develop a new estimator for the average treatment effect in the observational studies with unobserved cluster-level heterogeneity. We show that under particular assumptions on the sampling scheme the unobserved confounders can be integrated out conditioning on the empirical distribution of covariates and policy variable within the cluster. To make this result practical we impose a particular exponential family structure that implies that a low-dimensional sufficient statistic can summarize the empirical distribution. Then we use modern causal inference methods to construct a novel doubly robust estimator. The proposed estimator uses the estimated propensity score to adjust the familiar fixed effect estimator.







Essays on Causal Inference in Randomized Experiments


Book Description

This dissertation explores methodological topics in the analysis of randomized experiments, with a focus on weakening the assumptions of conventional models. Chapter 1 gives an overview of the dissertation, emphasizing connections with other areas of statistics (such as survey sampling) and other fields (such as econometrics and psychometrics). Chapter 2 reexamines Freedman's critique of ordinary least squares regression adjustment in randomized experiments. Using Neyman's model for randomization inference, Freedman argued that adjustment can lead to worsened asymptotic precision, invalid measures of precision, and small-sample bias. This chapter shows that in sufficiently large samples, those problems are minor or easily fixed. OLS adjustment cannot hurt asymptotic precision when a full set of treatment-covariate interactions is included. Asymptotically valid confidence intervals can be constructed with the Huber-White sandwich standard error estimator. Checks on the asymptotic approximations are illustrated with data from a randomized evaluation of strategies to improve college students' achievement. The strongest reasons to support Freedman's preference for unadjusted estimates are transparency and the dangers of specification search. Chapter 3 extends the discussion and analysis of the small-sample bias of OLS adjustment. The leading term in the bias of adjustment for multiple covariates is derived and can be estimated empirically, as was done in Chapter 2 for the single-covariate case. Possible implications for choosing a regression specification are discussed. Chapter 4 explores and modifies an approach suggested by Rosenbaum for analysis of treatment effects when the outcome is censored by death. The chapter is motivated by a randomized trial that studied the effects of an intensive care unit staffing intervention on length of stay in the ICU. The proposed approach estimates effects on the distribution of a composite outcome measure based on ICU mortality and survivors' length of stay, addressing concerns about selection bias by comparing the entire treatment group with the entire control group. Strengths and weaknesses of possible primary significance tests (including the Wilcoxon-Mann-Whitney rank sum test and a heteroskedasticity-robust variant due to Brunner and Munzel) are discussed and illustrated.




Transparent and Robust Causal Inferences in the Social and Health Sciences


Book Description

The past few decades have witnessed rapid and unprecedented theoretical progress on the science of causal inference. Most of this progress, however, relies on strong, exact assumptions, such as the absence of unobserved confounders, or the absence of certain direct effects. Unfortunately, more often than not these assumptions are not only untestable, but also very hard to defend in practice. This dissertation develops new theory, methods, and software for drawing causal inferences under more realistic settings. These tools allow applied scientists to both examine the sensitivity of their causal inferences to violations of their underlying assumptions, and also to draw robust (albeit also more modest) conclusions from settings in which traditional methods fail. Specifically, our contributions are the following: (i) novel powerful, yet simple, suite of sensitivity analysis tools for popular methods, such as confounding adjustment and instrumental variables, that can be immediately put to use to improve the robustness and transparency of current applied research; (ii) the first formal, systematic approach to sensitivity analysis for arbitrary linear structural causal models; and, (iii) novel (partial) identification results that marry two apparently disparate areas of causal inference research---the generalization of causal effects and the identification of "causes of effects." These methods are illustrated with examples from the social and health sciences.