1 Introduction to causal inference

Published

Last modified: 2026-01-27: 8:19:13 (AM)

1.1 Introduction

Hernán and Robins (2020) (page vii):

Unfortunately, the scientific literature is plagued by studies in which the causal question is not explicitly stated and the investigators’ unverifiable assumptions are not declared. This casual attitude towards causal inference has led to a great deal of confusion.

Rigorously defining cause and effect is difficult. Fortunately, many humans have strong intuitions about these concepts. We will make cursory attempts at definitions for the basic terms, and leave the finer points to philosophers.

1.2 Individual causal effects

Definition 1.1 (Action, intervention, exposure, policy, treatment) An action (also called an intervention, exposure, policy, or treatment) is a choice that we consider making.

Definition 1.2 (Random variable) A random variable is a variable that may have different values for different individuals and/or for different actions/exposures that an individual might experience. ¹

Definition 1.3 (potential outcome, consequence) A potential outcome of action \(a\) (also called a consequence of \(a\)) is the value of a random variable \(Y\) that would occur if we were to take action \(a\). The potential outcome of action \(a\) on random variable \(Y\) is often denoted \(Y(a)\), \(Y^a\), or \(Y^{A=a}\). We will use notation \(Y(a)\).

Definition 1.4 (Factual outcome, observed outcome) A factual outcome (or observed outcome) is the potential outcome corresponding to an action that was actually taken.

If we consider taking action \(a\) or an alternative \(a'\), and we actually take action \(A=a\), then \(Y(a)\) is the factual outcome.

Definition 1.5 (Counterfactual outcome) A counterfactual outcome is a potential outcome corresponding to an action that was not actually taken.

If we consider taking action \(a\) or an alternative \(a'\), and we actually take action \(A=a\), then \(Y(a')\) is a counterfactual outcome.

There might be more than one counterfactual outcome, depending on how many action options were considered, but there can only ever be one factual outcome per random variable.

Definition 1.6 (Cause) Action \(a\) causes outcome \(y\) (or is a cause of \(y\)) if:

outcome \(y\) would occur if we were to take action \(a\)
and
outcome \(y\) would not occur if we did not take action \(a\).

In other words, if:

\(Y(a) = y\)
and
\(\exists a' \neq a: Y(a') \neq y\)

Definition 1.7 (effect) The effect of action \(a\) on outcome \(Y\), relative to a given alternative action \(a'\), is the contrast in potential outcomes, \(Y(a)\) versus \(Y(a')\).

Definition 1.8 (Consistency) Consistency is the assumption that if we observe an action \(a\), then the observed outcome \(Y\) is equal to the “factual potential outcome” corresponding to action \(a\); in other words, if \(A=a\), then \(Y(a) = Y\).

Definition 1.9 (Exchangeability) Subpopulations defined by exposure \(X\) are exchangeable with respect to a potential outcome \(Y(x)\) if the distribution of \(Y(x)\) does not depend on the observed exposure \(X\):

\[Y(x) \perp\!\!\!\perp X\]

Theorem 1.1 If subpopulations defined by values of exposure \(X\) are exchangeable with respect to potential outcome \(Y(x)\), then the expected value of \(Y(x)\) does not depend on the observed value of \(X\):

\[\text{E}{\left[Y(x) | X = x'\right]} = \text{E}{\left[Y(x) | X = x\right]}\]

Definition 1.10 (Conditional exchangeability) Subpopulations defined by exposure \(X\) are exchangeable with respect to a potential outcome \(Y(x)\) if the distribution of \(Y(x)\) does not depend on the observed exposure \(X\), conditional on covariate(s) \(Z\):

\[Y(x) \perp\!\!\!\perp X | \tilde{Z}\]

Theorem 1.2 If subpopulations defined by values of exposure \(X\) are conditionally exchangeable with respect to potential outcome \(Y(x)\) given covariate \(\tilde{Z}\), then the expected value of \(Y(x)\) does not depend on the observed value of \(X\):

\[\text{E}{\left[Y(x) | X = x', \tilde{Z}=\tilde{z}\right]} = \text{E}{\left[Y(x) | X = x, \tilde{Z}=\tilde{z}\right]}\]

Definition 1.11 (Fundamental problem of causal inference) The fundamental problem of causal inference is that only one potential outcome (the factual outcome) can be observed per person (or per sampling unit, more generally) (Holland 1986). The other, counterfactual outcomes, are all missing data, and thus, the causal effects are all missing data as well.

adapted from Hernán and Robins (2020) p3↩︎