Propensity Score Analysis (PSA)

# Propensity Score Analysis (PSA) – Day 1
### Chaochen Wang (CWAN) <br> Thomas Laurent (TLAU)
### 2019-12-13 17:00~18:00 <span class="citation">@CSS</span>

---

- Revision about causal inference framework and terminologies.

- Introduction of Propensity Score Analysis (PSA).

- Understand the **pre-assumptions** that required for PSA.

- How to test for unobserved confounders? <br>(by TLAU)

- Discuss the ways of using Propensity Scores.

???

---
class: middle, center, inverse

## Some background before PSA

---
class: middle

## What do we mean by <br>"causal inference"? (1)

- Most data analysis in medical research (or in other area) has a central aim - to learn about **cause-effect relationships**.

- Does the treatment work? 
    - How harmful is the exposure? 
    - How effective would the policy be and why?
    
    
---
class: middle

## What do we mean by <br>"causal inference"? (2)

- Randomised studies are **randomised** precisely to make causal inference more reliable.

- When randomisation is not feasible, we still wish to make inferences about effects of causes.

- However, most medical studies can only be interpreted as *"associations"* rather than causal effect.

---
class: middle

## Example

Anaemic patients undergoing hip replacement operation might benefit from receiving an **intravenous iron supplement** prior to surgery.

- Data collected in one hospital on all anaemic patients undergoing a hip replacement operation between 2009 and 2014.

- `$X$`:  **intravenous iron supplement**, yes or no
- `$Y$`: 90 days survival after operation, alive or dead

---
class: middle

### Other data collected in the example:

- Age, gender, co-morbidities (CVD, diabetes, renal disease);

- Severity of anaemia;

- Operation types;

- Whether or not transfusion needed during operation;

- Length of stay in hospital;

---
class: middle

## Traditional Approach - crude

- We start by looking at a 2 `$\times$` 2 table:

<table class="table table-striped table-hover table-condensed" style="width: auto !important; margin-left: auto; margin-right: auto;">
 <thead>
<tr>
<th style="border-bottom:hidden" colspan="2"></th>
<th style="border-bottom:hidden; padding-bottom:0; padding-left:3px;padding-right:3px;text-align: center; " colspan="2"><div style="border-bottom: 1px solid #ddd; padding-bottom: 5px; ">Death90</div></th>
</tr>
  <tr>
   <th style="text-align:center;">  </th>
   <th style="text-align:center;">  </th>
   <th style="text-align:center;"> 0 </th>
   <th style="text-align:center;"> 1 </th>
  </tr>
 </thead>
<tbody>
  <tr>
   <td style="text-align:center;font-weight: bold;vertical-align: middle !important;" rowspan="2"> FeIV </td>
   <td style="text-align:center;font-weight: bold;"> 0 </td>
   <td style="text-align:center;"> 9206 </td>
   <td style="text-align:center;"> 376 </td>
  </tr>
  <tr>
   
   <td style="text-align:center;font-weight: bold;"> 1 </td>
   <td style="text-align:center;"> 7365 </td>
   <td style="text-align:center;"> 312 </td>
  </tr>
</tbody>
</table>

- Estimated log odds ratio `$= \log(\frac{312\times9206}{376\times7365}) = 0.04, 95\% \text{CI: } -0.12, 0.19$`

---
class: middle
## Traditional Approach - crude

- What are we estimating here? The **estimand**<sup>1</sup> is

$$
`\begin{aligned}
\log\text{OR}_{Y|X} & = \log{\frac{\text{Pr}(Y = 1 | X = 1)}{1- \text{Pr}(Y = 1 | X = 1)}} \\
                    & \;\;\;\;\;\;\;\;\; - \log{\frac{\text{Pr}(Y = 1 | X = 0)}{1- \text{Pr}(Y = 1 | X = 0)}}
\end{aligned}`
$$

.footnote[
[1] **Estimand** is defined as the value that we we would like to know, such as "the mean BMI of the population in Japan". This is an unknown but fixed quantity.
]

---
class: middle

## Confounding

- We cannot give `$\log\text{OR}_{Y|X}$` a causal interpretation because there is **confounding**. 
- It is possible that patients provided with intravenous iron were already worse off, masking a truely effect of the exposure. 
- So the estimand changed into a **conditional logOR**:

$$
`\begin{aligned}
\log\text{OR}_{Y|X} & = \log{\frac{\text{Pr}(Y = 1 | X = 1, \textbf{C})}{1- \text{Pr}(Y = 1 | X = 1, \textbf{C})}} \\
                    & \;\;\;\;\;\;\;\;\; - \log{\frac{\text{Pr}(Y = 1 | X = 0, \textbf{C})}{1- \text{Pr}(Y = 1 | X = 0, \textbf{C})}}
\end{aligned}`
$$

---
class: middle

## Conditioning on covariates

- So we have some statistical variable selection procedure. 
- Transfusion, and length of stay in hospital are on the causal pathway from `$X$` to `$Y$`, so they were excluded from adjustment.
- After adjustment of the other covariates, we have an estimated conditional logOR of <br> -0.24 (95% CI -0.41, -0.07)
- WE may interpret this logOR as evidence that patients treated with iron supplement before hip replacement surgery had lower odds of dying within 90 days of operation.

---
class: middle, center

## But, the conditional logOR still cannot answer our question:

--
## What is the survival probability if,

--
## **contrary to reality**,

--
## every patient were treated with intravenous iron?

---
class: middle

## Causal languages

1. We ask the question: how `$Y$` would react if we could change `$X$` and assign it, **contrary to how it was actually assigned.** <br> -- potential outcomes <br>(Neyman, 1923; Rubin, D. 1974)

2. `$Y(x)$` is defined as the value that `$Y$` would take if `$X$` were (hypothetically) set to value of `$x$`

3. Causal effect is expressed as comparisons between `$Y(x)$`: <br> `$E\{Y(1)\} - E\{Y(0)\}$`

---
class: middle

## Potential Causal Estimands - binary outcome

- Marginal causal risk difference:

$$
\text{Pr}[Y(1) = 1] - \text{Pr}[Y(0) = 1]
$$

- Conditional causal log odds ratio:

$$
\log\frac{\text{Pr}(Y(1) = 1, \textbf{C})}{1- \text{Pr}(Y(1) = 1,  \textbf{C})} - \log\frac{\text{Pr}(Y(0) = 1, \textbf{C})}{1- \text{Pr}(Y(0) = 1,  \textbf{C})}
$$

---
class: middle

## Potential Causal Estimands - cont. outcome

- Marginal causal mean difference:

`$$E\{Y(1) - Y(0)\}$$`
- Conditional causal mean difference:

`$$E\{Y(1) - Y(0)|\mathbf{C}\}$$`

They are called the **Average Causal/Treatment Effect (ACE or ATE)**.

---
class: middle

## Assumption No 1: **No Interference**

- Potential value of `$Y_i$`, does not depend on `$X_j$`:

- `$i, j$`, are individual indices `$\approx$` independent

- we assume that a (hypothetical) exposure for individual `$j$`, does not change the outcome of individual `$i$`

- An example of violation is in the study of vaccines, where vaccinating individual `$j$` may affect the disease status of individual `$i$`.

---
class: middle

## Assumption No 2: **Consistency**

$$
X_i = x \Rightarrow Y_i = Y_i(x)
$$

- For individual `$i$`, who actually (in the real world) received exposure `$x$`, their observed outcome is the same as in the hypothetical world that this individual received the same exposure `$x$`.

---
class: middle

## Assuption No 3: <br>**Conditional Exchangeability**

- This means we are so *arrogant* and 100% sure that there is **no other unmeasured confounding**.

`$$Y(x)\perp \!\!\! \perp X| \textbf{C}, \forall x$$`

- `$\perp \!\!\! \perp$` means conditional independence; 
    - `$A\perp \!\!\! \perp B | C$` means "A is conditionally independent of B given C"
    
- `$\forall$` means "for all", here means `$x = 0, 1$`

???

Conditional on `$\mathbf{C}$`, the actual exposure level `$X$` is independent of the level of the potential outcomes.

---
class: middle

## Identification (1):

Suppose we are interested in a conditional causal mean difference.

`$$E\{Y(1) - Y(0) | \mathbf{C} = \mathbf{c}\}$$`

Given `$\mathbf{C}$`, where `$\mathbf{C}$` also is a set of covariates given which we believe **conditional exchangaeability** to be plausible -- no other unmeasured confoundings.

---
class: middle

## Identification (2):

`$$\begin{aligned}E\{Y(1) - Y(0) | \mathbf{C} = \mathbf{c}\} & = E\{Y(1)| \mathbf{C} = \mathbf{c}\} \\
&\;\;\;\;\;- E\{Y(0)| \mathbf{C} = \mathbf{c}\} \\
& =  E\{Y(1)| \color{red}{X = 1}, \mathbf{C} = \mathbf{c}\} \\
&\;\;\;\;\;- E\{Y(0)| \color{red}{X = 0}, \mathbf{C} = \mathbf{c}\}\\
& (\text{conditional exchangeability}) \\ 
& =  E\{\color{red}{Y}| X = 1, \mathbf{C} = \mathbf{c}\} \\
&\;\;\;\;\;- E\{\color{red}{Y}| X = 0, \mathbf{C} = \mathbf{c}\} \\
& (\text{by consistency}) \\ 
\end{aligned}$$`

???

These steps are extremely important, 
under the assumptions of conditional exchangeability, consistency, we have rewritten our causal estimand -- unobservable potential outcomes -- into the observed data.

---
class: middle

## Assumptions linked between <br> causal estimand and observed data

These steps are extremely important, 
under the assumptions of conditional exchangeability, consistency, we have rewritten our causal estimand

-- unobservable potential outcomes --

into the observed data.

---
class: middle

## Estimation using linear regression

Suppose

`$$E\{Y|x = 1, \mathbf{C} = \mathbf{c}\} - E\{Y|x = 0, \mathbf{C} = \mathbf{c}\}$$`

is the same for every `$\mathbf{c}$` (covariate), then if we fit a linear regression model:

`$$E(Y|X=x, \mathbf{C} = \mathbf{c}) = \alpha + \color{red}{\beta}x + \gamma^T\mathbf{c}$$`

--
then the coefficient `$\color{red}{\beta}$` of `$X$` can be interpreted as the conditional causal mean difference, as long as the model is correctly specified.

---
class: middle

### Example: <br> maternal smoking and birth weight (1)

- Data is from Cattaneo <sup>1</sup> on singleton babies born in Pennsylvania between 1989 and 1991.

- Outcome: birth weight, in grams

- Exposure: whether or not mothers smoked during the pregnancy

- n = 4642

Efficient semiparametric estimation of multi-valued treatment effects under ignorability

<em>Journal of Econometrics, </em>

<em>Elsevier, </em>
<b>2010</b><i>, 155</i>, 138-154

<p></p></font></body></html>
]

---
class: middle
### Example: <br> maternal smoking and birth weight (2)

We think there are only 3 confounders:

- maternal age, 
- whether the baby is the first child,
- first clinical visit at which trimester.

---
class: middle

### One possible linear regression model is:

```r
cattaneo2 <- haven::read_dta("data/cattaneo2.dta")
Cat_mod <- lm(bweight ~ as.factor(mbsmoke) + mage + 
                as.factor(fbaby) + as.factor(prenatal), 
              data = cattaneo2)
broom::tidy(Cat_mod, conf.int = TRUE) %>% 
  knitr::kable(.)
```

<table>
 <thead>
  <tr>
   <th style="text-align:left;"> term </th>
   <th style="text-align:right;"> estimate </th>
   <th style="text-align:right;"> std.error </th>
   <th style="text-align:right;"> statistic </th>
   <th style="text-align:right;"> p.value </th>
   <th style="text-align:right;"> conf.low </th>
   <th style="text-align:right;"> conf.high </th>
  </tr>
 </thead>
<tbody>
  <tr>
   <td style="text-align:left;"> (Intercept) </td>
   <td style="text-align:right;"> 2735.8442 </td>
   <td style="text-align:right;"> 78.2260 </td>
   <td style="text-align:right;"> 34.9736 </td>
   <td style="text-align:right;"> 0.00000 </td>
   <td style="text-align:right;"> 2582.4841 </td>
   <td style="text-align:right;"> 2889.2043 </td>
  </tr>
  <tr>
   <td style="text-align:left;"> as.factor(mbsmoke)1 </td>
   <td style="text-align:right;"> -252.2599 </td>
   <td style="text-align:right;"> 21.5677 </td>
   <td style="text-align:right;"> -11.6962 </td>
   <td style="text-align:right;"> 0.00000 </td>
   <td style="text-align:right;"> -294.5428 </td>
   <td style="text-align:right;"> -209.9770 </td>
  </tr>
  <tr>
   <td style="text-align:left;"> mage </td>
   <td style="text-align:right;"> 5.2681 </td>
   <td style="text-align:right;"> 1.6146 </td>
   <td style="text-align:right;"> 3.2627 </td>
   <td style="text-align:right;"> 0.00111 </td>
   <td style="text-align:right;"> 2.1027 </td>
   <td style="text-align:right;"> 8.4336 </td>
  </tr>
  <tr>
   <td style="text-align:left;"> as.factor(fbaby)1 </td>
   <td style="text-align:right;"> -59.9184 </td>
   <td style="text-align:right;"> 17.7004 </td>
   <td style="text-align:right;"> -3.3851 </td>
   <td style="text-align:right;"> 0.00072 </td>
   <td style="text-align:right;"> -94.6196 </td>
   <td style="text-align:right;"> -25.2172 </td>
  </tr>
  <tr>
   <td style="text-align:left;"> as.factor(prenatal)1 </td>
   <td style="text-align:right;"> 578.8464 </td>
   <td style="text-align:right;"> 68.5078 </td>
   <td style="text-align:right;"> 8.4494 </td>
   <td style="text-align:right;"> 0.00000 </td>
   <td style="text-align:right;"> 444.5386 </td>
   <td style="text-align:right;"> 713.1542 </td>
  </tr>
  <tr>
   <td style="text-align:left;"> as.factor(prenatal)2 </td>
   <td style="text-align:right;"> 534.2280 </td>
   <td style="text-align:right;"> 70.6032 </td>
   <td style="text-align:right;"> 7.5666 </td>
   <td style="text-align:right;"> 0.00000 </td>
   <td style="text-align:right;"> 395.8122 </td>
   <td style="text-align:right;"> 672.6439 </td>
  </tr>
  <tr>
   <td style="text-align:left;"> as.factor(prenatal)3 </td>
   <td style="text-align:right;"> 458.5222 </td>
   <td style="text-align:right;"> 80.9992 </td>
   <td style="text-align:right;"> 5.6608 </td>
   <td style="text-align:right;"> 0.00000 </td>
   <td style="text-align:right;"> 299.7252 </td>
   <td style="text-align:right;"> 617.3191 </td>
  </tr>
</tbody>
</table>
]

---
class: middle
### Example: <br> maternal smoking and birth weight (3)

- Under assumptions of No Interference (NI),  Consistency (C), Conditional Exchangeability (CE) <br> and, **the model is correctly specified**

- The estimate `$\beta = -252.2599$` can be given a **causal intepretation**: the expected difference in birth weight, conditional on three confounders, comparing a hypothetical situation in which: <br> <br> **all mothers smoke** versus a different hypothetical situation in which **no mothers smoke**

???

Under the assumptions, for a mother of a given age, first baby status, and timing of first clinical visit, we expect that the baby would be on average 252.3 grams lighter if the mother were hypothetically forced/randomised to smoking, versus if she were hypothetically prevented from smoking.

---
class: middle

Under the assumptions, for a mother of a given age, first baby status, and timing of first clinical visit, we expect that the baby would be on average 252.3 grams lighter if the mother were **hypothetically forced/randomised to smoking**, versus if she were **hypothetically prevented from smoking**.

---
class: middle

## Propensity score analysis (PSA)

- PSA make assumptions under the framework of causal inference: <br>No Interference (NI),  Consistency (C), Conditional Exchangeability (CE)

--
- Plus, the (unknown) propensity score is appropriately modelled using the data.

--
- Instead of modelling `$E(Y|X, \mathbf{C})$`, we specify the form of `$E(X| \mathbf{C})$`

---
class: middle

## Propensity score (1)

- The propensity score `$p(\mathbf{C})$`, is the conditional probability that `$X = 1$` given `$\mathbf{C}$`

$$
p(\mathbf{C}) = p(X = 1 | \mathbf{C})
$$

- A **scalar**, irrespective of the dimension of `$\mathbf{C}$`

- Rosenbaum and Rubin <sup>1</sup> showed that if conditional exchangeability holds given `$\mathbf{C}$`, then it also holds given `$p(\mathbf{C})$`:

`$$Y(x)\perp \!\!\! \perp X| \textbf{C}, \forall x
\Rightarrow Y(x)\perp \!\!\! \perp X| \color{red}{p(\textbf{C})}, \forall x$$`

.tiny[
[1] <html><head></head><body><font face="sans-serif"> Rosenbaum, P. R. &amp; Rubin, D. B. The central role of the propensity score in observational studies for causal effects
 <em>Biometrika, </em>
 <em>Oxford University Press, </em>
<b>1983</b><i>, 70</i>, 41-55

<p></p></font></body></html>
]

---
class: middle

## Propensity score (2)

- In practice,  `$p(\mathbf{C})$` must be estimated by fiting a logistic regression of `$X$` on `$\mathbf{C}$`;

- The **predicted values** from the logistic model are the individual propensity scores.

- The validity of methods based on propensity scores relies on **correctly modelling** `$E(X| \mathbf{C})$`

---
class: middle

## Propensity score (3)

- If an exposed and unexposed person have the same value of `$p(\mathbf{C})$`, say 0.25, it means they were equally likely to have received the exposure.

- This is **similar as** in a randomised trial, an exposed and unexposed subject with the same `$p(\mathbf{C})$` are exchangeable. Unless:

- Important confounders were not included in `$p(\mathbf{C})$`; 
  - `$p(\mathbf{C})$` was incorrectly modelled.

---
class: middle

## Example: RFA dataset (1)

--
- 3351 patients with metastatic lung cancer were given either standard surgery (n = 1848) or radiofrequency ablation (RFA) (n = 1703) to remove metastatic lung cancer nodules.

--
- 3-year progression-free survival is higher (79.2% versus 67.9%) for those who received RFA.

--
- But confounding is a concern: larger modules cannot be removed by RFA, so at least some of the apparent protective effect of RFA is likely due to RFA given to patients **with already better prognosis**.

---
class: middle

## Example: RFA dataset (2)

- Potential confounders: age, gender, hospital (1,2,3,4), smoking (non, ex, current), nodules numbers, no. other metastatic sites, duration of disease, diameter of largest module, location of primary cancer (bladder, breast, bowel, gullet, kidney, skin, stomach....), modules can be reached easily (easy, moderate, difficult)

---
class: middle

## Example: RFA dataset (3)

- Propensity score model:

.small[
$$
`\begin{aligned}
\text{logit}\{ \text{Pr(RFA}|\mathbf{C} )\}  = & \beta_0 + \beta_1\text{age} + \beta_2 \text{gender}+ \beta_3I(\text{hospital = 2}) \\
& +\beta_4I(\text{hospital =3}) + \beta_5I(\text{hospital = 4}) \\
& + \beta_6I(\text{smoke = 2} ) + \beta_7I(\text{smoke = 3}) + \beta_8\text{nodules}\\
&  + \beta_9\text{mets} + \beta_{10}\text{duration}+ \cdots  + \beta_{20}I(\text{primary = 9})\\
&  + \beta_{21}I(\text{position = 2}) + \beta_{22}I(\text{position = 3})
\end{aligned}`
$$
]

- `$p(\mathbf{C})$` is estimated from this `$\uparrow$` model for every patient.

---
class: middle

### The PS model estimates:

.med[
<div style="border: 1px solid #ddd; padding: 0px; overflow-y: scroll; height:400px; overflow-x: scroll; width:780px; "><table class="table table-striped" style="margin-left: auto; margin-right: auto;">
 <thead>
  <tr>
   <th style="text-align:left;position: sticky; top:0; background-color: #FFFFFF;"> term </th>
   <th style="text-align:right;position: sticky; top:0; background-color: #FFFFFF;"> estimate </th>
   <th style="text-align:right;position: sticky; top:0; background-color: #FFFFFF;"> std.error </th>
   <th style="text-align:right;position: sticky; top:0; background-color: #FFFFFF;"> statistic </th>
   <th style="text-align:right;position: sticky; top:0; background-color: #FFFFFF;"> p.value </th>
   <th style="text-align:right;position: sticky; top:0; background-color: #FFFFFF;"> conf.low </th>
   <th style="text-align:right;position: sticky; top:0; background-color: #FFFFFF;"> conf.high </th>
  </tr>
 </thead>
<tbody>
  <tr>
   <td style="text-align:left;"> (Intercept) </td>
   <td style="text-align:right;"> 2.95799 </td>
   <td style="text-align:right;"> 0.61771 </td>
   <td style="text-align:right;"> 4.78866 </td>
   <td style="text-align:right;"> 0.00000 </td>
   <td style="text-align:right;"> 1.74981 </td>
   <td style="text-align:right;"> 4.17252 </td>
  </tr>
  <tr>
   <td style="text-align:left;"> age </td>
   <td style="text-align:right;"> -0.00044 </td>
   <td style="text-align:right;"> 0.00993 </td>
   <td style="text-align:right;"> -0.04434 </td>
   <td style="text-align:right;"> 0.96463 </td>
   <td style="text-align:right;"> -0.01993 </td>
   <td style="text-align:right;"> 0.01902 </td>
  </tr>
  <tr>
   <td style="text-align:left;"> gender1 </td>
   <td style="text-align:right;"> 0.10435 </td>
   <td style="text-align:right;"> 0.15103 </td>
   <td style="text-align:right;"> 0.69093 </td>
   <td style="text-align:right;"> 0.48961 </td>
   <td style="text-align:right;"> -0.19160 </td>
   <td style="text-align:right;"> 0.40063 </td>
  </tr>
  <tr>
   <td style="text-align:left;"> smoke1 </td>
   <td style="text-align:right;"> 0.11862 </td>
   <td style="text-align:right;"> 0.15107 </td>
   <td style="text-align:right;"> 0.78518 </td>
   <td style="text-align:right;"> 0.43235 </td>
   <td style="text-align:right;"> -0.17753 </td>
   <td style="text-align:right;"> 0.41508 </td>
  </tr>
  <tr>
   <td style="text-align:left;"> smoke2 </td>
   <td style="text-align:right;"> 0.09342 </td>
   <td style="text-align:right;"> 0.10381 </td>
   <td style="text-align:right;"> 0.89997 </td>
   <td style="text-align:right;"> 0.36814 </td>
   <td style="text-align:right;"> -0.11000 </td>
   <td style="text-align:right;"> 0.29704 </td>
  </tr>
  <tr>
   <td style="text-align:left;font-weight: bold;color: white !important;background-color: #D7261E !important;"> hospital2 </td>
   <td style="text-align:right;font-weight: bold;color: white !important;background-color: #D7261E !important;"> -0.23564 </td>
   <td style="text-align:right;font-weight: bold;color: white !important;background-color: #D7261E !important;"> 0.14244 </td>
   <td style="text-align:right;font-weight: bold;color: white !important;background-color: #D7261E !important;"> -1.65436 </td>
   <td style="text-align:right;font-weight: bold;color: white !important;background-color: #D7261E !important;"> 0.09805 </td>
   <td style="text-align:right;font-weight: bold;color: white !important;background-color: #D7261E !important;"> -0.51512 </td>
   <td style="text-align:right;font-weight: bold;color: white !important;background-color: #D7261E !important;"> 0.04341 </td>
  </tr>
  <tr>
   <td style="text-align:left;font-weight: bold;color: white !important;background-color: #D7261E !important;"> hospital3 </td>
   <td style="text-align:right;font-weight: bold;color: white !important;background-color: #D7261E !important;"> 0.30046 </td>
   <td style="text-align:right;font-weight: bold;color: white !important;background-color: #D7261E !important;"> 0.13851 </td>
   <td style="text-align:right;font-weight: bold;color: white !important;background-color: #D7261E !important;"> 2.16923 </td>
   <td style="text-align:right;font-weight: bold;color: white !important;background-color: #D7261E !important;"> 0.03007 </td>
   <td style="text-align:right;font-weight: bold;color: white !important;background-color: #D7261E !important;"> 0.02923 </td>
   <td style="text-align:right;font-weight: bold;color: white !important;background-color: #D7261E !important;"> 0.57236 </td>
  </tr>
  <tr>
   <td style="text-align:left;font-weight: bold;color: white !important;background-color: #D7261E !important;"> hospital4 </td>
   <td style="text-align:right;font-weight: bold;color: white !important;background-color: #D7261E !important;"> -0.11175 </td>
   <td style="text-align:right;font-weight: bold;color: white !important;background-color: #D7261E !important;"> 0.19237 </td>
   <td style="text-align:right;font-weight: bold;color: white !important;background-color: #D7261E !important;"> -0.58092 </td>
   <td style="text-align:right;font-weight: bold;color: white !important;background-color: #D7261E !important;"> 0.56129 </td>
   <td style="text-align:right;font-weight: bold;color: white !important;background-color: #D7261E !important;"> -0.48898 </td>
   <td style="text-align:right;font-weight: bold;color: white !important;background-color: #D7261E !important;"> 0.26526 </td>
  </tr>
  <tr>
   <td style="text-align:left;"> nodules </td>
   <td style="text-align:right;"> -0.02988 </td>
   <td style="text-align:right;"> 0.01821 </td>
   <td style="text-align:right;"> -1.64105 </td>
   <td style="text-align:right;"> 0.10079 </td>
   <td style="text-align:right;"> -0.06594 </td>
   <td style="text-align:right;"> 0.00550 </td>
  </tr>
  <tr>
   <td style="text-align:left;"> mets </td>
   <td style="text-align:right;"> 0.06514 </td>
   <td style="text-align:right;"> 0.04672 </td>
   <td style="text-align:right;"> 1.39423 </td>
   <td style="text-align:right;"> 0.16325 </td>
   <td style="text-align:right;"> -0.02641 </td>
   <td style="text-align:right;"> 0.15680 </td>
  </tr>
  <tr>
   <td style="text-align:left;"> duration </td>
   <td style="text-align:right;"> 0.00261 </td>
   <td style="text-align:right;"> 0.00489 </td>
   <td style="text-align:right;"> 0.53354 </td>
   <td style="text-align:right;"> 0.59366 </td>
   <td style="text-align:right;"> -0.00698 </td>
   <td style="text-align:right;"> 0.01218 </td>
  </tr>
  <tr>
   <td style="text-align:left;font-weight: bold;color: white !important;background-color: #D7261E !important;"> maxdia </td>
   <td style="text-align:right;font-weight: bold;color: white !important;background-color: #D7261E !important;"> -2.26045 </td>
   <td style="text-align:right;font-weight: bold;color: white !important;background-color: #D7261E !important;"> 0.08774 </td>
   <td style="text-align:right;font-weight: bold;color: white !important;background-color: #D7261E !important;"> -25.76373 </td>
   <td style="text-align:right;font-weight: bold;color: white !important;background-color: #D7261E !important;"> 0.00000 </td>
   <td style="text-align:right;font-weight: bold;color: white !important;background-color: #D7261E !important;"> -2.43490 </td>
   <td style="text-align:right;font-weight: bold;color: white !important;background-color: #D7261E !important;"> -2.09089 </td>
  </tr>
  <tr>
   <td style="text-align:left;"> primary2 </td>
   <td style="text-align:right;"> 0.09928 </td>
   <td style="text-align:right;"> 0.29102 </td>
   <td style="text-align:right;"> 0.34114 </td>
   <td style="text-align:right;"> 0.73300 </td>
   <td style="text-align:right;"> -0.46959 </td>
   <td style="text-align:right;"> 0.67330 </td>
  </tr>
  <tr>
   <td style="text-align:left;"> primary3 </td>
   <td style="text-align:right;"> 0.14593 </td>
   <td style="text-align:right;"> 0.29460 </td>
   <td style="text-align:right;"> 0.49536 </td>
   <td style="text-align:right;"> 0.62035 </td>
   <td style="text-align:right;"> -0.42937 </td>
   <td style="text-align:right;"> 0.72768 </td>
  </tr>
  <tr>
   <td style="text-align:left;"> primary4 </td>
   <td style="text-align:right;"> -0.00213 </td>
   <td style="text-align:right;"> 0.38021 </td>
   <td style="text-align:right;"> -0.00559 </td>
   <td style="text-align:right;"> 0.99554 </td>
   <td style="text-align:right;"> -0.74649 </td>
   <td style="text-align:right;"> 0.74543 </td>
  </tr>
  <tr>
   <td style="text-align:left;"> primary5 </td>
   <td style="text-align:right;"> -0.22135 </td>
   <td style="text-align:right;"> 0.37933 </td>
   <td style="text-align:right;"> -0.58353 </td>
   <td style="text-align:right;"> 0.55953 </td>
   <td style="text-align:right;"> -0.96499 </td>
   <td style="text-align:right;"> 0.52355 </td>
  </tr>
  <tr>
   <td style="text-align:left;"> primary6 </td>
   <td style="text-align:right;"> 0.04505 </td>
   <td style="text-align:right;"> 0.34609 </td>
   <td style="text-align:right;"> 0.13016 </td>
   <td style="text-align:right;"> 0.89644 </td>
   <td style="text-align:right;"> -0.63128 </td>
   <td style="text-align:right;"> 0.72712 </td>
  </tr>
  <tr>
   <td style="text-align:left;font-weight: bold;color: white !important;background-color: #D7261E !important;"> primary7 </td>
   <td style="text-align:right;font-weight: bold;color: white !important;background-color: #D7261E !important;"> 1.57439 </td>
   <td style="text-align:right;font-weight: bold;color: white !important;background-color: #D7261E !important;"> 0.64793 </td>
   <td style="text-align:right;font-weight: bold;color: white !important;background-color: #D7261E !important;"> 2.42989 </td>
   <td style="text-align:right;font-weight: bold;color: white !important;background-color: #D7261E !important;"> 0.01510 </td>
   <td style="text-align:right;font-weight: bold;color: white !important;background-color: #D7261E !important;"> 0.36040 </td>
   <td style="text-align:right;font-weight: bold;color: white !important;background-color: #D7261E !important;"> 2.92946 </td>
  </tr>
  <tr>
   <td style="text-align:left;"> primary8 </td>
   <td style="text-align:right;"> 0.21261 </td>
   <td style="text-align:right;"> 0.38273 </td>
   <td style="text-align:right;"> 0.55552 </td>
   <td style="text-align:right;"> 0.57854 </td>
   <td style="text-align:right;"> -0.53651 </td>
   <td style="text-align:right;"> 0.96525 </td>
  </tr>
  <tr>
   <td style="text-align:left;"> primary9 </td>
   <td style="text-align:right;"> 0.33223 </td>
   <td style="text-align:right;"> 0.40566 </td>
   <td style="text-align:right;"> 0.81899 </td>
   <td style="text-align:right;"> 0.41279 </td>
   <td style="text-align:right;"> -0.46107 </td>
   <td style="text-align:right;"> 1.13046 </td>
  </tr>
  <tr>
   <td style="text-align:left;font-weight: bold;color: white !important;background-color: #D7261E !important;"> position2 </td>
   <td style="text-align:right;font-weight: bold;color: white !important;background-color: #D7261E !important;"> 0.94096 </td>
   <td style="text-align:right;font-weight: bold;color: white !important;background-color: #D7261E !important;"> 0.10080 </td>
   <td style="text-align:right;font-weight: bold;color: white !important;background-color: #D7261E !important;"> 9.33491 </td>
   <td style="text-align:right;font-weight: bold;color: white !important;background-color: #D7261E !important;"> 0.00000 </td>
   <td style="text-align:right;font-weight: bold;color: white !important;background-color: #D7261E !important;"> 0.74445 </td>
   <td style="text-align:right;font-weight: bold;color: white !important;background-color: #D7261E !important;"> 1.13969 </td>
  </tr>
  <tr>
   <td style="text-align:left;font-weight: bold;color: white !important;background-color: #D7261E !important;"> position3 </td>
   <td style="text-align:right;font-weight: bold;color: white !important;background-color: #D7261E !important;"> 1.22135 </td>
   <td style="text-align:right;font-weight: bold;color: white !important;background-color: #D7261E !important;"> 0.11678 </td>
   <td style="text-align:right;font-weight: bold;color: white !important;background-color: #D7261E !important;"> 10.45840 </td>
   <td style="text-align:right;font-weight: bold;color: white !important;background-color: #D7261E !important;"> 0.00000 </td>
   <td style="text-align:right;font-weight: bold;color: white !important;background-color: #D7261E !important;"> 0.99365 </td>
   <td style="text-align:right;font-weight: bold;color: white !important;background-color: #D7261E !important;"> 1.45155 </td>
  </tr>
</tbody>
</table></div>
]

???

As expected, hospital, diameter of largest nodule, and position of the nodule are strong predictors of the treatment.

---
class: inverse
background-image: url("./fig/scoreoverlap.png")
background-position: 50% 50%
background-size: contain

---
class: middle

## How do we use the propensity scores?

- If controlling `$\mathbf{C}$` is deemed sufficient, then controlling for `$p(\mathbf{C})$` is also sufficient.

- `$p(\mathbf{C})$` is a scalar, so controlling for it is much easier than controlling for `$\mathbf{C}$`.

- Ways of using the propensity scores include: 
  - stratification;
  - adjustment;
  - matching;
  - re-weighting (inverse-weighting).

---
class: middle

### Stratification (1)

If we subclassifying the patients according to quartiles of `$\hat{p}(\mathbf{C})$`:

.small[
| PS subclasses | Treatment | N of patients | Prob of death or <br>disease progression (%) | Risk difference (%) |
|:-------------:|:---------:|:-------------:|:--------------------------------------------:|:-------------------:|
|       1       |  Standard |      788      |                     37.9                     |         13.1        |
|               |    RFA    |      100      |                     51.0                     |                     |
|       2       |  Standard |      547      |                     30.7                     |         6.0         |
|               |    RFA    |      341      |                     36.7                     |                     |
|       3       |  Standard |      344      |                     25.6                     |         -4.8        |
|               |    RFA    |      544      |                     20.8                     |                     |
|       4       |  Standard |      169      |                     22.5                     |        -13.4        |
|               |    RFA    |      718      |                      9.1                     |                     |
]

---
class: middle

### Stratification (2)

- Our suspicion of confounding is seen clearly, the probability of death or disease progression is higher in the lower categories of the propensity score.

- There is effect modification by the propensity score. RFA is better than standard surgery only in the upper two propensity score subclasses.

- The doctors are making choices for their patients. RFA are more likely to be given to patients who can benefit from the treatment.

---
class: middle

### Estimating  `$E\{Y(1) - Y(0)\}$` <br> (average causal/treatment effect, ACE)

The ACE can be estimated as the simple (unweighted) average of the 4 stratum-specific effects:

$$
\widehat{\text{ACE}} = \frac{13.1 + 6.0 - 4.8 - 13.4}{4} = 0.2 \%
$$

---
class: middle

### Estimating `$E\{Y(1) - Y(0) | X = 1\}$` <br> (average treatment effect on the **treated**, ATT)

.between[
| PS subclasses | Treatment | N of patients | Prob of death or <br>disease progression (%) | Risk difference (%) |
|:-------------:|:---------:|:-------------:|:--------------------------------------------:|:-------------------:|
|       1       |  Standard |      788      |                     37.9                     |         13.1        |
|               |    RFA    |      **100**      |                     51.0                     |                     |
|       2       |  Standard |      547      |                     30.7                     |         6.0         |
|               |    RFA    |      **341**      |                     36.7                     |                     |
|       3       |  Standard |      344      |                     25.6                     |         -4.8        |
|               |    RFA    |      **544**      |                     20.8                     |                     |
|       4       |  Standard |      169      |                     22.5                     |        -13.4        |
|               |    RFA    |      **718**      |                      9.1                     |                     |
]

.small[
$$
\widehat{\text{ATT}} = \frac{-13.1 \times 100 + 6.0 \times 341 - 4.8 \times 544 - 13.4\times718}{100 + 341 + 544 + 718} = -5.2\%
$$
]

???

The results shows that RFA is beneficial to those who tend to get the treatment (consistent with the effect modification)

---
class: middle, center

# Hold on,

--
# Before we move on to other methods of using the PS.

--
# We should bear in mind that we assumed the model for PS generation was correctly chosen.

---
class: middle, center

## Other than that,

--
## we also assumed that **all confounders** were included

--
## plus, some people may believe including non-confounders but predictive of the X is beneficial for precision.

---
class: middle

## Finally,

--
## How do we know if there is any other un-observed confounders?

---
class: middle
## Check for un-observed confounders (1)

- Researchers should routinely test for residual confounding or **endogeneity** (even after PS matching). <br>
This can be done using residuals of the PS model.

.med[
$$
`\begin{aligned}
\text{logit}\{ \text{Pr(RFA}|\mathbf{C}) \} & =  \beta_0 + \beta_1\text{age} + \dots + \color{red}{\epsilon}
\end{aligned}`
$$
]

---
class: middle
## Check for un-observed confounders (2)

- If the outcome equation is as follows (linear model here for illustration), where `$\nu$` is the residuals.

.med[
$$
`\begin{aligned}
Y & =  \alpha_0 + \alpha_1\text{age} + \dots + \color{red}{\nu}
\end{aligned}`
$$
]

- The following assumption should hold

`$$\small{cor(\text{logit}\{ \text{Pr(RFA}|\mathbf{C}) \},\nu)=cor(\epsilon,\nu)=0}$$`

---

## Hausman test (Endogeneity test)

- Null hypothesis is as follows:

`$$\small{cor(\text{logit}\{ \text{Pr(RFA}|\mathbf{C} )\},\nu)=cor(\epsilon,\nu)=0}$$`

- Equivalent to test `$\color{blue}{\delta}=0$` in the following equation (using usual tests):

$$
`\begin{aligned}
Y =  \alpha_0 + \alpha_1\text{age} + ...+\color{blue}{\delta} \epsilon +\color{red}{\nu}
\end{aligned}`
$$

---
class: middle

## Alternative #1: (1)<br> Differences-in-Differences methods

Other than fitting models to calculate propensity scores, we may do the following:

- Non-parametric alternative

- Require 2 periods in a same patient:

1.  A baseline period in the same period when neither the intervention nor the comparator are observed
2.  A period the intervention or the comparator is attributed

---
class: middle

## Alternative #1: (2)

- The outcome equations for the respective periods are:

$$
`\begin{aligned}
Y_{i1} & =  B_0 + B_1 X_{i1}+\mathbf{B_2 \lambda_{i}} +\epsilon_{i1} \\
Y_{i2} & = B_0 + B_1 X_{i2}+\mathbf{B_2 \lambda_{i}} + \color{red}{B_3 T_i}+\epsilon_{i2}
\end{aligned}`
$$

- The effect for each patient is estimated as follows:

`$$Y_{i2}-Y_{i1} = B_1(X_{i2}-X_{i1})+B_3T_i+(\epsilon_{i2}-\epsilon_{i1})$$`

---
class: middle

### Pros

- remove unobserved fixed effects

- instrumental variables are not required

- possibility to combine DID with PS matching (**Heckman et al.**)

### Cons

- DID estimator interpretation is not clear in the presence of within-treatment heterogeneity (differential effect in subgroups of non-observed characteristics)

---
class: middle

## Alternative #2: Endogenous switching regression (Econometric approach)

- Use in practice when heterogeneity is observed

- Compared to propensity score model, the selection process is modeled and counterfactuals are considered to estimate the effects

- ATE (average treatment effect), ATT, **ATU** (Average treatment effect on untreated population), heterogeneity for each outcome

---
class: middle

## Definitions of the different effects

- Average treatment among the untreated

- Effect of heterogeneity in the treated

- Effect of heterogeneity in the untreated

---
class: middle

### General model framework (1)

- The selection process is based on the estimation of utility:

`$A_i^{*}=Z_i\alpha+\eta_i$` **(1)**, with `$Z_i$` as observed variables (at least one instrument is required)

---
class: middle

### General model framework (2)

- In a second step, a switching regression model **(2)** is estimated for each subpopulation `$A_i$`:

$$
`\begin{aligned}
Y_{1i} & =  X_{1i}\beta_1+\epsilon_{1i} \\
Y_{0i} & =  X_{0i}\beta_0+\epsilon_{0i}
\end{aligned}`
$$

- Correlations between residuals in **(1)** and **(2)** are nonzero (presence of heterogeneity).

$$
Cor(\eta,\epsilon) \neq 0
$$

---
class: middle

### Pros

- Assessment of ATU and BH effects: policy/intervention on untreated and heterogeneity might be important in decision-making

- Control for both the structural relationships of the covariates with outcomes, and the sample composition of both groups

### Cons

- Difficult to find a good instrument in practice (variable included in the selection process not related to the outcome directly)

---
class: middle, center

# The end for the first day

### Slide address: <br>[https://wangcc.me/PSA-CSS](https://wangcc.me/PSA-CSS)

### Slide for Day 2 (practical): <br>[http://wangcc.me/PSA-CSS-Day2/](http://wangcc.me/PSA-CSS-Day2/)