TMLE Example Notebook¶

This notebook demonstrates the issue of using uplift curves without knowing true treatment effect and how to solve it by using TMLE as a proxy of the true treatment effect.

In [1]:

%reload_ext autoreload
%autoreload 2
%matplotlib inline

In [2]:

import logging
from matplotlib import pyplot as plt
import numpy as np
import pandas as pd
from sklearn.model_selection import train_test_split, KFold
import sys
import warnings
warnings.simplefilter("ignore", UserWarning)

from lightgbm import LGBMRegressor

In [3]:

sys.path.append('../')

In [4]:

import causalml

from causalml.dataset import synthetic_data
from causalml.inference.meta import BaseXRegressor, TMLELearner
from causalml.metrics.visualize import plot
from causalml.propensity import calibrate

print(causalml.__version__)

Using TensorFlow backend.

0.4.1

In [5]:

logger = logging.getLogger('causalml')
logger.setLevel(logging.DEBUG)
plt.style.use('fivethirtyeight')

Generating Synthetic Data¶

In [6]:

# Generate synthetic data using mode 1
y, X, treatment, tau, b, e = synthetic_data(mode=1, n=1000000, p=10, sigma=5.)

In [7]:

X_train, X_test, y_train, y_test, e_train, e_test, treatment_train, treatment_test, tau_train, tau_test, b_train, b_test = train_test_split(X, y, e, treatment, tau, b, test_size=0.5, random_state=42)

Calculating Individual Treatment Effect (ITE/CATE)¶

In [8]:

# X Learner
learner_x = BaseXRegressor(learner=LGBMRegressor())
learner_x.fit(X=X_train, treatment=treatment_train, y=y_train)
cate_x_test = learner_x.predict(X=X_test, p=e_test, treatment=treatment_test).flatten()

In [9]:

alpha=0.2
bins=30
plt.figure(figsize=(12,8))
plt.hist(cate_x_test, alpha=alpha, bins=bins, label='X Learner')
plt.hist(tau_test, alpha=alpha, bins=bins, label='Actual')

plt.title('Distribution of CATE Predictions by X-Learner and Actual')
plt.xlabel('Individual Treatment Effect (ITE/CATE)')
plt.ylabel('# of Samples')
_=plt.legend()

Validating CATE without TMLE¶

In [10]:

df = pd.DataFrame({'y': y_test, 'w': treatment_test, 'tau': tau_test, 'X-Learner': cate_x_test, 'Actual': tau_test})

Uplift Curve With Ground Truth¶

If true treatment effect is known as in simulations, the uplift curve of a model uses the cumulative sum of the treatment effect sorted by model's CATE estimate.

In the figure below, the uplift curve of X-learner shows positive lift close to the optimal lift by the ground truth.

In [11]:

plot(df, outcome_col='y', treatment_col='w', treatment_effect_col='tau')

Uplift Curve Without Ground Truth¶

If true treatment effect is unknown as in practice, the uplift curve of a model uses the cumulative mean difference of outcome in the treatment and control group sorted by model's CATE estimate.

In the figure below, the uplift curves of X-learner as well as the ground truth show no lift incorrectly.

In [12]:

plot(df.drop('tau', axis=1), outcome_col='y', treatment_col='w')

TMLE¶

In [13]:

n_fold = 5
n_segment = 5

TMLE Model Training¶

In [14]:

kf = KFold(n_splits=n_fold)

In [15]:

tmle = TMLELearner(learner=LGBMRegressor(num_leaves=64, learning_rate=.05, n_estimators=300), 
                   cv=kf, calibrate_propensity=True)

ate_all, ate_all_lb, ate_all_ub = tmle.estimate_ate(X=X_test, p=e_test, treatment=treatment_test, y=y_test)

ate_all, ate_all_lb, ate_all_ub

WARNING: Logging before flag parsing goes to stderr.
I1111 14:36:47.735486 4599510464 tmle.py:124] Estimating ATE for group 1.
I1111 14:36:47.737114 4599510464 tmle.py:129] Calibrating propensity scores.
I1111 14:36:53.161776 4599510464 tmle.py:136] Training an outcome model for CV #1
I1111 14:36:55.997822 4599510464 tmle.py:136] Training an outcome model for CV #2
I1111 14:36:58.315356 4599510464 tmle.py:136] Training an outcome model for CV #3
I1111 14:37:00.735098 4599510464 tmle.py:136] Training an outcome model for CV #4
I1111 14:37:03.180316 4599510464 tmle.py:136] Training an outcome model for CV #5
I1111 14:37:05.726341 4599510464 tmle.py:149] Training the TMLE learner.

Out[15]:

(array([0.48793413]), array([0.44969964]), array([0.52616862]))

In [16]:

tmle = TMLELearner(learner=LGBMRegressor(num_leaves=64, learning_rate=.05, n_estimators=300), 
                   cv=kf, calibrate_propensity=True)

ate_actual, ate_actual_lb, ate_actual_ub = tmle.estimate_ate(X=X_test, p=e_test, treatment=treatment_test, y=y_test, segment=pd.qcut(tau_test, n_segment, labels=False))

ate_actual, ate_actual_lb, ate_actual_ub

I1111 14:37:06.182612 4599510464 tmle.py:124] Estimating ATE for group 1.
I1111 14:37:06.183984 4599510464 tmle.py:129] Calibrating propensity scores.
I1111 14:37:11.492189 4599510464 tmle.py:136] Training an outcome model for CV #1
I1111 14:37:13.826500 4599510464 tmle.py:136] Training an outcome model for CV #2
I1111 14:37:16.256622 4599510464 tmle.py:136] Training an outcome model for CV #3
I1111 14:37:18.831965 4599510464 tmle.py:136] Training an outcome model for CV #4
I1111 14:37:21.290154 4599510464 tmle.py:136] Training an outcome model for CV #5
I1111 14:37:23.885609 4599510464 tmle.py:161] Training the TMLE learner for segment 0.
I1111 14:37:23.973082 4599510464 tmle.py:161] Training the TMLE learner for segment 1.
I1111 14:37:24.060353 4599510464 tmle.py:161] Training the TMLE learner for segment 2.
I1111 14:37:24.140305 4599510464 tmle.py:161] Training the TMLE learner for segment 3.
I1111 14:37:24.226202 4599510464 tmle.py:161] Training the TMLE learner for segment 4.

Out[16]:

(array([[0.21359494, 0.37680156, 0.47705638, 0.54578996, 0.83998392]]),
 array([[0.1172803 , 0.30158022, 0.40692295, 0.45495861, 0.74524376]]),
 array([[0.30990959, 0.4520229 , 0.54718981, 0.63662131, 0.93472409]]))

In [17]:

tmle = TMLELearner(learner=LGBMRegressor(num_leaves=64, learning_rate=.05, n_estimators=300), 
                   cv=kf, calibrate_propensity=True)

ate_xlearner, ate_xlearner_lb, ate_xlearner_ub = tmle.estimate_ate(X=X_test, p=e_test, treatment=treatment_test, y=y_test, segment=pd.qcut(cate_x_test, n_segment, labels=False))

ate_xlearner, ate_xlearner_lb, ate_xlearner_ub

I1111 14:37:24.454401 4599510464 tmle.py:124] Estimating ATE for group 1.
I1111 14:37:24.456296 4599510464 tmle.py:129] Calibrating propensity scores.
I1111 14:37:29.791435 4599510464 tmle.py:136] Training an outcome model for CV #1
I1111 14:37:32.308604 4599510464 tmle.py:136] Training an outcome model for CV #2
I1111 14:37:35.668303 4599510464 tmle.py:136] Training an outcome model for CV #3
I1111 14:37:38.855995 4599510464 tmle.py:136] Training an outcome model for CV #4
I1111 14:37:41.786494 4599510464 tmle.py:136] Training an outcome model for CV #5
I1111 14:37:45.212327 4599510464 tmle.py:161] Training the TMLE learner for segment 0.
I1111 14:37:45.310906 4599510464 tmle.py:161] Training the TMLE learner for segment 1.
I1111 14:37:45.402297 4599510464 tmle.py:161] Training the TMLE learner for segment 2.
I1111 14:37:45.473446 4599510464 tmle.py:161] Training the TMLE learner for segment 3.
I1111 14:37:45.560100 4599510464 tmle.py:161] Training the TMLE learner for segment 4.

Out[17]:

(array([[0.28326254, 0.41720096, 0.52000234, 0.5567797 , 0.67356684]]),
 array([[0.19190312, 0.33921252, 0.44215021, 0.46706366, 0.58111721]]),
 array([[0.37462195, 0.49518941, 0.59785446, 0.64649575, 0.76601646]]))

Uplift Curve with TMLE as Ground Truth¶

By using TMLE as a proxy of the ground truth, the uplift curves of X-learner and the ground truth become close to the original using the ground truth.

In [18]:

lift_actual = [0.] * (n_segment + 1)
lift_actual[n_segment] = ate_all[0]

lift_xlearner = [0.] * (n_segment + 1)
lift_xlearner[n_segment] = ate_all[0]

for i in range(1, n_segment):
    lift_actual[i] = ate_actual[0][n_segment - i] * .2 + lift_actual[i - 1]
    lift_xlearner[i] = ate_xlearner[0][n_segment - i] * .2 + lift_xlearner[i - 1]     

pd.DataFrame({'Population': np.linspace(0, 1, n_segment + 1),
              'Actual': lift_actual,
              'X-Learner': lift_xlearner,
              'Random': np.linspace(0, 1, n_segment + 1)*lift_actual[-1]}).plot(x='Population', 
                                                                                y=['X-Learner', 'Actual', 'Random'], 
                                                                                figsize=(8, 8))
plt.ylabel('Lift in GB')

Out[18]:

Text(0,0.5,'Lift in GB')