Implementation in `trialr`

Statistically, the authors assume that in a trial of \(k\) disease subtypes, the treatment effects are drawn from the same common normal distribution

\(\rho_1, ..., \rho_k \sim N(\mu, \sigma^2)\)

As is the convention in BUGS, the authors define normal distributions by a precision parameter \(\tau\) as opposed to the standard deviation parameter \(\sigma\) used here. We have re-specified the model to comply with the Stan convention of using standard deviation. The authors use a normal hyperprior on \(\mu\), and a gamma hyperprior on \(\tau\), equivalent to an inverse gamma hyperprior on \(\sigma^2\).

We observe \(x_i\) responses in \(n_i\) patients in disease sub-type \(i\). The rate of response in subtype \(i\) is modelled as \(p_i = \text{logit}^{-1}(\rho_i)\). Each \(x_i\) is assumed to be binomially distributed with success parameter \(p_i\). In Stan, that relationship is described as x ~ binomial_logit(n, rho);

The treatment is judged to be worthy of further investigation in cohort \(i\) if

\(\text{Pr}\left\{ p_i > \theta | \mathcal{D} \right\} > q\)

where \(\theta\) is the minimum required response rate, and \(q\) is the required certainty to approve. In their provided example, Thall et al. (2003) use \(\theta = 0.3\).

Example

library(trialr)

We reproduce Thall etl al.’s example. We have outcomes in 10 disease subgroups. The number of responses is stored in group_responses and the number of patients in group_sizes. There have been 3 / 7 responses in subgroup 4, for example, but 0 / 2 responses in subgroup 2, and zero patients treated at all in subgroups 1, 6 and 10.

fit <- stan_hierarchical_response_thall(
  group_responses = c(0, 0, 1, 3, 5, 0, 1, 2, 0, 0), 
  group_sizes = c(0, 2 ,1, 7, 5, 0, 2, 3, 1, 0), 
  mu_mean = -1.3863,
  mu_sd = sqrt(1 / 0.1),
  tau_alpha = 2,
  tau_beta = 20)

fit

## Inference for Stan model: ThallHierarchicalBinary.
## 4 chains, each with iter=2000; warmup=1000; thin=1; 
## post-warmup draws per chain=1000, total post-warmup draws=4000.
## 
##                     mean se_mean    sd   2.5%    25%    50%    75%  97.5%
## mu                 -0.08    0.03  1.41  -2.93  -0.93  -0.05   0.83   2.61
## sigma2             12.33    0.34 10.55   3.31   6.25   9.55  14.59  37.93
## rho[1]             -0.04    0.07  3.82  -8.04  -2.32   0.11   2.28   7.33
## rho[2]             -3.03    0.06  2.60  -9.61  -4.26  -2.58  -1.29   0.72
## rho[3]              2.39    0.06  2.79  -1.99   0.54   1.98   3.80   9.07
## rho[4]             -0.31    0.01  0.81  -1.97  -0.83  -0.30   0.24   1.25
## rho[5]              3.70    0.06  2.30   0.51   2.07   3.28   4.85   9.42
## rho[6]             -0.09    0.06  3.74  -7.66  -2.30  -0.02   2.20   7.19
## rho[7]              0.00    0.02  1.54  -3.15  -0.95  -0.01   0.96   3.04
## rho[8]              0.76    0.02  1.29  -1.64  -0.11   0.70   1.56   3.51
## rho[9]             -2.42    0.06  2.76  -8.84  -3.87  -2.00  -0.55   1.87
## rho[10]            -0.07    0.06  3.80  -7.56  -2.30  -0.02   2.13   7.63
## sigma               3.31    0.04  1.16   1.82   2.50   3.09   3.82   6.16
## prob_response[1]    0.51    0.01  0.38   0.00   0.09   0.53   0.91   1.00
## prob_response[2]    0.15    0.00  0.19   0.00   0.01   0.07   0.22   0.67
## prob_response[3]    0.77    0.00  0.26   0.12   0.63   0.88   0.98   1.00
## prob_response[4]    0.43    0.00  0.17   0.12   0.30   0.43   0.56   0.78
## prob_response[5]    0.92    0.00  0.10   0.63   0.89   0.96   0.99   1.00
## prob_response[6]    0.49    0.01  0.38   0.00   0.09   0.49   0.90   1.00
## prob_response[7]    0.50    0.00  0.27   0.04   0.28   0.50   0.72   0.95
## prob_response[8]    0.64    0.00  0.23   0.16   0.47   0.67   0.83   0.97
## prob_response[9]    0.23    0.00  0.26   0.00   0.02   0.12   0.37   0.87
## prob_response[10]   0.49    0.01  0.38   0.00   0.09   0.49   0.89   1.00
## lp__              -34.37    0.12  3.62 -42.63 -36.50 -33.86 -31.71 -28.86
##                   n_eff Rhat
## mu                 2538    1
## sigma2              971    1
## rho[1]             3077    1
## rho[2]             1788    1
## rho[3]             2209    1
## rho[4]             3399    1
## rho[5]             1617    1
## rho[6]             3345    1
## rho[7]             4007    1
## rho[8]             3978    1
## rho[9]             2194    1
## rho[10]            3930    1
## sigma               930    1
## prob_response[1]   3395    1
## prob_response[2]   4261    1
## prob_response[3]   4504    1
## prob_response[4]   3663    1
## prob_response[5]   3523    1
## prob_response[6]   3829    1
## prob_response[7]   4393    1
## prob_response[8]   4278    1
## prob_response[9]   3590    1
## prob_response[10]  4379    1
## lp__                840    1
## 
## Samples were drawn using NUTS(diag_e) at Mon Apr  6 09:45:57 2020.
## For each parameter, n_eff is a crude measure of effective sample size,
## and Rhat is the potential scale reduction factor on split chains (at 
## convergence, Rhat=1).

The probability of response in each subgroup is calculated under prob_response. We can use the generic summary function from rstan to get a nice summary:

knitr::kable(rstan::summary(fit, par = 'prob_response')$summary, digits = 3)

	mean	se_mean	sd	2.5%	25%	50%	75%	97.5%	n_eff	Rhat
prob_response[1]	0.506	0.007	0.382	0.000	0.090	0.528	0.908	0.999	3395.262	1.001
prob_response[2]	0.150	0.003	0.187	0.000	0.014	0.071	0.216	0.673	4260.520	1.000
prob_response[3]	0.771	0.004	0.260	0.120	0.632	0.878	0.978	1.000	4504.057	1.000
prob_response[4]	0.434	0.003	0.173	0.122	0.305	0.426	0.560	0.777	3663.027	1.000
prob_response[5]	0.922	0.002	0.102	0.626	0.888	0.964	0.992	1.000	3523.139	1.000
prob_response[6]	0.495	0.006	0.379	0.000	0.091	0.495	0.900	0.999	3829.440	1.001
prob_response[7]	0.500	0.004	0.269	0.041	0.279	0.498	0.724	0.954	4393.447	1.000
prob_response[8]	0.638	0.003	0.228	0.163	0.472	0.669	0.827	0.971	4278.160	1.001
prob_response[9]	0.226	0.004	0.255	0.000	0.020	0.120	0.365	0.866	3589.985	1.002
prob_response[10]	0.495	0.006	0.379	0.001	0.091	0.495	0.894	1.000	4379.447	1.000

Let us say that we are willing to approve the treatment for further study in a subgroup if we have at least \(q = 70\)% certainty that the probability of efficacy exceeds the target response rate of 30%.

colMeans(as.data.frame(fit, pars = 'prob_response') > 0.3)

##  prob_response[1]  prob_response[2]  prob_response[3]  prob_response[4] 
##           0.60950           0.17900           0.91675           0.75700 
##  prob_response[5]  prob_response[6]  prob_response[7]  prob_response[8] 
##           0.99975           0.59700           0.72650           0.90025 
##  prob_response[9] prob_response[10] 
##           0.29700           0.60125

On that basis, at this interim stage, we would be eager to approve the treatment in subgroups 3, 5 and 8. Subgroups 4 and 7 are close to the boundary.

Some distribution plots of the probabilities of efficacy in some subgroups may be informative.

library(ggplot2)
library(rstan)
library(dplyr)

plot(fit, pars = 'prob_response') + 
  geom_vline(xintercept = 0.3, col = 'orange', linetype = 'dashed') +
  labs(title = 'Partially-pooled analysis of response rate in 10 sarcoma subtypes')

Prob(Response | D) in subgroup 3

We see that the inferred efficacy in subgroup 5 is very high. In contrast, efficacy is subject to more uncertainty in subgroup 4, but the majority of the mass clearly lies to the right of 30%.

Hierarchical Bayesian model for binary responses

Thall et al. (2003) described a method for analysing treatment effects of a common intervention in several sub-types of a single disease. The treatment effects are assumed to be different but exchangeable and correlated. Observing good efficacy in one cohort, for example, increases one’s expectations of efficacy in other cohorts.

They demonstrate the hierarchical model in a trial with binary response outcomes and in another with time-to-event outcomes. This vignette describes the sarcoma example with binary response outcomes. The authors provide WinBUGS code in the appendix of their paper (Thall et al. 2003). We port their model to Stan and illustrate usage with the example given in their paper.

Hierarchical Bayesian Model for Binary Responses

Kristian Brock

2020-04-06

Hierarchical Bayesian model for binary responses

Implementation in `trialr`

Example

trialr

References

Hierarchical Bayesian Model for Binary Responses

Kristian Brock

2020-04-06

Hierarchical Bayesian model for binary responses

Implementation in trialr

Example

trialr

References

Implementation in `trialr`