This article by Grieve (2007) was published in Pharmaceutical Statistics. He started off by describing his personal journey into the world of pharmaceutical industry. He first joined Ciba-Geigy as a medical statistician and had an interest in developing the idea of Bayesian in the pharmaceutical R&D.
There is a section in this article, where the author described the basic of Bayesian idea, called “Bayesics“. One must be familiar with the terms such as: probability model for the data, prior probability, posterior probability, predictive distribution for future data, marginal posterior density, bivariate marginal density, posterior moments and highest posterior density.There are many areas of clinical trials where Bayesian has come into the picture. One of them is bioequivalence.
The aim of bioequivalence study is to determine whether two formulations of a drug are equivalent. Westlake (1972) claimed that hypothesis testing is not relevant. His claim was supported by Metzler (1972). Westlake argued that clinical pharmacologists work with ratios rather than difference and so confidence interval is more appropriate in that sense. Bayesian approach started to develop in the bioequivalence area in 1979 by three independent groups: Merck, Sharp and Dohme, Ciba-Geigy (US) and Ciba-Geigy (Swiss). In 1983, the ratio of the area under the curve (AUC) was commonly accepted as 0.8 and 1.2 and that 95% confidence interval is required for that ratio. Bayesian approach advocates the use of credible interval or posterior probability.
In the situation where a small single bioequivalence study is being conducted, proving that two drugs are similar in efficacy would be difficult. So, a 2-stage Bayesian approach is adopted. The first stage is called the screening stage and the sample size chosen here is small. Sample size in the second stage is chosen based on the result of the screening stage. This is an early example of adaptive design and making use of predictive distribution.
The standard approach to bioequivalence today is by conducting two “one-sided t-test”and so simultaneous rejection will indicate that two treatments are equivalent. Bauer and Bauer (1994) raised a question where the means of those two could possibly be close but the variances on the other hand, are large. They proposed testing an extra two “one-sided F test”; so there will be four null hypotheses altogether to be rejected to be able to claim bioequivalence. In the Bayesian perspective, this can be done by determining the joint posterior of say, the difference in means and the ratio of variances.
Grieve did not review the implementation of Bayesian method in non-inferiority trial. The non-inferiority term was first introduced by Blackwelder (1982). However, he quoted a statement from Susan Ellenberg: “area of non-inferiority is one that really lends itself to the Bayesian approach”, which pretty much summarized up Grieve’s idea.
In the late 1970s, the crossover design received many attention. The basic design is by having 2 treatments with 2 periods. The FDA however, said that it is not a good design when “unequivocal evidence” of a treatment effect was required. In the 1980s, Grieve started to work on Bayesian approach in the crossover design. Bayes factor is used to determine whether carryover effect should be included or not. In this section, Behrens-Fisher distribution came up as well. The problem of Behrens-Fisher is about interval estimation and hypothesis testing betwen the means of two normally distributed population, when the population variances are assumed unequal and come from independent samples.
In the early to mid 1980s, analytic approximation to posterior quantities of interest were being studied. There are four approaches to analytic approximation:
Specific expansion for ratios of integral to determine posterior moments and predictive distributions: This method requires calculation up to third order derivatives of log likelihood function, which is cumbersome. It also provides no correction to variance and covariance.
Laplace approximation to integrals: The advantage of this compared to the first one, is it only evaluates up to second derivative only. It is also possible to find marginal distribution by integrating over a subset of the parameters in the numerator.
Importance sampling: The idea is to choose a density of the parameter that approximates the probability model of the data. The efficiency depends on how well the approximation is.
Iterated Gaussian quadrature: Basic assumption of this method is posterior can be approximated by the multiplication of normal density and low order polynomial. In practice, we would not know the posterior mean and the covariance matrix and so Gaussian quadrature will update the estimates till it converges.
All the approaches described above behave well in small-dimensional problems; that is up to 6-10 parameters. If the parameters increased, the best way is to use Gibbs sampling or MCMC.
In the early 1980s, time was spent to studying the analysis of animal toxicology. The objective of this study is to estimate a dose-response curve and to determine LD50, a dose that kills 50% of the animals exposed to it in a fixed time interval. For this kind of design, logit and probit model are used and parameters of the model are estimated using the maximum likelihood estimator, weighted least squares and Fieller’s theorem. Fieller’s theorem is used to find the confidence interval for the log of LD50. Technical issues are noted to arise when using Fieller’s method: the interval consists of whole real line and as a result, little is learnt about LD50. This is said due to a small number of experimental units being studied. Grieve hoped that Bayesian could circumvent this issue.
Systematic investigation regarding the use of Bayesian in the drug development was carried out in 1982. The work was published in Racine et al. (1986). The obstacles identified in incorporating Bayesian approach are related to numerical implementation, utilizing priors in regulated environment and conservatory attitute among pharmaceutical statisticians and respective colleagues. Compared to the same paper read by John Lewis in 1983, Racine et al. (1986) was received in defeaning silence by the Royal Statistical Society.
The MCMC revolution in the 1990s enable multi-parameter problems to be tackled without having to simplify the models or the assumptions. The general form of MCMC is based on Metropolis-Hastings algorithm which is a special case of Gibbs sampler. The idea of Gibbs sampler is like this: say we have k parameters of interest and we want to sample from the posterior density. Starting values for the parameters are given and sampling is done in sequence for a large number of iterations, say 10 000. In practice, the first say, 1000 iterations are to be discarded, also called as “burn-in”. For a more complex problems the number of discarded iterations could be more than 1000. The mean of values from the sample is the estimate of the posterior mean. The same goes for the posterior variance. The kernel estimate is used to construct the graph of a posterior.
Other than the issue of efficacy and safety, one question that need to be answered is whether the drug can be manufactured consistently in terms of content, uniformity and level of contaminants. So, batches of drugs have to be analyzed and Bayesian approach can be implemented here. Historical data will be taken into consideration and a prior is formed. For example, one could assume that proportion of defectives in a particular batch has a beta distribution. The number to be sampled from the batch will be based on these calculation.
Adaptive dose-finding studies make an appearance in the 21st century, though the idea has already been discussed in 1933 by William Thompson. William described this design as “play-the-winner” design. There are two classes of design: one is to determine maximum tolerated dose in oncology studies and the other is to learn about dose-response curve when it can be non-monotonic.
Pharmacoeconomics is another area where concern arises due to escalating health-care costs. So, demonstration value for money is required and the method is by evaluating the cost data. From the Bayesian point of view, probability statements can be made about populations incremental mean values. John Stevens and Tony O’ Hagan are among the famous statisticians working in this area.
To conclude, Bayesian approach enable direct probability statements about parameters to be made and easier to be interpreted. Full accounting of uncertainty can be seen for example in using Bayes factor in the crossover design. Other than estimating and testing hypothesis, predicting is also of interest. Grieve suggested using predictive ideas. Predictive powers on the other hand is when it is conditioned on the observed data. Grieve quoted a statement from Stephen Senn: “nowhere will Bayesian statistics to be carried out with greater discipline than in the pharmaceutical industry”. Grieve predicted that Bayesian will be heading next to drug safety, playing the role in not just the development but the post-marketing as well.