Uncertainty Quantification for Bayesian Inverse Problems with Applications to Systems Biology

This research is carried out in the framework of MATHEON supported by Einstein Foundation Berlin.

Introduction

In biotechnology, systems biology, or chemical engineering one is faced with large systems of ordinary dierential equations (ODEs) that are used to describe the kinetics of the reaction network of interest. These models contain a large number of unknown parameters that one needs to infer from experimental data. The associated parameter identication problem is an inverse problem where knowledge about the uncertainty of the estimated parameters is of utmost importance, e.g., for designing further experiments. Classical parameter identification lacks possibilities to quantify such uncertainties and in addition faces the problem of non-identifiable parameters, especially in realistic biological models. Therefore, Bayesian approaches, that have become realizable due to higher compuational power of modern computers, have experienced a huge revival in the 21st century. A huge disadvantage of these methods, however, is the requirement of prior knowledge about the distribution of the parameters, which has lead to a lot of critique concerning the objectivity Bayesian inference.

Classical estimators (ML, Gauß-Newton)

Bayesian Inference

+ are computationally inexpensive,

- usually cannot identify all parameters,

- lack possibilities to quantify uncertainty,

- can result in ill-posed problems,

- do not incorporate prior knowledge about parameters.

- is computationally more expensive

+ provides distribution of all parameters

+ can quantify uncertainty

+ provides well-posed problems [Stuart 2010]

+ incorporates prior knowledge

- requires prior knowledge even when it’s not accessible.

Fortunatally, there are other methods, called empirical Bayes methods, that can tackle this problem by recovering the prior distribution from the data in certain cases, namely, if the data contains measurements for several individuals (patients). Note, that in such a case the upper two approaches would treat each patient separately when computing patient-specific parametrizations, making no use of the fact that data is given for several patients. Empirical Bayes methods incorporate the data of all patients to first estimate the prior distribution of the parameters and then use standard Bayesien inference for patient-specific parameter estimation.

We aim at implementing such methods for large system-biological models such as the human menstrual cycle. Our work flow has the following stucture:

(1) Compute the likelihood functions from the data of each patient.

(2) Construct a prior distribution as described by the algorithm below.

(3) Use Bayesian inference to compute the posteriors for each patient from the prior and the individual likelihoods.

(4) Make patient-specific predictions, e.g. success rates of certain treatments.

Algorithm for the Prior Estimation

The estimation of the prior is realized via an iterative algorithm. Starting with a "wide, non-informative" prior p₀ we repeat the following steps for n=0,1,...:

(1) Compute the posterior distribution for each patient (standard Bayesian inference) with respect to the current prior p_n.

(2) Choose the updated prior p_n+1 as the (pointwise) mean of these posteriors.

This algorithm can be proven to maximize the marginal likelihood. However, for a finite amount of data it starts peaking at certain parametrizations, which can also be shown theoretically. We are currently working on getting rid of these peaks by regularisation techniques.

The application to the human menstrual cycle (modeled by a 33-dimensional ODE with 114 unknown parameters) is shows below. Note, that the algorithm is applied in the high-dimensional parameter space and the results show the distribution of only one parameter, all others being integrated out.

Figure: First iteration step from p₀ (uniformly distributed) to p₁ (in blue). p₁ is chosen as the mean of the patients' individual posteriors (thin lines).

Figure: Further iterations from p₁ to p₂₀ (the individual posteriors are no longer plotted). Note that the distribution is shifted towards a blood volume of 5 liters, which is a typical value for adults.

Group

Computational Systems Biology

Heads

Schütte, Christof, Prof. Dr.

Röblitz, Susanna, Prof. Dr.

Members

Schütte, Christof, Prof. Dr.

Röblitz, Susanna, Prof. Dr.

Klebanov, Ilja

Sikorski, Alexander, Dr.

Partners

ECMath-CH5 (H. Siebert, S. Röblitz, A. Bockmayr)

FU Berlin, Biochemie (P. Knaus)

MPI Potsdam (P. Fratzl)

Charite/BSRT (G. Duda)

CiT GmbH (M. Wulkow)

Funding

Einstein Center for Mathematics Berlin - ECMath

Attachments

ConceptStandalone.pdf

marginalpost.png

Publications

2016

Empirical Bayes Methods for Prior Estimation in Systems Medicine ZIB-Report 16-57 Ilja Klebanov, Alexander Sikorski, Christof Schütte, Susanna Röblitz PDF
BibTeX
URN
arXiv

Empirical Bayes Methods, Reference Priors, Cross Entropy and the EM Algorithm ZIB-Report 16-56 Ilja Klebanov, Alexander Sikorski, Christof Schütte, Susanna Röblitz PDF
BibTeX
URN

On the numerical approximation of the Perron-Frobenius and Koopman operator Journal of Computational Dynamics, 3(1), pp. 51-77, 2016 Stefan Klus, Peter Koltai, Christof Schütte BibTeX
DOI

Prior estimation and Bayesian inference from large cohort data sets ZIB-Report 16-09 Ilja Klebanov, Alexander Sikorski, Christof Schütte, Susanna Röblitz PDF
PDF
BibTeX
URN

2015

A guide to numerical modelling in systems biology Springer, 2015, ISBN: 978-3-319-20058-3 Peter Deuflhard, Susanna Röblitz BibTeX
DOI

Mathematical modeling and sensitivity analysis of arterial anastomosis in arm arteries ZIB-Report 15-22 Raheem Gul, Christof Schütte, Stefan Bernhard PDF
BibTeX
URN

Solution of the chemical master equation by radial basis functions approximation with interface tracking BMC Systems Biology, 9(67), pp. 1-12, 2015 Ivan Kryven, Susanna Röblitz, Christof Schütte PDF
BibTeX
DOI

2014

Advanced mathematical modeling in systems biology MATHEON-Mathematics for Key Technologies, Vol.1, pp. 29-44, Peter Deuflhard, Martin Grötschel, Dietmar Hömberg, Jürg Kramer, Volker Mehrmann, Konrad Polthier, Frank Schmidt, Christof Schütte, Martin Skutela, Jürgen Sprekels (Eds.), European Mathematical Society, 2014 Alexander Bockmayr, Heike Siebert, Susanna Röblitz, Christof Schütte, Peter Deuflhard BibTeX

Markov Control Processes with Rare State Observation: Theory and Application to Treatment Scheduling in HIV-1 Communications in Mathematical Sciences, 12(5), pp. 859-877, 2014 (preprint available as ) Stefanie Winkelmann, Christof Schütte, Max von Kleist BibTeX
DOI