Procedure: Predict functions of simulator outputs using multiple independent emulators

Description and Background

Where separate, independent emulators have been built for different simulator outputs, there is often interest in predicting some function(s) of those outputs. The procedures here are for making such predictions. We assume that, whatever method was used to build each emulator the corresponding toolkit thread also describes how to make predictions for that emulator alone.

The individual emulators may be Gaussian process (GP) or Bayes linear (BL) emulators, although some of the specific procedures given here will only be applicable to GP emulators.

Inputs

  • Emulators for \(r\) simulator outputs \(f_u(x)\), \(u=1,2,\ldots,r\)
  • \(r'\) prediction functions \(f_w^*(x)=g_w\{f_1 (x),\ldots,f_r(x)\}\), \(w=1,2,\ldots,r'\)
  • A single point \(x^\prime\) or a set of points \(x^\prime_1, x^\prime_2,\ldots,x^\prime_{n^\prime}\) at which predictions are required for the prediction function(s)

Outputs

  • Predictions in the form of statistical quantities, such as expected values, variances and covariances or a sample of values from the (joint) predictive distribution of the prediction function(s) at the required prediction points

Procedures

The simplest case is when the prediction functions are linear in the outputs. Then

\[f_w^*(x)=a_w + f(x)^T b_w,\]

where \(a_w\) is a known constant, \(f(x)\) is the vector of \(r\) outputs \(f_1(x),\ldots,f_r(x)\) and \(b_w\) is a known \(r\times 1\) vector. It is straightforward to derive means, variances and covariances for the functions \(f_w^*(x)\) at the prediction point(s) when the prediction functions are linear. For nonlinear functions, the procedure is to draw a large sample from the predictive distribution and to compute means, variances and covariances from this sample (but note that this is only possible for GP emulators).

Predictive means

Suppose that we have linear prediction functions. Let the \(n'\times 1\) predictive mean vector for the \(u\)-th emulator at the \(n'\) prediction points be \(m_u\). (If we only wish to predict a single point, then this is a scalar.) Let the \(n'\times r\) matrix \(M\) have columns \(m_1,\ldots,m_r\). Then the predictive mean (vector) of \(f_w^*(x)\) at the \(n'\) prediction points is \(a_w 1_{n'} + M\,b_w\), where \(1_{n'}\) denotes a vector of \(n'\) ones, so that \(a_w 1_{n'}\) is a \(n'\times 1\) vector with all elements equal to \(a_w\).

Predictive variances and covariances

Suppose that we have linear prediction functions. Let the \(n'\times n'\) predictive variance matrix for the \(\strut u\)-th emulator at the \(n'\) prediction points be \(V_u\). (If we only wish to predict a single point, then this is a scalar.) Let \(b_{uw}\) be the \(\strut u\)-th element of \(b_w\). Then the variance matrix for \(f_w^*(x)\) at the \(n'\) prediction points is

\[\sum_{u=1}^r b_{uw}^2 V_u,\]

and the covariance matrix between \(f_w^*(x)\) and \(f_{w'}^*(x)\) at those points is

\[\sum_{u=1}^r b_{uw} b_{uw'} V_u.\]

Sample of predictions

Suppose we wish to draw a sample of \(N\) values from the (joint) predictive distribution of the prediction functions at the input \(x^\prime\), or at the points \(x^\prime_1, x^\prime_2,\ldots,x^\prime_{n^\prime}\). For GP emulators, such samples can be drawn from the predictive distributions of the individual outputs. Let \(f_u^{(I)}(x'_t)\) be the \(I\)-th sampled value of \(f_u(x)\) at \(t\)-th prediction point.

Then the \(I\)-th sampled value, \(I=1,2,\ldots,N\), of \(f^*_w(x'_t)\) is \(g_w\{f_1^{(I)}(x'_t),\ldots,f_r^{(I)}(x'_t)\}\).