Skip to main content

Advertisement

Log in

Multivariate probabilistic projections using imperfect climate models part I: outline of methodology

  • Published:
Climate Dynamics Aims and scope Submit manuscript

Abstract

We demonstrate a method for making probabilistic projections of climate change at global and regional scales, using examples consisting of the equilibrium response to doubled CO2 concentrations of global annual mean temperature and regional climate changes in summer and winter temperature and precipitation over Northern Europe and England-Wales. This method combines information from a perturbed physics ensemble, a set of international climate models, and observations. Our approach is based on a multivariate Bayesian framework which enables the prediction of a joint probability distribution for several variables constrained by more than one observational metric. This is important if different sets of impacts scientists are to use these probabilistic projections to make coherent forecasts for the impacts of climate change, by inputting several uncertain climate variables into their impacts models. Unlike a single metric, multiple metrics reduce the risk of rewarding a model variant which scores well due to a fortuitous compensation of errors rather than because it is providing a realistic simulation of the observed quantity. We provide some physical interpretation of how the key metrics constrain our probabilistic projections. The method also has a quantity, called discrepancy, which represents the degree of imperfection in the climate model i.e. it measures the extent to which missing processes, choices of parameterisation schemes and approximations in the climate model affect our ability to use outputs from climate models to make inferences about the real system. Other studies have, sometimes without realising it, treated the climate model as if it had no model error. We show that omission of discrepancy increases the risk of making over-confident predictions. Discrepancy also provides a transparent way of incorporating improvements in subsequent generations of climate models into probabilistic assessments. The set of international climate models is used to derive some numbers for the discrepancy term for the perturbed physics ensemble, and associated caveats with doing this are discussed.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14

Similar content being viewed by others

References

  • Adler RF, Huffman GJ, Chang A, Ferraro R, Xie P, Janowiak J, Rudolf B, Schneider U, Curtis S, Bolvin D, Gruber A, Susskind J, Arkin P (2003) The version 2 global precipitation climatology project (GPCP) monthly precipitation analysis (1979–present). J Hydrometeorol 4:1147–1167

    Article  Google Scholar 

  • Allan RJ, Ansell TJ (2006) A new globally complete monthly historical gridded mean sea level pressure data set (HadSLP2): 1850–2003. J Clim 19:5816–5842

    Article  Google Scholar 

  • Allen MR, Tett SFB (1999) Checking for model consistency in optimal finger printing. Clim Dyn 15:419–434

    Article  Google Scholar 

  • Allen MR, Stott PA, Mitchell JFB, Schnur R, Delworth TL (2000) Quantifying the uncertainty in forecasts of anthropogenic climate change. Nature 407:617–620

    Article  Google Scholar 

  • Aumann HH, Chahine MT, Gautier C, Goldberg MD, Kalnay E, McMillin LM, Revercomb H, Rosenkranz PW, Smith WL, Staelin DH, Strow LL, Susskind J (2003) AIRS/AMSU/HSB on the aqua mission: design, science objectives, data products, and processing systems. IEEE Trans Geosci Remote Sens 41:253–264

    Article  Google Scholar 

  • Barnett DN, Brown SJ, Murphy JM, Sexton DMH, Webb MJ (2006) Quantifying uncertainty in changes in extreme event frequency in response to doubled CO2 using a large ensemble of GCM simulations. Clim Dyn 26:489–511

    Article  Google Scholar 

  • Berry DI, Kent EC (2009) A new air-sea interaction gridded dataset from ICOADS with uncertainty estimates. Bull Am Meteorol Soc 90:645–656

    Article  Google Scholar 

  • Bony S, Dufresne JL (2005) Marine boundary layer clouds at the heart of cloud feedback uncertainties in climate models. Geophys Res Lett 32:L20806

    Article  Google Scholar 

  • Box GE (1980) Sampling and Bayes’ inference in scientific modelling and robustness. J R Stat Soc 143:383–430

    Article  Google Scholar 

  • Brohan P, Kennedy J, Harris I, Tett SFB, Jones PD (2006) Uncertainty estimates in regional and global observed temperature changes: a new dataset from 1850. J Geophys Res 111:D12106. doi:10.1029/2005JD006548

    Article  Google Scholar 

  • Collins M, Booth BBB, Bhaskaran B, Harris G, Murphy JM, Sexton DMH, Webb MJ (2010) A comparison of perturbed physics and multi-model ensembles: model errors, feedbacks and forcings. Clim Dyn doi:10.1007/s00382-010-0808-0

  • da Silva AM, Young CC, Levitus S (1994) Atlas of surface marine data, vol 1: algorithms and procedures. Tech Rep 6, U.S. Department of Commerce, NOAA, NESDIS

  • Draper NR, Smith H (1998) Applied regression analysis. Wiley Series in Probability and Statisics, 3rd edn. Wiley, New York

    Google Scholar 

  • Evans M, Swartz T (2000) Approximating integrals via Monte Carlo and deterministic methods, vol 20 of Oxford Statistical Science Series, 1st edn. Oxford University Press, Oxford

    Google Scholar 

  • Frame DJ, Booth BBB, Kettleborough JA, Stainforth DA, Gregory JM, Collins M, Allen MR (2005) Constraining climate forecasts: the role of prior assumptions. Geophys Res Lett 32:L09702. doi:10.1029/2004GL022241

    Article  Google Scholar 

  • Furrer R, Sain SR, Nychka D, Meehl GA (2007) Multivariate Bayesian analysis of atmosphere-ocean general circulation models. Environ Ecol Stat 13:249–266

    Article  Google Scholar 

  • Giorgi F, Francisco R (2000) Uncertainties in regional climate change predictions. A regional analysis of ensemble simulations with the HadCM2 GCM. Clim Dyn 16:169–182

    Article  Google Scholar 

  • Giorgi F, Mearns L (2003) Probability of regional climate change calculated using the reliability ensemble average (REA) method. Geophys Res Lett 30:1629–1632

    Article  Google Scholar 

  • Goldstein M, Rougier J (2004) Probabilistic formulations for transferring inferences from mathematical models to physical systems. SIAM J Sci Comput 26:467–487

    Article  Google Scholar 

  • Goldstein M, Rougier J (2006) Bayes linear calibrated prediction for complex systems. J Am Stat Assoc 101:1132–1143. doi:10.1198/016214506000000203

    Article  Google Scholar 

  • Greene A, Goddard L, Lall U (2006) Probabilistic multimodel regional temperature change projections. J Clim 19:4326–4343

    Article  Google Scholar 

  • Harris GR, Sexton DMH, Booth BBB, Collins M, Murphy JM, Webb MJ (2006) Frequency distributions of transient regional climate change from perturbed physics ensembles of general circulation model simulations. Clim Dyn 27:357–375. doi:10.1007/s00382-006-0142-8

    Article  Google Scholar 

  • Harrison EP, Minnis P, Barkstrom BR, Ramanathan V, Cess RD, Gibson GG (1990) Seasonal variation of cloud radiative forcing derived from the Earth Radiation Budget Experiment. J Geophys Res 95:18687–18703

    Article  Google Scholar 

  • Josey SA, Kent EC, Oakley D, Taylor PK (1996) A new global air-sea heat and momentum climatology. Int WOCE Newsl 24:3–5

    Google Scholar 

  • Joshi MM, Webb MJ, Maycock AC, Collins M (2010) Stratospheric water vapour and high climate sensitivity in a version of HadSM3 climate model. Atmos Chem Phys Discuss 10:7161–7167

    Article  Google Scholar 

  • Jun M, Knutti R, Nychka D (2008) Spatial analysis to quantify numerical model bias and dependence: how many climate models are there? J Am Stat Assoc 103:934–947

    Article  Google Scholar 

  • Kennedy M, O’Hagan A (2001) Bayesian calibration of computer models. J R Stat Soc B:425–464

    Article  Google Scholar 

  • Knutti R (2010) The end of model democracy? Clim Change doi:10.10007/s10584-010-9800-2

  • Knutti R, Meehl GA, Allen MR, Stainforth DA (2006) Constraining climate sensitivity from the seasonal cycle in surface temperature. J Clim 19:4224–4233

    Article  Google Scholar 

  • Lopez A, Tebaldi C, New M, Stainforth DA, Allen MR, Kettleborough JA (2006) Two approaches to quantify uncertainty in global temperature changes under different forcing scenarios. J Clim 19:4785–4796. doi:10.1175/JCLI3895.1

    Google Scholar 

  • McAvaney BJ, Le Treut H (2003) The cloud feedback intercomparison project: (CFMIP). CLIVAR exchanges—supplementary contributions

  • Meehl GA, Stocker TF, Collins WD, Friedlingstein P, Gaye AT, Gregory JM, Kitoh A, Knutti R, Murphy JM, Noda A, Raper SCB, Watterson IG, Weaver AJ, Zhao Z (2007) Global climate projections. In: Solomon S, Qin D, Manning M, Chen Z, Marquis M, Averyt KB, Tignor M, Miller HL (eds) Climate change 2007: the physical science basis. Contribution of Working Group I to the 4th Assessment Report of the Intergovernmental Panel on Climate Change. Cambridge University Press

  • Morse A, Prentice C, Carter T (2009) ENSEMBLES: climate change and its impacts: summary of research and results from the ENSEMBLES project. Met Office, Hadley Centre for Climate Prediction and Research, FitzRoy Road, Exeter, chapter 9 Assessments of climate change impacts, pp 107–129

  • Murphy JM, Sexton DMH, Barnett DN, Jones GS, Webb MJ, Collins M, Stainforth DA (2004) Quantification of modelling uncertainties in a large ensemble of climate change simulations. Nature 430:768–772

    Article  Google Scholar 

  • Murphy J, Booth B, Collins M, Harris G, Sexton D, Webb M (2007) A methodology for probabilistic predictions of regional climate change from perturbed physics ensembles. Phil Trans R Soc Lond 365:2133

    Article  Google Scholar 

  • Murphy JM, Sexton DMH, Jenkins GJ, Booth BBB, Brown CC, Clark RT, Collins M, Harris GR, Kendon EJ, Betts RA, Brown SJ, Humphrey KA, McCarthy MP, McDonald RE, Stephens A, Wallace C, Warren R, Wilby R, Wood RA (2009) UK climate projections science report: climate change projections. Met Office Hadley Centre, Exeter

  • New M, Lopez A, Dessai S, Wilby R (2007) Challenges in using probabilistic climate change information for impact assessments: an example from the water sector. Phil Trans R Soc Lond 365:2117–2132

    Article  Google Scholar 

  • Nounou MN, Bakshi BR, Goel PK, Shen XT (2002) Process modeling by Bayesian latent variable regression. Ambio 48:1775–1793

    Google Scholar 

  • O’Hagan A, Forster J (2004) Bayesian inference, vol 2b of Kendall’s Advanced Theory of Statistics, 2nd edn. Edward Arnold, London

    Google Scholar 

  • Piani C, Frame DJ, Stainforth DA, Allen MR (2005) Constraints on climate change from a multi-thousand member ensemble of simulations. Geophys Res Lett 32:L23825. doi:10.1029/2005GL024452

  • Pierce DW, Barnett TP, Santer BD, Gleckler PJ (2009) Selecting global climate models for regional climate change studies. Proc Natl Acad Sci USA 106:8441–8446

    Google Scholar 

  • Pope VD, Gallani ML, Rowntree PR, Stratton RA (2000) The impact of new physical parametrizations in the Hadley Centre climate model–HadAM3. Clim Dyn 16:123–146

    Article  Google Scholar 

  • Press WH, Teukolsky SA, Vetterling WT, Flannery BP (1992) Numerical recipes in Fortran: the art of scientific computing, 2nd edn. Cambridge University Press, Cambridge

    Google Scholar 

  • Randall DA, Wood RA, Bony S, Colman R, Fichefet T, Fyfe J, Kattsov V, Pitman A, Shukla J, Srinivasan J, Stouffer RJ, Sumi A, Taylor KE (2007) Climate models and their evaluation Climate Change 2007. In: Solomon S, Qin D, Manning M, Chen Z, Marquis M, Averyt KB, Tignor M, Miller HL (eds) The physical science basis. Contribution of Working Group I to the 4th Assessment Report of the Intergovernmental Panel on Climate Change. Cambridge University Press

  • Rayner NA, Parker DE, Horton EB, Folland CK, Alexander LV, Rowell DP, Kent EC, Kaplan A (2003) Global analyses of SST, sea ice and night marine air temperature since the late nineteenth century. J Geophys Res 108:4407. doi:10.1029/2002JD002670

    Google Scholar 

  • Rayner N, Brohan P, Parker DE, Folland CK, Kennedy J, Vanicek M, Ansell T, Tett SFB (2006) Improved analyses of changes and uncertainties in sea surface temperature measured in situ since the mid-nineteenth century. J Clim 19:446–469

    Article  Google Scholar 

  • Reichler T, Kim J (2008) How well do coupled models simulate today’s climate? Bull Am Meteorol Soc 89:303–311. doi:10.1175/BAMS-89-3-303

    Article  Google Scholar 

  • Robert CP, Casella G (2004) Monte Carlo statistical methods, 2nd edn. Springer, New York

    Google Scholar 

  • Rodwell MJ, Palmer TN (2007) Using numerical weather prediction to assess climate models. Q J R Meteorol Soc 133:129–146. doi:10.1002/qj.23

    Article  Google Scholar 

  • Rossow WB, Zhang YC (1995) Calculation of surface and top of atmosphere radiative fluxes from physical quantities based on ISCCP data sets. 2, validation and first results. J Geophys Res 100:1167–1197

    Article  Google Scholar 

  • Rougier J (2007) Probabilistic inference for future climate using an ensemble of climate model evaluations. Clim Change 81:247–264

    Article  Google Scholar 

  • Rougier J, Sexton DMH (2007) Inference in ensemble experiments. Phil Trans R Soc Lond 365:2133–2143. doi:10.1098/rsta.2007.2071

    Article  Google Scholar 

  • Rougier J, Sexton DMH, Murphy JM, Stainforth DA (2009) Analyzing the climate sensitivity of the HadSM3 climate model using ensembles from different but related experiments. J Clim 22:1327–1353

    Article  Google Scholar 

  • Sanderson BM (2011) A multi-model study of parametric uncertainty in response to rising greenhouse gases concentrations. J Clim 24:1362–1377 (submitted)

    Google Scholar 

  • Sanderson BM, Piani C, Ingram WJ, Stone DA, Allen MR (2008) Towards constraining climate sensitivity by linear analysis of feedback patterns in thousands of perturbed-physics GCM simulations. Clim Dyn 30:175–190. doi:10.1007/s00382-007-0280-7

    Article  Google Scholar 

  • Stainforth DA, Aina T, Christensen C, Collins M, Frame DJ, Kettleborough JA, Knight S, Martin A, Murphy J, Piani C, Sexton D, Smith LA, Spicer RA, Thorpe AJ, Allen MR (2005) Uncertainty in predictions of the climate response to rising levels of greenhouse gases. Nature 433:403–406

    Article  Google Scholar 

  • Stott PA, Kettleborough JA (2002) Origins and estimates of uncertainty in predictions of twenty first century temperature rise. Nature 416:723–726

    Article  Google Scholar 

  • Stott PA, Mitchell JFB, Allen MR, Delworth TL, Gregory JM, Meehl GA, Santer BD (2006) Observational constraints on past attributable warming and predictions of future global warming. J Clim 19:3055–3069

    Article  Google Scholar 

  • Tebaldi C, Lobell DB (2008) Towards probabilistic projections of climate change impacts on global crop yields. Geophys Res Lett 35:L08705. doi:10.1029/2008GL033423

  • Tebaldi C, Sanso B (2009) Joint projections of temperature and precipitation change from multiple climate models: a hierarchical bayesian approach. J R Stat Soc 172:83–106

    Article  Google Scholar 

  • Tebaldi C, Mearns LO, Nychka D, Smith RW (2004) Regional probabilities of precipitation change: a Bayesian analysis of multimodel simulations. Geophys Res Lett 31:L24213. doi:10.1029/2004GL021276

  • Uppala SM, Kållberg PW, Simmons AJ, Andrae U, daCosta Bechtold V, Fiorino M, Gibson JK, Haseler J, Hernandez A, Kelly GA, Li X, Onogi K, Saarinen S, Sokka N, Allan RP, Andersson E, Arpe K, Balmaseda MA, Beljaars ACM, van de Berg L, Bidlot J, Bormann N, Caires S, Chevallier F, Dethof A, Dragosavac M, Fisher M, Fuentes M, Hagemann S, Hólm E, Hoskins BJ, Isaksen L, Janssen PAEM, Jenne R, McNally AP, Mahfouf J-F, Morcrette J-J, Rayner NA, Saunders RW, Simon P, Sterl A, Trenberth KE, Untch A, Vasiljevic D, Viterbo P, Woollen J (2005) The ERA-40 re-analysis. Q J R Meteorol Soc 131:2961–3012. doi:10.1256/qj.04.176

    Article  Google Scholar 

  • Watterson IG (2008) Calculation of probability density functions for temperature and precipitation change under global warming. J Geophys Res 116:D07101. doi:10.1029/2007JD009254

  • Webb MJ, Senior CA, Sexton DMH, Ingram WJ, Williams KD, Ringer MA, McAvaney BJ, Colman R, Soden BJ, Gudgel R, Knutson T, Emori S, Ogura T, Tsushima Y, Andronova NG, Li B, Musat I, Bony S, Taylor KE (2006) On the contribution of local feedback mechanisms to the range of climate sensitivity in two GCM ensembles. Clim Dyn 27:17–38. doi:10.1007/s00382-006-0111-2

    Article  Google Scholar 

  • Wielicki B, Raj M, Sau S (1996) Clouds and the Earth’s Radiant Energy System (CERES): an earth observing system experiment. Bull Am Meteorol Soc 77:853–868

    Google Scholar 

  • Williams KD, Webb MJ (2009) A quantitative performance assessment of cloud regimes in climate models. Clim Dyn 33:141–157. doi:10.1007/s00382-008-0443-1

    Article  Google Scholar 

  • Williams KD, Ringer MA, Senior CA, Webb MJ, McAvaney BJ, Andronova N, Bony S, Dufresne J-L, Emori S, Gudgel R, Knutson T, Li B, Lo K, Musat I, Wegner J, Slingo A, Mitchell JFB (2006) Evaluation of a component of the cloud response to climate change in an intercomparison of climate models. Clim Dyn 26:145–165. doi:10.1007/s00382-005-0067-7

    Article  Google Scholar 

  • Wylie DP, Menzel WP (1999) Eight years of high cloud statistics using HIRS. J Clim 12(1):170–184

    Article  Google Scholar 

  • Xie P, Arkin PA (1996) Analyses of global monthly precipitation using gauge observations, satelite estimates and numerical model predections. J Clim 9:840–858

    Article  Google Scholar 

  • Yokohata T, Webb MJ, Collins M, Williams KD, Yoshimori M, Hargreaves JD, Annan JD (2010) Structural similarities and differences in climate responses to CO2 increase between two perturbed physics ensembles. J Clim 23:1392–1410. doi:10.1175/2009JCLI2917.1

    Article  Google Scholar 

Download references

Acknowledgments

We would like to thank Dr Jonathan Rougier, University of Bristol, for many useful discussions about the probabilistic framework that we adopted and for providing the equations for updating the joint probability distributions. We thank Glen Harris and Bruce Wielicki for useful discussions on discrepancy and the use of multimodel ensembles, and Doug McNeall for discussions on the diagnostic checks on the prior distribution. We would also like to thank the reviewers for useful comments on how to improve the paper. This work was supported by the Met Office Hadley Centre Climate Programme—DECC/Defra (GA01101). We acknowledge the international modeling groups for providing their data for analysis, the Program for Climate Model Diagnosis and Intercomparison (PCMDI) for collecting and archiving the model data, the JSC/CLIVAR Working Group on Coupled Modelling (GCM) and their Coupled Model Intercomparison Project (CMIP) and Climate Simulation Panel for organizing the model data analysis activity, and the IPCC WG1 TSU for technical support. The IPCC Data Archive at Lawrence Livermore National Laboratory is supported by the Office of Science, US Department of Energy.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to David M. H. Sexton.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 PDF (1675 KB)

Appendices

Appendix 1: Making eigenvectors of multiple variables

First, we decide which model variables e.g. cloud amount, 1.5 m temperature to include. We use N var  = 48 variables (4 seasons  ×  12 variables described in Table 1 of Sect. 2). Then, we construct an N e  × N o data matrix \(D^{\prime}\), where N e is the number of members in the PPE i.e. 280 here, and N o is the number of observable quantities over all locations and variable types. For each ensemble member, we concatenate all the grid-point/zonally-averaged data where observations exist for all the model variables under consideration into one long vector, length N o . Then we centre each observed quantity by removing its ensemble mean d i where \(i=1,\ldots,N_o\). Before we reduce dimensionality we must non-dimensionalise the data in D. In a similar way to Piani et al. (2005), we do this for each model variable by first calculating a global measure of scale, which is the root mean square of the standard deviation at each grid point in a set trimmed by only using those between the 1st and 99th percentile; the trimming stops grid-points of high variance across the ensemble e.g. near sea-ice margins, dominating the eigenvector analysis. Then for each variable, we rescale its values in each grid-box by that variable’s global scaling factor, ξ j where \(j=1, \ldots, N_{var}\). Therefore for each climate variable, we non-dimensionalise the data whilst retaining the relative geographical variations in variance. Finally each observable is area-weighted by the square root of the grid-box/zonal area, w i ; multi-level fields are divided by the number of pressure levels so that one multi-level field has the same overall weight as a single level horizontal field. The columns for each member, D (k) where \(k=1,\ldots,N_e\) are combined to form a matrix D which has a column of length N o and a row of length N e . That is,

$$ D = (D^{\prime} - 1 d^T) \xi^{-1} W^{-1}, $$
(14)

where d is a vector, length N o of the ensemble mean d i , ξ is a diagonal matrix containing the appropriate global scaling factor for each element of d, and W is a diagonal matrix of the square root of the area weights, a i .

SVD calculates the left vectors, U, eigenvalues, \(\Uplambda\) and right vectors, V so that

$$ D = U \Uplambda V^T. $$
(15)

This implies that the covariance of the ensemble data, \(\frac{1}{N_e-1}D^T D\), has eigenvectors V and eigenvalues \(\frac{1}{N_e-1} \Uplambda_i^2\). Also, the matrix \(A = U \Uplambda = DV\) is now the set of amplitudes, A ki of how the kth ensemble member D (k) projects onto the ith eigenvector V (i).

Similarly, we non-dimensionalise the observations and project them onto the set of eigenvectors, so that we have the N e -length vector of amplitudes for each eigenvector, \(o^T=(o^{{\prime} T}-d^T) \xi^{-1} W^{-1} V\) where \(o^{\prime}\) are the corresponding observed values for the data used to make \(D^{\prime}\). The same procedure has to be applied to the multimodel data used to specify the discrepancy.

To display the eigenvectors in Figs. 2 and 3 we show patterns of variation about the mean if the amplitude was set to one standard deviation of the amplitudes for the ith eigenvector, which is \(\left[\frac{1}{N_e-1} \Uplambda_i^2\right]^{\frac{1}{2}}\). The area-weighting and normalising has to be reversed so in Figs. 2 and 3 we show subsets of the vector \(\left[\frac{1}{N_e-1}\right]^{\frac{1}{2}}\Uplambda_i V_{(i)} W \xi\).

Appendix 2: Emulators

2.1 2.1 Building a univariate emulator

An emulator is used to predict model output for any combination of parameter values based on the ensemble of model evaluations. In this section, we discuss the method to emulate historical climate and the equilibrium response to doubled CO2 and so we use the ensemble of HadSM3 simulations that sample atmospheric parameter space, χ, which had ensemble size, N e  = 280. We chose a relatively simple method for automatically constructing these emulators using standard regression tools and a simple diagnostic which measures the normality of the residuals.

We build an emulator for each model variable, Y, using classical linear regression to relate Y to functions of the atmospheric parameter values. A stepwise algorithm was used to remove redundant regressors. The general regression model for our emulator is

$$ {\fancyscript{B}}(Y ; s, \theta_1, \theta_2) = \sum_{k=1}^p \gamma_k g_k(l(X)) + \eta. $$
(16)

where p is the number of regressors and η is the regression error. To help satisfy the requirement that our emulators are Gaussian, we use the Box–Cox transformation of a variable y (Draper and Smith 1998)

$$ {\fancyscript{B}}(Y ; s, \theta_1, \theta_2) = \left\{ { \begin{array}{l} { log(s Y-\theta_2), \quad \theta_1=0} \\ { (s Y-\theta_2)^{\theta_1}, \quad \theta_1 \not= 0,} \end{array} } \right. $$
(17)

parameterised by s which can be ±1, θ1, and θ2 where sY − θ2 > 0.

We define the full set of regressors to consider. In our study, these are functions \(g_i(\cdot)\) of l(X) where l(X) = X for all parameters, or l(X) is a mixture of X for discrete parameters and log(X) for continuous ones (see below for how l(X) is selected). Here, we allow \(g_i(\cdot)\) to be quadratics in the continuous parameters and two-way interactions between the continuous parameters. We use three-way interactions between RH crit (the threshold of relative humidity for cloud formation), V f1 (ice fallout speed), C wland (cloud droplet to rain conversion threshold), and C t (cloud droplet to rain conversion rate), ENT (entrainment rate coefficient), and EACFBL (cloud fraction at gridbox saturation), as these have been found by Rougier et al. (2009) to be important in emulating the climate sensitivity across the climateprediction.net ensemble (Stainforth et al. 2005).

The stepwise algorithm used here does not allow lower order terms for variables to be dropped if higher order terms are present which include those variables; this makes the statistical model robust to changes of scale (Draper and Smith 1998). We also ensure that not too many regressors are retained by preventing the residual variance going below the level of internal variability (see Sect. 2.4). The stepwise method is used on all four combinations of s =  ±1 and l(X) that are considered. As there is an emulator for each historic and future climate variable that we need to predict, and there are a large number of these, an automatic method is required to decide which combination of transformations \({\fancyscript{B}(Y)}\) and l(X) is best. We use a diagnostic which measures the degree of normality of the residuals as the p value from the Kolmogorov–Smirnov test that the residuals are Gaussian. For those statistical models with a p value >0.2, we chose the one with the lowest residual sum of squares in the units of Y. If all statistical models fail the test that the residuals are Gaussian, we pick the best one in terms of residual sum of squares but flag the emulator as a potentially bad one.

It is important to not make the number of functions, \(g_i(\cdot)\) too large as this can lead to over-fitting. Rougier et al. (2009) show diagnostics that can be used to check the emulator. We used their leave-one-out cross-validation test where each ensemble member is omitted and the other runs are used to re-estimate the regression parameters in the emulator and then predict the omitted run. Supplementary Information Figs. 1 and 2 show the results for historical and future variables respectively, and show that across all variables between 12 and 20 actual values fall outside the 95% credible intervals predicted by the emulator. We would expect by chance, from an ensemble of size 280, to have between 8 and 20 values fall outside so the emulators seem satisfactory. We also used quantile-quantile plots to test that the residuals are consistent with the Gaussian assumption (see Supplementary Information Figs. 3 and 4). The leave-one-out tests are not independent as the predictions are correlated across the ensemble, so Rougier et al. (2009) outline a sterner multivariate test where a randomly selected third of the ensemble are left out and the emulator is constructed from the remaining 187 members. Supplementary Information Figs. 5 and 6 shows the results from the leave-93-out cross-validation tests. The emulators passed both these tests. Finally, we checked that the emulator for the future climate variables does not predict a large fraction of the Monte Carlo sample lie outside the range of values sampled by the ensemble.

2.2 2.2 Estimating covariance matrix for multivariate emulator

In Eq. 10 we have to estimate the covariance matrix \(\Upsigma^{em}(x)\) that depends on the point in parameter space, x. There is no established way of estimating the covariance matrix for a multivariate emulator where each emulator differs in its set of regressors. We apply standard mathematical techniques to estimate the covariance between the emulators of the ith and jth climate variables at parameter point xCov i (x),μ j (x)) where μ i (x) = ∑ p k=1 γ ik g ik (l(x)) or in matrix notation, μ i (x) = G i γ i where G i is a matrix where each row is g i (l(X)) and X are the parameter sampled values for the model parameters. First we need to know the covariance matrix of the errors, Var(η). The covariance between residuals in the ith emulator, η i , and the jth emulator, η j , can be estimated as

$$ \frac{1}{[(N_e-q_i)(N_e-q_j)]^{\frac{1}{2}}} \eta_i \cdot \eta_j $$
(18)

where q i is the number of regression parameters in the ith emulator.

For the ith emulator with regressors G i we have

$$ G_i\hat{\gamma_i} - G_i\gamma_i = G_i(G_i^{T} G_i)^{-1} G_i^{T} Y - G_i\gamma_i \\ $$
(19)
$$ = G_i(G_i^{T} G_i)^{-1} G_i^{T} (\eta_i + G_i\gamma_i)- G_i\gamma_i \\ $$
(20)
$$ = G_i(G_i^{T} G_i)^{-1} G_i^{T} \eta_i\\ $$
(21)

so that, since G i has full common rank,

$$ \hat{\gamma_i} - \gamma_i = (G_i^{T} G_i)^{-1} G_i^{T} \eta_i \\ $$
(22)
$$ = H_i \eta_i. \\ $$
(23)

Then we have

$$ Cov(\gamma_i \gamma_j) = {\rm Var}(\eta)_{ij} H_i H_j^{T} $$
(24)

and it follows that

$$ \hbox{Cov}(\mu_i(x), \mu_j(x)) = \hbox{Var}(\eta)_{ij} \left(1 + [H_i^{T} g_i(x)]^{T} [H_j^{T} g_j(x)] \right). $$
(25)

where the first component comes from the error variance and the second component comes from the regressors of the two emulators.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Sexton, D.M.H., Murphy, J.M., Collins, M. et al. Multivariate probabilistic projections using imperfect climate models part I: outline of methodology. Clim Dyn 38, 2513–2542 (2012). https://doi.org/10.1007/s00382-011-1208-9

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00382-011-1208-9

Keywords

Navigation