Mortality analysed by propensity score matching: an application to national neonatal audit

Main Article Content

Longford, N.T.


Mortality, a key outcome variable in many population studies and studies of healthcare and its interventions, is commonly analysed by regression of the survival status on a set of relevant background variables. We describe an alternative based on the potential outcomes framework, in which we ask how a particular group of subjects, or a population, whose outcomes were realised in one condition, would have fared had they been treated or cared for in different circumstances. The method is applied to neonatal mortality in the operational delivery networks in England and Wales. The performance of a network is assessed by the difference of the mortality rates of the network and of a matched set of babies drawn from the entire domain of the study. The outlier status of a network is established by a decision-theoretical approach.

Article Details

How to Cite
Longford, N. T. (2023). Mortality analysed by propensity score matching: an application to national neonatal audit. Brazilian Journal of Biometrics, 41(4), 311–331.


Alexandrescu, R., Bottle, A., Jarman, B., and Aylin, P. Classifying hospitals as mortality outliers: logistic versus hierarchical logistic models. Journal of Medical Systems 38, (29) (2014). PMID 24711175.

Austin, P. C. An introduction to propensity score methods for reducing the effects of

confounding in observational studies. Multivariate Behavioral Research 46, 399–424 (2011).

Austin, P. C., and Fine, J. P. Propensity-score matching with competing risks in survival analysis. Statistics in Medicine 38, 751–777 (2019).

Claeskens, G., and Hjort, N. L. Model Selection and Model Averaging. (Cambridge, MA: Cambridge University Press, 2008).

Helenius, K., Longford, N., Lehtonen, L., Modi, N., and Gale, C. Association of early postnatal transfer and birth outside a tertiary hospital with mortality and severe brain injury in extremely preterm infants: observational cohort study with propensity score matching. British Medical Journal 367, l5678 (2019).

Gale, C., Longford, N.T., Jeyakumaran, D., Ougham, K., Battersby, C., Ohja, S., and Dorling, J. Feeding during neonatal therapeutic hypothermia, assessed using routinely collected National Neonatal Research Database: a retrospective population-based cohort study. Lancet Child & Adolescent Health 5, 408–416 (2021).

Imbens, G. W., and Rubin D. B. Causal Inference for Statistics, Social, and Biomedical Sciences. An Introduction. (Cambridge, MA: Cambridge University Press, 2015).

Kristoffersen, D. V., Helgeland, J., Clench-Aas, J., Laake, P., and Veierød, B. Observed to expected or logistic regression to identify hospitals with high or low 30-day mortality? PLoS ONE 13, e0195248. (2018).

Lindley, D. V. Making Decisions. (Chichester: Wiley,1985).

Longford, N. T. Statistical Decision Theory. (Heidelberg: Springer-Verlag,2013).

Longford, N. T. Estimation under model uncertainty. Statistica Sinica 27, 859–877 (2017).

Longford, N. T. Decision theory for comparing institutions. Statistics in Medicine 37, 437–456 (2018).

Longford, N. T. Performance assessment as an application of causal inference. Journal of the Royal Statistical Society Series A 183, 1363–1385 (2020).

Longford, N. T. Statistics for Making Decisions. (Boca Raton, FL:Taylor and Francis/CRC,2021).

Love, T. Graphical Display of Covariate Balance. Unpublished manuscript (2004).; retrieved on 28th January 2022.

Office for National Statistics. (2021).; retrieved on 5th April 2021.

Pimentel, S.D. Large, sparse optimal matching with R package rcbalance. Observational Studies 2, 4–23 (2016).

Rosenbaum, P. R. Observation and Experiment. An Introduction to Causal Inference. (Cambridge, MA: Cambridge University Press,2017).

Rosenbaum, P. R., and Rubin, D. B. The central role of propensity score in observational studies for causal effects. Biometrika 70, 41–55 (1983).

Rubin, D. B. Discussion of ‘Randomization analysis of experimental data in the Fisher randomization test’ by D. Basu. Journal of the American Statistical Association 75, 591–593 (1980).

Rubin, D. B. For objective causal inference, design trumps analysis. Annals of Applied Statistics 2, 808–840 (2008).

Silber, J. H., Rosenbaum, P. R., Ross, R. N., Ludwig, J. M., Wang, W., Niknam, B. A., Hill, A. S., Even-Shoshan, O., Kelz, R. R., and Fleisher, L.A. Indirect standardization matching: Assessing specific advantage and risk synergy. BMC Health Services Research 51, 2330–2357 (2016).

Spiegelhalter, D. J. Funnel plots for comparing institutional performance. Statistics in Medicine 24, 1185–1202 (2005).

Stuart, E. A. Matching methods for causal inference: A review and a look forward. Statistical Science 25, 1–21 (2010).

Yu, R., Silber, J.H., and Rosenbaum, P.R. Matching methods for observational studies derived from large administrative databases. Statistical Science 35, 338–355 (2020).

Zubizarreta, J.R. Using mixed integer programming for matching in an observational study of kidney failure after surgery. Journal of American Statistical Association 107, 1360–1371 (2012).

Zubizarreta, J.R. Stable weights that balance covariates for estimation with incomplete outcome data. Journal of the American Statistical Association 110, 910–922 (2015).