Nonparametric predictive inference with parametric copulas for combining bivariate diagnostic tests

Noryanti Muhammad; Tahani Coolen-Maturi; Frank P.A. Coolen

doi:10.19139/soic.v6i3.579

Noryanti Muhammad Faculty of Industrial Sciences and Technology, Universiti Malaysia Pahang, Malaysia.
Tahani Coolen-Maturi Durham University Business School, Durham University, UK.
Frank P.A. Coolen Department of Mathematical Sciences, Durham University, UK.

DOI: https://doi.org/10.19139/soic.v6i3.579

Keywords: Bivariate diagnostic tests, copulas, diagnostic accuracy, lower and upper probabilities, nonparametric predictive inference, ROC curve.

Abstract

Measuring the accuracy of diagnostic tests is crucial in many application areas including medicine, machine learning and credit scoring. The receiver operating characteristic (ROC) curve is a useful tool to assess the ability of a diagnostic test to discriminate among two classes or groups. In practice, multiple diagnostic tests or biomarkers may be combined to improve diagnostic accuracy, e.g. by maximizing the area under the ROC curve. In this paper we present Nonparametric Predictive Inference (NPI) for best linear combination of two biomarkers, where the dependence of the two biomarkers is modelled using parametric copulas. NPI is a frequentist statistical method that is explicitly aimed at using few modelling assumptions, enabled through the use of lower and upper probabilities to quantify uncertainty. The combination of NPI for the individual biomarkers, combined with a basic parametric copula to take dependence into account, has good robustness properties and leads to quite straightforward computation. We briefly comment on the results of a simulation study to investigate the performance of the proposed method in comparison to the empirical method. An example with data from the literature is provided to illustrate the proposed method, and related research problems are briefly discussed.

Author Biographies

Noryanti Muhammad, Faculty of Industrial Sciences and Technology, Universiti Malaysia Pahang, Malaysia.

Faculty of Industrial Sciences and Technology, Universiti Malaysia Pahang, Malaysia.

Tahani Coolen-Maturi, Durham University Business School, Durham University, UK.

Durham University Business School, Durham University, UK.

Frank P.A. Coolen, Department of Mathematical Sciences, Durham University, UK.

Department of Mathematical Sciences, Durham University, UK.

References

T. Augustin and F.P.A. Coolen. Nonparametric predictive inference and interval probability. Journal of Statistical Planning and Inference, vol. 124, pp. 251–272, 2004.

T. Augustin, F.P.A. Coolen, G. de Cooman and M.C.M. Troffaes (Eds). Introduction to Imprecise Probabilities. Chichester: Wiley, 2014.

A. Bansal and M. Sullivan Pepe. When does combining markers improve classification performance and what are implications for practice? Statistics in Medicine, vol. 32, pp. 1877–1892, 2013.

F.P.A. Coolen. On nonparametric predictive inference and objective Bayesianism. Journal of Logic, Language and Information, vol. 15, pp. 21–47, 2006.

T. Coolen-Maturi. Three-group ROC predictive analysis for ordinal outcomes. Communications in Statistics - Theory and Methods, vol. 46, pp. 9476-9493, 2017.

T. Coolen-Maturi, F.P.A. Coolen and N. Muhammad. Predictive inference for bivariate data: combining nonparametric predictive inference for marginals with an estimated copula. Journal of Statistical Theory and Practice, vol. 10, pp. 515–538, 2016.

T. Coolen-Maturi, P. Coolen-Schrijner and F.P.A. Coolen. Nonparametric predictive inference for binary diagnostic tests. Journal of Statistical Theory and Practice, vol. 6, pp. 665–680, 2012.

T. Coolen-Maturi, P. Coolen-Schrijner and F.P.A. Coolen. Nonparametric predictive inference for diagnostic accuracy. Journal of Statistical Planning and Inference, vol. 142, pp. 1141–1150, 2012.

T. Coolen-Maturi, F.F. Elkhafifi and F.P.A. Coolen. Three-group ROC analysis: A nonparametric predictive approach. Computational Statistics & Data Analysis, vol. 78, pp. 69–81, 2014.

B. De Finetti. Theory of Probability. London: Wiley, 1974.

F.F. Elkhafifi and F.P.A. Coolen. Nonparametric predictive inference for accuracy of ordinal diagnostic tests. Journal of Statistical Theory and Practice, vol. 6, pp. 681–697, 2012.

B.M. Hill. Posterior distribution of percentiles: Bayes’ theorem for sampling from a population. Journal of the American Statistical Association, vol. 63, pp. 677–691, 1968.

L. Kang, A. Liu and L. Tian. Linear combination methods to improve diagnostic/prognostic accuracy on future observations. Statistical Methods in Medical Research, vol. 25, pp. 1359–1380, 2016.

J.F. Lawless and M. Fredette. Frequentist prediction intervals and predictive distributions. Biometrika, vol. 92, pp. 529–542, 2005.

C. Liu, A. Liu and S. Halabi. A min–max combination of biomarkers to improve diagnostic accuracy. Statistics in Medicine, vol. 30, pp. 2005–2014, 2011.

N. Muhammad. Predictive Inference with Copulas for Bivariate Data. PhD thesis, Durham University, UK (available from www.npistatistics.com), 2016.

J.Q. Su and J.S. Liu. Linear combinations of multiple diagnostic markers. Journal of the American Statistical Association, vol. 88, pp. 1350–1355, 1993.

M. Sullivan Pepe. The Statistical Evaluation of Medical Tests for Classification and Prediction. Oxford: Oxford University Press, 2003.

M. Sullivan Pepe and M.L. Thompson. Combining diagnostic test results to increase accuracy. Biostatistics, vol. 1, pp. 123–140, 2000.

S. Wieand, M.H. Gail, B.R. James and K.L. James. A family of nonparametric statistics for comparing diagnostic markers with paired or unpaired data. Biometrika, vol. 76, pp. 585–592, 1989.