You can edit almost every page by Creating an account. Otherwise, see the FAQ.

ISUP (Inter Subject Unified Performance)

From EverybodyWiki Bios & Wiki

Analytical performance in blood glucose monitoring (BGM) - New single figure characterization ‘ISUP’ to complement the currently used ‘MARD’[edit]

Several approaches to BGM exists. Some 30 years ago, home-test kits were introduced, and over the last 5 years the continuous glucose monitoring (CGM) systems, e.g. from Dexcom, Abbott and Medtronic, have entered mainstream use, where statistical and graphical representations have been developed to summarize the analytical quality with regard to their primary purpose; aiding the diabetic patient in controlling his/her blood glucose levels.

Since the introduction of CGMs, mean absolute relative difference (MARD) has become the most popular figure of merit for performance assessment. In practice, this figure is supplemented by scatter plots in a so-called consensus error grid,[1] where measured values are plotted against reference values (measured by a more accurate reference method) and certain zones are indicated to identify the most critical points in relation to therapeutic decisions. The consensus error grid has been used in the examples below.

It is broadly accepted that the sole use of MARD for assessing the accuracy of glucose monitors may lead to flawed conclusions[2] and, for this reason, it has been suggested to supplement the MARD with reliability metrics, like the precision absolute relative difference (PARD)[3] and MARD reliability index[4]. As an alternative evaluation of sensor performance, a new figure of merit has been proposed, the so-called inter-subject unified performance (ISUP) parameter, which compounds statistical performance and aspects of the consensus error grid plot.

Inter-subject unified performance (ISUP)[edit]

The ISUP value is a composite measure that evaluates the performance of a regression model. The ISUP is, by definition, a general performance measure that can be utilized to compare the predictive capability of regression models, irrespective of the origin of the input data and the (continuous) quantity to be predicted. Yet, the ISUP has been developed to quantify the analytical performance of glucose monitoring systems that are routinely used by subjects with diabetes. The challenging aspect in quantifying the performance of glucose monitoring systems owes to the fact that subjects with diabetes show very different temporal behavior and range of glucose values. This may turn single-parameter measures, like root-mean-square-error (RMSE), MARD, and slope, to misleading figures of merit, particularly when compared between subjects.

The composite ISUP parameter is evaluated on cross- or independent validated results and is designed to weight the presence of samples in zone A and B of the consensus error grid, together with a low MARD, a high slope and a high correlation coefficient. Furthermore, the ISUP takes into account the difference in the average glucose level between subjects by weighting with the median of the blood glucose concentrations.

It is noted that the usability of a regression model is application specific, but past studies used a threshold on the ISUP of 0.5 to classify good and bad performing regression models.[5]

Formula[edit]

The ISUP of an subject specific regression model is calculated with the following formula:

where A and B are percent samples lying in zone A and B of a consensus error grid, MARD is the mean absolute relative difference between estimated (yp) and reference (y) glucose values, a is the slope of the least-squares fitted linear polynomial, R2 is the correlation coefficient between yp and y, and ymedian  is the median of the reference glucose values.

MARD is calculated with the following formula:

where n is the number of paired glucose measurements.

Example[edit]

In the figures below, the following three scenarios are presented:

  • Scenario A: Subject with well-regulated diabetes. Glucose level from ~4 - ~12 mM.
  • Scenario B: Subject with very variating glucose level from ~4 - ~28 mM.
  • Scenario C: Subject with mainly hyperglycemic periods. Glucose level from ~9 - ~30 mM.
Scenario A. Subject with well-regulated diabetes. Glucose level from ~4 - ~12 mM.
Scenario B. Subject with very variating glucose level from ~4 - ~28 mM.
Scenario C. Subject with mainly hyperglycemic periods. Glucose level from ~9 - ~30 mM.

From a visual inspection of the consensus error grids, it seems reasonable to conclude that scenario A and B feature well-performing regression models, while that is not the case for scenario C. With an ISUP threshold of 0.5, we arrive at the same conclusion (see Table 1). Though the conclusions of the three scenarios may seem trivial, the examples illustrate the issue of using single-parameter evaluations, like MARD and samples in zone A and B, that lead to the opposite conclusion. The misleading conclusions are a result of both the MARD and samples in zone A and B being vulnerable to deviations in predictions for low glucose levels, while the opposite is true for high glucose levels.

Table 1. Results
Scenario A B C
ISUP 3.62 6.41 -0.01
MARD (%) 30.67 14.9 23.4
A+B (%) 93 100 96

References[edit]

  1. Parkes JL. Slatin SL. Pardo S. Ginsberg BH. (2000) A New Consensus Error Grid to Evaluated the Clinical Significance of Inaccuracies in the Measurement of Blood Glucose. Diabetes Care 23(8), 1143-1148.
  2. Kirchsteiger H. Heinemann L. Freckmann G. Lodwig V. Schmelzeisen-Redeker G. Schoemaker M. del Re L. (2015) Performance comparison of CGM systems: MARD values are not always a reliable indicator of their accuracy. J. Diabetes Sci. Technol. 9(5), 1030-1040.
  3. Obermaier K. Schmelzeisen-Redeker G. Schoemaker M. Klötzer HM. Kirchsteiger H. Eikmeier H. del Re L. (2013) Performance evaluations of continuous glucose monitoring systems: precision absolute relative deviation is part of the assessment, J. Diabetes Sci. Technol. 7(4): 824-832.
  4. Reiterer F. Polterauer P. Schoemaker M. Schmelzeisen-Redecker G. Freckmann G. Heinemann L. del Re L. (2017) Significance and Reliability of MARD for the Accuracy of CGM Systems, J. Diabetes Sci Technol. 11(1): 59–67.
  5. Lundsgaard-Nielsen SM, Pors A, Banke SO, Henriksen JE, Hepp DK, Weber A (2018) Critical-depth Raman spectroscopy enables home-use non-invasive glucose monitoring. PLoS ONE 13(5): e0197134. https://doi.org/10.1371/journal.pone.0197134

Inter-Subject Unified Performance (ISUP)[edit]


This article "ISUP (Inter Subject Unified Performance)" is from Wikipedia. The list of its authors can be seen in its historical and/or the page Edithistory:ISUP (Inter Subject Unified Performance). Articles copied from Draft Namespace on Wikipedia could be seen on the Draft Namespace of Wikipedia and not main one.