|
Sign In to gain access to subscriptions and/or personal tools.
|
Enhancing and Evaluating Diagnostic Accuracy
John A. Swets, PhD
David J. Getty, PhD
Ronald M. Pickett, PhD
Carl J. D'Orsi, MD
Steven E. Seltzer, MD
Barbara J. McNeil, MD, PhD.
Techniques that may enhance diagnostic accuracy in clinical settings were tested in the context of mammography. Statistical information about the relevant features among those visible in a mammogram and about their relative importances in the diagnosis of breast cancer was the basis of two decision aids for radiologists: a checklist that guides the ra diologist in assigning a scale value to each significant feature of the images of a particular case, and a computer program that merges those scale values optimally to estimate a probability of malignancy. A test set of approximately 150 proven cases (including normals and benign and malignant lesions) was interpreted by six radiologists, first in their usual manner and later with the decision aids. The enhancing effect of these feature-analytic techniques was analyzed across subsets of cases that were restricted progressively to more and more difficult cases, where difficulty was defined in terms of the radiologists' judgments in the standard reading condition. Accuracy in both standard and enhanced conditions de creased regularly and substantially as case difficulty increased, but differentially, such that the enhancement effect grew regularly and substantially. For the most difficult case sets, the observed increases in accuracy translated into an increase of about 0.15 in sensitivity (true-positive proportion) for a selected specificity (true-negative proportion) of 0.85 or a similar increase in specificity for a selected sensitivity of 0.85. That measured accuracy can depend on case-set difficulty to different degrees for two diagnostic approaches has general implications for evaluation in clinical medicine. Comparative, as well as absolute, assess ments of diagnostic performancesfor example, of alternative imaging techniquesmay be distorted by inadequate treatments of this experimental variable. Subset analysis, as defined and illustrated here, can be useful in alleviating the problem. Key words: computer- aided diagnosis; expert systems; technology assessment; quality assurance; diagnostic ac curacy; ROC analysis; feature analysis; cognitive processes; perception. (Med Decis Making 1991;11:9-18)
Medical Decision Making, Vol. 11, No. 1,
9-17 (1991)
DOI: 10.1177/0272989X9101100102

CiteULike Complore Connotea Del.icio.us Digg Reddit Technorati Twitter What's this?
This article has been cited by other articles:

|
 |

|
 |
 
R. M. Nishikawa, S. Acharyya, C. Gatsonis, E. D. Pisano, E. B. Cole, H. S. Marques, C. J. D'Orsi, D. M. Farria, K. M. Kanal, M. C. Mahoney, et al.
Comparison of Soft-copy and Hard-copy Reading for Full-Field Digital Mammography
Radiology,
April 1, 2009;
251(1):
41 - 49.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
E. S. Burnside, J. E. Ochsner, K. J. Fowler, J. P. Fine, L. R. Salkowski, D. L. Rubin, and G. A. Sisney
Use of Microcalcification Descriptors in BI-RADS 4th Edition to Stratify Risk of Malignancy
Radiology,
February 1, 2007;
242(2):
388 - 395.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
A. K. Eckstein, M. Plicht, H. Lax, M. Neuhauser, K. Mann, S. Lederbogen, C. Heckmann, J. Esser, and N. G. Morgenthaler
Thyrotropin Receptor Autoantibodies Are Independent Risk Factors for Graves' Ophthalmopathy and Help to Predict Severity and Outcome of the Disease
J. Clin. Endocrinol. Metab.,
September 1, 2006;
91(9):
3464 - 3470.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
E. S. Burnside, D. L. Rubin, J. P. Fine, R. D. Shachter, G. A. Sisney, and W. K. Leung
Bayesian Network to Predict Breast Cancer Risk of Mammographic Microcalcifications and Reduce Number of Benign Biopsy Results: Initial Experience
Radiology,
September 1, 2006;
240(3):
666 - 673.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
L. R. Goldberg and M. Jessup
Stage B Heart Failure: Management of Asymptomatic Left Ventricular Systolic Dysfunction
Circulation,
June 20, 2006;
113(24):
2851 - 2860.
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
S. D.R. Thackray, K. Witte, J. Ghosh, N. Nikitin, A. Anderson, A. Rigby, K. Goode, A. L. Clark, and J. G.F. Cleland
N-terminal brain natriuretic peptide as a screening tool for heart failure in the pacemaker population
Eur. Heart J.,
February 2, 2006;
27(4):
447 - 453.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
R. F. Wagner, C. A. Beam, and S. V. Beiden
Reader Variability in Mammography and Its Implications for Expected Utility over the Population of Readers and Cases
Med Decis Making,
November 1, 2004;
24(6):
561 - 572.
[Abstract]
[PDF]
|
 |
|

|
 |

|
 |
 
J. G. Elmore, C. Y. Nakano, T. D. Koepsell, L. M. Desnick, C. J. D'Orsi, and D. F. Ransohoff
International Variation in Screening Mammography Interpretations in Community-Based Programs
J Natl Cancer Inst,
September 17, 2003;
95(18):
1384 - 1393.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
R. S. Vasan, E. J. Benjamin, M. G. Larson, E. P. Leip, T. J. Wang, P. W. F. Wilson, and D. Levy
Plasma Natriuretic Peptides for Community Screening for Left Ventricular Hypertrophy and Systolic Dysfunction: The Framingham Heart Study
JAMA,
September 11, 2002;
288(10):
1252 - 1259.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
A. S. Maisel, P. Krishnaswamy, R. M. Nowak, J. McCord, J. E. Hollander, P. Duc, T. Omland, A. B. Storrow, W. T. Abraham, A. H.B. Wu, et al.
Rapid Measurement of B-Type Natriuretic Peptide in the Emergency Diagnosis of Heart Failure
N. Engl. J. Med.,
July 18, 2002;
347(3):
161 - 167.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
C. M. C. Tempany and B. J. McNeil
Advances in Biomedical Imaging
JAMA,
February 7, 2001;
285(5):
562 - 567.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
S. J. Jacobsen, E. J. Bergstralh, H. A. Guess, S. K. Katusic, G. G. Klee, J. E. Oesterling, and M. M. Lieber
Predictive Properties of Serum Prostate-Specific Antigen Testing in a Community-Based Setting
Arch Intern Med,
November 25, 1996;
156(21):
2462 - 2468.
[Abstract]
[PDF]
|
 |
|

|
 |

|
 |
 
D. A. Perednia, J. A. Gaines, and T. W. Butruille
Comparison of the Clinical Informativeness of Photographs and Digital Imaging Media With Multiple-Choice Receiver Operating Characteristic Analysis
Arch Dermatol,
March 1, 1995;
131(3):
292 - 297.
[Abstract]
[PDF]
|
 |
|
|
|