Measuring expertise in identifying interictal epileptiform discharges

Nitish M. Harid; Jin Jing; Jacob Hogan; Fábio A. Nascimento; An Ouyang; Wei-Long Zheng; Wendong Ge; Sahar F. Zafar; Jennifer A. Kim; D. Lam Alice; Aline Herlopian; Douglas Maus; Ioannis Karakis; Marcus Ng; Shenda Hong; Zhu Yu; Peter W. Kaplan; Sydney Cash; Mouhsin Shafi; Gabriel Martz; Jonathan J. Halford; Michael Brandon Westover

doi:10.1684/epd.2021.1409

Epileptic Disorders

Measuring expertise in identifying interictal epileptiform discharges Volume 24, numéro 3, June 2022

Illustrations

Figure 1.

(A) Performance metrics for sensitivity and specificity of clinical experts (blue), the original eight experts used for the reference standard (black), experienced non-clinical experts (green) and novices (red). (B-D) Calibration curves for experts (B), experienced non-experts (C) and novices (D).

Figure 1.
Figure 2.

95% confidence interval (CI) for each of the performance metrics: sensitivity (A), false positive rate (B) and calibration error (C) as a function of the number of questions answered. The black vertical dashed lines show the minimum number of questions required to drive the 95% CI below 0.1, corresponding to 549 (A), 514 (B) and 250 (C).

Figure 2.
Figure 3.

The “latent trait” framework for analyzing level of expertise in spike detection: (A) schematic of our framework for measuring a scorer’s level of expertise in recognizing epileptiform discharges; and (B) simulation of the decision process for the ideal observer, expert (including the original eight), experienced non-expert and novice (from top to bottom).

Figure 3.
Figure 4.

(A) Estimation of scorer’s internal parameters for internal noise levels and σ and θ threshold for experts (blue), the original eight experts used as the reference standard (black), experienced non-experts (green) and novices (red). (B) Updated ROC curves based on estimated internal parameter (blue: experts; green: experienced non-experts; red: novices).

Figure 4.

Tableaux

Auteurs

Nitish M. Harid ¹ a

Jin Jing ¹ a

Jacob Hogan ¹ a

Fábio A. Nascimento ¹

Marcus Ng ⁴

Zhu Yu ⁶

Jonathan J. Halford ¹⁰

Michael Brandon Westover ¹

1 Department of Neurology, Massachusetts General Hospital, Boston MA, USA

2 Department of Neurology, Yale School of Medicine, New Haven CT, USA

3 Department of Neurology, Emory University School of Medicine, Atlanta GA, USA

4 Department of Neurology, University of Manitoba, Winnipeg, Manitoba, Canada

5 National Institute of Health Data Science, Peking University, Beijing China

6 Xuanwu Hospital, Capital Medical University, Beijing China

7 Department of Neurology, Johns Hopkins University School of Medicine, Bayview Medical Center, Baltimore, MD, USA

8 Department of Neurology, Beth Israel Deaconess Medical Center, Boston, MA, USA

9 Department of Neurology, Hartford HealthCare Medical Group at Hartford Hospital, CT, USA

10 Department of Neurology, Medical University of South Carolina, Charleston SC, USA

* Correspondence: Jin Jing

a Authors contributed equally

Mots-clés : interictal epileptiform discharge, EEG, epilepsy, assessment, expert and non-expert
DOI : 10.1684/epd.2021.1409
Page(s) : 496-506
Année de parution : 2022

Objective. Interictal epileptiform discharges on EEG are integral to diagnosing epilepsy. However, EEGs are interpreted by readers with and without specialty training, and there is no accepted method to assess skill in interpretation. We aimed to develop a test to quantify IED recognition skills.

Methods. A total of 13,262 candidate IEDs were selected from EEGs and scored by eight fellowship-trained reviewers to establish a gold standard. An online test was developed to assess how well readers with different training levels could distinguish candidate waveforms. Sensitivity, false positive rate and calibration were calculated for each reader. A simple mathematical model was developed to estimate each reader’s skill and threshold in identifying an IED, and to develop receiver operating characteristics curves for each reader. We investigated the number of IEDs needed to measure skill level with acceptable precision.

Results. Twenty-nine raters completed the test; nine experts, seven experienced non-experts and thirteen novices. Median calibration errors for experts, experienced non-experts and novices were -0.056, 0.012, 0.046; median sensitivities were 0.800, 0.811, 0.715; and median false positive rates were 0.177, 0.272, 0.396, respectively. The number of test questions needed to measure those scores was 549. Our analysis identiﬁed that novices had a higher noise level (uncertainty) compared to experienced non-experts and experts. Using calculated noise and threshold levels, receiver operating curves were created, showing increasing median area under the curve from novices (0.735), to experienced non-experts (0.852) and experts (0.891).

Significance. Expert and non-expert readers can be distinguished based on ability to identify IEDs. This type of assessment could also be used to identify and correct differences in thresholds in identifying IEDs.

Epileptic Disorders

Measuring expertise in identifying interictal epileptiform discharges Volume 24, numéro 3, June 2022

Illustrations

Figure 1.

Figure 2.

Figure 3.

Figure 4.

Tableaux