xRead - Incorporating Artificial Intelligence into Clinical Practice (March 2026)
Ng et al. BMC Medical Informatics and Decision Making
(2025) 25:236
Page 9 of 24
Key Findings Novel Features
Study noted that indexing errors need improvement, especially those pertaining to word ambiguities and abbreviations;
recommend further refinement of the indexing tools used in ASR systems to mitigate these errors.
The authors recognize the limitations in handling non-English terms; suggest
developing ASR systems with multilingual capabilities, which could increase usability in diverse clinical settings.
IBM via voice with medical vocabulary outperformed all other configurations using naturally speaking.
Index evaluation of Nomindex is com parable to previous evaluations. At
the current stage, the precision (73%) is too low to rely on this indexing but
the recall (90%) is high enough to use it as a machine-aided indexing tool.
AI Transcription
Proficiency (paper
specific outcomes)
Baseline (“at blank”) 8.23 3.59 2.34 6.02 0.36
5.67
Trained (“with learning”) 4.31* 2.67*
1.87*
7.14
0.54
4.21*
Baseline (“at blank”) 3.68 1.13 0.52 2.31 0.43
1.80
Trained (“with learning”) 2.16 1.22
0.66
1.19
0.00
1.71
Metric (F1 score, Precision 0.73
Precision, Recall, WER) Recall 0.90
“IBM via voice” SR tool: - Medical
Vocabulary - General
vocabulary
- Misspelling - Numbers
- Punctuation marks - Total
Gold standard “Naturally speak ing” SR tool: - Medical
Vocabulary - General
vocabulary
- Misspelling - Numbers
- Punctuation marks - Total
Comparator Type Subcategories Performance
Standard
Set of keywords manually ex
tracted from the
initial document.
Table 2 (continued)
Study Reference Happe et al., 2003 [10]
Made with FlippingBook - professional solution for displaying marketing and sales documents online