Accurate and Reliable Classification of Unstructured Reports on Their Diagnostic Goal Using BERT Models

Rietberg, Max, Tigo; Nguyen, Van, Bach; Geerdink, Jeroen; Vijlbrief, Onno; Seifert, Christin

doi:10.3390/diagnostics13071251

Artikel / Aufsatz Mo., 27. März. 2023 CC BY 4.0

Veröffentlicht

Accurate and Reliable Classification of Unstructured Reports on Their Diagnostic Goal Using BERT Models

Rietberg, Max Tigo ; Nguyen, Van Bach; Geerdink, Jeroen ; Vijlbrief, Onno; Seifert, Christin

Understanding the diagnostic goal of medical reports is valuable information for understanding patient flows. This work focuses on extracting the reason for taking an MRI scan of Multiple Sclerosis (MS) patients using the attached free-form reports: Diagnosis, Progression or Monitoring. We investigate the performance of domain-dependent and general state-of-the-art language models and their alignment with domain expertise. To this end, eXplainable Artificial Intelligence (XAI) techniques are used to acquire insight into the inner workings of the model, which are verified on their trustworthiness. The verified XAI explanations are then compared with explanations from a domain expert, to indirectly determine the reliability of the model. BERTje, a Dutch Bidirectional Encoder Representations from Transformers (BERT) model, outperforms RobBERT and MedRoBERTa.nl in both accuracy and reliability. The latter model (MedRoBERTa.nl) is a domain-specific model, while BERTje is a generic model, showing that domain-specific models are not always superior. Our validation of BERTje in a small prospective study shows promising results for the potential uptake of the model in a practical setting.

Vorschau

Einordnung

Datum der Veröffentlichung:

27.03.2023

URN:

urn:nbn:de:hbz:465-20230809-152639-0

DOI:

10.3390/diagnostics13071251

Sprache:

Englisch

Ressourcentyp:

Text

Schlagwörter:

natural language processing; health informatics; BERT; text classification

Kollektion:

E-Publikationen

Sachgruppen der Deutschen Nationalbibliographie:

610 Medizin, Gesundheit

Einrichtung:

Medizinische Fakultät, Universitätsklinikum Essen, Institut für KI in der Medizin (IKIM)

Förderung:

The publication of this article was supported by the Publication Fund of the University of Duisburg-Essen.

Informationen zur Erstveröffentlichung:

Rietberg, M.T.; Nguyen, V.B.; Geerdink, J.; Vijlbrief, O.; Seifert, C. Accurate and Reliable Classification of Unstructured Reports on Their Diagnostic Goal Using BERT Models. Diagnostics 2023, 13, 1251. https://doi.org/10.3390/diagnostics13071251

Published: 27 March 2023

Versionskennzeichen: