Weakly supervised deep learning models for anomaly and change detection in radiology

Di Noto, Tommaso

Abstract

The goal of this PhD thesis was to address some recurrent limitations that are associated with Deep Learning (DL)-based Computer-Aided Detection/Diagnosis (CAD) systems in medical imaging. We focused our analysis on two clinical tasks routinely performed in radiology: the detection of intracra- nial aneurysms on Magnetic Resonance Angiography (MRA) scans and the longitudinal monitoring of high-grade gliomas in T2-weighted MR scans. The first limitation that we tackled was the lack of annotated data; to mitigate this, in both tasks we made use of weak labels to drive the learning process. These are coarse labels that are typically imprecise, but fast/cheap to obtain. For the aneurysm detection task, our weak labels corresponded to oversized annotations which resulted to be 4 times faster to create with respect to their voxel-wise counterparts. In the glioma change detection task, the weak labels were automatically extracted from textual radiology reports with a Natural Language Processing (NLP) framework, and allowed us to increase the amount of labeled data more than 3 fold. A further contribution of this thesis to reduce data scarcity in medical DL applications is the open release of our two in-house datasets. To date, our cohort for aneurysm detection is the largest in the community (N=284, 127 controls, 157 patients with aneurysms), while our cohort for glioma change detection (N=183 patients, 1693 difference maps) is the first longitudinal dataset ever made open access. The second limitation that we addressed was domain shift, which corre- sponds to a change in data distribution between a model’s training data, and the unseen test data it will be fed with at inference time. To alleviate domain shift, we investigated Transfer Learning (TL), and in particular we automated the choice of TL sub-type treating it as a hyperparameter to optimize which avoids empirical and sub-optimal choices that are frequent in similar works. The last limitation of DL-based tools that we tried to mitigate was the lack of interpretability, with DL models often being addressed as incomprehensible “black-box” models: in the aneurysm project, we added prior anatomical knowledge constraining the analysis only to areas of the brain that are plausible for aneurysm occurrence. This aimed at simulating the radiologists’ reading of images and avoiding unrealistic model outputs. Although the use of prior anatomical knowledge is not the most frequently used technique to improve interpretability, it has been shown to help increasing model transparency in related works. Overall, we believe that the combination of our open-source contributions and open-access datasets have the potential to make DL-based CAD tools more reproductble, and bring them closer to clinical application.
--
L’objectif de cette thèse est d’aborder certaines limitations récurrentes associées aux systèmes de diagnostic assisté par ordinateur (CAD) basés sur le Deep Learning (DL). Nous avons concentré notre analyse sur deux tâches couramment réalisées en radiologie : la détection d’anévrismes cérébraux sur des examens d’angiographie par résonance magnétique et le suivi de gliomes de haut grade sur des images par résonance magnétique pondérées T2. La première limitation que nous avons abordée est le manque de données annotées. Pour atténuer ce problème, nous utilisons dans les deux tâches des weak labels : il s’agit d’annotations généralement imprécises, mais rapides et peu coûteuses à obtenir. Pour le projet des anévrismes, nos weak labels correspondent à des annotations grossières qui ont été créées quatre fois plus rapidement que leurs équivalents à l’échelle d’un voxel. Dans le projet des gliomes, les weak labels ont été extraites automatiquement des rapports radiologiques avec un modèle de Natural Language Processing, et nous ont permis de tripler la quantité de données annotées. Une autre contribution majeure de cette thèse pour pallier la pénurie de données repose sur la publication de nos datasets.
à ce jour, notre dataset pour la détection des anévrismes est le plus important de la communauté en terme de nombre de sujets (N=284), tandis que notre dataset pour la détection des changements dans les gliomes (N=183) est le premier dataset longitudinal jamais publié. La deuxième limitation que nous avons abordée est le domain shift, un scénario fréquent en imagerie médicale qui se traduit par un changement dans la distribution des données entre les données d’apprentissage d’un modèle et les données de test non vues qui lui seront fournies au moment de l’inférence. Pour atténuer le domain shift, nous avons étudié le Transfer Learning (TL), et en particulier nous avons automatisé le choix du type de TL en le traitant comme un hyperparamètre à optimiser, évitant ainsi des choix empiriques, fréquents dans des études similaires. La dernière limitation que nous avons abordée est le manque d’interprétabilité des modèles : dans le projet des anévrismes, nous avons inclus des connaissances anatomiques a priori en restreignant l’analyse aux zones du cerveau susceptibles d’être impactées par la survenue d’un anévrisme. Cette démarche vise à simuler la lecture des images par les radiologues et à éviter des résultats irréalistes. Nous sommes convaincus que nos contributions à la science ouverte ont le potentiel de rendre les systèmes de CAD basés sur le DL plus reproductibles et de les rapprocher de l’application clinique.

SERVAL

serveur académique lausannois

Weakly supervised deep learning models for anomaly and change detection in radiology

Details