Deep learning for acute rib fracture detection in CT data: a systematic review and meta-analysis.
Détails
Télécharger: 38323515.pdf (806.44 [Ko])
Etat: Public
Version: Final published version
Licence: CC BY 4.0
Etat: Public
Version: Final published version
Licence: CC BY 4.0
ID Serval
serval:BIB_87520A6256AF
Type
Article: article d'un périodique ou d'un magazine.
Collection
Publications
Institution
Titre
Deep learning for acute rib fracture detection in CT data: a systematic review and meta-analysis.
Périodique
The British journal of radiology
ISSN
1748-880X (Electronic)
ISSN-L
0007-1285
Statut éditorial
Publié
Date de publication
28/02/2024
Peer-reviewed
Oui
Volume
97
Numéro
1155
Pages
535-543
Langue
anglais
Notes
Publication types: Meta-Analysis ; Systematic Review ; Journal Article
Publication Status: ppublish
Publication Status: ppublish
Résumé
To review studies on deep learning (DL) models for classification, detection, and segmentation of rib fractures in CT data, to determine their risk of bias (ROB), and to analyse the performance of acute rib fracture detection models.
Research articles written in English were retrieved from PubMed, Embase, and Web of Science in April 2023. A study was only included if a DL model was used to classify, detect, or segment rib fractures, and only if the model was trained with CT data from humans. For the ROB assessment, the Quality Assessment of Diagnostic Accuracy Studies tool was used. The performance of acute rib fracture detection models was meta-analysed with forest plots.
A total of 27 studies were selected. About 75% of the studies have ROB by not reporting the patient selection criteria, including control patients or using 5-mm slice thickness CT scans. The sensitivity, precision, and F1-score of the subgroup of low ROB studies were 89.60% (95%CI, 86.31%-92.90%), 84.89% (95%CI, 81.59%-88.18%), and 86.66% (95%CI, 84.62%-88.71%), respectively. The ROB subgroup differences test for the F1-score led to a p-value below 0.1.
ROB in studies mostly stems from an inappropriate patient and data selection. The studies with low ROB have better F1-score in acute rib fracture detection using DL models.
This systematic review will be a reference to the taxonomy of the current status of rib fracture detection with DL models, and upcoming studies will benefit from our data extraction, our ROB assessment, and our meta-analysis.
Research articles written in English were retrieved from PubMed, Embase, and Web of Science in April 2023. A study was only included if a DL model was used to classify, detect, or segment rib fractures, and only if the model was trained with CT data from humans. For the ROB assessment, the Quality Assessment of Diagnostic Accuracy Studies tool was used. The performance of acute rib fracture detection models was meta-analysed with forest plots.
A total of 27 studies were selected. About 75% of the studies have ROB by not reporting the patient selection criteria, including control patients or using 5-mm slice thickness CT scans. The sensitivity, precision, and F1-score of the subgroup of low ROB studies were 89.60% (95%CI, 86.31%-92.90%), 84.89% (95%CI, 81.59%-88.18%), and 86.66% (95%CI, 84.62%-88.71%), respectively. The ROB subgroup differences test for the F1-score led to a p-value below 0.1.
ROB in studies mostly stems from an inappropriate patient and data selection. The studies with low ROB have better F1-score in acute rib fracture detection using DL models.
This systematic review will be a reference to the taxonomy of the current status of rib fracture detection with DL models, and upcoming studies will benefit from our data extraction, our ROB assessment, and our meta-analysis.
Mots-clé
Humans, Rib Fractures/diagnostic imaging, Deep Learning, Tomography, X-Ray Computed, Retrospective Studies, CT, computed tomography, deep learning, rib fracture
Pubmed
Web of science
Open Access
Oui
Création de la notice
09/02/2024 10:49
Dernière modification de la notice
12/03/2024 7:16