Deep learning for acute rib fracture detection in CT data: a systematic review and meta-analysis.
Details
Serval ID
serval:BIB_87520A6256AF
Type
Article: article from journal or magazin.
Collection
Publications
Institution
Title
Deep learning for acute rib fracture detection in CT data: a systematic review and meta-analysis.
Journal
The British journal of radiology
ISSN
1748-880X (Electronic)
ISSN-L
0007-1285
Publication state
Published
Issued date
28/02/2024
Peer-reviewed
Oui
Volume
97
Number
1155
Pages
535-543
Language
english
Notes
Publication types: Meta-Analysis ; Systematic Review ; Journal Article
Publication Status: ppublish
Publication Status: ppublish
Abstract
To review studies on deep learning (DL) models for classification, detection, and segmentation of rib fractures in CT data, to determine their risk of bias (ROB), and to analyse the performance of acute rib fracture detection models.
Research articles written in English were retrieved from PubMed, Embase, and Web of Science in April 2023. A study was only included if a DL model was used to classify, detect, or segment rib fractures, and only if the model was trained with CT data from humans. For the ROB assessment, the Quality Assessment of Diagnostic Accuracy Studies tool was used. The performance of acute rib fracture detection models was meta-analysed with forest plots.
A total of 27 studies were selected. About 75% of the studies have ROB by not reporting the patient selection criteria, including control patients or using 5-mm slice thickness CT scans. The sensitivity, precision, and F1-score of the subgroup of low ROB studies were 89.60% (95%CI, 86.31%-92.90%), 84.89% (95%CI, 81.59%-88.18%), and 86.66% (95%CI, 84.62%-88.71%), respectively. The ROB subgroup differences test for the F1-score led to a p-value below 0.1.
ROB in studies mostly stems from an inappropriate patient and data selection. The studies with low ROB have better F1-score in acute rib fracture detection using DL models.
This systematic review will be a reference to the taxonomy of the current status of rib fracture detection with DL models, and upcoming studies will benefit from our data extraction, our ROB assessment, and our meta-analysis.
Research articles written in English were retrieved from PubMed, Embase, and Web of Science in April 2023. A study was only included if a DL model was used to classify, detect, or segment rib fractures, and only if the model was trained with CT data from humans. For the ROB assessment, the Quality Assessment of Diagnostic Accuracy Studies tool was used. The performance of acute rib fracture detection models was meta-analysed with forest plots.
A total of 27 studies were selected. About 75% of the studies have ROB by not reporting the patient selection criteria, including control patients or using 5-mm slice thickness CT scans. The sensitivity, precision, and F1-score of the subgroup of low ROB studies were 89.60% (95%CI, 86.31%-92.90%), 84.89% (95%CI, 81.59%-88.18%), and 86.66% (95%CI, 84.62%-88.71%), respectively. The ROB subgroup differences test for the F1-score led to a p-value below 0.1.
ROB in studies mostly stems from an inappropriate patient and data selection. The studies with low ROB have better F1-score in acute rib fracture detection using DL models.
This systematic review will be a reference to the taxonomy of the current status of rib fracture detection with DL models, and upcoming studies will benefit from our data extraction, our ROB assessment, and our meta-analysis.
Keywords
Humans, Rib Fractures/diagnostic imaging, Deep Learning, Tomography, X-Ray Computed, Retrospective Studies, CT, computed tomography, deep learning, rib fracture
Pubmed
Web of science
Open Access
Yes
Create date
09/02/2024 10:49
Last modification date
12/03/2024 7:16