Deep learning for acute rib fracture detection in CT data: a systematic review and meta-analysis.

Lopez-Melia, M.; Magnin, V.; Marchand-Maillet, S.; Grabherr, S.

doi:10.1093/bjr/tqae014

Deep learning for acute rib fracture detection in CT data: a systematic review and meta-analysis.

Details

Download: 38323515.pdf (806.44 [Ko])
State: Public
Version: Final published version
License: CC BY 4.0

Serval ID

serval:BIB_87520A6256AF

Type

Article: article from journal or magazin.

Collection

Publications

Institution

UNIL/CHUV

Title

Deep learning for acute rib fracture detection in CT data: a systematic review and meta-analysis.

Journal

The British journal of radiology

Author(s)

Lopez-Melia M., Magnin V., Marchand-Maillet S., Grabherr S.

ISSN

1748-880X (Electronic)

ISSN-L

0007-1285

Publication state

Published

Issued date

28/02/2024

Peer-reviewed

Oui

Volume

Number

1155

Pages

535-543

Language

english

Notes

Publication types: Meta-Analysis ; Systematic Review ; Journal Article
Publication Status: ppublish

Abstract

To review studies on deep learning (DL) models for classification, detection, and segmentation of rib fractures in CT data, to determine their risk of bias (ROB), and to analyse the performance of acute rib fracture detection models.
Research articles written in English were retrieved from PubMed, Embase, and Web of Science in April 2023. A study was only included if a DL model was used to classify, detect, or segment rib fractures, and only if the model was trained with CT data from humans. For the ROB assessment, the Quality Assessment of Diagnostic Accuracy Studies tool was used. The performance of acute rib fracture detection models was meta-analysed with forest plots.
A total of 27 studies were selected. About 75% of the studies have ROB by not reporting the patient selection criteria, including control patients or using 5-mm slice thickness CT scans. The sensitivity, precision, and F1-score of the subgroup of low ROB studies were 89.60% (95%CI, 86.31%-92.90%), 84.89% (95%CI, 81.59%-88.18%), and 86.66% (95%CI, 84.62%-88.71%), respectively. The ROB subgroup differences test for the F1-score led to a p-value below 0.1.
ROB in studies mostly stems from an inappropriate patient and data selection. The studies with low ROB have better F1-score in acute rib fracture detection using DL models.
This systematic review will be a reference to the taxonomy of the current status of rib fracture detection with DL models, and upcoming studies will benefit from our data extraction, our ROB assessment, and our meta-analysis.

Keywords

Humans, Rib Fractures/diagnostic imaging, Deep Learning, Tomography, X-Ray Computed, Retrospective Studies, CT, computed tomography, deep learning, rib fracture

URN

urn:nbn:ch:serval-BIB_87520A6256AF1

OAI-PMH

oai:serval.unil.ch:BIB_87520A6256AF

DOI

10.1093/bjr/tqae014

Pubmed

38323515

Web of science

001157585900001

Open Access

Yes