Clinical data mining: a review.
Détails
ID Serval
serval:BIB_BA34E9C26657
Type
Article: article d'un périodique ou d'un magazine.
Sous-type
Synthèse (review): revue aussi complète que possible des connaissances sur un sujet, rédigée à partir de l'analyse exhaustive des travaux publiés.
Collection
Publications
Institution
Titre
Clinical data mining: a review.
Périodique
Yearbook of medical informatics
ISSN
2364-0502 (Electronic)
ISSN-L
0943-4747
Statut éditorial
Publié
Date de publication
2009
Peer-reviewed
Oui
Pages
121-133
Langue
anglais
Notes
Publication types: Journal Article ; Research Support, Non-U.S. Gov't ; Review
Publication Status: ppublish
Publication Status: ppublish
Résumé
Clinical data mining is the application of data mining techniques using clinical data. We review the literature in order to provide a general overview by identifying the status-of-practice and the challenges ahead.
The nine data mining steps proposed by Fayyad in 1996 [4] were used as the main themes of the review. MEDLINE was used as primary source and 84 papers were retained based on our inclusion criteria.
Clinical data mining has three objectives: understanding the clinical data, assist healthcare professionals, and develop a data analysis methodology suitable for medical data. Classification is the most frequently used data mining function with a predominance of the implementation of Bayesian classifiers, neural networks, and SVMs (Support Vector Machines). A myriad of quantitative performance measures were proposed with a predominance of accuracy, sensitivity, specificity, and ROC curves. The latter are usually associated with qualitative evaluation.
Clinical data mining respects its commitment to extracting new and previously unknown knowledge from clinical databases. More efforts are still needed to obtain a wider acceptance from the healthcare professionals and for generalization of the knowledge and reproducibility of its extraction process: better description of variables, systematic report of algorithm parameters including the method to obtain them, use of easy-to-understand models and comparisons of the efficiency of clinical data mining with traditional statistical analyses. More and more data will be available for data miners and they have to develop new methodologies and infrastructures to analyze the increasingly complex medical data.
The nine data mining steps proposed by Fayyad in 1996 [4] were used as the main themes of the review. MEDLINE was used as primary source and 84 papers were retained based on our inclusion criteria.
Clinical data mining has three objectives: understanding the clinical data, assist healthcare professionals, and develop a data analysis methodology suitable for medical data. Classification is the most frequently used data mining function with a predominance of the implementation of Bayesian classifiers, neural networks, and SVMs (Support Vector Machines). A myriad of quantitative performance measures were proposed with a predominance of accuracy, sensitivity, specificity, and ROC curves. The latter are usually associated with qualitative evaluation.
Clinical data mining respects its commitment to extracting new and previously unknown knowledge from clinical databases. More efforts are still needed to obtain a wider acceptance from the healthcare professionals and for generalization of the knowledge and reproducibility of its extraction process: better description of variables, systematic report of algorithm parameters including the method to obtain them, use of easy-to-understand models and comparisons of the efficiency of clinical data mining with traditional statistical analyses. More and more data will be available for data miners and they have to develop new methodologies and infrastructures to analyze the increasingly complex medical data.
Mots-clé
Algorithms, Bibliometrics, Clinical Medicine, Data Mining/methods, Data Mining/statistics & numerical data
Pubmed
Création de la notice
29/08/2023 7:45
Dernière modification de la notice
13/10/2023 14:35