Analysis of missense variants in the human genome reveals widespread gene-specific clustering and improves prediction of pathogenicity.

Détails

Ressource 1Télécharger: 35120630.pdf (4311.68 [Ko])
Etat: Public
Version: Final published version
Licence: CC BY-NC-ND 4.0
ID Serval
serval:BIB_00A3C85AEFEA
Type
Article: article d'un périodique ou d'un magazine.
Collection
Publications
Institution
Titre
Analysis of missense variants in the human genome reveals widespread gene-specific clustering and improves prediction of pathogenicity.
Périodique
American journal of human genetics
Auteur⸱e⸱s
Quinodoz M., Peter V.G., Cisarova K., Royer-Bertrand B., Stenson P.D., Cooper D.N., Unger S., Superti-Furga A., Rivolta C.
ISSN
1537-6605 (Electronic)
ISSN-L
0002-9297
Statut éditorial
Publié
Date de publication
03/03/2022
Peer-reviewed
Oui
Volume
109
Numéro
3
Pages
457-470
Langue
anglais
Notes
Publication types: Journal Article ; Research Support, Non-U.S. Gov't
Publication Status: ppublish
Résumé
We used a machine learning approach to analyze the within-gene distribution of missense variants observed in hereditary conditions and cancer. When applied to 840 genes from the ClinVar database, this approach detected a significant non-random distribution of pathogenic and benign variants in 387 (46%) and 172 (20%) genes, respectively, revealing that variant clustering is widespread across the human exome. This clustering likely occurs as a consequence of mechanisms shaping pathogenicity at the protein level, as illustrated by the overlap of some clusters with known functional domains. We then took advantage of these findings to develop a pathogenicity predictor, MutScore, that integrates qualitative features of DNA substitutions with the new additional information derived from this positional clustering. Using a random forest approach, MutScore was able to identify pathogenic missense mutations with very high accuracy, outperforming existing predictive tools, especially for variants associated with autosomal-dominant disease and cancer. Thus, the within-gene clustering of pathogenic and benign DNA changes is an important and previously underappreciated feature of the human exome, which can be harnessed to improve the prediction of pathogenicity and disambiguation of DNA variants of uncertain significance.
Mots-clé
Cluster Analysis, Exome/genetics, Genome, Human/genetics, Humans, Mutation, Missense/genetics, Virulence, functional domains, gain-of-function, human genome, loss-of-function, medical genetics, missense, molecular pathology, mutation, pathogenicity prediction, variant clustering
Pubmed
Web of science
Open Access
Oui
Création de la notice
12/02/2022 14:57
Dernière modification de la notice
05/05/2022 6:08
Données d'usage