Remote homolog detection places insect chemoreceptors in a cryptic protein superfamily spanning the tree of life.
Détails
Télécharger: 2023.08.30.555482v1.full.pdf (4191.89 [Ko])
Etat: Public
Version: Author's accepted manuscript
Licence: Non spécifiée
Etat: Public
Version: Author's accepted manuscript
Licence: Non spécifiée
ID Serval
serval:BIB_3B786407249F
Type
Article: article d'un périodique ou d'un magazine.
Collection
Publications
Institution
Titre
Remote homolog detection places insect chemoreceptors in a cryptic protein superfamily spanning the tree of life.
Périodique
Current biology
ISSN
1879-0445 (Electronic)
ISSN-L
0960-9822
Statut éditorial
Publié
Date de publication
20/11/2023
Peer-reviewed
Oui
Volume
33
Numéro
22
Pages
5023-5033.e4
Langue
anglais
Notes
Publication types: Journal Article
Publication Status: ppublish
Publication Status: ppublish
Résumé
Many proteins exist in the so-called "twilight zone" of sequence alignment, where low pairwise sequence identity makes it difficult to determine homology and phylogeny. <sup>1</sup> <sup>,</sup> <sup>2</sup> As protein tertiary structure is often more conserved, <sup>3</sup> recent advances in ab initio protein folding have made structure-based identification of putative homologs feasible. <sup>4</sup> <sup>,</sup> <sup>5</sup> <sup>,</sup> <sup>6</sup> We present a pipeline for the identification and characterization of distant homologs and apply it to 7-transmembrane-domain ion channels (7TMICs), a protein group founded by insect odorant and gustatory receptors. Previous sequence and limited structure-based searches identified putatively related proteins, mainly in other animals and plants. <sup>7</sup> <sup>,</sup> <sup>8</sup> <sup>,</sup> <sup>9</sup> <sup>,</sup> <sup>10</sup> However, very few 7TMICs have been identified in non-animal, non-plant taxa. Moreover, these proteins' remarkable sequence dissimilarity made it uncertain whether disparate 7TMIC types (Gr/Or, Grl, GRL, DUF3537, PHTF, and GrlHz) are homologous or convergent, leaving their evolutionary history unresolved. Our pipeline identified thousands of new 7TMICs in archaea, bacteria, and unicellular eukaryotes. Using graph-based analyses and protein language models to extract family-wide signatures, we demonstrate that 7TMICs have structure and sequence similarity, supporting homology. Through sequence- and structure-based phylogenetics, we classify eukaryotic 7TMICs into two families (Class-A and Class-B), which are the result of a gene duplication predating the split(s) leading to Amorphea (animals, fungi, and allies) and Diaphoretickes (plants and allies). Our work reveals 7TMICs as a cryptic superfamily, with origins close to the evolution of cellular life. More generally, this study serves as a methodological proof of principle for the identification of extremely distant protein homologs.
Mots-clé
Humans, Animals, Amino Acid Sequence, Sequence Alignment, Proteins/genetics, Archaea/genetics, Plants/genetics, Phylogeny, Evolution, Molecular, LECA, LUCA, ab initio folding, evolution, homology, sensory receptor, transmembrane protein
Pubmed
Web of science
Création de la notice
30/08/2023 15:02
Dernière modification de la notice
18/07/2024 6:11