Remote homolog detection places insect chemoreceptors in a cryptic protein superfamily spanning the tree of life.
Details
Download: 2023.08.30.555482v1.full.pdf (4191.89 [Ko])
State: Public
Version: Author's accepted manuscript
License: Not specified
State: Public
Version: Author's accepted manuscript
License: Not specified
Serval ID
serval:BIB_3B786407249F
Type
Article: article from journal or magazin.
Collection
Publications
Institution
Title
Remote homolog detection places insect chemoreceptors in a cryptic protein superfamily spanning the tree of life.
Journal
Current biology
ISSN
1879-0445 (Electronic)
ISSN-L
0960-9822
Publication state
Published
Issued date
20/11/2023
Peer-reviewed
Oui
Volume
33
Number
22
Pages
5023-5033.e4
Language
english
Notes
Publication types: Journal Article
Publication Status: ppublish
Publication Status: ppublish
Abstract
Many proteins exist in the so-called "twilight zone" of sequence alignment, where low pairwise sequence identity makes it difficult to determine homology and phylogeny. <sup>1</sup> <sup>,</sup> <sup>2</sup> As protein tertiary structure is often more conserved, <sup>3</sup> recent advances in ab initio protein folding have made structure-based identification of putative homologs feasible. <sup>4</sup> <sup>,</sup> <sup>5</sup> <sup>,</sup> <sup>6</sup> We present a pipeline for the identification and characterization of distant homologs and apply it to 7-transmembrane-domain ion channels (7TMICs), a protein group founded by insect odorant and gustatory receptors. Previous sequence and limited structure-based searches identified putatively related proteins, mainly in other animals and plants. <sup>7</sup> <sup>,</sup> <sup>8</sup> <sup>,</sup> <sup>9</sup> <sup>,</sup> <sup>10</sup> However, very few 7TMICs have been identified in non-animal, non-plant taxa. Moreover, these proteins' remarkable sequence dissimilarity made it uncertain whether disparate 7TMIC types (Gr/Or, Grl, GRL, DUF3537, PHTF, and GrlHz) are homologous or convergent, leaving their evolutionary history unresolved. Our pipeline identified thousands of new 7TMICs in archaea, bacteria, and unicellular eukaryotes. Using graph-based analyses and protein language models to extract family-wide signatures, we demonstrate that 7TMICs have structure and sequence similarity, supporting homology. Through sequence- and structure-based phylogenetics, we classify eukaryotic 7TMICs into two families (Class-A and Class-B), which are the result of a gene duplication predating the split(s) leading to Amorphea (animals, fungi, and allies) and Diaphoretickes (plants and allies). Our work reveals 7TMICs as a cryptic superfamily, with origins close to the evolution of cellular life. More generally, this study serves as a methodological proof of principle for the identification of extremely distant protein homologs.
Keywords
Humans, Animals, Amino Acid Sequence, Sequence Alignment, Proteins/genetics, Archaea/genetics, Plants/genetics, Phylogeny, Evolution, Molecular, LECA, LUCA, ab initio folding, evolution, homology, sensory receptor, transmembrane protein
Pubmed
Web of science
Create date
30/08/2023 15:02
Last modification date
18/07/2024 6:11