CLCAs - a family of metalloproteases of intriguing phylogenetic distribution and with cases of substituted catalytic sites.

The zinc-dependent metalloproteases with His-Glu-x-x-His (HExxH) active site motif, zincins, are a broad group of proteins involved in many metabolic and regulatory functions, and found in all forms of life. Human genome contains more than 100 genes encoding proteins with known zincin-like domains....

Full description

Bibliographic Details
Main Authors: Anna Lenart, Małgorzata Dudkiewicz, Marcin Grynberg, Krzysztof Pawłowski
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2013-01-01
Series:PLoS ONE
Online Access:https://www.ncbi.nlm.nih.gov/pmc/articles/pmid/23671590/pdf/?tool=EBI
_version_ 1818573293294190592
author Anna Lenart
Małgorzata Dudkiewicz
Marcin Grynberg
Krzysztof Pawłowski
author_facet Anna Lenart
Małgorzata Dudkiewicz
Marcin Grynberg
Krzysztof Pawłowski
author_sort Anna Lenart
collection DOAJ
description The zinc-dependent metalloproteases with His-Glu-x-x-His (HExxH) active site motif, zincins, are a broad group of proteins involved in many metabolic and regulatory functions, and found in all forms of life. Human genome contains more than 100 genes encoding proteins with known zincin-like domains. A survey of all proteins containing the HExxH motif shows that approximately 52% of HExxH occurrences fall within known protein structural domains (as defined in the Pfam database). Domain families with majority of members possessing a conserved HExxH motif include, not surprisingly, many known and putative metalloproteases. Furthermore, several HExxH-containing protein domains thus identified can be confidently predicted to be putative peptidases of zincin fold. Thus, we predict zincin-like fold for eight uncharacterised Pfam families. Besides the domains with the HExxH motif strictly conserved, and those with sporadic occurrences, intermediate families are identified that contain some members with a conserved HExxH motif, but also many homologues with substitutions at the conserved positions. Such substitutions can be evolutionarily conserved and non-random, yet functional roles of these inactive zincins are not known. The CLCAs are a novel zincin-like protease family with many cases of substituted active sites. We show that this allegedly metazoan family has a number of bacterial and archaeal members. An extremely patchy phylogenetic distribution of CLCAs in prokaryotes and their conserved protein domain composition strongly suggests an evolutionary scenario of horizontal gene transfer (HGT) from multicellular eukaryotes to bacteria, providing an example of eukaryote-derived xenologues in bacterial genomes. Additionally, in a protein family identified here as closely homologous to CLCA, the CLCA_X (CLCA-like) family, a number of proteins is found in phages and plasmids, supporting the HGT scenario.
first_indexed 2024-12-15T00:09:14Z
format Article
id doaj.art-b6ef177ef9014ea3a0ccdd79f204bcff
institution Directory Open Access Journal
issn 1932-6203
language English
last_indexed 2024-12-15T00:09:14Z
publishDate 2013-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj.art-b6ef177ef9014ea3a0ccdd79f204bcff2022-12-21T22:42:38ZengPublic Library of Science (PLoS)PLoS ONE1932-62032013-01-0185e6227210.1371/journal.pone.0062272CLCAs - a family of metalloproteases of intriguing phylogenetic distribution and with cases of substituted catalytic sites.Anna LenartMałgorzata DudkiewiczMarcin GrynbergKrzysztof PawłowskiThe zinc-dependent metalloproteases with His-Glu-x-x-His (HExxH) active site motif, zincins, are a broad group of proteins involved in many metabolic and regulatory functions, and found in all forms of life. Human genome contains more than 100 genes encoding proteins with known zincin-like domains. A survey of all proteins containing the HExxH motif shows that approximately 52% of HExxH occurrences fall within known protein structural domains (as defined in the Pfam database). Domain families with majority of members possessing a conserved HExxH motif include, not surprisingly, many known and putative metalloproteases. Furthermore, several HExxH-containing protein domains thus identified can be confidently predicted to be putative peptidases of zincin fold. Thus, we predict zincin-like fold for eight uncharacterised Pfam families. Besides the domains with the HExxH motif strictly conserved, and those with sporadic occurrences, intermediate families are identified that contain some members with a conserved HExxH motif, but also many homologues with substitutions at the conserved positions. Such substitutions can be evolutionarily conserved and non-random, yet functional roles of these inactive zincins are not known. The CLCAs are a novel zincin-like protease family with many cases of substituted active sites. We show that this allegedly metazoan family has a number of bacterial and archaeal members. An extremely patchy phylogenetic distribution of CLCAs in prokaryotes and their conserved protein domain composition strongly suggests an evolutionary scenario of horizontal gene transfer (HGT) from multicellular eukaryotes to bacteria, providing an example of eukaryote-derived xenologues in bacterial genomes. Additionally, in a protein family identified here as closely homologous to CLCA, the CLCA_X (CLCA-like) family, a number of proteins is found in phages and plasmids, supporting the HGT scenario.https://www.ncbi.nlm.nih.gov/pmc/articles/pmid/23671590/pdf/?tool=EBI
spellingShingle Anna Lenart
Małgorzata Dudkiewicz
Marcin Grynberg
Krzysztof Pawłowski
CLCAs - a family of metalloproteases of intriguing phylogenetic distribution and with cases of substituted catalytic sites.
PLoS ONE
title CLCAs - a family of metalloproteases of intriguing phylogenetic distribution and with cases of substituted catalytic sites.
title_full CLCAs - a family of metalloproteases of intriguing phylogenetic distribution and with cases of substituted catalytic sites.
title_fullStr CLCAs - a family of metalloproteases of intriguing phylogenetic distribution and with cases of substituted catalytic sites.
title_full_unstemmed CLCAs - a family of metalloproteases of intriguing phylogenetic distribution and with cases of substituted catalytic sites.
title_short CLCAs - a family of metalloproteases of intriguing phylogenetic distribution and with cases of substituted catalytic sites.
title_sort clcas a family of metalloproteases of intriguing phylogenetic distribution and with cases of substituted catalytic sites
url https://www.ncbi.nlm.nih.gov/pmc/articles/pmid/23671590/pdf/?tool=EBI
work_keys_str_mv AT annalenart clcasafamilyofmetalloproteasesofintriguingphylogeneticdistributionandwithcasesofsubstitutedcatalyticsites
AT małgorzatadudkiewicz clcasafamilyofmetalloproteasesofintriguingphylogeneticdistributionandwithcasesofsubstitutedcatalyticsites
AT marcingrynberg clcasafamilyofmetalloproteasesofintriguingphylogeneticdistributionandwithcasesofsubstitutedcatalyticsites
AT krzysztofpawłowski clcasafamilyofmetalloproteasesofintriguingphylogeneticdistributionandwithcasesofsubstitutedcatalyticsites