RefPlantNLR is a comprehensive collection of experimentally validated plant disease resistance proteins from the NLR family.
Reference datasets are critical in computational biology. They help define canonical biological features and are essential for benchmarking studies. Here, we describe a comprehensive reference dataset of experimentally validated plant nucleotide-binding leucine-rich repeat (NLR) immune receptors. Re...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Public Library of Science (PLoS)
2021-10-01
|
Series: | PLoS Biology |
Online Access: | https://doi.org/10.1371/journal.pbio.3001124 |
_version_ | 1819101807561932800 |
---|---|
author | Jiorgos Kourelis Toshiyuki Sakai Hiroaki Adachi Sophien Kamoun |
author_facet | Jiorgos Kourelis Toshiyuki Sakai Hiroaki Adachi Sophien Kamoun |
author_sort | Jiorgos Kourelis |
collection | DOAJ |
description | Reference datasets are critical in computational biology. They help define canonical biological features and are essential for benchmarking studies. Here, we describe a comprehensive reference dataset of experimentally validated plant nucleotide-binding leucine-rich repeat (NLR) immune receptors. RefPlantNLR consists of 481 NLRs from 31 genera belonging to 11 orders of flowering plants. This reference dataset has several applications. We used RefPlantNLR to determine the canonical features of functionally validated plant NLRs and to benchmark 5 NLR annotation tools. This revealed that although NLR annotation tools tend to retrieve the majority of NLRs, they frequently produce domain architectures that are inconsistent with the RefPlantNLR annotation. Guided by this analysis, we developed a new pipeline, NLRtracker, which extracts and annotates NLRs from protein or transcript files based on the core features found in the RefPlantNLR dataset. The RefPlantNLR dataset should also prove useful for guiding comparative analyses of NLRs across the wide spectrum of plant diversity and identifying understudied taxa. We hope that the RefPlantNLR resource will contribute to moving the field beyond a uniform view of NLR structure and function. |
first_indexed | 2024-12-22T01:24:32Z |
format | Article |
id | doaj.art-33905e38031a4ff7bf30da3fc029eced |
institution | Directory Open Access Journal |
issn | 1544-9173 1545-7885 |
language | English |
last_indexed | 2024-12-22T01:24:32Z |
publishDate | 2021-10-01 |
publisher | Public Library of Science (PLoS) |
record_format | Article |
series | PLoS Biology |
spelling | doaj.art-33905e38031a4ff7bf30da3fc029eced2022-12-21T18:43:38ZengPublic Library of Science (PLoS)PLoS Biology1544-91731545-78852021-10-011910e300112410.1371/journal.pbio.3001124RefPlantNLR is a comprehensive collection of experimentally validated plant disease resistance proteins from the NLR family.Jiorgos KourelisToshiyuki SakaiHiroaki AdachiSophien KamounReference datasets are critical in computational biology. They help define canonical biological features and are essential for benchmarking studies. Here, we describe a comprehensive reference dataset of experimentally validated plant nucleotide-binding leucine-rich repeat (NLR) immune receptors. RefPlantNLR consists of 481 NLRs from 31 genera belonging to 11 orders of flowering plants. This reference dataset has several applications. We used RefPlantNLR to determine the canonical features of functionally validated plant NLRs and to benchmark 5 NLR annotation tools. This revealed that although NLR annotation tools tend to retrieve the majority of NLRs, they frequently produce domain architectures that are inconsistent with the RefPlantNLR annotation. Guided by this analysis, we developed a new pipeline, NLRtracker, which extracts and annotates NLRs from protein or transcript files based on the core features found in the RefPlantNLR dataset. The RefPlantNLR dataset should also prove useful for guiding comparative analyses of NLRs across the wide spectrum of plant diversity and identifying understudied taxa. We hope that the RefPlantNLR resource will contribute to moving the field beyond a uniform view of NLR structure and function.https://doi.org/10.1371/journal.pbio.3001124 |
spellingShingle | Jiorgos Kourelis Toshiyuki Sakai Hiroaki Adachi Sophien Kamoun RefPlantNLR is a comprehensive collection of experimentally validated plant disease resistance proteins from the NLR family. PLoS Biology |
title | RefPlantNLR is a comprehensive collection of experimentally validated plant disease resistance proteins from the NLR family. |
title_full | RefPlantNLR is a comprehensive collection of experimentally validated plant disease resistance proteins from the NLR family. |
title_fullStr | RefPlantNLR is a comprehensive collection of experimentally validated plant disease resistance proteins from the NLR family. |
title_full_unstemmed | RefPlantNLR is a comprehensive collection of experimentally validated plant disease resistance proteins from the NLR family. |
title_short | RefPlantNLR is a comprehensive collection of experimentally validated plant disease resistance proteins from the NLR family. |
title_sort | refplantnlr is a comprehensive collection of experimentally validated plant disease resistance proteins from the nlr family |
url | https://doi.org/10.1371/journal.pbio.3001124 |
work_keys_str_mv | AT jiorgoskourelis refplantnlrisacomprehensivecollectionofexperimentallyvalidatedplantdiseaseresistanceproteinsfromthenlrfamily AT toshiyukisakai refplantnlrisacomprehensivecollectionofexperimentallyvalidatedplantdiseaseresistanceproteinsfromthenlrfamily AT hiroakiadachi refplantnlrisacomprehensivecollectionofexperimentallyvalidatedplantdiseaseresistanceproteinsfromthenlrfamily AT sophienkamoun refplantnlrisacomprehensivecollectionofexperimentallyvalidatedplantdiseaseresistanceproteinsfromthenlrfamily |