Whole-genome resequencing of Coffea arabica L. (Rubiaceae) genotypes identify SNP and unravels distinct groups showing a strong geographical pattern
Abstract Background Coffea arabica L. is an economically important agricultural crop and the most popular beverage worldwide. As a perennial crop with recalcitrant seed, conservation of the genetic resources of coffee can be achieved through the complementary approach of in-situ and ex-situ field ge...
Main Authors: | , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2022-02-01
|
Series: | BMC Plant Biology |
Subjects: | |
Online Access: | https://doi.org/10.1186/s12870-022-03449-4 |
_version_ | 1818280316683419648 |
---|---|
author | Yeshitila Mekbib Kassahun Tesfaye Xiang Dong Josphat K. Saina Guang-Wan Hu Qing-Feng Wang |
author_facet | Yeshitila Mekbib Kassahun Tesfaye Xiang Dong Josphat K. Saina Guang-Wan Hu Qing-Feng Wang |
author_sort | Yeshitila Mekbib |
collection | DOAJ |
description | Abstract Background Coffea arabica L. is an economically important agricultural crop and the most popular beverage worldwide. As a perennial crop with recalcitrant seed, conservation of the genetic resources of coffee can be achieved through the complementary approach of in-situ and ex-situ field genebank. In Ethiopia, a large collection of C. arabica L. germplasm is preserved in field gene banks. Here, we report the whole-genome resequencing of 90 accessions from Choche germplasm bank representing garden and forest-based coffee production systems using Illumina sequencing technology. Results The genome sequencing generated 6.41 billion paired-end reads, with a mean of 71.19 million reads per sample. More than 93% of the clean reads were mapped onto the C. arabica L. reference genome. A total of 11.08 million variants were identified, among which 9.74 million (87.9%) were SNPs (Single nucleotide polymorphisms) and 1.34 million (12.1%) were InDels. In all accessions, genomic variants were unevenly distributed across the coffee genome. The phylogenetic analysis using the SNP markers displayed distinct groups. Conclusions Resequencing of the coffee accessions has allowed identification of genetic markers, such as SNPs and InDels. The SNPs discovered in this study might contribute to the variation in important pathways of genes for important agronomic traits such as caffeine content, yield, disease, and pest in coffee. Moreover, the genome resequencing data and the genetic markers identified from 90 accessions provide insight into the genetic variation of the coffee germplasm and facilitate a broad range of genetic studies. |
first_indexed | 2024-12-12T23:47:18Z |
format | Article |
id | doaj.art-83657c9e001a45b39cbeac1eb07a8f48 |
institution | Directory Open Access Journal |
issn | 1471-2229 |
language | English |
last_indexed | 2024-12-12T23:47:18Z |
publishDate | 2022-02-01 |
publisher | BMC |
record_format | Article |
series | BMC Plant Biology |
spelling | doaj.art-83657c9e001a45b39cbeac1eb07a8f482022-12-22T00:06:49ZengBMCBMC Plant Biology1471-22292022-02-012211910.1186/s12870-022-03449-4Whole-genome resequencing of Coffea arabica L. (Rubiaceae) genotypes identify SNP and unravels distinct groups showing a strong geographical patternYeshitila Mekbib0Kassahun Tesfaye1Xiang Dong2Josphat K. Saina3Guang-Wan Hu4Qing-Feng Wang5CAS Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, Chinese Academy of SciencesDepartment of Microbial, Cellular and Molecular Biology, Addis Ababa UniversityCAS Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, Chinese Academy of SciencesCAS Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, Chinese Academy of SciencesCAS Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, Chinese Academy of SciencesCAS Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, Chinese Academy of SciencesAbstract Background Coffea arabica L. is an economically important agricultural crop and the most popular beverage worldwide. As a perennial crop with recalcitrant seed, conservation of the genetic resources of coffee can be achieved through the complementary approach of in-situ and ex-situ field genebank. In Ethiopia, a large collection of C. arabica L. germplasm is preserved in field gene banks. Here, we report the whole-genome resequencing of 90 accessions from Choche germplasm bank representing garden and forest-based coffee production systems using Illumina sequencing technology. Results The genome sequencing generated 6.41 billion paired-end reads, with a mean of 71.19 million reads per sample. More than 93% of the clean reads were mapped onto the C. arabica L. reference genome. A total of 11.08 million variants were identified, among which 9.74 million (87.9%) were SNPs (Single nucleotide polymorphisms) and 1.34 million (12.1%) were InDels. In all accessions, genomic variants were unevenly distributed across the coffee genome. The phylogenetic analysis using the SNP markers displayed distinct groups. Conclusions Resequencing of the coffee accessions has allowed identification of genetic markers, such as SNPs and InDels. The SNPs discovered in this study might contribute to the variation in important pathways of genes for important agronomic traits such as caffeine content, yield, disease, and pest in coffee. Moreover, the genome resequencing data and the genetic markers identified from 90 accessions provide insight into the genetic variation of the coffee germplasm and facilitate a broad range of genetic studies.https://doi.org/10.1186/s12870-022-03449-4CoffeeGenetic markersPhylogenetic analysisResequencingSingle nucleotide polymorphism |
spellingShingle | Yeshitila Mekbib Kassahun Tesfaye Xiang Dong Josphat K. Saina Guang-Wan Hu Qing-Feng Wang Whole-genome resequencing of Coffea arabica L. (Rubiaceae) genotypes identify SNP and unravels distinct groups showing a strong geographical pattern BMC Plant Biology Coffee Genetic markers Phylogenetic analysis Resequencing Single nucleotide polymorphism |
title | Whole-genome resequencing of Coffea arabica L. (Rubiaceae) genotypes identify SNP and unravels distinct groups showing a strong geographical pattern |
title_full | Whole-genome resequencing of Coffea arabica L. (Rubiaceae) genotypes identify SNP and unravels distinct groups showing a strong geographical pattern |
title_fullStr | Whole-genome resequencing of Coffea arabica L. (Rubiaceae) genotypes identify SNP and unravels distinct groups showing a strong geographical pattern |
title_full_unstemmed | Whole-genome resequencing of Coffea arabica L. (Rubiaceae) genotypes identify SNP and unravels distinct groups showing a strong geographical pattern |
title_short | Whole-genome resequencing of Coffea arabica L. (Rubiaceae) genotypes identify SNP and unravels distinct groups showing a strong geographical pattern |
title_sort | whole genome resequencing of coffea arabica l rubiaceae genotypes identify snp and unravels distinct groups showing a strong geographical pattern |
topic | Coffee Genetic markers Phylogenetic analysis Resequencing Single nucleotide polymorphism |
url | https://doi.org/10.1186/s12870-022-03449-4 |
work_keys_str_mv | AT yeshitilamekbib wholegenomeresequencingofcoffeaarabicalrubiaceaegenotypesidentifysnpandunravelsdistinctgroupsshowingastronggeographicalpattern AT kassahuntesfaye wholegenomeresequencingofcoffeaarabicalrubiaceaegenotypesidentifysnpandunravelsdistinctgroupsshowingastronggeographicalpattern AT xiangdong wholegenomeresequencingofcoffeaarabicalrubiaceaegenotypesidentifysnpandunravelsdistinctgroupsshowingastronggeographicalpattern AT josphatksaina wholegenomeresequencingofcoffeaarabicalrubiaceaegenotypesidentifysnpandunravelsdistinctgroupsshowingastronggeographicalpattern AT guangwanhu wholegenomeresequencingofcoffeaarabicalrubiaceaegenotypesidentifysnpandunravelsdistinctgroupsshowingastronggeographicalpattern AT qingfengwang wholegenomeresequencingofcoffeaarabicalrubiaceaegenotypesidentifysnpandunravelsdistinctgroupsshowingastronggeographicalpattern |