Whole-genome resequencing of Coffea arabica L. (Rubiaceae) genotypes identify SNP and unravels distinct groups showing a strong geographical pattern

Abstract Background Coffea arabica L. is an economically important agricultural crop and the most popular beverage worldwide. As a perennial crop with recalcitrant seed, conservation of the genetic resources of coffee can be achieved through the complementary approach of in-situ and ex-situ field ge...

Full description

Bibliographic Details
Main Authors: Yeshitila Mekbib, Kassahun Tesfaye, Xiang Dong, Josphat K. Saina, Guang-Wan Hu, Qing-Feng Wang
Format: Article
Language:English
Published: BMC 2022-02-01
Series:BMC Plant Biology
Subjects:
Online Access:https://doi.org/10.1186/s12870-022-03449-4
_version_ 1818280316683419648
author Yeshitila Mekbib
Kassahun Tesfaye
Xiang Dong
Josphat K. Saina
Guang-Wan Hu
Qing-Feng Wang
author_facet Yeshitila Mekbib
Kassahun Tesfaye
Xiang Dong
Josphat K. Saina
Guang-Wan Hu
Qing-Feng Wang
author_sort Yeshitila Mekbib
collection DOAJ
description Abstract Background Coffea arabica L. is an economically important agricultural crop and the most popular beverage worldwide. As a perennial crop with recalcitrant seed, conservation of the genetic resources of coffee can be achieved through the complementary approach of in-situ and ex-situ field genebank. In Ethiopia, a large collection of C. arabica L. germplasm is preserved in field gene banks. Here, we report the whole-genome resequencing of 90 accessions from Choche germplasm bank representing garden and forest-based coffee production systems using Illumina sequencing technology. Results The genome sequencing generated 6.41 billion paired-end reads, with a mean of 71.19 million reads per sample. More than 93% of the clean reads were mapped onto the C. arabica L. reference genome. A total of 11.08 million variants were identified, among which 9.74 million (87.9%) were SNPs (Single nucleotide polymorphisms) and 1.34 million (12.1%) were InDels. In all accessions, genomic variants were unevenly distributed across the coffee genome. The phylogenetic analysis using the SNP markers displayed distinct groups. Conclusions Resequencing of the coffee accessions has allowed identification of genetic markers, such as SNPs and InDels. The SNPs discovered in this study might contribute to the variation in important pathways of genes for important agronomic traits such as caffeine content, yield, disease, and pest in coffee. Moreover, the genome resequencing data and the genetic markers identified from 90 accessions provide insight into the genetic variation of the coffee germplasm and facilitate a broad range of genetic studies.
first_indexed 2024-12-12T23:47:18Z
format Article
id doaj.art-83657c9e001a45b39cbeac1eb07a8f48
institution Directory Open Access Journal
issn 1471-2229
language English
last_indexed 2024-12-12T23:47:18Z
publishDate 2022-02-01
publisher BMC
record_format Article
series BMC Plant Biology
spelling doaj.art-83657c9e001a45b39cbeac1eb07a8f482022-12-22T00:06:49ZengBMCBMC Plant Biology1471-22292022-02-012211910.1186/s12870-022-03449-4Whole-genome resequencing of Coffea arabica L. (Rubiaceae) genotypes identify SNP and unravels distinct groups showing a strong geographical patternYeshitila Mekbib0Kassahun Tesfaye1Xiang Dong2Josphat K. Saina3Guang-Wan Hu4Qing-Feng Wang5CAS Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, Chinese Academy of SciencesDepartment of Microbial, Cellular and Molecular Biology, Addis Ababa UniversityCAS Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, Chinese Academy of SciencesCAS Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, Chinese Academy of SciencesCAS Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, Chinese Academy of SciencesCAS Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, Chinese Academy of SciencesAbstract Background Coffea arabica L. is an economically important agricultural crop and the most popular beverage worldwide. As a perennial crop with recalcitrant seed, conservation of the genetic resources of coffee can be achieved through the complementary approach of in-situ and ex-situ field genebank. In Ethiopia, a large collection of C. arabica L. germplasm is preserved in field gene banks. Here, we report the whole-genome resequencing of 90 accessions from Choche germplasm bank representing garden and forest-based coffee production systems using Illumina sequencing technology. Results The genome sequencing generated 6.41 billion paired-end reads, with a mean of 71.19 million reads per sample. More than 93% of the clean reads were mapped onto the C. arabica L. reference genome. A total of 11.08 million variants were identified, among which 9.74 million (87.9%) were SNPs (Single nucleotide polymorphisms) and 1.34 million (12.1%) were InDels. In all accessions, genomic variants were unevenly distributed across the coffee genome. The phylogenetic analysis using the SNP markers displayed distinct groups. Conclusions Resequencing of the coffee accessions has allowed identification of genetic markers, such as SNPs and InDels. The SNPs discovered in this study might contribute to the variation in important pathways of genes for important agronomic traits such as caffeine content, yield, disease, and pest in coffee. Moreover, the genome resequencing data and the genetic markers identified from 90 accessions provide insight into the genetic variation of the coffee germplasm and facilitate a broad range of genetic studies.https://doi.org/10.1186/s12870-022-03449-4CoffeeGenetic markersPhylogenetic analysisResequencingSingle nucleotide polymorphism
spellingShingle Yeshitila Mekbib
Kassahun Tesfaye
Xiang Dong
Josphat K. Saina
Guang-Wan Hu
Qing-Feng Wang
Whole-genome resequencing of Coffea arabica L. (Rubiaceae) genotypes identify SNP and unravels distinct groups showing a strong geographical pattern
BMC Plant Biology
Coffee
Genetic markers
Phylogenetic analysis
Resequencing
Single nucleotide polymorphism
title Whole-genome resequencing of Coffea arabica L. (Rubiaceae) genotypes identify SNP and unravels distinct groups showing a strong geographical pattern
title_full Whole-genome resequencing of Coffea arabica L. (Rubiaceae) genotypes identify SNP and unravels distinct groups showing a strong geographical pattern
title_fullStr Whole-genome resequencing of Coffea arabica L. (Rubiaceae) genotypes identify SNP and unravels distinct groups showing a strong geographical pattern
title_full_unstemmed Whole-genome resequencing of Coffea arabica L. (Rubiaceae) genotypes identify SNP and unravels distinct groups showing a strong geographical pattern
title_short Whole-genome resequencing of Coffea arabica L. (Rubiaceae) genotypes identify SNP and unravels distinct groups showing a strong geographical pattern
title_sort whole genome resequencing of coffea arabica l rubiaceae genotypes identify snp and unravels distinct groups showing a strong geographical pattern
topic Coffee
Genetic markers
Phylogenetic analysis
Resequencing
Single nucleotide polymorphism
url https://doi.org/10.1186/s12870-022-03449-4
work_keys_str_mv AT yeshitilamekbib wholegenomeresequencingofcoffeaarabicalrubiaceaegenotypesidentifysnpandunravelsdistinctgroupsshowingastronggeographicalpattern
AT kassahuntesfaye wholegenomeresequencingofcoffeaarabicalrubiaceaegenotypesidentifysnpandunravelsdistinctgroupsshowingastronggeographicalpattern
AT xiangdong wholegenomeresequencingofcoffeaarabicalrubiaceaegenotypesidentifysnpandunravelsdistinctgroupsshowingastronggeographicalpattern
AT josphatksaina wholegenomeresequencingofcoffeaarabicalrubiaceaegenotypesidentifysnpandunravelsdistinctgroupsshowingastronggeographicalpattern
AT guangwanhu wholegenomeresequencingofcoffeaarabicalrubiaceaegenotypesidentifysnpandunravelsdistinctgroupsshowingastronggeographicalpattern
AT qingfengwang wholegenomeresequencingofcoffeaarabicalrubiaceaegenotypesidentifysnpandunravelsdistinctgroupsshowingastronggeographicalpattern