Fingerprint Finder: Identifying Genomic Fingerprint Sites in Cotton Cohorts for Genetic Analysis and Breeding Advancement

Genomic data in Gossypium provide numerous data resources for the cotton genomics community. However, to fill the gap between genomic analysis and breeding field work, detecting the featured genomic items of a subset cohort is essential for geneticists. We developed FPFinder v1.0 software to identif...

Full description

Bibliographic Details
Main Authors: Shang Liu, Hailiang Cheng, Youping Zhang, Man He, Dongyun Zuo, Qiaolian Wang, Limin Lv, Zhongxv Lin, Guoli Song
Format: Article
Language:English
Published: MDPI AG 2024-03-01
Series:Genes
Subjects:
Online Access:https://www.mdpi.com/2073-4425/15/3/378
_version_ 1797240926795464704
author Shang Liu
Hailiang Cheng
Youping Zhang
Man He
Dongyun Zuo
Qiaolian Wang
Limin Lv
Zhongxv Lin
Guoli Song
author_facet Shang Liu
Hailiang Cheng
Youping Zhang
Man He
Dongyun Zuo
Qiaolian Wang
Limin Lv
Zhongxv Lin
Guoli Song
author_sort Shang Liu
collection DOAJ
description Genomic data in Gossypium provide numerous data resources for the cotton genomics community. However, to fill the gap between genomic analysis and breeding field work, detecting the featured genomic items of a subset cohort is essential for geneticists. We developed FPFinder v1.0 software to identify a subset of the cohort’s fingerprint genomic sites. The FPFinder was developed based on the term frequency–inverse document frequency algorithm. With the short-read sequencing of an elite cotton pedigree, we identified 453 pedigree fingerprint genomic sites and found that these pedigree-featured sites had a role in cotton development. In addition, we applied FPFinder to evaluate the geographical bias of fiber-length-related genomic sites from a modern cotton cohort consisting of 410 accessions. Enriching elite sites in cultivars from the Yangtze River region resulted in the longer fiber length of Yangze River-sourced accessions. Apart from characterizing functional sites, we also identified 12,536 region-specific genomic sites. Combining the transcriptome data of multiple tissues and samples under various abiotic stresses, we found that several region-specific sites contributed to environmental adaptation. In this research, FPFinder revealed the role of the cotton pedigree fingerprint and region-specific sites in cotton development and environmental adaptation, respectively. The FPFinder can be applied broadly in other crops and contribute to genetic breeding in the future.
first_indexed 2024-04-24T18:15:12Z
format Article
id doaj.art-e167b4d8bbfd4c86a372b0bb2a6ffe3f
institution Directory Open Access Journal
issn 2073-4425
language English
last_indexed 2024-04-24T18:15:12Z
publishDate 2024-03-01
publisher MDPI AG
record_format Article
series Genes
spelling doaj.art-e167b4d8bbfd4c86a372b0bb2a6ffe3f2024-03-27T13:43:15ZengMDPI AGGenes2073-44252024-03-0115337810.3390/genes15030378Fingerprint Finder: Identifying Genomic Fingerprint Sites in Cotton Cohorts for Genetic Analysis and Breeding AdvancementShang Liu0Hailiang Cheng1Youping Zhang2Man He3Dongyun Zuo4Qiaolian Wang5Limin Lv6Zhongxv Lin7Guoli Song8National Key Laboratory of Cotton Bio-breeding and Integrated Utilization, Institute of Cotton Research of Chinese Academy of Agricultural Sciences, Anyang 455000, ChinaNational Key Laboratory of Cotton Bio-breeding and Integrated Utilization, Institute of Cotton Research of Chinese Academy of Agricultural Sciences, Anyang 455000, ChinaNational Key Laboratory of Cotton Bio-breeding and Integrated Utilization, Institute of Cotton Research of Chinese Academy of Agricultural Sciences, Anyang 455000, ChinaNational Key Laboratory of Cotton Bio-breeding and Integrated Utilization, Institute of Cotton Research of Chinese Academy of Agricultural Sciences, Anyang 455000, ChinaNational Key Laboratory of Cotton Bio-breeding and Integrated Utilization, Institute of Cotton Research of Chinese Academy of Agricultural Sciences, Anyang 455000, ChinaNational Key Laboratory of Cotton Bio-breeding and Integrated Utilization, Institute of Cotton Research of Chinese Academy of Agricultural Sciences, Anyang 455000, ChinaNational Key Laboratory of Cotton Bio-breeding and Integrated Utilization, Institute of Cotton Research of Chinese Academy of Agricultural Sciences, Anyang 455000, ChinaNational Key Laboratory of Crop Genetic Improvement, College of Plant Science and Technology, Huazhong Agricultural University, Wuhan 430070, ChinaNational Key Laboratory of Cotton Bio-breeding and Integrated Utilization, Institute of Cotton Research of Chinese Academy of Agricultural Sciences, Anyang 455000, ChinaGenomic data in Gossypium provide numerous data resources for the cotton genomics community. However, to fill the gap between genomic analysis and breeding field work, detecting the featured genomic items of a subset cohort is essential for geneticists. We developed FPFinder v1.0 software to identify a subset of the cohort’s fingerprint genomic sites. The FPFinder was developed based on the term frequency–inverse document frequency algorithm. With the short-read sequencing of an elite cotton pedigree, we identified 453 pedigree fingerprint genomic sites and found that these pedigree-featured sites had a role in cotton development. In addition, we applied FPFinder to evaluate the geographical bias of fiber-length-related genomic sites from a modern cotton cohort consisting of 410 accessions. Enriching elite sites in cultivars from the Yangtze River region resulted in the longer fiber length of Yangze River-sourced accessions. Apart from characterizing functional sites, we also identified 12,536 region-specific genomic sites. Combining the transcriptome data of multiple tissues and samples under various abiotic stresses, we found that several region-specific sites contributed to environmental adaptation. In this research, FPFinder revealed the role of the cotton pedigree fingerprint and region-specific sites in cotton development and environmental adaptation, respectively. The FPFinder can be applied broadly in other crops and contribute to genetic breeding in the future.https://www.mdpi.com/2073-4425/15/3/378fingerprintterm frequency–inverse document frequencycotton genomicspedigreegeographical bias
spellingShingle Shang Liu
Hailiang Cheng
Youping Zhang
Man He
Dongyun Zuo
Qiaolian Wang
Limin Lv
Zhongxv Lin
Guoli Song
Fingerprint Finder: Identifying Genomic Fingerprint Sites in Cotton Cohorts for Genetic Analysis and Breeding Advancement
Genes
fingerprint
term frequency–inverse document frequency
cotton genomics
pedigree
geographical bias
title Fingerprint Finder: Identifying Genomic Fingerprint Sites in Cotton Cohorts for Genetic Analysis and Breeding Advancement
title_full Fingerprint Finder: Identifying Genomic Fingerprint Sites in Cotton Cohorts for Genetic Analysis and Breeding Advancement
title_fullStr Fingerprint Finder: Identifying Genomic Fingerprint Sites in Cotton Cohorts for Genetic Analysis and Breeding Advancement
title_full_unstemmed Fingerprint Finder: Identifying Genomic Fingerprint Sites in Cotton Cohorts for Genetic Analysis and Breeding Advancement
title_short Fingerprint Finder: Identifying Genomic Fingerprint Sites in Cotton Cohorts for Genetic Analysis and Breeding Advancement
title_sort fingerprint finder identifying genomic fingerprint sites in cotton cohorts for genetic analysis and breeding advancement
topic fingerprint
term frequency–inverse document frequency
cotton genomics
pedigree
geographical bias
url https://www.mdpi.com/2073-4425/15/3/378
work_keys_str_mv AT shangliu fingerprintfinderidentifyinggenomicfingerprintsitesincottoncohortsforgeneticanalysisandbreedingadvancement
AT hailiangcheng fingerprintfinderidentifyinggenomicfingerprintsitesincottoncohortsforgeneticanalysisandbreedingadvancement
AT youpingzhang fingerprintfinderidentifyinggenomicfingerprintsitesincottoncohortsforgeneticanalysisandbreedingadvancement
AT manhe fingerprintfinderidentifyinggenomicfingerprintsitesincottoncohortsforgeneticanalysisandbreedingadvancement
AT dongyunzuo fingerprintfinderidentifyinggenomicfingerprintsitesincottoncohortsforgeneticanalysisandbreedingadvancement
AT qiaolianwang fingerprintfinderidentifyinggenomicfingerprintsitesincottoncohortsforgeneticanalysisandbreedingadvancement
AT liminlv fingerprintfinderidentifyinggenomicfingerprintsitesincottoncohortsforgeneticanalysisandbreedingadvancement
AT zhongxvlin fingerprintfinderidentifyinggenomicfingerprintsitesincottoncohortsforgeneticanalysisandbreedingadvancement
AT guolisong fingerprintfinderidentifyinggenomicfingerprintsitesincottoncohortsforgeneticanalysisandbreedingadvancement