Fingerprint Finder: Identifying Genomic Fingerprint Sites in Cotton Cohorts for Genetic Analysis and Breeding Advancement
Genomic data in Gossypium provide numerous data resources for the cotton genomics community. However, to fill the gap between genomic analysis and breeding field work, detecting the featured genomic items of a subset cohort is essential for geneticists. We developed FPFinder v1.0 software to identif...
Main Authors: | , , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2024-03-01
|
Series: | Genes |
Subjects: | |
Online Access: | https://www.mdpi.com/2073-4425/15/3/378 |
_version_ | 1797240926795464704 |
---|---|
author | Shang Liu Hailiang Cheng Youping Zhang Man He Dongyun Zuo Qiaolian Wang Limin Lv Zhongxv Lin Guoli Song |
author_facet | Shang Liu Hailiang Cheng Youping Zhang Man He Dongyun Zuo Qiaolian Wang Limin Lv Zhongxv Lin Guoli Song |
author_sort | Shang Liu |
collection | DOAJ |
description | Genomic data in Gossypium provide numerous data resources for the cotton genomics community. However, to fill the gap between genomic analysis and breeding field work, detecting the featured genomic items of a subset cohort is essential for geneticists. We developed FPFinder v1.0 software to identify a subset of the cohort’s fingerprint genomic sites. The FPFinder was developed based on the term frequency–inverse document frequency algorithm. With the short-read sequencing of an elite cotton pedigree, we identified 453 pedigree fingerprint genomic sites and found that these pedigree-featured sites had a role in cotton development. In addition, we applied FPFinder to evaluate the geographical bias of fiber-length-related genomic sites from a modern cotton cohort consisting of 410 accessions. Enriching elite sites in cultivars from the Yangtze River region resulted in the longer fiber length of Yangze River-sourced accessions. Apart from characterizing functional sites, we also identified 12,536 region-specific genomic sites. Combining the transcriptome data of multiple tissues and samples under various abiotic stresses, we found that several region-specific sites contributed to environmental adaptation. In this research, FPFinder revealed the role of the cotton pedigree fingerprint and region-specific sites in cotton development and environmental adaptation, respectively. The FPFinder can be applied broadly in other crops and contribute to genetic breeding in the future. |
first_indexed | 2024-04-24T18:15:12Z |
format | Article |
id | doaj.art-e167b4d8bbfd4c86a372b0bb2a6ffe3f |
institution | Directory Open Access Journal |
issn | 2073-4425 |
language | English |
last_indexed | 2024-04-24T18:15:12Z |
publishDate | 2024-03-01 |
publisher | MDPI AG |
record_format | Article |
series | Genes |
spelling | doaj.art-e167b4d8bbfd4c86a372b0bb2a6ffe3f2024-03-27T13:43:15ZengMDPI AGGenes2073-44252024-03-0115337810.3390/genes15030378Fingerprint Finder: Identifying Genomic Fingerprint Sites in Cotton Cohorts for Genetic Analysis and Breeding AdvancementShang Liu0Hailiang Cheng1Youping Zhang2Man He3Dongyun Zuo4Qiaolian Wang5Limin Lv6Zhongxv Lin7Guoli Song8National Key Laboratory of Cotton Bio-breeding and Integrated Utilization, Institute of Cotton Research of Chinese Academy of Agricultural Sciences, Anyang 455000, ChinaNational Key Laboratory of Cotton Bio-breeding and Integrated Utilization, Institute of Cotton Research of Chinese Academy of Agricultural Sciences, Anyang 455000, ChinaNational Key Laboratory of Cotton Bio-breeding and Integrated Utilization, Institute of Cotton Research of Chinese Academy of Agricultural Sciences, Anyang 455000, ChinaNational Key Laboratory of Cotton Bio-breeding and Integrated Utilization, Institute of Cotton Research of Chinese Academy of Agricultural Sciences, Anyang 455000, ChinaNational Key Laboratory of Cotton Bio-breeding and Integrated Utilization, Institute of Cotton Research of Chinese Academy of Agricultural Sciences, Anyang 455000, ChinaNational Key Laboratory of Cotton Bio-breeding and Integrated Utilization, Institute of Cotton Research of Chinese Academy of Agricultural Sciences, Anyang 455000, ChinaNational Key Laboratory of Cotton Bio-breeding and Integrated Utilization, Institute of Cotton Research of Chinese Academy of Agricultural Sciences, Anyang 455000, ChinaNational Key Laboratory of Crop Genetic Improvement, College of Plant Science and Technology, Huazhong Agricultural University, Wuhan 430070, ChinaNational Key Laboratory of Cotton Bio-breeding and Integrated Utilization, Institute of Cotton Research of Chinese Academy of Agricultural Sciences, Anyang 455000, ChinaGenomic data in Gossypium provide numerous data resources for the cotton genomics community. However, to fill the gap between genomic analysis and breeding field work, detecting the featured genomic items of a subset cohort is essential for geneticists. We developed FPFinder v1.0 software to identify a subset of the cohort’s fingerprint genomic sites. The FPFinder was developed based on the term frequency–inverse document frequency algorithm. With the short-read sequencing of an elite cotton pedigree, we identified 453 pedigree fingerprint genomic sites and found that these pedigree-featured sites had a role in cotton development. In addition, we applied FPFinder to evaluate the geographical bias of fiber-length-related genomic sites from a modern cotton cohort consisting of 410 accessions. Enriching elite sites in cultivars from the Yangtze River region resulted in the longer fiber length of Yangze River-sourced accessions. Apart from characterizing functional sites, we also identified 12,536 region-specific genomic sites. Combining the transcriptome data of multiple tissues and samples under various abiotic stresses, we found that several region-specific sites contributed to environmental adaptation. In this research, FPFinder revealed the role of the cotton pedigree fingerprint and region-specific sites in cotton development and environmental adaptation, respectively. The FPFinder can be applied broadly in other crops and contribute to genetic breeding in the future.https://www.mdpi.com/2073-4425/15/3/378fingerprintterm frequency–inverse document frequencycotton genomicspedigreegeographical bias |
spellingShingle | Shang Liu Hailiang Cheng Youping Zhang Man He Dongyun Zuo Qiaolian Wang Limin Lv Zhongxv Lin Guoli Song Fingerprint Finder: Identifying Genomic Fingerprint Sites in Cotton Cohorts for Genetic Analysis and Breeding Advancement Genes fingerprint term frequency–inverse document frequency cotton genomics pedigree geographical bias |
title | Fingerprint Finder: Identifying Genomic Fingerprint Sites in Cotton Cohorts for Genetic Analysis and Breeding Advancement |
title_full | Fingerprint Finder: Identifying Genomic Fingerprint Sites in Cotton Cohorts for Genetic Analysis and Breeding Advancement |
title_fullStr | Fingerprint Finder: Identifying Genomic Fingerprint Sites in Cotton Cohorts for Genetic Analysis and Breeding Advancement |
title_full_unstemmed | Fingerprint Finder: Identifying Genomic Fingerprint Sites in Cotton Cohorts for Genetic Analysis and Breeding Advancement |
title_short | Fingerprint Finder: Identifying Genomic Fingerprint Sites in Cotton Cohorts for Genetic Analysis and Breeding Advancement |
title_sort | fingerprint finder identifying genomic fingerprint sites in cotton cohorts for genetic analysis and breeding advancement |
topic | fingerprint term frequency–inverse document frequency cotton genomics pedigree geographical bias |
url | https://www.mdpi.com/2073-4425/15/3/378 |
work_keys_str_mv | AT shangliu fingerprintfinderidentifyinggenomicfingerprintsitesincottoncohortsforgeneticanalysisandbreedingadvancement AT hailiangcheng fingerprintfinderidentifyinggenomicfingerprintsitesincottoncohortsforgeneticanalysisandbreedingadvancement AT youpingzhang fingerprintfinderidentifyinggenomicfingerprintsitesincottoncohortsforgeneticanalysisandbreedingadvancement AT manhe fingerprintfinderidentifyinggenomicfingerprintsitesincottoncohortsforgeneticanalysisandbreedingadvancement AT dongyunzuo fingerprintfinderidentifyinggenomicfingerprintsitesincottoncohortsforgeneticanalysisandbreedingadvancement AT qiaolianwang fingerprintfinderidentifyinggenomicfingerprintsitesincottoncohortsforgeneticanalysisandbreedingadvancement AT liminlv fingerprintfinderidentifyinggenomicfingerprintsitesincottoncohortsforgeneticanalysisandbreedingadvancement AT zhongxvlin fingerprintfinderidentifyinggenomicfingerprintsitesincottoncohortsforgeneticanalysisandbreedingadvancement AT guolisong fingerprintfinderidentifyinggenomicfingerprintsitesincottoncohortsforgeneticanalysisandbreedingadvancement |