Development and comparison of RNA-sequencing pipelines for more accurate SNP identification: practical example of functional SNP detection associated with feed efficiency in Nellore beef cattle

Abstract Background Optimization of an RNA-Sequencing (RNA-Seq) pipeline is critical to maximize power and accuracy to identify genetic variants, including SNPs, which may serve as genetic markers to select for feed efficiency, leading to economic benefits for beef production. This study used RNA-Se...

Full description

Bibliographic Details
Main Authors: S. Lam, J. Zeidan, F. Miglior, A. Suárez-Vega, I. Gómez-Redondo, P. A. S. Fonseca, L. L. Guan, S. Waters, A. Cánovas
Format: Article
Language:English
Published: BMC 2020-10-01
Series:BMC Genomics
Subjects:
Online Access:http://link.springer.com/article/10.1186/s12864-020-07107-7
_version_ 1819228099463610368
author S. Lam
J. Zeidan
F. Miglior
A. Suárez-Vega
I. Gómez-Redondo
P. A. S. Fonseca
L. L. Guan
S. Waters
A. Cánovas
author_facet S. Lam
J. Zeidan
F. Miglior
A. Suárez-Vega
I. Gómez-Redondo
P. A. S. Fonseca
L. L. Guan
S. Waters
A. Cánovas
author_sort S. Lam
collection DOAJ
description Abstract Background Optimization of an RNA-Sequencing (RNA-Seq) pipeline is critical to maximize power and accuracy to identify genetic variants, including SNPs, which may serve as genetic markers to select for feed efficiency, leading to economic benefits for beef production. This study used RNA-Seq data (GEO Accession ID: PRJEB7696 and PRJEB15314) from muscle and liver tissue, respectively, from 12 Nellore beef steers selected from 585 steers with residual feed intake measures (RFI; n = 6 low-RFI, n = 6 high-RFI). Three RNA-Seq pipelines were compared including multi-sample calling from i) non-merged samples; ii) merged samples by RFI group, iii) merged samples by RFI and tissue group. The RNA-Seq reads were aligned against the UMD3.1 bovine reference genome (release 94) assembly using STAR aligner. Variants were called using BCFtools and variant effect prediction (VeP) and functional annotation (ToppGene) analyses were performed. Results On average, total reads detected for Approach i) non-merged samples for liver and muscle, were 18,362,086.3 and 35,645,898.7, respectively. For Approach ii), merging samples by RFI group, total reads detected for each merged group was 162,030,705, and for Approach iii), merging samples by RFI group and tissues, was 324,061,410, revealing the highest read depth for Approach iii). Additionally, Approach iii) merging samples by RFI group and tissues, revealed the highest read depth per variant coverage (572.59 ± 3993.11) and encompassed the majority of localized positional genes detected by each approach. This suggests Approach iii) had optimized detection power, read depth, and accuracy of SNP calling, therefore increasing confidence of variant detection and reducing false positive detection. Approach iii) was then used to detect unique SNPs fixed within low- (12,145) and high-RFI (14,663) groups. Functional annotation of SNPs revealed positional candidate genes, for each RFI group (2886 for low-RFI, 3075 for high-RFI), which were significantly (P < 0.05) associated with immune and metabolic pathways. Conclusion The most optimized RNA-Seq pipeline allowed for more accurate identification of SNPs, associated positional candidate genes, and significantly associated metabolic pathways in muscle and liver tissues, providing insight on the underlying genetic architecture of feed efficiency in beef cattle.
first_indexed 2024-12-23T10:51:54Z
format Article
id doaj.art-015f866c95324819954bb954cd48d82a
institution Directory Open Access Journal
issn 1471-2164
language English
last_indexed 2024-12-23T10:51:54Z
publishDate 2020-10-01
publisher BMC
record_format Article
series BMC Genomics
spelling doaj.art-015f866c95324819954bb954cd48d82a2022-12-21T17:49:52ZengBMCBMC Genomics1471-21642020-10-0121111710.1186/s12864-020-07107-7Development and comparison of RNA-sequencing pipelines for more accurate SNP identification: practical example of functional SNP detection associated with feed efficiency in Nellore beef cattleS. Lam0J. Zeidan1F. Miglior2A. Suárez-Vega3I. Gómez-Redondo4P. A. S. Fonseca5L. L. Guan6S. Waters7A. Cánovas8Centre for Genetic Improvement of Livestock, Department of Animal Biosciences, University of GuelphCentre for Genetic Improvement of Livestock, Department of Animal Biosciences, University of GuelphCentre for Genetic Improvement of Livestock, Department of Animal Biosciences, University of GuelphCentre for Genetic Improvement of Livestock, Department of Animal Biosciences, University of GuelphCentre for Genetic Improvement of Livestock, Department of Animal Biosciences, University of GuelphCentre for Genetic Improvement of Livestock, Department of Animal Biosciences, University of GuelphDepartment of Agriculture, Food & Nutritional Science, University of AlbertaTeagasc, Animal & Grassland Research and Innovation CentreCentre for Genetic Improvement of Livestock, Department of Animal Biosciences, University of GuelphAbstract Background Optimization of an RNA-Sequencing (RNA-Seq) pipeline is critical to maximize power and accuracy to identify genetic variants, including SNPs, which may serve as genetic markers to select for feed efficiency, leading to economic benefits for beef production. This study used RNA-Seq data (GEO Accession ID: PRJEB7696 and PRJEB15314) from muscle and liver tissue, respectively, from 12 Nellore beef steers selected from 585 steers with residual feed intake measures (RFI; n = 6 low-RFI, n = 6 high-RFI). Three RNA-Seq pipelines were compared including multi-sample calling from i) non-merged samples; ii) merged samples by RFI group, iii) merged samples by RFI and tissue group. The RNA-Seq reads were aligned against the UMD3.1 bovine reference genome (release 94) assembly using STAR aligner. Variants were called using BCFtools and variant effect prediction (VeP) and functional annotation (ToppGene) analyses were performed. Results On average, total reads detected for Approach i) non-merged samples for liver and muscle, were 18,362,086.3 and 35,645,898.7, respectively. For Approach ii), merging samples by RFI group, total reads detected for each merged group was 162,030,705, and for Approach iii), merging samples by RFI group and tissues, was 324,061,410, revealing the highest read depth for Approach iii). Additionally, Approach iii) merging samples by RFI group and tissues, revealed the highest read depth per variant coverage (572.59 ± 3993.11) and encompassed the majority of localized positional genes detected by each approach. This suggests Approach iii) had optimized detection power, read depth, and accuracy of SNP calling, therefore increasing confidence of variant detection and reducing false positive detection. Approach iii) was then used to detect unique SNPs fixed within low- (12,145) and high-RFI (14,663) groups. Functional annotation of SNPs revealed positional candidate genes, for each RFI group (2886 for low-RFI, 3075 for high-RFI), which were significantly (P < 0.05) associated with immune and metabolic pathways. Conclusion The most optimized RNA-Seq pipeline allowed for more accurate identification of SNPs, associated positional candidate genes, and significantly associated metabolic pathways in muscle and liver tissues, providing insight on the underlying genetic architecture of feed efficiency in beef cattle.http://link.springer.com/article/10.1186/s12864-020-07107-7Feed efficiencyBovineRNA-SeqSingle nucleotide polymorphisms (SNPs)Transcriptomics
spellingShingle S. Lam
J. Zeidan
F. Miglior
A. Suárez-Vega
I. Gómez-Redondo
P. A. S. Fonseca
L. L. Guan
S. Waters
A. Cánovas
Development and comparison of RNA-sequencing pipelines for more accurate SNP identification: practical example of functional SNP detection associated with feed efficiency in Nellore beef cattle
BMC Genomics
Feed efficiency
Bovine
RNA-Seq
Single nucleotide polymorphisms (SNPs)
Transcriptomics
title Development and comparison of RNA-sequencing pipelines for more accurate SNP identification: practical example of functional SNP detection associated with feed efficiency in Nellore beef cattle
title_full Development and comparison of RNA-sequencing pipelines for more accurate SNP identification: practical example of functional SNP detection associated with feed efficiency in Nellore beef cattle
title_fullStr Development and comparison of RNA-sequencing pipelines for more accurate SNP identification: practical example of functional SNP detection associated with feed efficiency in Nellore beef cattle
title_full_unstemmed Development and comparison of RNA-sequencing pipelines for more accurate SNP identification: practical example of functional SNP detection associated with feed efficiency in Nellore beef cattle
title_short Development and comparison of RNA-sequencing pipelines for more accurate SNP identification: practical example of functional SNP detection associated with feed efficiency in Nellore beef cattle
title_sort development and comparison of rna sequencing pipelines for more accurate snp identification practical example of functional snp detection associated with feed efficiency in nellore beef cattle
topic Feed efficiency
Bovine
RNA-Seq
Single nucleotide polymorphisms (SNPs)
Transcriptomics
url http://link.springer.com/article/10.1186/s12864-020-07107-7
work_keys_str_mv AT slam developmentandcomparisonofrnasequencingpipelinesformoreaccuratesnpidentificationpracticalexampleoffunctionalsnpdetectionassociatedwithfeedefficiencyinnellorebeefcattle
AT jzeidan developmentandcomparisonofrnasequencingpipelinesformoreaccuratesnpidentificationpracticalexampleoffunctionalsnpdetectionassociatedwithfeedefficiencyinnellorebeefcattle
AT fmiglior developmentandcomparisonofrnasequencingpipelinesformoreaccuratesnpidentificationpracticalexampleoffunctionalsnpdetectionassociatedwithfeedefficiencyinnellorebeefcattle
AT asuarezvega developmentandcomparisonofrnasequencingpipelinesformoreaccuratesnpidentificationpracticalexampleoffunctionalsnpdetectionassociatedwithfeedefficiencyinnellorebeefcattle
AT igomezredondo developmentandcomparisonofrnasequencingpipelinesformoreaccuratesnpidentificationpracticalexampleoffunctionalsnpdetectionassociatedwithfeedefficiencyinnellorebeefcattle
AT pasfonseca developmentandcomparisonofrnasequencingpipelinesformoreaccuratesnpidentificationpracticalexampleoffunctionalsnpdetectionassociatedwithfeedefficiencyinnellorebeefcattle
AT llguan developmentandcomparisonofrnasequencingpipelinesformoreaccuratesnpidentificationpracticalexampleoffunctionalsnpdetectionassociatedwithfeedefficiencyinnellorebeefcattle
AT swaters developmentandcomparisonofrnasequencingpipelinesformoreaccuratesnpidentificationpracticalexampleoffunctionalsnpdetectionassociatedwithfeedefficiencyinnellorebeefcattle
AT acanovas developmentandcomparisonofrnasequencingpipelinesformoreaccuratesnpidentificationpracticalexampleoffunctionalsnpdetectionassociatedwithfeedefficiencyinnellorebeefcattle