Multi-omics data integration for the identification of biomarkers for bull fertility.

Bull fertility is an important economic trait, and the use of subfertile semen for artificial insemination decreases the global efficiency of the breeding sector. Although the analysis of semen functional parameters can help to identify infertile bulls, no tools are currently available to enable pre...

Full description

Bibliographic Details
Main Authors: Valentin Costes, Eli Sellem, Sylvain Marthey, Chris Hoze, Aurélie Bonnet, Laurent Schibler, Hélène Kiefer, Florence Jaffrezic
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2024-01-01
Series:PLoS ONE
Online Access:https://doi.org/10.1371/journal.pone.0298623
_version_ 1827338875989655552
author Valentin Costes
Eli Sellem
Sylvain Marthey
Chris Hoze
Aurélie Bonnet
Laurent Schibler
Hélène Kiefer
Florence Jaffrezic
author_facet Valentin Costes
Eli Sellem
Sylvain Marthey
Chris Hoze
Aurélie Bonnet
Laurent Schibler
Hélène Kiefer
Florence Jaffrezic
author_sort Valentin Costes
collection DOAJ
description Bull fertility is an important economic trait, and the use of subfertile semen for artificial insemination decreases the global efficiency of the breeding sector. Although the analysis of semen functional parameters can help to identify infertile bulls, no tools are currently available to enable precise predictions and prevent the commercialization of subfertile semen. Because male fertility is a multifactorial phenotype that is dependent on genetic, epigenetic, physiological and environmental factors, we hypothesized that an integrative analysis might help to refine our knowledge and understanding of bull fertility. We combined -omics data (genotypes, sperm DNA methylation at CpGs and sperm small non-coding RNAs) and semen parameters measured on a large cohort of 98 Montbéliarde bulls with contrasting fertility levels. Multiple Factor Analysis was conducted to study the links between the datasets and fertility. Four methodologies were then considered to identify the features linked to bull fertility variation: Logistic Lasso, Random Forest, Gradient Boosting and Neural Networks. Finally, the features selected by these methods were annotated in terms of genes, to conduct functional enrichment analyses. The less relevant features in -omics data were filtered out, and MFA was run on the remaining 12,006 features, including the 11 semen parameters and a balanced proportion of each type of-omics data. The results showed that unlike the semen parameters studied the-omics datasets were related to fertility. Biomarkers related to bull fertility were selected using the four methodologies mentioned above. The most contributory CpGs, SNPs and miRNAs targeted genes were all found to be involved in development. Interestingly, fragments derived from ribosomal RNAs were overrepresented among the selected features, suggesting roles in male fertility. These markers could be used in the future to identify subfertile bulls in order to increase the global efficiency of the breeding sector.
first_indexed 2024-03-07T19:40:58Z
format Article
id doaj.art-c758960eca694d42abec6ba486e9a440
institution Directory Open Access Journal
issn 1932-6203
language English
last_indexed 2024-03-07T19:40:58Z
publishDate 2024-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj.art-c758960eca694d42abec6ba486e9a4402024-02-29T05:31:54ZengPublic Library of Science (PLoS)PLoS ONE1932-62032024-01-01192e029862310.1371/journal.pone.0298623Multi-omics data integration for the identification of biomarkers for bull fertility.Valentin CostesEli SellemSylvain MartheyChris HozeAurélie BonnetLaurent SchiblerHélène KieferFlorence JaffrezicBull fertility is an important economic trait, and the use of subfertile semen for artificial insemination decreases the global efficiency of the breeding sector. Although the analysis of semen functional parameters can help to identify infertile bulls, no tools are currently available to enable precise predictions and prevent the commercialization of subfertile semen. Because male fertility is a multifactorial phenotype that is dependent on genetic, epigenetic, physiological and environmental factors, we hypothesized that an integrative analysis might help to refine our knowledge and understanding of bull fertility. We combined -omics data (genotypes, sperm DNA methylation at CpGs and sperm small non-coding RNAs) and semen parameters measured on a large cohort of 98 Montbéliarde bulls with contrasting fertility levels. Multiple Factor Analysis was conducted to study the links between the datasets and fertility. Four methodologies were then considered to identify the features linked to bull fertility variation: Logistic Lasso, Random Forest, Gradient Boosting and Neural Networks. Finally, the features selected by these methods were annotated in terms of genes, to conduct functional enrichment analyses. The less relevant features in -omics data were filtered out, and MFA was run on the remaining 12,006 features, including the 11 semen parameters and a balanced proportion of each type of-omics data. The results showed that unlike the semen parameters studied the-omics datasets were related to fertility. Biomarkers related to bull fertility were selected using the four methodologies mentioned above. The most contributory CpGs, SNPs and miRNAs targeted genes were all found to be involved in development. Interestingly, fragments derived from ribosomal RNAs were overrepresented among the selected features, suggesting roles in male fertility. These markers could be used in the future to identify subfertile bulls in order to increase the global efficiency of the breeding sector.https://doi.org/10.1371/journal.pone.0298623
spellingShingle Valentin Costes
Eli Sellem
Sylvain Marthey
Chris Hoze
Aurélie Bonnet
Laurent Schibler
Hélène Kiefer
Florence Jaffrezic
Multi-omics data integration for the identification of biomarkers for bull fertility.
PLoS ONE
title Multi-omics data integration for the identification of biomarkers for bull fertility.
title_full Multi-omics data integration for the identification of biomarkers for bull fertility.
title_fullStr Multi-omics data integration for the identification of biomarkers for bull fertility.
title_full_unstemmed Multi-omics data integration for the identification of biomarkers for bull fertility.
title_short Multi-omics data integration for the identification of biomarkers for bull fertility.
title_sort multi omics data integration for the identification of biomarkers for bull fertility
url https://doi.org/10.1371/journal.pone.0298623
work_keys_str_mv AT valentincostes multiomicsdataintegrationfortheidentificationofbiomarkersforbullfertility
AT elisellem multiomicsdataintegrationfortheidentificationofbiomarkersforbullfertility
AT sylvainmarthey multiomicsdataintegrationfortheidentificationofbiomarkersforbullfertility
AT chrishoze multiomicsdataintegrationfortheidentificationofbiomarkersforbullfertility
AT aureliebonnet multiomicsdataintegrationfortheidentificationofbiomarkersforbullfertility
AT laurentschibler multiomicsdataintegrationfortheidentificationofbiomarkersforbullfertility
AT helenekiefer multiomicsdataintegrationfortheidentificationofbiomarkersforbullfertility
AT florencejaffrezic multiomicsdataintegrationfortheidentificationofbiomarkersforbullfertility