Discovering novel reproductive genes in a non-model fly using de novo GridION transcriptomics

Gene discovery has important implications for investigating phenotypic trait evolution, adaptation, and speciation. Male reproductive tissues, such as accessory glands (AGs), are hotspots for recruitment of novel genes that diverge rapidly even among closely related species/populations. These genes...

Full description

Bibliographic Details
Main Authors: Mrinalini Walter, Nalini Puniamoorthy
Format: Article
Language:English
Published: Frontiers Media S.A. 2022-12-01
Series:Frontiers in Genetics
Subjects:
Online Access:https://www.frontiersin.org/articles/10.3389/fgene.2022.1003771/full
_version_ 1811297829098881024
author Mrinalini Walter
Nalini Puniamoorthy
author_facet Mrinalini Walter
Nalini Puniamoorthy
author_sort Mrinalini Walter
collection DOAJ
description Gene discovery has important implications for investigating phenotypic trait evolution, adaptation, and speciation. Male reproductive tissues, such as accessory glands (AGs), are hotspots for recruitment of novel genes that diverge rapidly even among closely related species/populations. These genes synthesize seminal fluid proteins that often affect post-copulatory sexual selection—they can mediate male-male sperm competition, ejaculate-female interactions that modify female remating and even influence reproductive incompatibilities among diverging species/populations. Although de novo transcriptomics has facilitated gene discovery in non-model organisms, reproductive gene discovery is still challenging without a reference database as they are often novel and bear no homology to known proteins. Here, we use reference-free GridION long-read transcriptomics, from Oxford Nanopore Technologies (ONT), to discover novel AG genes and characterize their expression in the widespread dung fly, Sepsis punctum. Despite stark population differences in male reproductive traits (e.g.: Body size, testes size, and sperm length) as well as female re-mating, the male AG genes and their secretions of S. punctum are still unknown. We implement a de novo ONT transcriptome pipeline incorporating quality-filtering and rigorous error-correction procedures, and we evaluate gene sequence and gene expression results against high-quality Illumina short-read data. We discover highly-expressed reproductive genes in AG transcriptomes of S. punctum consisting of 40 high-quality and high-confidence ONT genes that cross-verify against Illumina genes, among which 26 are novel and specific to S. punctum. Novel genes account for an average of 81% of total gene expression and may be functionally relevant in seminal fluid protein production. For instance, 80% of genes encoding secretory proteins account for 74% total gene expression. In addition, median sequence similarities of ONT nucleotide and protein sequences match within-Illumina sequence similarities. Read-count based expression quantification in ONT is congruent with Illumina’s Transcript per Million (TPM), both in overall pattern and within functional categories. Rapid genomic innovation followed by recruitment of de novo genes for high expression in S. punctum AG tissue, a pattern observed in other insects, could be a likely mechanism of evolution of these genes. The study also demonstrates the feasibility of adapting ONT transcriptomics for gene discovery in non-model systems.
first_indexed 2024-04-13T06:10:01Z
format Article
id doaj.art-9fa842560a6349509a5890f73e4bd462
institution Directory Open Access Journal
issn 1664-8021
language English
last_indexed 2024-04-13T06:10:01Z
publishDate 2022-12-01
publisher Frontiers Media S.A.
record_format Article
series Frontiers in Genetics
spelling doaj.art-9fa842560a6349509a5890f73e4bd4622022-12-22T02:59:05ZengFrontiers Media S.A.Frontiers in Genetics1664-80212022-12-011310.3389/fgene.2022.10037711003771Discovering novel reproductive genes in a non-model fly using de novo GridION transcriptomicsMrinalini WalterNalini PuniamoorthyGene discovery has important implications for investigating phenotypic trait evolution, adaptation, and speciation. Male reproductive tissues, such as accessory glands (AGs), are hotspots for recruitment of novel genes that diverge rapidly even among closely related species/populations. These genes synthesize seminal fluid proteins that often affect post-copulatory sexual selection—they can mediate male-male sperm competition, ejaculate-female interactions that modify female remating and even influence reproductive incompatibilities among diverging species/populations. Although de novo transcriptomics has facilitated gene discovery in non-model organisms, reproductive gene discovery is still challenging without a reference database as they are often novel and bear no homology to known proteins. Here, we use reference-free GridION long-read transcriptomics, from Oxford Nanopore Technologies (ONT), to discover novel AG genes and characterize their expression in the widespread dung fly, Sepsis punctum. Despite stark population differences in male reproductive traits (e.g.: Body size, testes size, and sperm length) as well as female re-mating, the male AG genes and their secretions of S. punctum are still unknown. We implement a de novo ONT transcriptome pipeline incorporating quality-filtering and rigorous error-correction procedures, and we evaluate gene sequence and gene expression results against high-quality Illumina short-read data. We discover highly-expressed reproductive genes in AG transcriptomes of S. punctum consisting of 40 high-quality and high-confidence ONT genes that cross-verify against Illumina genes, among which 26 are novel and specific to S. punctum. Novel genes account for an average of 81% of total gene expression and may be functionally relevant in seminal fluid protein production. For instance, 80% of genes encoding secretory proteins account for 74% total gene expression. In addition, median sequence similarities of ONT nucleotide and protein sequences match within-Illumina sequence similarities. Read-count based expression quantification in ONT is congruent with Illumina’s Transcript per Million (TPM), both in overall pattern and within functional categories. Rapid genomic innovation followed by recruitment of de novo genes for high expression in S. punctum AG tissue, a pattern observed in other insects, could be a likely mechanism of evolution of these genes. The study also demonstrates the feasibility of adapting ONT transcriptomics for gene discovery in non-model systems.https://www.frontiersin.org/articles/10.3389/fgene.2022.1003771/fullgene expressionGridIONIlluminanovel geneOxford Nanopore Technologies (ONT)reproduction
spellingShingle Mrinalini Walter
Nalini Puniamoorthy
Discovering novel reproductive genes in a non-model fly using de novo GridION transcriptomics
Frontiers in Genetics
gene expression
GridION
Illumina
novel gene
Oxford Nanopore Technologies (ONT)
reproduction
title Discovering novel reproductive genes in a non-model fly using de novo GridION transcriptomics
title_full Discovering novel reproductive genes in a non-model fly using de novo GridION transcriptomics
title_fullStr Discovering novel reproductive genes in a non-model fly using de novo GridION transcriptomics
title_full_unstemmed Discovering novel reproductive genes in a non-model fly using de novo GridION transcriptomics
title_short Discovering novel reproductive genes in a non-model fly using de novo GridION transcriptomics
title_sort discovering novel reproductive genes in a non model fly using de novo gridion transcriptomics
topic gene expression
GridION
Illumina
novel gene
Oxford Nanopore Technologies (ONT)
reproduction
url https://www.frontiersin.org/articles/10.3389/fgene.2022.1003771/full
work_keys_str_mv AT mrinaliniwalter discoveringnovelreproductivegenesinanonmodelflyusingdenovogridiontranscriptomics
AT nalinipuniamoorthy discoveringnovelreproductivegenesinanonmodelflyusingdenovogridiontranscriptomics