Revisiting and corrections to the annotated SRSF3 (SRp20) gene structure and RefSeq sequences from the human and mouse genomes

SRSF3 (SRp20) is the smallest member of the serine/arginine (SR)-rich protein family. We found the annotated human SRSF3 and mouse Srsf3 RefSeq sequences are much larger than the detected SRSF3/Srsf3 RNA size by Northern blot. Mapping of RNA-seq reads from various human and mouse cell lines to the a...

Full description

Bibliographic Details
Main Authors: Lulu Yu, Vladimir Majerciak, Rong Jia, Zhi-Ming Zheng
Format: Article
Language:English
Published: Elsevier 2023-04-01
Series:Cell Insight
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2772892723000135
_version_ 1811158439189020672
author Lulu Yu
Vladimir Majerciak
Rong Jia
Zhi-Ming Zheng
author_facet Lulu Yu
Vladimir Majerciak
Rong Jia
Zhi-Ming Zheng
author_sort Lulu Yu
collection DOAJ
description SRSF3 (SRp20) is the smallest member of the serine/arginine (SR)-rich protein family. We found the annotated human SRSF3 and mouse Srsf3 RefSeq sequences are much larger than the detected SRSF3/Srsf3 RNA size by Northern blot. Mapping of RNA-seq reads from various human and mouse cell lines to the annotated SRSF3/Srsf3 gene illustrated only a partial coverage of its terminal exon 7. By 5ʹ RACE and 3ʹ RACE, we determined that SRSF3 gene spanning over 8422 bases and Srsf3 gene spanning over 9423 bases. SRSF3/Srsf3 gene has seven exons with exon 7 bearing two alternative polyadenylation signals (PAS). Through alternative PAS selection and exon 4 exclusion/inclusion by alternative RNA splicing, SRSF3/Srsf3 gene expresses four RNA isoforms. The major SRSF3 mRNA isoform with exon 4 exclusion by using a favorable distal PAS to encode a full-length protein is 1411 nt long (not annotated 4228 nt) and the same major mouse Srsf3 mRNA isoform is only 1295 nt (not annotated 2585 nt). The difference from the redefined RNA size of SRSF3/Srsf3 to the corresponding RefSeq sequence is at the 3’ UTR region. Collectively, the redefined SRSF3/Srsf3 gene structure and expression will allow better understanding of SRSF3 functions and its regulations in health and diseases.
first_indexed 2024-04-10T05:23:35Z
format Article
id doaj.art-5ff23ac90a884c0bb8cd11b7d624ec30
institution Directory Open Access Journal
issn 2772-8927
language English
last_indexed 2024-04-10T05:23:35Z
publishDate 2023-04-01
publisher Elsevier
record_format Article
series Cell Insight
spelling doaj.art-5ff23ac90a884c0bb8cd11b7d624ec302023-03-08T04:15:12ZengElsevierCell Insight2772-89272023-04-0122100089Revisiting and corrections to the annotated SRSF3 (SRp20) gene structure and RefSeq sequences from the human and mouse genomesLulu Yu0Vladimir Majerciak1Rong Jia2Zhi-Ming Zheng3Tumor Virus RNA Biology Section, HIV Dynamics and Replication Program, Center for Cancer Research, National Cancer Institute, Frederick, MD, 21702, USATumor Virus RNA Biology Section, HIV Dynamics and Replication Program, Center for Cancer Research, National Cancer Institute, Frederick, MD, 21702, USAThe State Key Laboratory Breeding Base of Basic Science of Stomatology (Hubei-MOST), Key Laboratory of Oral Biomedicine Ministry of Education, School & Hospital of Stomatology, Wuhan University, Wuhan, ChinaTumor Virus RNA Biology Section, HIV Dynamics and Replication Program, Center for Cancer Research, National Cancer Institute, Frederick, MD, 21702, USA; Corresponding author.SRSF3 (SRp20) is the smallest member of the serine/arginine (SR)-rich protein family. We found the annotated human SRSF3 and mouse Srsf3 RefSeq sequences are much larger than the detected SRSF3/Srsf3 RNA size by Northern blot. Mapping of RNA-seq reads from various human and mouse cell lines to the annotated SRSF3/Srsf3 gene illustrated only a partial coverage of its terminal exon 7. By 5ʹ RACE and 3ʹ RACE, we determined that SRSF3 gene spanning over 8422 bases and Srsf3 gene spanning over 9423 bases. SRSF3/Srsf3 gene has seven exons with exon 7 bearing two alternative polyadenylation signals (PAS). Through alternative PAS selection and exon 4 exclusion/inclusion by alternative RNA splicing, SRSF3/Srsf3 gene expresses four RNA isoforms. The major SRSF3 mRNA isoform with exon 4 exclusion by using a favorable distal PAS to encode a full-length protein is 1411 nt long (not annotated 4228 nt) and the same major mouse Srsf3 mRNA isoform is only 1295 nt (not annotated 2585 nt). The difference from the redefined RNA size of SRSF3/Srsf3 to the corresponding RefSeq sequence is at the 3’ UTR region. Collectively, the redefined SRSF3/Srsf3 gene structure and expression will allow better understanding of SRSF3 functions and its regulations in health and diseases.http://www.sciencedirect.com/science/article/pii/S2772892723000135SRSF3Genome structureGene expressionRNA isoforms5ʹ UTR3ʹ UTR
spellingShingle Lulu Yu
Vladimir Majerciak
Rong Jia
Zhi-Ming Zheng
Revisiting and corrections to the annotated SRSF3 (SRp20) gene structure and RefSeq sequences from the human and mouse genomes
Cell Insight
SRSF3
Genome structure
Gene expression
RNA isoforms
5ʹ UTR
3ʹ UTR
title Revisiting and corrections to the annotated SRSF3 (SRp20) gene structure and RefSeq sequences from the human and mouse genomes
title_full Revisiting and corrections to the annotated SRSF3 (SRp20) gene structure and RefSeq sequences from the human and mouse genomes
title_fullStr Revisiting and corrections to the annotated SRSF3 (SRp20) gene structure and RefSeq sequences from the human and mouse genomes
title_full_unstemmed Revisiting and corrections to the annotated SRSF3 (SRp20) gene structure and RefSeq sequences from the human and mouse genomes
title_short Revisiting and corrections to the annotated SRSF3 (SRp20) gene structure and RefSeq sequences from the human and mouse genomes
title_sort revisiting and corrections to the annotated srsf3 srp20 gene structure and refseq sequences from the human and mouse genomes
topic SRSF3
Genome structure
Gene expression
RNA isoforms
5ʹ UTR
3ʹ UTR
url http://www.sciencedirect.com/science/article/pii/S2772892723000135
work_keys_str_mv AT luluyu revisitingandcorrectionstotheannotatedsrsf3srp20genestructureandrefseqsequencesfromthehumanandmousegenomes
AT vladimirmajerciak revisitingandcorrectionstotheannotatedsrsf3srp20genestructureandrefseqsequencesfromthehumanandmousegenomes
AT rongjia revisitingandcorrectionstotheannotatedsrsf3srp20genestructureandrefseqsequencesfromthehumanandmousegenomes
AT zhimingzheng revisitingandcorrectionstotheannotatedsrsf3srp20genestructureandrefseqsequencesfromthehumanandmousegenomes