FindAdapt: A python package for fast and accurate adapter detection in small RNA sequencing.

Adapter trimming is an essential step for analyzing small RNA sequencing data, where reads are generally longer than target RNAs ranging from 18 to 30 bp. Most adapter trimming tools require adapter information as input. However, adapter information is hard to access, specified incorrectly, or not p...

Full description

Bibliographic Details
Main Authors: Hua-Chang Chen, Jing Wang, Yu Shyr, Qi Liu
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2024-01-01
Series:PLoS Computational Biology
Online Access:https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1011786&type=printable
_version_ 1797322332506685440
author Hua-Chang Chen
Jing Wang
Yu Shyr
Qi Liu
author_facet Hua-Chang Chen
Jing Wang
Yu Shyr
Qi Liu
author_sort Hua-Chang Chen
collection DOAJ
description Adapter trimming is an essential step for analyzing small RNA sequencing data, where reads are generally longer than target RNAs ranging from 18 to 30 bp. Most adapter trimming tools require adapter information as input. However, adapter information is hard to access, specified incorrectly, or not provided with publicly available datasets, hampering their reproducibility and reusability. Manual identification of adapter patterns from raw reads is labor-intensive and error-prone. Moreover, the use of randomized adapters to reduce ligation biases during library preparation makes adapter detection even more challenging. Here, we present FindAdapt, a Python package for fast and accurate detection of adapter patterns without relying on prior information. We demonstrated that FindAdapt was far superior to existing approaches. It identified adapters successfully in 180 simulation datasets with diverse read structures and 3,184 real datasets covering a variety of commercial and customized small RNA library preparation kits. FindAdapt is stand-alone software that can be easily integrated into small RNA sequencing analysis pipelines.
first_indexed 2024-03-08T05:12:44Z
format Article
id doaj.art-03bbb53df7504b1bb6905fb12c260480
institution Directory Open Access Journal
issn 1553-734X
1553-7358
language English
last_indexed 2024-03-08T05:12:44Z
publishDate 2024-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS Computational Biology
spelling doaj.art-03bbb53df7504b1bb6905fb12c2604802024-02-07T05:30:41ZengPublic Library of Science (PLoS)PLoS Computational Biology1553-734X1553-73582024-01-01201e101178610.1371/journal.pcbi.1011786FindAdapt: A python package for fast and accurate adapter detection in small RNA sequencing.Hua-Chang ChenJing WangYu ShyrQi LiuAdapter trimming is an essential step for analyzing small RNA sequencing data, where reads are generally longer than target RNAs ranging from 18 to 30 bp. Most adapter trimming tools require adapter information as input. However, adapter information is hard to access, specified incorrectly, or not provided with publicly available datasets, hampering their reproducibility and reusability. Manual identification of adapter patterns from raw reads is labor-intensive and error-prone. Moreover, the use of randomized adapters to reduce ligation biases during library preparation makes adapter detection even more challenging. Here, we present FindAdapt, a Python package for fast and accurate detection of adapter patterns without relying on prior information. We demonstrated that FindAdapt was far superior to existing approaches. It identified adapters successfully in 180 simulation datasets with diverse read structures and 3,184 real datasets covering a variety of commercial and customized small RNA library preparation kits. FindAdapt is stand-alone software that can be easily integrated into small RNA sequencing analysis pipelines.https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1011786&type=printable
spellingShingle Hua-Chang Chen
Jing Wang
Yu Shyr
Qi Liu
FindAdapt: A python package for fast and accurate adapter detection in small RNA sequencing.
PLoS Computational Biology
title FindAdapt: A python package for fast and accurate adapter detection in small RNA sequencing.
title_full FindAdapt: A python package for fast and accurate adapter detection in small RNA sequencing.
title_fullStr FindAdapt: A python package for fast and accurate adapter detection in small RNA sequencing.
title_full_unstemmed FindAdapt: A python package for fast and accurate adapter detection in small RNA sequencing.
title_short FindAdapt: A python package for fast and accurate adapter detection in small RNA sequencing.
title_sort findadapt a python package for fast and accurate adapter detection in small rna sequencing
url https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1011786&type=printable
work_keys_str_mv AT huachangchen findadaptapythonpackageforfastandaccurateadapterdetectioninsmallrnasequencing
AT jingwang findadaptapythonpackageforfastandaccurateadapterdetectioninsmallrnasequencing
AT yushyr findadaptapythonpackageforfastandaccurateadapterdetectioninsmallrnasequencing
AT qiliu findadaptapythonpackageforfastandaccurateadapterdetectioninsmallrnasequencing