FindAdapt: A python package for fast and accurate adapter detection in small RNA sequencing.
Adapter trimming is an essential step for analyzing small RNA sequencing data, where reads are generally longer than target RNAs ranging from 18 to 30 bp. Most adapter trimming tools require adapter information as input. However, adapter information is hard to access, specified incorrectly, or not p...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Public Library of Science (PLoS)
2024-01-01
|
Series: | PLoS Computational Biology |
Online Access: | https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1011786&type=printable |
_version_ | 1797322332506685440 |
---|---|
author | Hua-Chang Chen Jing Wang Yu Shyr Qi Liu |
author_facet | Hua-Chang Chen Jing Wang Yu Shyr Qi Liu |
author_sort | Hua-Chang Chen |
collection | DOAJ |
description | Adapter trimming is an essential step for analyzing small RNA sequencing data, where reads are generally longer than target RNAs ranging from 18 to 30 bp. Most adapter trimming tools require adapter information as input. However, adapter information is hard to access, specified incorrectly, or not provided with publicly available datasets, hampering their reproducibility and reusability. Manual identification of adapter patterns from raw reads is labor-intensive and error-prone. Moreover, the use of randomized adapters to reduce ligation biases during library preparation makes adapter detection even more challenging. Here, we present FindAdapt, a Python package for fast and accurate detection of adapter patterns without relying on prior information. We demonstrated that FindAdapt was far superior to existing approaches. It identified adapters successfully in 180 simulation datasets with diverse read structures and 3,184 real datasets covering a variety of commercial and customized small RNA library preparation kits. FindAdapt is stand-alone software that can be easily integrated into small RNA sequencing analysis pipelines. |
first_indexed | 2024-03-08T05:12:44Z |
format | Article |
id | doaj.art-03bbb53df7504b1bb6905fb12c260480 |
institution | Directory Open Access Journal |
issn | 1553-734X 1553-7358 |
language | English |
last_indexed | 2024-03-08T05:12:44Z |
publishDate | 2024-01-01 |
publisher | Public Library of Science (PLoS) |
record_format | Article |
series | PLoS Computational Biology |
spelling | doaj.art-03bbb53df7504b1bb6905fb12c2604802024-02-07T05:30:41ZengPublic Library of Science (PLoS)PLoS Computational Biology1553-734X1553-73582024-01-01201e101178610.1371/journal.pcbi.1011786FindAdapt: A python package for fast and accurate adapter detection in small RNA sequencing.Hua-Chang ChenJing WangYu ShyrQi LiuAdapter trimming is an essential step for analyzing small RNA sequencing data, where reads are generally longer than target RNAs ranging from 18 to 30 bp. Most adapter trimming tools require adapter information as input. However, adapter information is hard to access, specified incorrectly, or not provided with publicly available datasets, hampering their reproducibility and reusability. Manual identification of adapter patterns from raw reads is labor-intensive and error-prone. Moreover, the use of randomized adapters to reduce ligation biases during library preparation makes adapter detection even more challenging. Here, we present FindAdapt, a Python package for fast and accurate detection of adapter patterns without relying on prior information. We demonstrated that FindAdapt was far superior to existing approaches. It identified adapters successfully in 180 simulation datasets with diverse read structures and 3,184 real datasets covering a variety of commercial and customized small RNA library preparation kits. FindAdapt is stand-alone software that can be easily integrated into small RNA sequencing analysis pipelines.https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1011786&type=printable |
spellingShingle | Hua-Chang Chen Jing Wang Yu Shyr Qi Liu FindAdapt: A python package for fast and accurate adapter detection in small RNA sequencing. PLoS Computational Biology |
title | FindAdapt: A python package for fast and accurate adapter detection in small RNA sequencing. |
title_full | FindAdapt: A python package for fast and accurate adapter detection in small RNA sequencing. |
title_fullStr | FindAdapt: A python package for fast and accurate adapter detection in small RNA sequencing. |
title_full_unstemmed | FindAdapt: A python package for fast and accurate adapter detection in small RNA sequencing. |
title_short | FindAdapt: A python package for fast and accurate adapter detection in small RNA sequencing. |
title_sort | findadapt a python package for fast and accurate adapter detection in small rna sequencing |
url | https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1011786&type=printable |
work_keys_str_mv | AT huachangchen findadaptapythonpackageforfastandaccurateadapterdetectioninsmallrnasequencing AT jingwang findadaptapythonpackageforfastandaccurateadapterdetectioninsmallrnasequencing AT yushyr findadaptapythonpackageforfastandaccurateadapterdetectioninsmallrnasequencing AT qiliu findadaptapythonpackageforfastandaccurateadapterdetectioninsmallrnasequencing |