miRTrace reveals the organismal origins of microRNA sequencing data

Abstract We present here miRTrace, the first algorithm to trace microRNA sequencing data back to their taxonomic origins. This is a challenge with profound implications for forensics, parasitology, food control, and research settings where cross-contamination can compromise results. miRTrace accurat...

Full description

Bibliographic Details
Main Authors: Wenjing Kang, Yrin Eldfjell, Bastian Fromm, Xavier Estivill, Inna Biryukova, Marc R. Friedländer
Format: Article
Language:English
Published: BMC 2018-12-01
Series:Genome Biology
Online Access:http://link.springer.com/article/10.1186/s13059-018-1588-9
Description
Summary:Abstract We present here miRTrace, the first algorithm to trace microRNA sequencing data back to their taxonomic origins. This is a challenge with profound implications for forensics, parasitology, food control, and research settings where cross-contamination can compromise results. miRTrace accurately (> 99%) assigns real and simulated data to 14 important animal and plant groups, sensitively detects parasitic infection in mammals, and discovers the primate origin of single cells. Applying our algorithm to over 700 public datasets, we find evidence that over 7% are cross-contaminated and present a novel solution to clean these computationally, even after sequencing has occurred. miRTrace is freely available at https://github.com/friedlanderlab/mirtrace.
ISSN:1474-760X