Detecting sample swaps in diverse NGS data types using linkage disequilibrium

Parallelized analysis in clinical genomics can lead to sample or data mislabelling, and could have serious downstream consequences. Here the authors present a tool to quantify sample genetic relatedness and detect such mistakes, and apply it to thousands of datasets from the ENCODE consortium.

Bibliographic Details
Main Authors: Nauman Javed, Yossi Farjoun, Tim J. Fennell, Charles B. Epstein, Bradley E. Bernstein, Noam Shoresh
Format: Article
Language:English
Published: Nature Portfolio 2020-07-01
Series:Nature Communications
Online Access:https://doi.org/10.1038/s41467-020-17453-5
Description
Summary:Parallelized analysis in clinical genomics can lead to sample or data mislabelling, and could have serious downstream consequences. Here the authors present a tool to quantify sample genetic relatedness and detect such mistakes, and apply it to thousands of datasets from the ENCODE consortium.
ISSN:2041-1723