Systematic tissue annotations of genomics samples by modeling unstructured metadata

The 1+ million publicly-available human –omics samples currently remain acutely underused. Here the authors present an approach combining natural language processing and machine learning to infer the source tissue of public genomics samples based on their plain text descriptions, making these sample...

Full description

Bibliographic Details
Main Authors: Nathaniel T. Hawkins, Marc Maldaver, Anna Yannakopoulos, Lindsay A. Guare, Arjun Krishnan
Format: Article
Language:English
Published: Nature Portfolio 2022-11-01
Series:Nature Communications
Online Access:https://doi.org/10.1038/s41467-022-34435-x

Similar Items