Systematic tissue annotations of genomics samples by modeling unstructured metadata
The 1+ million publicly-available human –omics samples currently remain acutely underused. Here the authors present an approach combining natural language processing and machine learning to infer the source tissue of public genomics samples based on their plain text descriptions, making these sample...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Nature Portfolio
2022-11-01
|
Series: | Nature Communications |
Online Access: | https://doi.org/10.1038/s41467-022-34435-x |