Systematic tissue annotations of genomics samples by modeling unstructured metadata
The 1+ million publicly-available human –omics samples currently remain acutely underused. Here the authors present an approach combining natural language processing and machine learning to infer the source tissue of public genomics samples based on their plain text descriptions, making these sample...
Main Authors: | Nathaniel T. Hawkins, Marc Maldaver, Anna Yannakopoulos, Lindsay A. Guare, Arjun Krishnan |
---|---|
Format: | Article |
Language: | English |
Published: |
Nature Portfolio
2022-11-01
|
Series: | Nature Communications |
Online Access: | https://doi.org/10.1038/s41467-022-34435-x |
Similar Items
-
Making sense of unstructured biological metadata
by: Sea, Bao Yi
Published: (2024) -
Challenges to sharing sample metadata in computational genomics
by: Nathan C. Sheffield, et al.
Published: (2023-05-01) -
Automated metadata annotation: What is and is not possible with machine learning
by: Mingfang Wu, et al.
Published: (2023-01-01) -
Semantic Metadata Annotation Services in the Biomedical Domain—A Literature Review
by: Julia Sasse, et al.
Published: (2022-01-01) -
A configurable software platform for creating, reviewing and adjudicating annotation of unstructured text.
by: Richard Beare, et al.
Published: (2022-08-01)