Cleaning by clustering: methodology for addressing data quality issues in biomedical metadata
Abstract Background The ability to efficiently search and filter datasets depends on access to high quality metadata. While most biomedical repositories require data submitters to provide a minimal set of metadata, some such as the Gene Expression Omnibus (GEO) allows users to specify additional met...
| Main Authors: | Wei Hu, Amrapali Zaveri, Honglei Qiu, Michel Dumontier |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
BMC
2017-09-01
|
| Series: | BMC Bioinformatics |
| Subjects: | |
| Online Access: | http://link.springer.com/article/10.1186/s12859-017-1832-4 |
Similar Items
-
LabPipe: an extensible bioinformatics toolkit to manage experimental data and metadata
by: Bo Zhao, et al.
Published: (2020-12-01) -
Big Metadata, Smart Metadata, and Metadata Capital: Toward Greater Synergy Between Data Science and Metadata
by: Greenberg Jane
Published: (2017-08-01) -
Semantic Metadata Annotation Services in the Biomedical Domain—A Literature Review
by: Julia Sasse, et al.
Published: (2022-01-01) -
GeoWebCln: An Intensive Cleaning Architecture for Geospatial Metadata
by: Sheoran Savita Kumari, et al.
Published: (2022-03-01) -
Metadata Extraction and Management in Data Lakes With GEMMS
by: Christoph Quix, et al.
Published: (2016-12-01)