Seeping Semantics: Linking Datasets Using Word Embeddings for Data Discovery
© 2018 IEEE. Employees that spend more time finding relevant data than analyzing it suffer from a data discovery problem. The large volume of data in enterprises, and sometimes the lack of knowledge of the schemas aggravates this problem. Similar to how we navigate the Web, we propose to identify se...
Main Authors: | Castro Fernandez, Raul, Mansour, Essam, Qahtan, Abdulhakim A., Elmagarmid, Ahmed, Ilyas, Ihab, Madden, Samuel, Ouzzani, Mourad, Stonebraker, Michael, Tang, Nan |
---|---|
Other Authors: | Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory |
Format: | Article |
Language: | English |
Published: |
IEEE
2021
|
Online Access: | https://hdl.handle.net/1721.1/137849 |
Similar Items
-
Building Data Civilizer Pipelines with an Advanced Workflow Engine
by: Mansour, Essam, et al.
Published: (2021) -
Building Data Civilizer Pipelines with an Advanced Workflow Engine
by: Mansour, Essam, et al.
Published: (2022) -
A Demo of the Data Civilizer System
by: Castro Fernandez, Raul, et al.
Published: (2019) -
Pattern functional dependencies for data cleaning
by: Qahtan, Abdulhakim, et al.
Published: (2022) -
Pattern functional dependencies for data cleaning
by: Qahtan, Abdulhakim, et al.
Published: (2021)