Report from the 5th Workshop on Extremely Large Databases

The 5th XLDB workshop brought together scientific and industrial users, developers, and researchers of extremely large data and focused on emerging challenges in the healthcare and genomics communities, spreadsheet-based large scale analysis, and challenges in applying statistics to large scale anal...

Full description

Bibliographic Details
Main Authors: Jacek Becla, Daniel Liwei Wang, Kian-Tat Lim
Format: Article
Language:English
Published: Ubiquity Press 2012-03-01
Series:Data Science Journal
Subjects:
Online Access:http://datascience.codata.org/articles/47
Description
Summary:The 5th XLDB workshop brought together scientific and industrial users, developers, and researchers of extremely large data and focused on emerging challenges in the healthcare and genomics communities, spreadsheet-based large scale analysis, and challenges in applying statistics to large scale analysis, including machine learning. Major problems discussed were the lack of scalable applications, the lack of expertise in developing solutions, the lack of respect for or attention to big data problems, data volume growth exceeding Moore's Law, poorly scaling algorithms, and poor data quality and integration. More communication between users, developers, and researchers is sorely needed. A variety of future work to help all three groups was discussed, ranging from collecting challenge problems to connecting with particular industrial or academic sectors.
ISSN:1683-1470