Workflow framework to support data analytics in cloud computing
This paper reports on the development of the Cloud Oriented Data Analytics (CODA) framework which has functions for composing, managing, and processing workflows for data analytics in cloud computing. The framework provides a number of reusable software components for data analytics to users wh...
Main Authors: | , , , , , , |
---|---|
Other Authors: | |
Format: | Conference Paper |
Language: | English |
Published: |
2013
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/96732 http://hdl.handle.net/10220/13100 |
Summary: | This paper reports on the development of the
Cloud Oriented Data Analytics (CODA) framework which has
functions for composing, managing, and processing workflows
for data analytics in cloud computing. The framework provides
a number of reusable software components for data analytics to
users which can be composed as workflows through well-known
workflow composers, e.g., RapidMiner, Taverna, and JOpera.
In particular, workflow scheduling, workflow recommendation,
resource provisioning, resource monitoring, data locality, and
security for the workflow computation are addressed by the
framework. By using the framework, we demonstrate that
workflows can be easily composed and processed in cloud
computing. By coordinating the submitted workflows, we can
obtain a significant improvement in performance. |
---|