Workflow framework to support data analytics in cloud computing

This paper reports on the development of the Cloud Oriented Data Analytics (CODA) framework which has functions for composing, managing, and processing workflows for data analytics in cloud computing. The framework provides a number of reusable software components for data analytics to users wh...

Full description

Bibliographic Details
Main Authors: Chaisiri, Sivadon, Bong, Zoebir, Lee, Chonho, Lee, Bu-Sung, Sessomboon, Punyapat, Saisillapee, Tanakrit, Achalakul, Tiranee
Other Authors: School of Computer Engineering
Format: Conference Paper
Language:English
Published: 2013
Subjects:
Online Access:https://hdl.handle.net/10356/96732
http://hdl.handle.net/10220/13100
Description
Summary:This paper reports on the development of the Cloud Oriented Data Analytics (CODA) framework which has functions for composing, managing, and processing workflows for data analytics in cloud computing. The framework provides a number of reusable software components for data analytics to users which can be composed as workflows through well-known workflow composers, e.g., RapidMiner, Taverna, and JOpera. In particular, workflow scheduling, workflow recommendation, resource provisioning, resource monitoring, data locality, and security for the workflow computation are addressed by the framework. By using the framework, we demonstrate that workflows can be easily composed and processed in cloud computing. By coordinating the submitted workflows, we can obtain a significant improvement in performance.