Data replication is one of the significant sub-areas of data management in cloud based workflows. Data-intensive workflow applications can gain great benefits from cloud environments and usually need data management strategies to manage large amounts of data. At the same time, multi-cloud environments become more and more popular. We propose a cost-effective and threshold-based data replication strategy with the consideration of both data dependency and data access times for data-intensive workflows in the multi-cloud environment. Finally, the simulation results show that our approach can greatly reduce total cost of data-intensive workflow applications by considering both of data dependency and data access times in multi-cloud environments.
History
Citation
Xie, F., Yan, J. & Shen, J. (2019). A Data Dependency and Access Threshold Based Replication Strategy for Multi-Cloud Workflow Applications. Lecture Notes in Computer Science, 11434 281-293. Hangzhou, China ICSOC 2018: Service-Oriented Computing - ICSOC 2018 Workshops