Corporation Consumer Carrier Home and Enterprise

Language

简体中文 English

Online Shuffling with Task Duplication in Cloud

Release Date：2017-11-24 Author：ZANG Qimeng and GUO Song

[Abstract] Task duplication has been widely adopted to mitigate the impact of stragglers that run much longer than normal tasks. However, task duplication on data pipelining case would generate excessive traffic over the datacenter networks. In this paper, we study minimizing the traffic cost for data pipelining task replications and design a controller that chooses the data generated by the first finished task and discards data generated later by other replications belonging to the same task. Each task replication communicates with the controller when it finishes a data processing, which causes additional network overhead. Hence, we try to reduce the network overhead and make a trade⁃off between the delay of data block and the network overhead. Finally, extensive simulation results demonstrate that our proposal can minimize network traffic cost under data pipelining case.

[Keywords] cloud computing; big data; shuffling; task duplication; traffic

Download： PDF

relative articles

Virtualization Technology in Cloud Computing Based Radio Access Networks: A Primer

Technical Analysis of Network Plug⁃in Flannel for Containers

Dew Computing and Transition of Internet Computing Paradigms

HCOS: A Unified Model and Architecture for Cloud Operating System

An OS for Internet of Everything: Early Experience from A Smart Home Prototype

A Transparent and User⁃Centric Approach to Unify Resource Management and Code Scheduling of Local, Edge, and Cloud

Cloud Computing, Fog Computing, and Dew Computing