
handle: 11386/4694807
Abstract This paper presents a new message-passing algorithm, called Do-UM, for distributed cooperative task computing in synchronous settings where processors may crash, and where any multicasts (or broadcasts) performed by crashing processors are unreliable. We specify the algorithm, prove its correctness and analyse its complexity. We show that its worst case available processor steps is S = Θ t + n log n log log n + f ( n − f ) and that the number of messages sent is less than n 2 t + n f 2 , where n is the number of processors, t is the number of tasks to be executed and f is the number of failures. To assess the performance of the algorithm in practical scenarios, we perform an experimental evaluation on a planetary-scale distributed platform. This also allows us to compare our algorithm with the currently best algorithm that is, however, explicitly designed to use reliable multicast; the results suggest that our algorithm does not lose much efficiency in order to cope with unreliable multicast.
Crash faults; Fault-tolerant distributed algorithms; Task computing; Unreliable multicast; Theoretical Computer Science; Software; Hardware and Architecture; Computer Networks and Communications; Artificial Intelligence, Crash faults, Experimental evaluation, Multicasting, Unreliable multicast, Message passing, Distributed platforms, Task computing, Cooperative tasks, Message passing algorithm, Fault-tolerant distributed algorithms, Reliable Multicast, Fault-tolerant
Crash faults; Fault-tolerant distributed algorithms; Task computing; Unreliable multicast; Theoretical Computer Science; Software; Hardware and Architecture; Computer Networks and Communications; Artificial Intelligence, Crash faults, Experimental evaluation, Multicasting, Unreliable multicast, Message passing, Distributed platforms, Task computing, Cooperative tasks, Message passing algorithm, Fault-tolerant distributed algorithms, Reliable Multicast, Fault-tolerant
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
