
doi: 10.1109/tdsc.2008.50
We propose a new framework for the (length,reliability) bicriteria static multiprocessor scheduling problem. Our first criterion remains the schedule's length, crucial to assess the system's real-time property. For our second criterion, we consider the global system failure rate, seen as if the whole system were a single task scheduled onto a single processor, instead of the usual reliability, because it does not depend on the schedule length like the reliability does (due to its computation in the classical exponential distribution model). Therefore, we control better the replication factor of each individual task of the dependency task graph given as a specification, with respect to the desired failure rate. To solve this bicriteria optimization problem, we take the failure rate as a constraint, and we minimize the schedule length. We are thus able to produce, for a given dependency task graph and multiprocessor architecture, a Pareto curve of non-dominated solutions, among which the user can choose the compromise that fits his requirements best. Compared to the other bicriteria (length,reliability) scheduling algorithms found in the literature, the algorithm we present here is the first able to improve significantly the reliability, by several orders of magnitude, making it suitable to safety critical systems.
Testing, availability, 004, 620, and Fault-Tolerance, ACM: B.: Hardware/B.8: PERFORMANCE AND RELIABILITY/B.8.1: Reliability, ACM: C.: Computer Systems Organization/C.4: PERFORMANCE OF SYSTEMS/C.4.1: Fault tolerance, [INFO.INFO-PF]Computer Science [cs]/Performance [cs.PF], [INFO.INFO-PF] Computer Science [cs]/Performance [cs.PF], ACM: C.: Computer Systems Organization/C.3: SPECIAL-PURPOSE AND APPLICATION-BASED SYSTEMS/C.3.2: Real-time and embedded systems, ACM: C.: Computer Systems Organization/C.4: PERFORMANCE OF SYSTEMS/C.4.5: Reliability, and serviceability
Testing, availability, 004, 620, and Fault-Tolerance, ACM: B.: Hardware/B.8: PERFORMANCE AND RELIABILITY/B.8.1: Reliability, ACM: C.: Computer Systems Organization/C.4: PERFORMANCE OF SYSTEMS/C.4.1: Fault tolerance, [INFO.INFO-PF]Computer Science [cs]/Performance [cs.PF], [INFO.INFO-PF] Computer Science [cs]/Performance [cs.PF], ACM: C.: Computer Systems Organization/C.3: SPECIAL-PURPOSE AND APPLICATION-BASED SYSTEMS/C.3.2: Real-time and embedded systems, ACM: C.: Computer Systems Organization/C.4: PERFORMANCE OF SYSTEMS/C.4.5: Reliability, and serviceability
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 64 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
