
The existence of computational intensive applications with a high degree of granularity which have to be executed as quickly as possible or sometimes with budget limitations makes the scheduling problem to be very important in today's distributed systems. In the context of the distributed systems being used by a various community of users, the quality and precision of the scheduling must be as close to optimality as possible. We present in this paper a scheduling method based on a genetic algorithm which respects the deadline constraints. Based on this model we developed a scheduler for MapReduce applications which also takes into account the heterogeneity of distributed systems. The developed scheduler was created for Hadoop platform, an already successful framework, and it can be built in conjunction with other existing Hadoop's schedulers like: fair-scheduler or capacity-scheduler. We tested and validated this model by testing its scalability and by comparing it with existing solutions in Hadoop. The novelty of this solution is that compared to other planners in Hadoop it meets the deadline and budget constraints and thus provides trust for the Hadoop platform.
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 2 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
