Computer and Modernization

Previous Articles     Next Articles

A Resource Scheduling Mechanism of Hadoop YARN

  

  1. (Information Service Laboratory, No. 32nd Research Institute of China Electronics Technology Group Corporation, Shanghai 201808, China)
  • Received:2017-05-31 Online:2017-11-21 Published:2017-11-21

Abstract: YARN is a resource management system widely used in Hadoop. It supports MapReduce, Spark, Storm and other computing frameworks, and has become the core component of big data ecology. However, in Hadoop YARN’s existing resource scheduler, a resource guarantee mechanism based on resource reservation, will produce resource fragmentations, leading to a waste of resources. In order to improve the resource utilization and throughput of the cluster, this paper proposes a resource allocation mechanism based on reservation and backfill. In this mechanism, based on the priority of the job, it decides whether to make a reservation to the resource and introduce a backfill strategy to backfill the resource without affecting the execution of the reservation job. Experiments show that the resource scheduling mechanism based on reserved backfill can effectively improve the resource utilization and throughput of Hadoop YARN cluster.

Key words: Hadoop YARN, big data, resource scheduler, reserved backfill

CLC Number: