Computer and Modernization ›› 2013, Vol. 1 ›› Issue (2): 130-133.doi: 10.3969/j.issn.1006-2475.2013.02.032

• 应用与开发 • Previous Articles     Next Articles

Design and Implementation of Task Scheduling Middleware for GPU Cluster

CHEN Chun-lei   

  1. School of Automation, Northwestern Polytechnical University, Xi’an 710072, China
  • Received:2012-12-21 Revised:1900-01-01 Online:2013-02-27 Published:2013-02-27

Abstract: In a GPU cluster, the static task scheduling policy may result in unbalanced allocation of computing resource, because GPUs work as co-processors. A weight-based dynamic scheduling policy is proposed and implemented as a middleware, so that it can be applied to the GPU cluster. Under this policy, local GPUs and remote GPUs are not explicitly distinguished, and no global information is required. Every cluster node decides whether to use local GPUs or remote GPUs, according to weights of GPUs. And these weights are locally maintained by each node, respectively. As a carrier of the policy, the middleware ensures that the policy is transparent to users. It is composed of three parts: API library, resource management daemon, and GPU execution daemon. The policy is validated on a two-node cluster. Experiments show that the weight-based dynamic scheduling policy can achieve a 16% higher GPU utilization rate than the static policy, and a 45% higher GPU utilization rate than another dynamic policy (global-queue-based policy).

Key words: GPU cluster, dynamic task scheduling, middleware