一种面向异构集群的DNN推理任务批调度优化方法及实现

A BATCH SCHEDULING OPTIMIZATION METHOD AND IMPLEMENTATION OF DNN INFERENCE TASKS FOR HETEROGENEOUS CLUSTERS

  • 摘要: 随着人工智能产业的蓬勃发展,大量基于深度神经网络(DNN)的应用每天产生大规模的DNN推理任务,并汇集到大型异构计算集群进行推理。如何设计一种面向异构集群的高效DNN推理任务批调度优化方法及其实现是一个关键问题。针对该问题,提出一种基于策略参数表征机制的调度优化方法。它使用包含参数的策略组合进行调度结果流生成,并不断使用元启发式搜索算法对这些参数进行最优解搜索;基于此优化方法实现了一个适用于异构计算集群推理任务调度的系统。实验结果表明该优化方法相较于传统的调度方法有着显著的性能提升。

     

    Abstract: With the development of artificial intelligence industry, a large number of DNN-based applications generate large-scale DNN inference tasks every day. These tasks are aggregated into large heterogeneous clusters for inference. How to design an efficient batch scheduling optimization method of DNN inference tasks for heterogeneous clusters and its implementation is a key issue. To solve this problem, a scheduling optimization method based on a policy parameter characterization mechanism was proposed. It used a combination of policies with embedded parameters for scheduling results generation and continuously searches for optimal solutions to these parameters using a meta-heuristic algorithm. A scheduling system for inference tasks in heterogeneous computing clusters was implemented based on this optimization method. Experiments show that the proposed optimization method has significant performance improvements over traditional scheduling methods.

     

/

返回文章
返回