Effective utilization of CUDA hyper-Q for improved power and performance efficiency

Ryan S. Luley, Qinru Qiu

Research output: Chapter in Book/Report/Conference proceedingConference contribution

3 Scopus citations

Abstract

High utilization of hardware resources is the key for designing performance and power optimized GPUapplications. The efficiency of applications and kernels, which do not fully utilize the GPU resources, can be improved through concurrent execution with independent kernels and/or applications. Hyper-Q enables multiple CPU threads or processes to launch work on a single GPU simultaneously for increased GPU utilization. However, without careful design, false serialization may occur due to the contention for shared hardware resources such as direct memory access (DMA) engines. In this paper, we reveal the impact of such contention on performance and assess a method for overcoming the limitation with minimal algorithmic overhead. We demonstrate a method to achieve up to 31.8% improvement in performance and 10.4%reduction in energy on average for a finite set of application tasks when maximizing GPU execution concurrency.

Original languageEnglish (US)
Title of host publicationProceedings - 2016 IEEE 30th International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2016
PublisherIEEE Computer Society
Pages1160-1169
Number of pages10
Volume2016-August
ISBN (Electronic)9781509021406
DOIs
StatePublished - Aug 2 2016
Event30th IEEE International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2016 - Chicago, United States
Duration: May 23 2016May 27 2016

Other

Other30th IEEE International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2016
CountryUnited States
CityChicago
Period5/23/165/27/16

Keywords

  • Concurrency
  • GPU performance
  • GPU utilization
  • Hyper-Q
  • Power efficiency
  • Resource sharing

ASJC Scopus subject areas

  • Computational Theory and Mathematics
  • Computer Networks and Communications
  • Hardware and Architecture
  • Software

Fingerprint Dive into the research topics of 'Effective utilization of CUDA hyper-Q for improved power and performance efficiency'. Together they form a unique fingerprint.

  • Cite this

    Luley, R. S., & Qiu, Q. (2016). Effective utilization of CUDA hyper-Q for improved power and performance efficiency. In Proceedings - 2016 IEEE 30th International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2016 (Vol. 2016-August, pp. 1160-1169). [7529999] IEEE Computer Society. https://doi.org/10.1109/IPDPSW.2016.154