Efficient GPU multitasking with latency minimization and cache boosting

Kim Jiho, Chu Minsung, Park Yongjun

doi:10.1587/elex.14.20161158

Efficient GPU multitasking with latency minimization and cache boosting

DOI Web Site 被引用文献1件参考文献14件

Kim Jiho

School of Electronic and Electrical Engineering, Hongik University
Chu Minsung

School of Electronic and Electrical Engineering, Hongik University
Park Yongjun

School of Electronic and Electrical Engineering, Hongik University

抄録

<p>GPU spatial multitasking has been proven to be quite effective at executing different applications concurrently using SM partitioning. However, while it maximizes total throughput, latency-critical applications often cannot meet their deadlines due to the increased execution time. Furthermore, SM partitioning cannot allocate the appropriate L1 cache size per kernel. To solve these problems, this paper proposes a new application-aware resource allocation framework called GPU Fine-Tuner, for assigning appropriate resources to GPU kernels. To minimize the execution time of latency-constrained applications, it assigns them more SMs when performance is not affected. It also increases the cache size of SMs for cache-sensitive kernels using resource borrowing from neighbors for cache-insensitive kernels. Experimental results show that the Fine-Tuner outperforms GPU spatial multitasking with up to 15% less average latency without performance degradation.</p>

収録刊行物

IEICE Electronics Express

IEICE Electronics Express 14 (7), 20161158-20161158, 2017

一般社団法人電子情報通信学会

被引用文献 (1)*注記

参考文献 (14)*注記

詳細情報詳細情報について

CRID

1390282680195567488
NII論文ID

130005589255
DOI

10.1587/elex.14.20161158
ISSN

13492543
Web Site

https://www.jstage.jst.go.jp/article/elex/14/7/14_14.20161158/_pdf
本文言語コード

en
データソース種別
- JaLC
- Crossref
- CiNii Articles
抄録ライセンスフラグ
使用不可

書き出し

問題の指摘

ページトップへ

Efficient GPU multitasking with latency minimization and cache boosting

抄録

収録刊行物

被引用文献 (1)*注記

参考文献 (14)*注記

キーワード

詳細情報 詳細情報について

書き出し

問題の指摘

詳細情報詳細情報について