-
- Yuki Iida
- Graduate School of Information Science and Engineering, Ritsumeikan University, Kusatsu Shiga, Japan
-
- Yusuke Fujii
- Graduate School of Information Science and Engineering, Ritsumeikan University, Kusatsu Shiga, Japan
-
- Takuya Azumi
- Graduate School of Engineering Science, Osaka University, Toyonaka Osaka, Japan
-
- Nobuhiko Nishio
- Graduate School of Information Science and Engineering, Ritsumeikan University, Kusatsu Shiga, Japan
-
- Shinpei Kato
- Graduate School of Information Science, Nagoya University, Nagoya Aichi, Japan
書誌事項
- タイトル別名
-
- Exploring Transparent Access to Remote GPUs
抄録
<jats:p>Graphics processing units (GPUs) are increasingly used for high-performance computing. Programming frameworks for general-purpose computing on GPUs (GPGPU), such as CUDA and OpenCL, are also maturing. Driving this trend is the recent proliferation of mobile devices such as smartphones and wearable computers. These devices are increasingly incorporating computationally intensive applications that involve some form of environmental recognition such as augmented reality (AR) or voice recognition. However, devices with low computational power cannot satisfy such demanding computing requirements. The CPU load of these devices could be reduced by offloading computation onto GPUs on the cloud. This paper presents GPUrpc, a remote procedure call (RPC) extension to Gdev, which is a rich set of runtime libraries and device drivers for achieving first-class GPU resource management. GPUrpc allows developers to use CUDA for GPGPU development work. Existing research uses RPCs based on the CUDA application programming interfaces (APIs); hence, all CUDA APIs require communication. To reduce communication overhead, we use an RPC based on a low-level API than CUDA API and reduced API that does not require communication. Our evaluation conducted on Linux and NVIDIA GPUs shows that the basic performance of our prototype implementation is reliable in comparison with the existing method. Evaluation using the Rodinia benchmark suite designed for research in heterogeneous parallel computing showed that GPUrpc is effective for applications such as image processing and data mining. GPUrpc also can improve power consumption to approximately 1/6 that of CPU processing for performing 512 × 512 matrix multiplication.</jats:p>
収録刊行物
-
- ACM Transactions on Embedded Computing Systems
-
ACM Transactions on Embedded Computing Systems 16 (1), 1-25, 2016-10-13
Association for Computing Machinery (ACM)
- Tweet
詳細情報 詳細情報について
-
- CRID
- 1360285711253740928
-
- DOI
- 10.1145/2950056
-
- ISSN
- 15583465
- 15399087
-
- データソース種別
-
- Crossref
- KAKEN