2차원 구조 대비 3차원 구조 GPU의 메모리 접근 효율성 분석

Hyung-Gyu Jeon,Jong-Myon Kim,Jin-Woo Ahn,Cheol-Hong Kim

doi:10.9708/jksci.2012.17.7.001

Abstract

최근 반도체 공정 기술이 발달함에 따라 단일 프로세서에 적재되는 코어의 수가 크게 증가하였고, 이는 프로세서의 성능을 급격하게 향상시키는 계기가 되고 있다. 특히, 많은 수의 코어들로 구성된 GPU(Graphics Processing Unit)는 대규모 병렬성을 활용하여 연산처리 성능을 크게 향상시키고 있다. 하지만, 주 메모리 접근 지연시간이 GPU의 성능 향상을 제약하는 심각한 요인 중 하나로 제기되는 상황이다. 본 논문에서는 3차원 구조를 통한 GPU의 메모리 접근 효율성 향상에 대한 정량적 분석과 3차원 구조 적용 시 발생 가능한 문제점에 대하여 살펴보고자 한다. 일반적으로 메모리 명령어 비율은 평균적으로 전체 명령어의 30%를 차지하고, 메모리 명령어 중에서 주 메모리 접근과 관련된 글로벌/로컬 메모리 명령어가 차지하는 비율 또한 평균 60%이므로 주 메모리로의 접근 지연시간을 크게 감소시키는 3차원 구조를 적용한다면 GPU의 성능 또한 크게 향상시킬 수 있을 것으로 예상된다. 그러나 본 논문에서 수행한 실험 결과에 따르면 메모리 병목현상으로 인해 3차원 구조 GPU의 성능이 2차원 구조 GPU에 비해 크게 향상되지는 않음을 확인할 수 있다. 분석 결과에 의하면, 3차원 구조 GPU는 2차원 구조 GPU와 비교하여 메모리 병목현상으로 인한 성능 지연이 최대 245%까지 증가하기 때문이다. 본 논문에서는 3차원 구조 GPU를 대상으로 메모리 접근의 효율성과 문제점을 함께 분석함으로써, 3차원 GPU에 적합한 메모리 구조를 설계하기 위한 가이드라인을 제시하고자 한다. As process technology scales down, the number of cores integrated into a processor increases dramatically, leading to significant performance improvement. Especially, the GPU(Graphics Processing Unit) containing many cores can provide high computational performance by maximizing the parallelism. In the GPU architecture, the access latency to the main memory becomes one of the major reasons restricting the performance improvement. In this work, we analyze the performance improvement of the 3D GPU architecture compared to the 2D GPU architecture quantitatively and investigate the potential problems of the 3D GPU architecture. In general, memory instructions account for 30% of total instructions, and global/local memory instructions constitutes 60% of total memory instructions. Therefore, the performance of the 3D GPU is expected to be improved significantly compared to the 2D GPU by reducing the delay of memory instructions. However, according to our experimental results, the 3D architecture improves the GPU performance only by 2% compared to the 2D architecture due to the memory bottleneck, since the performance reduction due to memory bottleneck in the 3D GPU architecture increases by 245% compared to the 2D architecture. This paper provides the guideline for suitable memory design by analyzing the efficiency of the memory architecture in 3D GPU architecture.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

2차원 구조 대비 3차원 구조 GPU의 메모리 접근 효율성 분석

Abstract

Talk to us

Similar Papers

More From: Journal of the Korea Society of Computer and Information

Lead the way for us

Journal: Journal of the Korea Society of Computer and Information	Publication Date: Jul 31, 2012
Citations: 20

Similar Papers

Impact of Clock Frequency and Number of Cores on GPU Performance
Hong Jun Choi ... Dong Oh Son
-
Hong Jun Choi, et. al.Hong Jun Choi ... Dong Oh Son
01 Oct 2014
01 Oct 2014

Accelerating genetic algorithms with GPU computing: A selective overview
John Runwei Cheng ... Mitsuo Gen
Computers & Industrial Engineering | VOL. 128
John Runwei Cheng, et. al.John Runwei Cheng ... Mitsuo Gen
29 Dec 2018
Computers & Industrial Engineering | VOL. 128

Evaluation of CPU and GPU architectures for spectral image analysis algorithms
Virginie Fresse ... Dominique Houzet
-
Virginie Fresse, et. al.Virginie Fresse ... Dominique Houzet
23 Jan 2011
23 Jan 2011

Impact of memory bottleneck on the performance of graphics processing units
Jong Myon Kim ... Cheol Hong Kim
-
Jong Myon Kim, et. al.Jong Myon Kim ... Cheol Hong Kim
09 Dec 2015
09 Dec 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

2차원 구조 대비 3차원 구조 GPU의 메모리 접근 효율성 분석

Abstract

Talk to us

Similar Papers

More From: Journal of the Korea Society of Computer and Information