CSIRO Publishing blank image blank image blank image blank imageBooksblank image blank image blank image blank imageJournalsblank image blank image blank image blank imageAbout Usblank image blank image blank image blank imageShopping Cartblank image blank image blank image You are here: Journals > PASA   
PASA
http://asa.astronomy.org.au/
  Publications of the Astronomical Society of Australia
 
blank image Search
 
blank image blank image
blank image
 
  Advanced Search
   

Journal Home
Content
Current Issue
All Issues
Special Issues
Research Fronts

 

Open Access Article << Previous     |     Next >>   Contents Vol 28(1)

Astrophysical Supercomputing with GPUs: Critical Decisions for Early Adopters*

Christopher J. Fluke A B, David G. Barnes A, Benjamin R. Barsdell A, Amr H. Hassan A

A Centre for Astrophysics and Supercomputing, Swinburne University of Technology,PO Box 218, Hawthorn, VIC 3122, Australia
B Corresponding author. Email: cfluke@swin.edu.au
 
 Full Text
 PDF (529 KB)
 Export Citation
 Print
  


Abstract

General-purpose computing on graphics processing units (GPGPU) is dramatically changing the landscape of high performance computing in astronomy. In this paper, we identify and investigate several key decision areas, with a goal of simplifying the early adoption of GPGPU in astronomy. We consider the merits of OpenCL as an open standard in order to reduce risks associated with coding in a native, vendor-specific programming environment, and present a GPU programming philosophy based on using brute force solutions. We assert that effective use of new GPU-based supercomputing facilities will require a change in approach from astronomers. This will likely include improved programming training, an increased need for software development best practice through the use of profiling and related optimisation tools, and a greater reliance on third-party code libraries. As with any new technology, those willing to take the risks and make the investment of time and effort to become early adopters of GPGPU in astronomy, stand to reap great benefits.

Keywords: methods: numerical — methods: n-body simulations — gravitational lensing



1 http://kolob.ziti.uni-heidelberg.de/ .

2 1 flop = 1 floating-point operation; 1 flop/s = 1 floating-point operation/second.

3 http://silkroad.bao.ac.cn/ .

4 http://www.lanl.gov/ .

5 Published prior to the release of CUDA, some of the implementation issues they raise have been resolved.

6 Online versions of volumes 1–3 are freely available from http://developer.nvidia.com/page/home.html .

7 NVIDIA CUDA: http://www.nvidia.com/object/cuda_home_new.html .

8 Other architecture-specific SDKs include the ATI Stream SDK (AMD) for programming ATI Radeon GPUs and the Cell Broadband Engine SDK (IBM).

9 Khronos OpenCL: http://www.khronos.org/opencl/ .

10 Khronos: http://www.khronos.org/ .

11 SiSoftware CUDA and OpenCL comparison: http://www.sisoftware.info/?d=qa&f=gpu_opencl&l=en&a= .

12 Only one of the two GPUs on the Radeon card were used in these tests.

13 i.e. the difference between OpenCL and CUDA kernel execution performance is typcially a factor or 10–100 smaller than the the gain achieved by using GPUs instead of CPUs.

14 We note that at the time of code development, OpenCL had not been publically released, hence our choice of CUDA.

15 http://www.gnu.org/software/gsl .

16 A single processor of a 2 quad-core Clovertown Processor.

17 We note that a naïve use of the Mathematica Minimize function also returns an incorrect global minimum value f(1.4454, 28.89434) = –0.284.

18 The standalone C implementation was a single-core code; we report quad-core timings by assuming perfect scaling which is reasonable for this task.

19 http://developer.nvidia.com.

20 http://www.fftw.org.

21 http://www.astro.caltech.edu/~tjp/pgplot.

22 http://www.numerical-recipes.com/.

23 http://code.google.com/p/cudpp/.

24 http://www.ittvis.com/ProductServices/IDL.aspx.

25 http://www.wolfram.com/products/mathematica.

26 http://www.python.org/.

27 http://www.txcorp.com/products/GPULib/.

28 http://mathema.tician.de/software/pycuda.

29 http://www.cs.lth.se/home/Calle_Lejdfors/pygpu.

30 http://www.caps-enterprise.com .

* Research undertaken as part of the Commonwealth Cosmology Initiative (CCI: www.thecci.org ), an international collaboration supported by the Australian Research Council.
   
    
Legal & Privacy | Contact Us | Help

CSIRO

© CSIRO 1996-2013