-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathREADME
28 lines (22 loc) · 886 Bytes
/
README
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
Fahrenholtz, Samuel J., et al. "A model evaluation study for treatment planning of laser-induced thermal therapy." International Journal of Hyperthermia (2015).
https://www.ncbi.nlm.nih.gov/pubmed/26368014
Usage
=====
$ make
/opt/apps/cuda/5.5//bin/nvcc -g -G -ptx -gencode=arch=compute_20,code=sm_20 steadyStatePennesLaser.cu
(from MATLAB)
>> exampleSSGPU
.
.
iter 500
Elapsed time is 204.292927 seconds.
Publish
=======
opts.outputDir = '.';
opts.format = 'pdf';
publish('exampleSSGPU',opts)
Design Pattern
==============
grid stride loop design pattern, 1-d grid
http://devblogs.nvidia.com/parallelforall/cuda-pro-tip-write-flexible-kernels-grid-stride-loops/
- By using a loop, you can support any problem size even if it exceeds the largest grid size your CUDA device supports. Moreover, you can limit the number of blocks you use to tune performance.