Difference between revisions of "Cuda"
From Ghoulwiki
Ghoulsblade (talk | contribs) |
Ghoulsblade (talk | contribs) |
||
Line 56: | Line 56: | ||
1371k clockRate | 1371k clockRate | ||
256 textureAlignment | 256 textureAlignment | ||
+ | </pre> | ||
+ | |||
+ | <pre> | ||
+ | N=65536 | ||
+ | SX=2048 | ||
+ | SY=64 | ||
+ | SZ=2 | ||
+ | I0=32 | ||
+ | DATASIZE_IN_RAW=2304kb | ||
+ | DATASIZE_IN_STATE=256kb | ||
+ | DATASIZE_IN_INDEX=136kb | ||
+ | DATASIZE_IN_TOTAL=2696kb | ||
+ | DATASIZE_OUT_TOTAL=16384kb | ||
+ | 0.74 sec : reading data from file | ||
+ | assert passed : (INDEXPOS_0(I0) == INDEXSTART_1-1) | ||
+ | assert passed : (INDEXPOS_1(I0-1,I0) == INDEXSTART_2-1) | ||
+ | assert passed : (INDEXPOS_2(I0-1,I0-1,I0) == INDEX_END-1) | ||
+ | assert passed : (sz < 255) | ||
+ | -2.478893,-2.459100,-2.459100,...,4.097229 | ||
+ | 0.09 sec : generating index data | ||
+ | 0.02 sec : allocate and init device mem | ||
+ | 9.55 sec : exec kernel on device | ||
+ | 0.00 sec : receive results from device | ||
+ | atom[0]=57100 atom[1]=0 iNumResults=57100 kMaxResults=2097152 | ||
+ | check : with index on cpu... | ||
+ | check : with index on cpu: iNumResults=57100 | ||
+ | 17.29 sec : check : with index on cpu | ||
+ | |||
+ | Press ENTER to exit... | ||
</pre> | </pre> |
Revision as of 17:04, 29 September 2007
- http://www.litec-computer.de/PC-Komponenten/Grafikkarten/PCI-express/nVidia/Gigabyte-GV-NX85T256H-8500GT-512MB-Dual-DVI-TV-out-passiv::13515.html
- http://www.litec-computer.de/PC-Komponenten/Grafikkarten/PCI-express/nVidia/ASUS-EN8500GT-SILENT-MAGIC-HTD-512MB-DVI-TV-out-passiv::13117.html
- beachten : keine 88 (kann kein atomic), nur 85 oder 86, möglichst viel ram (512mb) , keine karten mit "Nur 128-bit Speicherinterface"
- svn+ssh://ghoulsblade@zwischenwelt.org/var/svn/robertprojarbeit
- http://zwischenwelt.org/svn/robertprojarbeit
- nvidia-cuda-forum http://forums.nvidia.com/index.php?showforum=62 (search for 8600)
- nvidia-cuda-hp http://developer.nvidia.com/object/cuda.html
- FAQ : http://forums.nvidia.com/index.php?showtopic=36286&hl=8600 (many interesting programming tips)
- SIMD : http://en.wikipedia.org/wiki/Vector_processor
- samples http://developer.download.nvidia.com/compute/cuda/sdk/website/samples.html
- cuda 1.0 announcement 26.june : http://forums.nvidia.com/index.php?showtopic=39030&hl=8600
- $(CUDA_BIN_PATH)\nvcc.exe -arch sm_11 -ccbin "$(VCInstallDir)bin" -c -DWIN32 -D_CONSOLE -D_MBCS -Xcompiler /EHsc,/W3,/nologo,/Wp64,/O2,/Zi,/MT -I"$(CUDA_INC_PATH)" -I./ -I../../common/inc -o $(ConfigurationName)\myproj.obj myproj.cu
##### ##### ##### ##### #####device 0 name : GeForce 8500 GT 261888k totalGlobalMem 16k sharedMemPerBlock 8k regsPerBlock 32 warpSize 256k memPitch 512 maxThreadsPerBlock 512 maxThreadsDim[0] 512 maxThreadsDim[1] 64 maxThreadsDim[2] 63k maxGridSize[0] 63k maxGridSize[1] 1 maxGridSize[2] 64k totalConstMem 1 major 1 minor 1371k clockRate 256 textureAlignment ##### ##### ##### ##### #####device 1 name : GeForce 8500 GT 261824k totalGlobalMem 16k sharedMemPerBlock 8k regsPerBlock 32 warpSize 256k memPitch 512 maxThreadsPerBlock 512 maxThreadsDim[0] 512 maxThreadsDim[1] 64 maxThreadsDim[2] 63k maxGridSize[0] 63k maxGridSize[1] 1 maxGridSize[2] 64k totalConstMem 1 major 1 minor 1371k clockRate 256 textureAlignment
N=65536 SX=2048 SY=64 SZ=2 I0=32 DATASIZE_IN_RAW=2304kb DATASIZE_IN_STATE=256kb DATASIZE_IN_INDEX=136kb DATASIZE_IN_TOTAL=2696kb DATASIZE_OUT_TOTAL=16384kb 0.74 sec : reading data from file assert passed : (INDEXPOS_0(I0) == INDEXSTART_1-1) assert passed : (INDEXPOS_1(I0-1,I0) == INDEXSTART_2-1) assert passed : (INDEXPOS_2(I0-1,I0-1,I0) == INDEX_END-1) assert passed : (sz < 255) -2.478893,-2.459100,-2.459100,...,4.097229 0.09 sec : generating index data 0.02 sec : allocate and init device mem 9.55 sec : exec kernel on device 0.00 sec : receive results from device atom[0]=57100 atom[1]=0 iNumResults=57100 kMaxResults=2097152 check : with index on cpu... check : with index on cpu: iNumResults=57100 17.29 sec : check : with index on cpu Press ENTER to exit...