Difference between revisions of "Cuda"

From Ghoulwiki
Jump to: navigation, search
Line 56: Line 56:
 
       1371k clockRate
 
       1371k clockRate
 
       256 textureAlignment
 
       256 textureAlignment
 +
</pre>
 +
 +
<pre>
 +
N=65536
 +
SX=2048
 +
SY=64
 +
SZ=2
 +
I0=32
 +
DATASIZE_IN_RAW=2304kb
 +
DATASIZE_IN_STATE=256kb
 +
DATASIZE_IN_INDEX=136kb
 +
DATASIZE_IN_TOTAL=2696kb
 +
DATASIZE_OUT_TOTAL=16384kb
 +
0.74 sec : reading data from file
 +
assert passed : (INDEXPOS_0(I0) == INDEXSTART_1-1)
 +
assert passed : (INDEXPOS_1(I0-1,I0) == INDEXSTART_2-1)
 +
assert passed : (INDEXPOS_2(I0-1,I0-1,I0) == INDEX_END-1)
 +
assert passed : (sz < 255)
 +
-2.478893,-2.459100,-2.459100,...,4.097229
 +
0.09 sec : generating index data
 +
0.02 sec : allocate and init device mem
 +
9.55 sec : exec kernel on device
 +
0.00 sec : receive results from device
 +
atom[0]=57100 atom[1]=0 iNumResults=57100 kMaxResults=2097152
 +
check : with index on cpu...
 +
check : with index on cpu: iNumResults=57100
 +
17.29 sec : check : with index on cpu
 +
 +
Press ENTER to exit...
 
</pre>
 
</pre>

Revision as of 17:04, 29 September 2007


  • $(CUDA_BIN_PATH)\nvcc.exe -arch sm_11 -ccbin "$(VCInstallDir)bin" -c -DWIN32 -D_CONSOLE -D_MBCS -Xcompiler /EHsc,/W3,/nologo,/Wp64,/O2,/Zi,/MT -I"$(CUDA_INC_PATH)" -I./ -I../../common/inc -o $(ConfigurationName)\myproj.obj myproj.cu


##### ##### ##### ##### #####device 0
name : GeForce 8500 GT
    261888k totalGlobalMem
        16k sharedMemPerBlock
         8k regsPerBlock
        32 warpSize
       256k memPitch
       512 maxThreadsPerBlock
       512 maxThreadsDim[0]
       512 maxThreadsDim[1]
        64 maxThreadsDim[2]
        63k maxGridSize[0]
        63k maxGridSize[1]
         1 maxGridSize[2]
        64k totalConstMem
         1 major
         1 minor
      1371k clockRate
       256 textureAlignment
##### ##### ##### ##### #####device 1
name : GeForce 8500 GT
    261824k totalGlobalMem
        16k sharedMemPerBlock
         8k regsPerBlock
        32 warpSize
       256k memPitch
       512 maxThreadsPerBlock
       512 maxThreadsDim[0]
       512 maxThreadsDim[1]
        64 maxThreadsDim[2]
        63k maxGridSize[0]
        63k maxGridSize[1]
         1 maxGridSize[2]
        64k totalConstMem
         1 major
         1 minor
      1371k clockRate
       256 textureAlignment
N=65536
SX=2048
SY=64
SZ=2
I0=32
DATASIZE_IN_RAW=2304kb
DATASIZE_IN_STATE=256kb
DATASIZE_IN_INDEX=136kb
DATASIZE_IN_TOTAL=2696kb
DATASIZE_OUT_TOTAL=16384kb
0.74 sec : reading data from file
assert passed : (INDEXPOS_0(I0) == INDEXSTART_1-1)
assert passed : (INDEXPOS_1(I0-1,I0) == INDEXSTART_2-1)
assert passed : (INDEXPOS_2(I0-1,I0-1,I0) == INDEX_END-1)
assert passed : (sz < 255)
-2.478893,-2.459100,-2.459100,...,4.097229
0.09 sec : generating index data
0.02 sec : allocate and init device mem
9.55 sec : exec kernel on device
0.00 sec : receive results from device
atom[0]=57100 atom[1]=0 iNumResults=57100 kMaxResults=2097152
check : with index on cpu...
check : with index on cpu: iNumResults=57100
17.29 sec : check : with index on cpu

Press ENTER to exit...