GPUOcelot
|
#include <cuda_runtime.h>
Number of asynchronous engines
Device can map host memory with cudaHostAlloc/cudaHostGetDevicePointer
Clock frequency in kilohertz
Compute mode (See cudaComputeMode)
Device can possibly execute multiple kernels concurrently
Device can concurrently copy memory and execute a kernel. Deprecated. Use instead asyncEngineCount.
Device has ECC support enabled
Device is integrated as opposed to discrete
Specified whether there is a run time limit on kernels
Size of L2 cache in bytes
Major compute capability
Maximum size of each dimension of a grid
int cudaDeviceProp::maxSurface2D[2] |
int cudaDeviceProp::maxSurface3D[3] |
Maximum 1D texture size
Maximum 1D layered texture dimensions
Maximum 2D texture dimensions
Maximum 2D layered texture dimensions
Maximum 3D texture dimensions
Maximum size of each dimension of a block
Maximum number of threads per block
Maximum resident threads per multiprocessor
Global memory bus width in bits
Peak memory clock frequency in kilohertz
Maximum pitch in bytes allowed by memory copies
Minor compute capability
Number of multiprocessors on device
char cudaDeviceProp::name |
ASCII string identifying device
PCI bus ID of the device
PCI device ID of the device
PCI domain ID of the device
32-bit registers available per block
Shared memory available per block in bytes
Alignment requirements for surfaces
1 if device is a Tesla device using TCC driver, 0 otherwise
Alignment requirement for textures
Constant memory available on device in bytes
Global memory available on device in bytes
Device shares a unified address space with the host
Warp size in threads