|  | 
Copies from host memory to device memory. dstDeviceandsrcHostare the base addresses of the destination and source, respectively.ByteCountspecifies the number of bytes to copy. 
cuMemcpyHtoDAsync() is asynchronous and can optionally be associated to a stream by passing a non-zero hStreamargument. It only works on page-locked memory and returns an error if a pointer to pageable memory is passed as input. 
 Parameters:
  
    |  | dstDevice | - Destination device pointer |  |  | srcHost | - Source host pointer |  |  | ByteCount | - Size of memory copy in bytes |  |  | hStream | - Stream identifier | 
 Returns:CUDA_SUCCESS, CUDA_ERROR_DEINITIALIZED, CUDA_ERROR_NOT_INITIALIZED, CUDA_ERROR_INVALID_CONTEXT, CUDA_ERROR_INVALID_VALUE 
 Note:Note that this function may also return error codes from previous, asynchronous launches.
 See also:cuArray3DCreate, cuArray3DGetDescriptor, cuArrayCreate, cuArrayDestroy, cuArrayGetDescriptor, cuMemAlloc, cuMemAllocHost, cuMemAllocPitch, cuMemcpy2D, cuMemcpy2DAsync, cuMemcpy2DUnaligned, cuMemcpy3D, cuMemcpy3DAsync, cuMemcpyAtoA, cuMemcpyAtoD, cuMemcpyAtoH, cuMemcpyAtoHAsync, cuMemcpyDtoA, cuMemcpyDtoD, cuMemcpyDtoDAsync, cuMemcpyDtoH, cuMemcpyDtoHAsync, cuMemcpyHtoA, cuMemcpyHtoAAsync, cuMemcpyHtoD, cuMemFree, cuMemFreeHost, cuMemGetAddressRange, cuMemGetInfo, cuMemHostAlloc, cuMemHostGetDevicePointer, cuMemsetD2D8, cuMemsetD2D8Async, cuMemsetD2D16, cuMemsetD2D16Async, cuMemsetD2D32, cuMemsetD2D32Async, cuMemsetD8, cuMemsetD8Async, cuMemsetD16, cuMemsetD16Async, cuMemsetD32, cuMemsetD32Async 
 
     |