![]() |
QMCPACK
|
Public Member Functions | |
Queue () | |
~Queue () | |
template<class DSC > | |
void | enqueueH2D (DSC &dataset, typename DSC::size_type size=0, typename DSC::size_type offset=0) |
template<class DSC > | |
void | enqueueD2H (DSC &dataset, typename DSC::size_type size=0, typename DSC::size_type offset=0) |
void | sync () |
cudaStream_t | getNative () |
Private Attributes | |
cudaStream_t | hstream_ |
Definition at line 25 of file QueueCUDA.hpp.
|
inline |
Definition at line 28 of file QueueCUDA.hpp.
References qmcplusplus::cudaErrorCheck(), and cudaStreamCreate.
|
inline |
Definition at line 30 of file QueueCUDA.hpp.
References qmcplusplus::cudaErrorCheck(), and cudaStreamDestroy.
|
inline |
Definition at line 53 of file QueueCUDA.hpp.
References qmcplusplus::cudaErrorCheck(), cudaMemcpyAsync, and cudaMemcpyDeviceToHost.
|
inline |
Definition at line 34 of file QueueCUDA.hpp.
References qmcplusplus::cudaErrorCheck(), cudaMemcpyAsync, and cudaMemcpyHostToDevice.
|
inline |
Definition at line 73 of file QueueCUDA.hpp.
Referenced by DelayedUpdateCUDA< T, T_FP >::getInvRow(), DelayedUpdateCUDA< T, T_FP >::initializeInv(), and DelayedUpdateCUDA< T, T_FP >::updateInvMat().
|
inline |
Definition at line 71 of file QueueCUDA.hpp.
References qmcplusplus::cudaErrorCheck(), and cudaStreamSynchronize.
Referenced by DelayedUpdateCUDA< T, T_FP >::getInvRow(), DelayedUpdateCUDA< T, T_FP >::initializeInv(), and DelayedUpdateCUDA< T, T_FP >::updateInvMat().
|
private |
Definition at line 76 of file QueueCUDA.hpp.