Derived class of GPUSender that handles MPI calls that should NOT use GPU direct communicatoin The implementation moves data fromthe GPU to the CPU and then sends it using regular MPI.
More...
#include <GpuOwnerOverlapCopy.hpp>
|
using | X = GpuVector<field_type> |
|
using | X = GpuVector<field_type> |
|
|
| GPUObliviousMPISender (const OwnerOverlapCopyCommunicationType &cpuOwnerOverlapCopy) |
|
void | copyOwnerToAll (const X &source, X &dest) const override |
| copyOwnerToAll will copy source to the CPU, then call OwnerOverlapCopyCommunicationType::copyOwnerToAll on the copied data, and copy the result back to the GPU
|
|
| GPUSender (const OwnerOverlapCopyCommunicationType &cpuOwnerOverlapCopy) |
|
void | project (X &x) const |
| project will project x to the owned subspace
|
|
void | dot (const X &x, const X &y, field_type &output) const |
| dot will carry out the dot product between x and y on the owned indices, then sum up the result across MPI processes.
|
|
field_type | norm (const X &x) const |
| norm computes the l^2-norm of x across processes.
|
|
|
std::once_flag | m_initializedIndices |
|
std::unique_ptr< GpuVector< int > > | m_indicesOwner |
|
std::unique_ptr< GpuVector< int > > | m_indicesCopy |
|
const OwnerOverlapCopyCommunicationType & | m_cpuOwnerOverlapCopy |
|
template<class field_type, int block_size, class OwnerOverlapCopyCommunicationType>
class Opm::gpuistl::GPUObliviousMPISender< field_type, block_size, OwnerOverlapCopyCommunicationType >
Derived class of GPUSender that handles MPI calls that should NOT use GPU direct communicatoin The implementation moves data fromthe GPU to the CPU and then sends it using regular MPI.
- Template Parameters
-
field_type | is float or double |
block_size | is the blocksize of the blockelements in the matrix |
OwnerOverlapCopyCommunicationType | is typically a Dune::LinearOperator::communication_type |
◆ copyOwnerToAll()
template<class field_type , int block_size, class OwnerOverlapCopyCommunicationType >
The documentation for this class was generated from the following file: