Skip to content

Reply To: multi-block approach

#6711
hungryfish
Participant

The run itself on a single NVIDIA A100 works fine.
Compilation for ‘gpu_only’, ! without MPI !, CUDA_ARCH := 80, PARALLEL_MODE := NONE

prepareGeometry] Prepare Geometry … OK
[prepareLattice] Prepare Lattice …
[prepareLattice] Prepare Lattice … OK
[main] starting simulation…

[Timer] step=7755; percent=1.10059; passedTime=458.053; remTime=41160.6; MLUPs=3508.08
[LatticeStatistics] step=7755; t=1.10059; uMax=1.67154; avEnergy=-nan; avRho=-nan

Wed Jun 29 17:04:26 2022
+—————————————————————————–+
| NVIDIA-SMI 515.48.07 Driver Version: 515.48.07 CUDA Version: 11.7 |
|——————————-+———————-+———————-+
| GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
| | | MIG M. |
|===============================+======================+======================|
| 0 NVIDIA A100-SXM… Off | 00000000:0E:00.0 Off | 0 |
| N/A 42C P0 240W / 400W | 64895MiB / 81920MiB | 100% Default |
| | | Disabled |
+——————————-+———————-+———————-+
….
+—————————————————————————–+
| Processes: |
| GPU GI CI PID Type Process name GPU Memory |
| ID ID Usage |
|=============================================================================|
| 0 N/A N/A 23303 C ./sphereFlow 64893MiB |
+—————————————————————————–+

You said ‘the VTK Output logic is independent of the specific platform’ so I will try to search further ..