Reply To: Turbulence in closed space
I have been able to run a quick benchmark using the nozzle3d example on OpenLB 1.4.
As you can see, the maximum speedup using OMP is about x7.7. Surprisingly, this maximum speedup with OMP is achieved using only 17 threads out of 144. adding more than 17 threads increases the runtime.
On the contrary, the maximum speedup using MPI if about x26 and is achieved using 72 processes (for 72 cores).
I hope this can useful for you!