Re: Use of a Nvidia K40 in OLB
Due to recent bot attacks we have changed the sign-up process. If you want to participate in our forum, first register on this website and then send a message via our contact form.
› Forums › Lattice Boltzmann Methods › General Topics › Use of a Nvidia K40 in OLB › Re: Use of a Nvidia K40 in OLB
September 5, 2018 at 4:22 pm
#2925
Markus Mohrhard
Participant
Hey Laurent,
most likely you have 2 E5-2620v4 which are 8 core/16 thread CPUs. In total you will see 32 virtual cores but there are only 16 physical cores, so the best performance will be reached with 16 MPI jobs. With anything above 16 you have additional context switches that take time and limit the available cache per core.
For additional information about that I recommend to read about Simultaneous multithreading (SMT) or Hyper-threading (HT).
Regards,
Markus
