On craSHi in Kiel (a ``CRAY SV1 20-32768 SN9542'', now out of service) CO5BOLD could use all 4 processors per board. Documentation about the system and the compiler can be found with the CRAYdoc system. The new configure script still includes a branch for this system even if has never been tested on that machine.
In some cases the non-default versions of loops in the CO5BOLD code vectorize better and are preferred over the standard ones:
-F: Enable macro expansion
-Otask1: Parallelization: Enable tasking (in this case OpenMP).
-Oinline3: Optimization: enable (high level) of inlining.
-Ovector3 -Oscalar3: General optimization
-Drhd_hyd_roe1d_l01=1: Optimization: Choose non-standard set of routines for Roe solver. See Sect. 3.7.
-Drhd_hyd_entropyfix_p01=1: Optimization: version with masks (weights). See Sect. 3.7
-Drhd_hyd_upwind_p01=1: Optimization: version with masks (weights).
-Drhd_shortrad_operator_l01=2: Optimization: short characteristics operator with masks (weights).
OMP PARALLELstatement outside of outer loop in