TEST
e7e5ba4e4688lammps-icms

TEST

Benchmarks #
in.intel.lj - Atomic fluid (LJ Benchmark)
in.intel.rhodo - Protein (Rhodopsin Benchmark)
in.intel.lc - Liquid Crystal w/ Gay-Berne potential
in.intel.sw - Silicon benchmark with Stillinger-Weber
in.intel.tersoff - Silicon benchmark with Tersoff
in.intel.water - Coarse-grain water benchmark using Stillinger-Weber # #################

For Haswell (Xeon v3) architectures, depending on the compiler version,
it may give better performance to compile for an AVX target (with -xAVX
compiler option) instead of -xHost or -xCORE-AVX2 for some of the
workloads. In most cases, FMA sensitive routines will still use AVX2
(MKL and SVML detect the processor at runtime). For Broadwell (Xeon v4)
architectures, -xCORE-AVX2 or -xHost will work best for all. #################

# Example for running benchmarks:

export LMP_CORES=28

export OMP_NUM_THREADS=2

export LMP_BIN=../../lmp_intel_cpu

export LMP_ROOT=../../../

source /opt/intel/parallel_studio_xe_2016.2.062/psxevars.sh export I_MPI_PIN_DOMAIN=core export I_MPI_FABRICS=shm # For single node

mpirun -np $LMP_CORES $LMP_BIN -in in.lc_generate_restart -log none

export bench=in.intel.lj

mpirun -np $LMP_CORES $LMP_BIN -in $bench -log none

mpirun -np $LMP_CORES $LMP_BIN -in $bench -log none -pk omp 0 -sf omp

mpirun -np $LMP_CORES $LMP_BIN -in $bench -log none -pk intel 0 -sf intel

To run with USER-INTEL and automatic load balancing to 1 coprocessor #################

mpirun -np $LMP_CORES $LMP_BIN -in $bench -log none -pk intel 1 -sf intel

If using PPPM (in.intel.rhodo) on Intel Xeon Phi x200 series processors #################

mpirun -np $LMP_CORES $LMP_BIN -in $bench -log none -pk intel 0 omp 3 lrt yes -sf intel