USER-INTEL
72a1364d854fmaster

USER-INTEL

Install.sh
README
TEST/
angle_charmm_intel.cpp
angle_charmm_intel.h
angle_harmonic_intel.cpp
angle_harmonic_intel.h
bond_fene_intel.cpp
bond_fene_intel.h
bond_harmonic_intel.cpp
bond_harmonic_intel.h
dihedral_charmm_intel.cpp
dihedral_charmm_intel.h
dihedral_harmonic_intel.cpp
dihedral_harmonic_intel.h
dihedral_opls_intel.cpp
dihedral_opls_intel.h
fix_intel.cpp
fix_intel.h
fix_nh_intel.cpp
fix_nh_intel.h
fix_npt_intel.cpp
fix_npt_intel.h
fix_nve_asphere_intel.cpp
fix_nve_asphere_intel.h
fix_nve_intel.cpp
fix_nve_intel.h
fix_nvt_intel.cpp
fix_nvt_intel.h
fix_nvt_sllod_intel.cpp
fix_nvt_sllod_intel.h
improper_cvff_intel.cpp
improper_cvff_intel.h
improper_harmonic_intel.cpp
improper_harmonic_intel.h
intel_buffers.cpp
intel_buffers.h
intel_intrinsics.h
intel_preprocess.h
intel_simd.h
math_extra_intel.h
nbin_intel.cpp
nbin_intel.h
npair_full_bin_intel.cpp
npair_full_bin_intel.h
npair_half_bin_newton_intel.cpp
npair_half_bin_newton_intel.h
npair_half_bin_newton_tri_intel.cpp
npair_half_bin_newton_tri_intel.h
npair_intel.cpp
npair_intel.h
pair_buck_coul_cut_intel.cpp
pair_buck_coul_cut_intel.h
pair_buck_coul_long_intel.cpp
pair_buck_coul_long_intel.h
pair_buck_intel.cpp
pair_buck_intel.h
pair_eam_intel.cpp
pair_eam_intel.h
pair_gayberne_intel.cpp
pair_gayberne_intel.h
pair_lj_charmm_coul_long_intel.cpp
pair_lj_charmm_coul_long_intel.h
pair_lj_cut_coul_long_intel.cpp
pair_lj_cut_coul_long_intel.h
pair_lj_cut_intel.cpp
pair_lj_cut_intel.h
pair_lj_long_coul_long_intel.cpp
pair_lj_long_coul_long_intel.h
pair_sw_intel.cpp
pair_sw_intel.h
pair_tersoff_intel.cpp
pair_tersoff_intel.h
pppm_disp_intel.cpp
pppm_disp_intel.h
pppm_intel.cpp
pppm_intel.h
verlet_lrt_intel.cpp
verlet_lrt_intel.h

README

                          LAMMPS Intel(R) Package
                     --------------------------------
                     
             W. Michael Brown (Intel) michael.w.brown at intel.com
                   William McDoniel (RWTH Aachen University)
                   Rodrigo Canales (RWTH Aachen University)
                  Markus H�hnerbach (RWTH Aachen University)
                           Stan Moore (Sandia)
		   Ahmed E. Ismail (RWTH Aachen University)
                   Paolo Bientinesi (RWTH Aachen University)
                          Anupama Kurpad (Intel)
                          Biswajit Mishra (Shell)

This package provides LAMMPS styles that:

include support for single and mixed precision in addition to double.
include modifications to support vectorization for key routines
include modifications for data layouts to improve cache efficiency
include modifications to support offload to Intel(R) Xeon Phi(TM) coprocessors

For Intel server processors codenamed "Skylake", the following flags should be added or changed in the Makefile depending on the version:

2017 update 2 - No changes needed 2017 updates 3 or 4 - Use -xCOMMON-AVX512 and not -xHost or -xCORE-AVX512 2018 or newer - Use -xHost or -xCORE-AVX512 and -qopt-zmm-usage=high

When using the suffix command with "intel", intel styles will be used if they exist. If the suffix command is used with "hybrid intel omp" and the USER-OMP USER-OMP styles will be used whenever USER-INTEL styles are not available. This allow for running most styles in LAMMPS with threading.

The Long-Range Thread mode (LRT) in the Intel package currently uses pthreads by default. If pthreads are not supported in the build environment, the compile flag "-DLMP_INTEL_NOLRT" will disable the feature to allow for builds without pthreads. Alternatively, "-DLMP_INTEL_LRT11" can be used to build with compilers that support threads using the C++11 standard. When using LRT mode, you might need to disable OpenMP affinity settings (e.g. export KMP_AFFINITY=none). LAMMPS will generate a warning if the settings need to be changed.

In order to use offload to Intel(R) Xeon Phi(TM) coprocessors, the flag -DLMP_INTEL_OFFLOAD should be set in the Makefile. Offload requires the use of Intel compilers.

For portability reasons, vectorization directives are currently only enabled for Intel compilers. Using other compilers may result in significantly lower performance. This behavior can be changed by defining LMP_SIMD_COMPILER for the preprocessor (see intel_preprocess.h).

By default, when running with offload to Intel(R) coprocessors, affinity for host MPI tasks and OpenMP threads is set automatically within the code. This currently requires the use of system calls. To disable at build time, compile with -DINTEL_OFFLOAD_NOAFFINITY.

Vector intrinsics are temporarily being used for the Stillinger-Weber potential to allow for advanced features in the AVX512 instruction set to be exploited on early hardware. We hope to see compiler improvements for AVX512 that will eliminate this requirement, so it is not recommended to develop code based on the intrinsics implementation. Please e-mail the authors for more details.

lammps/src/USER-INTEL72a1364d854fmaster

USER-INTEL

README

lammps/src/USER-INTEL
72a1364d854fmaster