gpu

Makefile.fermi
Makefile.lammps.mac_ocl
Makefile.lammps.mingw-cross
Makefile.lammps.opencl
Makefile.lammps.standard
Makefile.lens
Makefile.lincoln
Makefile.linux
Makefile.linux.double
Makefile.linux.mixed
Makefile.linux_opencl
Makefile.longhorn
Makefile.mac
Makefile.mac_opencl
Makefile.mingw32-cross
Makefile.mingw32-cross-mpi
Makefile.mingw64-cross
Makefile.mingw64-cross-mpi
Makefile.serial
Makefile.serial_opencl
Makefile.xk7
Nvidia.makefile
Opencl.makefile
README
cudpp_mini/
geryon/
lal_answer.cpp
lal_answer.h
lal_atom.cpp
lal_atom.cu
lal_atom.h
lal_aux_fun1.h
lal_balance.h
lal_base_atomic.cpp
lal_base_atomic.h
lal_base_charge.cpp
lal_base_charge.h
lal_base_dipole.cpp
lal_base_dipole.h
lal_base_ellipsoid.cpp
lal_base_ellipsoid.h
lal_base_three.cpp
lal_base_three.h
lal_beck.cpp
lal_beck.h
lal_beck_ext.cpp
lal_born.cpp
lal_born.cu
lal_born.h
lal_born_coul_long.cpp
lal_born_coul_long.cu
lal_born_coul_long.h
lal_born_coul_long_ext.cpp
lal_born_coul_wolf.cpp
lal_born_coul_wolf.cu
lal_born_coul_wolf.h
lal_born_coul_wolf_ext.cpp
lal_born_ext.cpp
lal_buck.cpp
lal_buck.cu
lal_buck.h
lal_buck_coul.cpp
lal_buck_coul.cu
lal_buck_coul.h
lal_buck_coul_ext.cpp
lal_buck_coul_long.cpp
lal_buck_coul_long.cu
lal_buck_coul_long.h
lal_buck_coul_long_ext.cpp
lal_buck_ext.cpp
lal_cg_cmm.cpp
lal_cg_cmm.cu
lal_cg_cmm.h
lal_cg_cmm_ext.cpp
lal_cg_cmm_long.cpp
lal_cg_cmm_long.cu
lal_cg_cmm_long.h
lal_cg_cmm_long_ext.cpp
lal_charmm_long.cpp
lal_charmm_long.cu
lal_charmm_long.h
lal_charmm_long_ext.cpp
lal_colloid.cpp
lal_colloid.cu
lal_colloid.h
lal_colloid_ext.cpp
lal_coul_dsf.cpp
lal_coul_dsf.cu
lal_coul_dsf.h
lal_coul_dsf_ext.cpp
lal_coul_long.cpp
lal_coul_long.cu
lal_coul_long.h
lal_coul_long_ext.cpp
lal_device.cpp
lal_device.cu
lal_device.h
lal_dipole_lj.cpp
lal_dipole_lj.cu
lal_dipole_lj.h

README

   LAMMPS ACCELERATOR LIBRARY
--------------------------------
   
     W. Michael Brown (ORNL)
      Trung Dac Nguyen (ORNL)
        Peng Wang (NVIDIA)
     Axel Kohlmeyer (Temple)
       Steve Plimpton (SNL)
      Inderaj Bains (NVIDIA)

This directory has source files to build a library that LAMMPS links against when using the GPU package.

This library must be built with a C++ compiler, before LAMMPS is built, so LAMMPS can link against it.

Build the library using one of the provided Makefile.* files or create your own, specific to your compiler and system. For example:

make -f Makefile.linux

When you are done building this library, two files should exist in this directory:

libgpu.a the library LAMMPS will link against Makefile.lammps settings the LAMMPS Makefile will import

Makefile.lammps is created by the make command, by copying one of the Makefile.lammps.* files. See the EXTRAMAKE setting at the top of the Makefile.* files.

IMPORTANT: You must examine the final Makefile.lammps to insure it is correct for your system, else the LAMMPS build will likely fail.

Makefile.lammps has settings for 3 variables:

user-gpu_SYSINC = leave blank for this package user-gpu_SYSLIB = CUDA libraries needed by this package user-gpu_SYSPATH = path(s) to where those libraries are

Because you have the CUDA compilers on your system, you should have the needed libraries. If the CUDA developement tools were installed in the standard manner, the settings in the Makefile.lammps.standard file should work.

        GENERAL NOTES
--------------------------------

This library, libgpu.a, provides routines for GPU acceleration of certain LAMMPS styles and neighbor list builds. Compilation of this library requires installing the CUDA GPU driver and CUDA toolkit for your operating system. Installation of the CUDA SDK is not necessary. In addition to the LAMMPS library, the binary nvc_get_devices will also be built. This can be used to query the names and properties of GPU devices on your system. A Makefile for OpenCL compilation is provided, but support for OpenCL use is not currently provided by the developers. Details of the implementation are provided in:

Brown, W.M., Wang, P. Plimpton, S.J., Tharrington, A.N. Implementing Molecular Dynamics on Hybrid High Performance Computers - Short Range Forces. Computer Physics Communications. 2011. 182: p. 898-911.

and

Brown, W.M., Kohlmeyer, A. Plimpton, S.J., Tharrington, A.N. Implementing Molecular Dynamics on Hybrid High Performance Computers - Particle-Particle Particle-Mesh. Computer Physics Communications. 2012. 183: p. 449-459.

and

Brown, W.M., Masako, Y. Implementing Molecular Dynamics on Hybrid High Performance Computers - Three-Body Potentials. Computer Physics Communications.

In press.

NOTE: Installation of the CUDA SDK is not required.

Current styles supporting GPU acceleration:

 1  beck
 2  born/coul/long
 3  born/coul/wolf
 4  born
 5  buck/coul/cut
 6  buck/coul/long
 7  buck
 8  colloid
 9  coul/dsf
10  coul/long
11  eam/alloy
12  eam/fs
13  eam
14  eam/lj
15  gauss
16  gayberne
17  lj96/cut
18  lj/charmm/coul/long
19  lj/class2/coul/long
20  lj/class2
21  lj/cut/coul/cut
22  lj/cut/coul/debye
23  lj/cut/coul/dsf
24  lj/cut/coul/long
25  lj/cut/coul/msm
26  lj/cut/coul/wolf/fsw
27  lj/cut/dipole/cut
28  lj/cut
29  lj/cut/tgpu
30  lj/expand
31  lj/sdk/coul/long
32  cg/cmm/coul/long
33  lj/sdk
34  cg/cmm
35  lj/sf/dipole/sf
36  mie/cut
37  morse
38  resquared
39  soft
40  sw
41  table
42  yukawa/colloid
43  yukawa
44  pppm


                 MULTIPLE LAMMPS PROCESSES
              --------------------------------

Multiple LAMMPS MPI processes can share GPUs on the system, but multiple GPUs cannot be utilized by a single MPI process. In many cases, the best performance will be obtained by running as many MPI processes as CPU cores available with the condition that the number of MPI processes is an integer multiple of the number of GPUs being used. See the LAMMPS user manual for details on running with GPU acceleration.

  BUILDING AND PRECISION MODES
--------------------------------

To build, edit the CUDA_ARCH, CUDA_PRECISION, CUDA_HOME variables in one of the Makefiles. CUDA_ARCH should be set based on the compute capability of your GPU. This can be verified by running the nvc_get_devices executable after the build is complete. Additionally, the GPU package must be installed and compiled for LAMMPS. This may require editing the gpu_SYSPATH variable in the LAMMPS makefile.

Please note that the GPU library accesses the CUDA driver library directly, so it needs to be linked not only to the CUDA runtime library (libcudart.so) that ships with the CUDA toolkit, but also with the CUDA driver library (libcuda.so) that ships with the Nvidia driver. If you are compiling LAMMPS on the head node of a GPU cluster, this library may not be installed, so you may need to copy it over from one of the compute nodes (best into this directory).

The gpu library supports 3 precision modes as determined by the CUDA_PRECISION variable:

CUDA_PREC = -D_SINGLE_SINGLE  # Single precision for all calculations
CUDA_PREC = -D_DOUBLE_DOUBLE  # Double precision for all calculations
CUDA_PREC = -D_SINGLE_DOUBLE  # Accumulation of forces, etc. in double

NOTE: PPPM acceleration can only be run on GPUs with compute capability>=1.1. You will get the error "GPU library not compiled for this accelerator." when attempting to run PPPM on a GPU with compute capability 1.0.

NOTE: Double precision is only supported on certain GPUs (with compute capability>=1.3). If you compile the GPU library for a GPU with compute capability 1.1 and 1.2, then only single precision FFTs are supported, i.e. LAMMPS has to be compiled with -DFFT_SINGLE. For details on configuring FFT support in LAMMPS, see http://lammps.sandia.gov/doc/Section_start.html#2_2_4

NOTE: For graphics cards with compute capability>=1.3 (e.g. Tesla C1060), make sure that -arch=sm_13 is set on the CUDA_ARCH line.

NOTE: For newer graphics card (a.k.a. "Fermi", e.g. Tesla C2050), make sure that either -arch=sm_20 or -arch=sm_21 is set on the CUDA_ARCH line, depending on hardware and CUDA toolkit version.

NOTE: The gayberne/gpu pair style will only be installed if the ASPHERE package has been installed.

NOTE: The cg/cmm/gpu and cg/cmm/coul/long/gpu pair styles will only be installed if the USER-CG-CMM package has been installed.

NOTE: The lj/cut/coul/long/gpu, cg/cmm/coul/long/gpu, coul/long/gpu, lj/charmm/coul/long/gpu and pppm/gpu styles will only be installed if the KSPACE package has been installed.

    EXAMPLE BUILD PROCESS
--------------------------------

cd ~/lammps/lib/gpu emacs Makefile.linux make -f Makefile.linux ./nvc_get_devices cd ../../src emacs ./MAKE/Makefile.linux make yes-asphere make yes-kspace make yes-gpu make linux

lammps/lib/gpu22751a2aaef2runner

gpu

README

lammps/lib/gpu
22751a2aaef2runner