- Queries
- All Stories
- Search
- Advanced Search
Feed Advanced Search
Advanced Search
Advanced Search
Jul 8 2019
Jul 8 2019
adding message when exiting
fourestey committed R1448:c089a2e48757: Mixed precision Grid Gradient Benchmark (authored by fourestey).
Mixed precision Grid Gradient Benchmark
mpi check functions
bug correction
bug correction
improved makefiles
fourestey committed R1448:2516410a2d5f: casting literals explicitely to prevent compilers to do it for us (badly) (authored by fourestey).
casting literals explicitely to prevent compilers to do it for us (badly)
update for merge
removing debugging comments
removing unecessary output
fourestey committed R1448:a2b6347c33d7: adding a counter to prevent comms when no images are found (authored by fourestey).
adding a counter to prevent comms when no images are found
fourestey committed R1448:3ffdb6e24087: renaming of the executable to match the benchmark (authored by fourestey).
renaming of the executable to match the benchmark
fourestey committed R1448:becfb19ab17f: reworked cleaner interface for gradient and grid gradient computations (authored by fourestey).
reworked cleaner interface for gradient and grid gradient computations
removing unecessary code + cleanup
fourestey committed R1448:0e63f280cf6f: update to match the new interface names (authored by fourestey).
update to match the new interface names
adding MPI support
adding MPI and GPU flags
small updates
update for intel version 17
1 rcp instead of 2 divisions
fourestey committed R1448:91fab2d45d91: update with a working MPI implementation (authored by fourestey).
update with a working MPI implementation
interface update update
inserting MPI
fourestey committed R1448:66a1b43eb6c1: removing absolute paths to inludes and libs (authored by fourestey).
removing absolute paths to inludes and libs
fourestey committed R1448:4903c7447257: further separation in the chi computation (authored by fourestey).
further separation in the chi computation
deleted default makefile
fourestey committed R1448:d682ff267726: separation between image localisation and chi computation (authored by fourestey).
separation between image localisation and chi computation
intel mpi loaded
changing type 5 to 1 for lenstool
version 5 inserted
fourestey committed R1448:c8eb608b6516: sepaating souce location and chi computation (authored by fourestey).
sepaating souce location and chi computation
inserting more tests
GPU-enable code makefile
removed default makefile
fourestey committed R1448:5d0a16f92e87: adding absolute path of the code for global variables (authored by fourestey).
adding absolute path of the code for global variables
fourestey committed R1448:6180e5151f85: GPU makefile to separate CPU and GPU versions (authored by fourestey).
GPU makefile to separate CPU and GPU versions
update after the rebase
update and bug corrections
update after the rebase
update after the rebase
update after the rebase
update after the rebase
update after the rebase
update after the rebase
update after the rebase
debugged makefile
deleted benchmark
local chi implementation
new makefiles
fourestey committed R1448:831d6da385cd: adding vector off flag (which could be unecessary) (authored by fourestey).
adding vector off flag (which could be unecessary)
fourestey committed R1448:d6d2b4011753: updated to reflect the fact that the GPU version of the gradient moved to src (authored by fourestey).
updated to reflect the fact that the GPU version of the gradient moved to src
fourestey committed R1448:89f204a094f2: updated to reflect the fact that the GPU version of the gradient moved to src (authored by fourestey).
updated to reflect the fact that the GPU version of the gradient moved to src
adding GPU library
fourestey committed R1448:07f53260e0dd: a little bit of tabulation to make things clearer (authored by fourestey).
a little bit of tabulation to make things clearer
updated benchmark
fourestey committed R1448:7b180f048cf8: adding already computed rotations into the gradient computation (authored by fourestey).
adding already computed rotations into the gradient computation
fourestey committed R1448:db407492f42e: added vectorization off toggle flag (authored by fourestey).
added vectorization off toggle flag
fourestey committed R1448:5c50c73d37cf: adding already computed rotations into the gradient computation (authored by fourestey).
adding already computed rotations into the gradient computation
updated for precomputed rotations
fourestey committed R1448:466c49a3c59f: de-vectorized loop added to the gradient computation (authored by fourestey).
de-vectorized loop added to the gradient computation
fourestey committed R1448:c650d42d352c: adding vanilla lentool chi compuation (authored by fourestey).
adding vanilla lentool chi compuation
adding tabulation for readability
fourestey committed R1448:cb182c4c7de3: adding rotation using precomputed cosine/sine (authored by fourestey).
adding rotation using precomputed cosine/sine
bug correction
update before merge
fourestey committed R1448:e8c11a67ece1: new methods to make the link with lenstool (authored by fourestey).
new methods to make the link with lenstool
adding lenstool call
fourestey committed R1448:e46c653d833f: updated avx version to remove overloaded operators (authored by fourestey).
updated avx version to remove overloaded operators
removed cuda includes
fourestey committed R1448:5a248b99f551: updated avx version to remove overloaded operators (authored by fourestey).
updated avx version to remove overloaded operators
removed cuda includes
new shared memory version
fourestey committed R1448:1f4f7c1066b1: updated to match the env variables defined in the root dir (authored by fourestey).
updated to match the env variables defined in the root dir
modified debug output
fourestey committed R1448:3510c27c0389: update with fastest version so far (2x wrt CPU) (authored by fourestey).
update with fastest version so far (2x wrt CPU)
fourestey committed R1448:8ed9f86f57d0: adding debug infos (commented for now) (authored by fourestey).
adding debug infos (commented for now)
fourestey committed R1448:6e94ecce88f4: several new shared memory version implemented (authored by fourestey).
several new shared memory version implemented
fourestey committed R1448:8b3f01b939af: adding openmp version and rebranding the old version (authored by fourestey).
adding openmp version and rebranding the old version
fourestey committed R1448:136e775b25a7: updated version with check between CPU and GPU version work. the gradient… (authored by fourestey).
updated version with check between CPU and GPU version work. the gradient…
adding openmp version
added env variables to makefiles
updated version
added env variables to makefiles
added nvvp files
fourestey committed R1448:b1bb9a1a50ce: added cuo files into exclusion list (authored by fourestey).
added cuo files into exclusion list
Jun 27 2019
Jun 27 2019
fourestey committed R1448:dc6607b87355: new makefiles to support compilation with gcc (authored by fourestey).
new makefiles to support compilation with gcc
c4science · Help