- Queries
- All Stories
- Search
- Advanced Search
Feed Advanced Search
Advanced Search
Advanced Search
May 16 2018
May 16 2018
updated working version
updated working version
now includes .txt files
Mar 13 2018
Mar 13 2018
fourestey committed R1448:7ea4c72140cd: adding new precision-templated structure (authored by fourestey).
adding new precision-templated structure
fourestey committed R1448:5d605474b40d: adding new precision-templated structure (authored by fourestey).
adding new precision-templated structure
fourestey committed R1448:32bb2bde9545: NR approximation of the inverse of the square root (authored by fourestey).
NR approximation of the inverse of the square root
updated intel compiler
Mar 6 2018
Mar 6 2018
GPU version
Mar 5 2018
Mar 5 2018
update with pseudo-working version
added GPU makefile
sanatization
sanatization
update with intel compilers
removing ldg accessess
correct values
sanatization
sanatization
NR implementation of the inv sqrt
Dec 13 2017
Dec 13 2017
updated version of the scripts
fourestey committed R1448:baaf7ff134d9: Merge branch 'master' of ssh://c4science.ch/diffusion/1448/lenstool-hpc (authored by fourestey).
Merge branch 'master' of ssh://c4science.ch/diffusion/1448/lenstool-hpc
fourestey committed R1448:c9055b6bd7eb: adding fast sqrt reciprocal for SIS (authored by fourestey).
adding fast sqrt reciprocal for SIS
adding message when exiting
fourestey committed R1448:13bc4a5c42be: Mixed precision Grid Gradient Benchmark (authored by fourestey).
Mixed precision Grid Gradient Benchmark
adding Ofast
mpi check functions
improved makefiles
fourestey committed R1448:8812ace8a01e: casting literals explicitely to prevent compilers to do it for us (badly) (authored by fourestey).
casting literals explicitely to prevent compilers to do it for us (badly)
bug correction
bug correction
Sep 19 2017
Sep 19 2017
removing debugging comments
fourestey committed R1448:bd665a40b39d: update to match the new interface names (authored by fourestey).
update to match the new interface names
update for merge
fourestey committed R1448:551dc5107aee: renaming of the executable to match the benchmark (authored by fourestey).
renaming of the executable to match the benchmark
fourestey committed R1448:710b03a2c887: adding a counter to prevent comms when no images are found (authored by fourestey).
adding a counter to prevent comms when no images are found
adding MPI and GPU flags
fourestey committed R1448:6c430c0511f0: reworked cleaner interface for gradient and grid gradient computations (authored by fourestey).
reworked cleaner interface for gradient and grid gradient computations
removing unecessary code + cleanup
removing unecessary output
interface update update
adding MPI support
small updates
1 rcp instead of 2 divisions
update for intel version 17
fourestey committed R1448:b4d31d3a502e: update with a working MPI implementation (authored by fourestey).
update with a working MPI implementation
inserting MPI
Sep 1 2017
Sep 1 2017
fourestey committed R1448:36a96a0ba5e2: removing absolute paths to inludes and libs (authored by fourestey).
removing absolute paths to inludes and libs
changing type 5 to 1 for lenstool
fourestey committed R1448:5628838bdd36: further separation in the chi computation (authored by fourestey).
further separation in the chi computation
version 5 inserted
deleted default makefile
intel mpi loaded
fourestey committed R1448:9c98f3443238: separation between image localisation and chi computation (authored by fourestey).
separation between image localisation and chi computation
fourestey committed R1448:d573f83aa1c8: sepaating souce location and chi computation (authored by fourestey).
sepaating souce location and chi computation
Aug 23 2017
Aug 23 2017
inserting more tests
Aug 9 2017
Aug 9 2017
removed default makefile
fourestey committed R1448:1e1277c0bb93: GPU makefile to separate CPU and GPU versions (authored by fourestey).
GPU makefile to separate CPU and GPU versions
fourestey committed R1448:438781a4b01c: adding absolute path of the code for global variables (authored by fourestey).
adding absolute path of the code for global variables
update and bug corrections
GPU-enable code makefile
Aug 8 2017
Aug 8 2017
update after the rebase
update after the rebase
update after the rebase
update after the rebase
update after the rebase
update after the rebase
update after the rebase
local chi implementation
debugged makefile
update after the rebase
deleted benchmark
new makefiles
fourestey committed R1448:80cf6f51f522: updated to reflect the fact that the GPU version of the gradient moved to src (authored by fourestey).
updated to reflect the fact that the GPU version of the gradient moved to src
fourestey committed R1448:f276cf92246e: updated to reflect the fact that the GPU version of the gradient moved to src (authored by fourestey).
updated to reflect the fact that the GPU version of the gradient moved to src
adding GPU library
fourestey committed R1448:8ba845f84bb6: adding vector off flag (which could be unecessary) (authored by fourestey).
adding vector off flag (which could be unecessary)
fourestey committed R1448:5d9a50fb8af5: adding already computed rotations into the gradient computation (authored by fourestey).
adding already computed rotations into the gradient computation
updated benchmark
fourestey committed R1448:cd0e99a7f312: a little bit of tabulation to make things clearer (authored by fourestey).
a little bit of tabulation to make things clearer
fourestey committed R1448:7685f14978af: added vectorization off toggle flag (authored by fourestey).
added vectorization off toggle flag
updated for precomputed rotations
adding tabulation for readability
fourestey committed R1448:52bbec23e4b6: adding already computed rotations into the gradient computation (authored by fourestey).
adding already computed rotations into the gradient computation
fourestey committed R1448:b01eb5ff7652: de-vectorized loop added to the gradient computation (authored by fourestey).
de-vectorized loop added to the gradient computation
c4science ยท Help