Graphpreparemerge
Graph
preparemerge
History Graph
History Graph
Commit | Author | Details | Committed | ||||
---|---|---|---|---|---|---|---|
e1f0e469bd87 | fourestey | 1 rcp instead of 2 divisions | Sep 14 2017 | ||||
9b2798179842 | fourestey | update for intel version 17 | Sep 14 2017 | ||||
b4d31d3a502e | fourestey | update with a working MPI implementation | Sep 14 2017 | ||||
36ce9340f7d5 | fourestey | inserting MPI | Sep 14 2017 | ||||
a282c3906a23 | Christoph Schaefer | Clean GPU Model 5 implementation | Sep 1 2017 | ||||
63a0f0e0a288 | Christoph Schaefer | Merge remote-tracking branch 'origin/lenstool_mpi' into cleanGPU | Sep 1 2017 | ||||
5628838bdd36 | fourestey | further separation in the chi computation | Sep 1 2017 | ||||
36a96a0ba5e2 | fourestey | removing absolute paths to inludes and libs | Sep 1 2017 | ||||
589813446d5b | fourestey | update | Sep 1 2017 | ||||
8411b11a7301 | fourestey | changing type 5 to 1 for lenstool | Sep 1 2017 | ||||
e32259c143e6 | fourestey | deleted default makefile | Sep 1 2017 | ||||
74c7b3d026c5 | fourestey | version 5 inserted | Sep 1 2017 | ||||
eb2ab0268c19 | fourestey | intel mpi loaded | Sep 1 2017 | ||||
590823430bc0 | Christoph Schaefer | working gradient GPU function table | Aug 31 2017 | ||||
7dfbda11fa1e | Christoph Schaefer | Finished float CPU and works, perferct results | Aug 30 2017 | ||||
9c98f3443238 | fourestey | separation between image localisation and chi computation | Aug 29 2017 | ||||
d573f83aa1c8 | fourestey | sepaating souce location and chi computation | Aug 26 2017 | ||||
ee0994fcf2a8 | Christoph Schaefer | finished version, weird result | Aug 25 2017 | ||||
0e0e0b5e42bd | fourestey | update | Aug 23 2017 | ||||
31683f6bfbc6 | fourestey | inserting more tests | Aug 23 2017 | ||||
c20d6dfe17cc | Christoph Schaefer | Merge branch 'master' of https://c4science.ch/diffusion/1448/lenstool-hpc | Aug 14 2017 | ||||
dc6cc7b15333 | Christoph Schaefer | found bug | Aug 14 2017 | ||||
75862f226826 | fourestey | removed default makefile | Aug 9 2017 | ||||
1e1277c0bb93 | fourestey | GPU makefile to separate CPU and GPU versions | Aug 9 2017 | ||||
0c72f5929fed | fourestey | GPU-enable code makefile | Aug 9 2017 | ||||
438781a4b01c | fourestey | adding absolute path of the code for global variables | Aug 9 2017 | ||||
fb407754ab97 | fourestey | update and bug corrections | Aug 9 2017 | ||||
5dd3b25e8004 | fourestey | update after the rebase | Aug 8 2017 | ||||
10cd8384e610 | fourestey | update after the rebase | Aug 8 2017 | ||||
7f19b7de35b6 | fourestey | update after the rebase | Aug 8 2017 | ||||
306c9b5d4bb3 | fourestey | update after the rebase | Aug 8 2017 | ||||
b9821734f043 | fourestey | update after the rebase | Aug 8 2017 | ||||
4056484a9127 | fourestey | update after the rebase | Aug 8 2017 | ||||
a86032cc53f4 | fourestey | update after the rebase | Aug 8 2017 | ||||
c611033d3d7c | fourestey | update after the rebase | Aug 8 2017 | ||||
05bb15a6c406 | fourestey | update | Aug 8 2017 | ||||
ffe90c1a9b9f | fourestey | deleted benchmark | Aug 8 2017 | ||||
8416c26a60c0 | fourestey | debugged makefile | Aug 8 2017 | ||||
a186ef05c834 | fourestey | local chi implementation | Aug 8 2017 | ||||
e28030db2fcc | fourestey | update | Aug 8 2017 | ||||
52d2417b3b1a | fourestey | new makefiles | Aug 8 2017 | ||||
80cf6f51f522 | fourestey | updated to reflect the fact that the GPU version of the gradient moved to src | Aug 8 2017 | ||||
f276cf92246e | fourestey | updated to reflect the fact that the GPU version of the gradient moved to src | Aug 8 2017 | ||||
76fff640d16a | fourestey | adding GPU library | Aug 8 2017 | ||||
cd0e99a7f312 | fourestey | a little bit of tabulation to make things clearer | Aug 8 2017 | ||||
8ba845f84bb6 | fourestey | adding vector off flag (which could be unecessary) | Aug 8 2017 | ||||
149cc04dccc0 | fourestey | updated benchmark | Aug 8 2017 | ||||
d993aacd4595 | fourestey | cleanup | Aug 8 2017 | ||||
b01eb5ff7652 | fourestey | de-vectorized loop added to the gradient computation | Aug 8 2017 | ||||
5d9a50fb8af5 | fourestey | adding already computed rotations into the gradient computation | Aug 8 2017 | ||||
9bcd8c2895be | fourestey | updated for precomputed rotations | Aug 8 2017 | ||||
52bbec23e4b6 | fourestey | adding already computed rotations into the gradient computation | Aug 8 2017 | ||||
7685f14978af | fourestey | added vectorization off toggle flag | Aug 8 2017 | ||||
223d0b03be9e | fourestey | adding tabulation for readability | Aug 8 2017 | ||||
87d47626fa76 | fourestey | adding rotation using precomputed cosine/sine | Aug 8 2017 | ||||
adcb913ab4db | fourestey | bug correction | Aug 8 2017 | ||||
f89d13ecb2b0 | fourestey | adding vanilla lentool chi compuation | Aug 8 2017 | ||||
e19a37232c4b | fourestey | update before merge | Aug 8 2017 | ||||
da6059698213 | fourestey | new methods to make the link with lenstool | Aug 8 2017 | ||||
63903aab0538 | fourestey | adding lenstool call | Aug 8 2017 | ||||
ce535ac958fc | fourestey | updated avx version to remove overloaded operators | Aug 8 2017 | ||||
de25f0af5356 | fourestey | removed cuda includes | Aug 8 2017 | ||||
c6d976421b0f | fourestey | updated avx version to remove overloaded operators | Aug 8 2017 | ||||
ce180071f842 | fourestey | removed cuda includes | Aug 8 2017 | ||||
05b9c0688629 | fourestey | update | Aug 8 2017 | ||||
c4391fd69959 | fourestey | update | Aug 8 2017 | ||||
bf74068b09fa | fourestey | new shared memory version | Aug 8 2017 | ||||
bf4340e5afcf | fourestey | update | Aug 8 2017 | ||||
89fa8d775e3a | fourestey | update | Aug 8 2017 | ||||
8f3e2b3ee56f | fourestey | update | Aug 8 2017 | ||||
8d95413ab0a3 | fourestey | updated to match the env variables defined in the root dir | Aug 8 2017 | ||||
6f1510a90976 | fourestey | modified debug output | Aug 8 2017 | ||||
d8d45487cc53 | fourestey | update with fastest version so far (2x wrt CPU) | Aug 8 2017 | ||||
5c3ce095294b | fourestey | several new shared memory version implemented | Aug 8 2017 | ||||
dee834c2d14c | fourestey | updated version with check between CPU and GPU version work. the gradient… | Aug 8 2017 | ||||
12a16ee988a0 | fourestey | adding openmp version and rebranding the old version | Aug 8 2017 | ||||
393b276d95f7 | fourestey | adding openmp version | Aug 8 2017 | ||||
5dc4d685136b | fourestey | adding debug infos (commented for now) | Aug 8 2017 | ||||
bb0b0ebecf62 | fourestey | updated version | Aug 8 2017 | ||||
545031b9d90c | fourestey | added env variables to makefiles | Aug 8 2017 | ||||
7701e1e8a276 | fourestey | added env variables to makefiles | Aug 8 2017 | ||||
0ca53974d7e0 | fourestey | added nvvp files | Aug 8 2017 | ||||
7305f1eaacc7 | fourestey | added cuo files into exclusion list | Aug 8 2017 | ||||
eba67e4791d5 | Christoph Schaefer | merge 3, problem with lenstool | Mar 8 2017 | ||||
c8a4b458969b | Christoph Schaefer | resolved merge | Feb 23 2017 | ||||
f41fbe45c73f | Christoph Schaefer | resolved merge | Feb 23 2017 | ||||
f1e9d2705e47 | Christoph Schaefer | updated Makefile with env variables | Feb 23 2017 | ||||
49db4dcbe135 | Christoph Schaefer | greina0 preparation | Feb 23 2017 | ||||
9df9e1a1fc10 | schaefer | played auround with chi | Feb 23 2017 | ||||
538f4071ca78 | schaefer | added gradient grid that handles multiple GPU usage 2 | Feb 21 2017 | ||||
b83c2d705661 | schaefer | added gradient grid that handles multiple GPU usage | Feb 21 2017 | ||||
8ed3ea71f869 | schaefer | few corrections, startend on multi GPU method | Feb 20 2017 | ||||
30af87e0951b | schaefer | relieved register pressure on GPUs by precalculating cos sin values | Feb 16 2017 | ||||
aba442ee867c | schaefer | added basic GPU version | Feb 16 2017 | ||||
e339389554c0 | schaefer | GridGradientBenchmark finished for CPU | Feb 15 2017 | ||||
15fae33691a4 | schaefer | finished cleaning up chi_CPU function, changed lens_SOA structure, implemented… | Jan 31 2017 | ||||
db232b68318b | schaefer | unsorted lens function + big benchmark = 219 Img | Jan 31 2017 | ||||
730c79dea1a8 | schaefer | starting GPU | Jan 30 2017 | ||||
53b8a1923fc7 | schaefer | integrating chi_CPU version into master | Jan 30 2017 | ||||
a6eebe218f9d | schaefer | integrating chi_CPU version into master | Jan 30 2017 |
c4science · Help