- Queries
- All Stories
- Search
- Advanced Search
Feed Advanced Search
Advanced Search
Advanced Search
Feb 22 2017
Feb 22 2017
added env variables to makefiles
added env variables to makefiles
environment files
added nvvp files
added env variables to makefiles
2000x2000 image by default
fourestey committed R1448:69b2e7ce07ce: added cuo files into exclusion list (authored by fourestey).
added cuo files into exclusion list
Jan 2 2017
Jan 2 2017
calling the right version
fourestey committed R1448:a710e43288c0: lense type dispatcher enabled for hand coded avx version (authored by fourestey).
lense type dispatcher enabled for hand coded avx version
Jan 1 2017
Jan 1 2017
fourestey committed R1448:5bdc6523dd48: inserting guard to allow linking with the lenstool library (authored by fourestey).
inserting guard to allow linking with the lenstool library
fourestey committed R1448:edb40afe60bf: added possibility to link with the lenstool library using the -D__WITH_LIBTOOL… (authored by fourestey).
added possibility to link with the lenstool library using the -D__WITH_LIBTOOL…
Dec 30 2016
Dec 30 2016
fourestey committed R1448:300a1434e86d: added improved version of lense derivative selection (authored by fourestey).
added improved version of lense derivative selection
Dec 25 2016
Dec 25 2016
fourestey committed R1448:59584d508582: added function to call the correct lense type gradient function correctly (authored by fourestey).
added function to call the correct lense type gradient function correctly
useful functions
Dec 22 2016
Dec 22 2016
fourestey committed R1448:22e37e26a423: reverting to original configuration (authored by fourestey).
reverting to original configuration
interface change update
fourestey committed R1448:feb189992011: interface cleanup + SIS implementation for SOA (authored by fourestey).
interface cleanup + SIS implementation for SOA
deleted binary file
interface change update
Dec 21 2016
Dec 21 2016
fourestey committed R1448:43bf390e02a9: updated library + gradient benchmark with lense 81 auto-vectorized version (authored by fourestey).
updated library + gradient benchmark with lense 81 auto-vectorized version
clean repo
Dec 14 2016
Dec 14 2016
fourestey committed R1448:3ef0946ab029: added chi benchmark + general updates (authored by fourestey).
added chi benchmark + general updates
Dec 13 2016
Dec 13 2016
bug fix, precision test passed
fourestey committed R1448:cd86e73dbb27: gradent benchmark using jauzac input (authored by fourestey).
gradent benchmark using jauzac input
updated makefile to create library
Dec 12 2016
Dec 12 2016
fourestey committed R1448:6003e8cd2da6: added sources for the different versions of the derivative (authored by fourestey).
added sources for the different versions of the derivative
Nov 29 2016
Nov 29 2016
fourestey committed R1448:adf7e3bb3043: updated (ie working) AVX512F version (authored by fourestey).
updated (ie working) AVX512F version
fourestey committed R1448:930d2f468ef7: added toggle to remove benchmarks baselines (authored by fourestey).
added toggle to remove benchmarks baselines
fourestey committed R1448:d3ad35d03fb0: added fine level threading using openmp (authored by fourestey).
added fine level threading using openmp
AVX512F (KNL) version added
fourestey committed R1448:d19ea01e5116: using native flag for the compilation architecture (authored by fourestey).
using native flag for the compilation architecture
Nov 27 2016
Nov 27 2016
fourestey committed R1448:d2d2f9a13cb1: addind the unix timestamp to the directory file (authored by fourestey).
addind the unix timestamp to the directory file
removed binary file
Nov 16 2016
Nov 16 2016
fourestey committed R1448:5bbd1066b045: jauziac benchmark implementated, avx and scalar version (3x for now). (authored by fourestey).
jauziac benchmark implementated, avx and scalar version (3x for now).
Nov 12 2016
Nov 12 2016
fourestey committed R1448:372e20ceadd1: update with scalar and vector (avx) version (authored by fourestey).
update with scalar and vector (avx) version
Nov 7 2016
Nov 7 2016
better output
fourestey committed R1448:c4dbca031d4d: major overhaul of the benchmark to include real-life data from jauzac paper (authored by fourestey).
major overhaul of the benchmark to include real-life data from jauzac paper
removed unecessary old file
setup header file
adding new jauzac benchmark
Nov 4 2016
Nov 4 2016
fourestey committed R1448:c2f3e43b96e0: adding vector operators for the intel compilers to ensure backward compatibility (authored by fourestey).
adding vector operators for the intel compilers to ensure backward compatibility
Nov 2 2016
Nov 2 2016
hand optimized simd math operators
fourestey committed R1448:91114be53829: added hand optimized inverse and sqrt operations + cleanup (authored by fourestey).
added hand optimized inverse and sqrt operations + cleanup
gradient file header
corrected executable name
higher precision in result
Oct 31 2016
Oct 31 2016
fourestey committed R1448:182455bb75cd: vectorization update with RCP + newton raphson methode to increase the precision (authored by fourestey).
vectorization update with RCP + newton raphson methode to increase the precision
Oct 28 2016
Oct 28 2016
fourestey committed R1448:871038e9d982: hand vectorized version added: 50% faster than the scalar one probably because… (authored by fourestey).
hand vectorized version added: 50% faster than the scalar one probably because…
Oct 25 2016
Oct 25 2016
new values
big test comparison
timer added
full loop expension
Oct 24 2016
Oct 24 2016
fourestey committed R1448:d1d407485259: added inlined first version of the kernel (authored by fourestey).
added inlined first version of the kernel
Oct 22 2016
Oct 22 2016
fourestey committed R1448:5b30d8deeb61: modified makefile to take compiler into account + small update (authored by fourestey).
modified makefile to take compiler into account + small update
sperated main and compute kernels
Sep 13 2016
Sep 13 2016
Apr 15 2016
Apr 15 2016
Mar 22 2016
Mar 22 2016
authorship
Mar 16 2016
Mar 16 2016
fourestey committed R31:cce22d66b421: overhaul: new approach with better register usage (authored by fourestey).
overhaul: new approach with better register usage
fourestey committed R31:32a5179769d1: authorship + overhaul: better use of the registers and better prefetechers (authored by fourestey).
authorship + overhaul: better use of the registers and better prefetechers
added permuted 8x4 dgemm kernel
authorship + better prefetechers
new kernels update
Mar 15 2016
Mar 15 2016
fourestey committed R31:1c2049842f9e: better prefetchers + use of more registers (authored by fourestey).
better prefetchers + use of more registers
added unrolled 4x4 permutation code
adding .gitignore
admin awarded R31:b0279640793d: dgemm for neon - initial commit a Like token.
dgemm for neon - initial commit
Mar 14 2016
Mar 14 2016
c4science · Help