diff options
author | Martin Kroeker <martin@ruby.chemie.uni-freiburg.de> | 2019-08-11 23:23:27 +0200 |
---|---|---|
committer | GitHub <noreply@github.com> | 2019-08-11 23:23:27 +0200 |
commit | 5f36f18148603facb6c3540e673610d6b24cbfbb (patch) | |
tree | e3928c979c5f1acaaf3faf1acdaf546f27270164 | |
parent | d47fe78b0e5453dea32741e7cf14f2be10387ba8 (diff) | |
download | openblas-5f36f18148603facb6c3540e673610d6b24cbfbb.tar.gz openblas-5f36f18148603facb6c3540e673610d6b24cbfbb.tar.bz2 openblas-5f36f18148603facb6c3540e673610d6b24cbfbb.zip |
Update with 0.3.7 changesv0.3.7upstream/0.3.7
-rw-r--r-- | Changelog.txt | 43 |
1 files changed, 43 insertions, 0 deletions
diff --git a/Changelog.txt b/Changelog.txt index 8df35d5c3..205ca02e2 100644 --- a/Changelog.txt +++ b/Changelog.txt @@ -1,5 +1,48 @@ OpenBLAS ChangeLog ==================================================================== +Version 0.3.7 +11-Aug 2019 + +common: + * having the gmake special variables TARGET_ARCH or TARGET_MACH + defined no longer causes build failures in ctest or utest + * defining NO_AFFINITY or USE_TLS to 0 in gmake builds no longer + has the same effect as setting them to 1 + * a new test program was added to allow checking the library for + thread safety + * a new option USE_LOCKING was added to ensure thread safety when + OpenBLAS itself is built without multithreading but will be + called from multiple threads. + * a build failure on Linux with glibc versions earlier than 2.5 + was fixed + * a runtime error with CPU enumeration (and NO_AFFINITY not set) + on glibc 2.6 was fixed + * NO_AFFINITY was added to the CMAKE options (and defaults to being + active on Linux, as in the gmake builds) + +x86_64: + * the build-time logic for detection of AVX512 availability in + the processor and compiler was fixed + * gmake builds on OSX now set the internal name of the library to + libopenblas.0.dylib (consistent with CMAKE) + * the Haswell DGEMM kernel received a significant speedup through + improved prefetch and load instructions + * performance of DGEMM, DTRMM, DTRSM and ZDOT on Zen/Zen2 was markedly + increased by avoiding vpermpd instructions + * the SKYLAKEX (AVX512) DGEMM helper functions have now been disabled + to fix remaining errors in DGEMM, DSYMM and DTRMM + +## POWER: + * added support for building on FreeBSD/powerpc64 and FreeBSD/ppc970 + * added optimized kernels for POWER9 single and double precision complex BLAS3 + * added optimized kernels for POWER9 SGEMM and STRMM + +## ARMV7: + * fixed the softfp implementations of xAMAX and IxAMAX + * removed the predefined -march= flags on both ARMV5 and ARMV6 as + they were appropriate for only a subset of platforms + +==================================================================== Version 0.3.6 29-Apr-2019 |