Age | Commit message (Expand) | Author | Files | Lines |
2019-07-01 | Merge pull request #2172 from quickwritereader/develop | Martin Kroeker | 7 | -251/+6416 |
2019-07-01 | cgemm/ctrmm power9 | AbdelRauf | 5 | -4/+6132 |
2019-06-25 | Fix build on FreeBSD/powerpc64. | Piotr Kubaj | 86 | -190/+190 |
2019-06-19 | Update dtrmm_kernel_16x4_power8.S | kavanabhat | 1 | -2/+0 |
2019-06-17 | new sgemm 8x16 | AbdelRauf | 2 | -247/+284 |
2019-06-06 | Merge pull request #2153 from quickwritereader/develop | Martin Kroeker | 8 | -744/+4460 |
2019-06-05 | conflict resolve | AbdelRauf | 1 | -1/+1 |
2019-06-05 | power9 zgemm ztrmm optimized | AbdelRauf | 4 | -1305/+2597 |
2019-06-04 | sgemm pipeline improved, zgemm rewritten without inner packs, ABI lxvx v20 fi... | AbdelRauf | 8 | -2416/+2062 |
2019-05-30 | improved zgemm power9 based on power8 | AbdelRauf | 6 | -22/+2800 |
2019-05-02 | Merge pull request #2107 from quickwritereader/develop | Martin Kroeker | 6 | -4/+8251 |
2019-05-01 | conflict resolve | AbdelRauf | 3 | -7/+7 |
2019-04-29 | Merge branch 'develop' of https://github.com/quickwritereader/OpenBLAS into d... | AbdelRauf | 2 | -2/+2 |
2019-04-29 | sgemm/strmm | AbdelRauf | 4 | -3/+8250 |
2019-04-23 | Merge pull request #2072 from martin-frbg/sum | Martin Kroeker | 2 | -0/+898 |
2019-04-09 | Add in runtime CPU detection for POWER. | Rashmica Gupta | 2 | -32/+32 |
2019-03-30 | Add POWER implementation of ?sum | Martin Kroeker | 2 | -0/+898 |
2019-03-29 | Merge branch 'develop' into develop | Martin Kroeker | 2 | -2/+2 |
2019-03-29 | power9 makefile. dgemm based on power8 kernel with following changes : 32x un... | AbdelRauf | 28 | -27/+6063 |
2019-02-13 | Fix out-of-bounds memory access in gemm_beta | Martin Kroeker | 1 | -1/+1 |
2019-02-13 | Fix out-of-bounds memory access in gemm_beta | Martin Kroeker | 1 | -1/+1 |
2019-02-04 | Note for unused kernels | Ubuntu | 2 | -0/+13 |
2019-02-04 | NBMAX=4096 for gemvn, added sgemvn 8x8 for future | Ubuntu | 3 | -2/+509 |
2019-02-01 | sgemv cgemv pairs | Ubuntu | 10 | -32/+2691 |
2019-01-17 | crot fix | Ubuntu | 1 | -36/+54 |
2019-01-16 | Merge branch 'develop' into develop | Abdelrauf | 1 | -0/+3 |
2019-01-16 | Added missing Blas1 single fp {saxpy, caxpy, cdot, crot(refactored version of... | Ubuntu | 11 | -48/+1802 |
2018-05-23 | Use the new zrot.c on POWER8 for crot as well | Martin Kroeker | 1 | -1/+1 |
2018-04-23 | Use generic zrot.c on ppc64/POWER6 to work around utest failure from … (#1535) | Martin Kroeker | 1 | -0/+3 |
2018-03-27 | power8:Added initial zgemv_(t|n) ,i(d|z)amax,i(d|z)amin,dgemv_t(transposed),zrot | QWR QWR | 9 | -8/+4454 |
2018-02-18 | dgemm_ncopy_4_ save/restore | the mslm | 2 | -249/+160 |
2018-02-16 | power8 ?gemm_tcopy save/restore | the mslm | 7 | -495/+238 |
2017-11-28 | Add trivially optimized DSDOT for POWER8 | martin | 2 | -9/+47 |
2017-09-28 | Save and restore VSX registers | Martin Kroeker | 15 | -89/+884 |
2017-06-14 | Optimise sscal for POWER9 | Matt Brown | 1 | -40/+40 |
2017-06-14 | Optimise srot for POWER9 | Matt Brown | 1 | -32/+32 |
2017-06-14 | Optimise sdot for POWER9 | Matt Brown | 1 | -32/+32 |
2017-06-14 | Optimise sasum for POWER9 | Matt Brown | 1 | -16/+16 |
2017-06-14 | Optimise casum for POWER9 | Matt Brown | 1 | -16/+16 |
2017-06-14 | Optimise cswap for POWER9 | Matt Brown | 1 | -64/+64 |
2017-06-14 | Optimise sswap for POWER9 | Matt Brown | 1 | -32/+32 |
2017-06-14 | Optimise scopy for POWER9 | Matt Brown | 1 | -32/+32 |
2017-06-14 | Optimise ccopy for POWER9 | Matt Brown | 1 | -64/+64 |
2017-04-04 | Power8 inline assembly tweaks | Alan Modra | 3 | -44/+45 |
2017-02-13 | Power8 inline assembly fixes | Martin Kroeker | 38 | -3640/+3314 |
2016-09-29 | Remove explicit include of complex.h | Martin Kroeker | 1 | -1/+0 |
2016-08-18 | Refs #946. Use nrm2 reference implementation for Power8. | Zhang Xianyi | 1 | -4/+4 |
2016-08-18 | Refs #929. Deal with zero and NaNs for scale. | Zhang Xianyi | 2 | -1/+18 |
2016-05-23 | optimized dtrsm_logic_LT_16x4_power8.S and dtrsm_macros_LT_16x4_power8.S | Werner Saar | 2 | -338/+371 |
2016-05-22 | optimized dtrsm_kernel_LT for POWER8 | Werner Saar | 2 | -2/+45 |