summaryrefslogtreecommitdiff
path: root/kernel
AgeCommit message (Expand)AuthorFilesLines
2019-08-30Merge pull request #2243 from quickwritereader/developMartin Kroeker12-162/+217
2019-08-30 fix uninitialized variables iAbdelRauf2-4/+4
2019-08-30caxpy and cdot are using vec_vsx_ldAbdelRauf2-50/+69
2019-08-30cgemv using vec_vsx_ld instead of letting gcc to decideAbdelRauf2-79/+115
2019-08-29alignedAbdelRauf12-29/+29
2019-08-28Keep both PGI/SUN and default code paths to avoid breaking Clang/WIndowsMartin Kroeker1-3/+17
2019-08-28Make x86_64 zdot compile with PGI and Sun C againMartin Kroeker1-5/+7
2019-08-15Add multithreading support to the x86_64 zdot kernel (#2222)Martin Kroeker1-14/+72
2019-08-13Merge pull request #2216 from martin-frbg/issue2214Martin Kroeker1-2/+2
2019-08-13Fix unwanted case-sensitivity in x86 LSAME for (AMD) processors without CMOV Martin Kroeker1-2/+2
2019-08-09Merge pull request #2206 from martin-frbg/zen-dtrmmMartin Kroeker1-14/+10
2019-08-09Merge pull request #2199 from martin-frbg/zen-dtrsmMartin Kroeker1-20/+16
2019-08-03Replace most vpermpd calls in the Haswell DTRSM_RN kernelMartin Kroeker1-20/+16
2019-07-28Replace vpermpd with vpermilpd in the Haswell DTRMM kernelMartin Kroeker1-14/+10
2019-07-28Merge pull request #2196 from wjc404/developMartin Kroeker1-9/+325
2019-07-28Add files via uploadwjc4041-9/+325
2019-07-23Merge pull request #2190 from martin-frbg/zdot-zenMartin Kroeker1-8/+16
2019-07-22Replace vpermpd with vpermilpdMartin Kroeker1-8/+16
2019-07-21Update dgemm_kernel_4x8_haswell.Swjc4041-7/+7
2019-07-21Update dgemm_kernel_4x8_haswell.Swjc4041-4/+4
2019-07-20Add files via uploadwjc4041-2/+2
2019-07-20Add files via uploadwjc4041-26/+19
2019-07-20Add files via uploadwjc4041-7/+43
2019-07-19Update dgemm_kernel_4x8_haswell.Swjc4041-2/+9
2019-07-19Add files via uploadwjc4041-1/+28
2019-07-17Update dgemm_kernel_4x8_haswell.Swjc4041-12/+12
2019-07-17Update dgemm_kernel_4x8_haswell.Swjc4041-5/+37
2019-07-17Update dgemm_kernel_4x8_haswell.Swjc4041-3/+4
2019-07-17Update dgemm_kernel_4x8_haswell.Swjc4041-1/+1
2019-07-17Update dgemm_kernel_4x8_haswell.Swjc4041-20/+20
2019-07-17Update dgemm_kernel_4x8_haswell.Swjc4041-12/+12
2019-07-17Update dgemm_kernel_4x8_haswell.S for zen2wjc4041-66/+54
2019-07-01Merge pull request #2172 from quickwritereader/developMartin Kroeker7-251/+6416
2019-07-01cgemm/ctrmm power9AbdelRauf5-4/+6132
2019-06-25Fix build on FreeBSD/powerpc64.Piotr Kubaj86-190/+190
2019-06-19Update dtrmm_kernel_16x4_power8.Skavanabhat1-2/+0
2019-06-17new sgemm 8x16AbdelRauf2-247/+284
2019-06-06Merge pull request #2153 from quickwritereader/developMartin Kroeker8-744/+4460
2019-06-05conflict resolveAbdelRauf1-1/+1
2019-06-05power9 zgemm ztrmm optimizedAbdelRauf4-1305/+2597
2019-06-04sgemm pipeline improved, zgemm rewritten without inner packs, ABI lxvx v20 fi...AbdelRauf8-2416/+2062
2019-05-30improved zgemm power9 based on power8AbdelRauf6-22/+2800
2019-05-30Use generic kernels for complex (I)AMAX to support softfpMartin Kroeker1-8/+8
2019-05-30Ensure correct output for DAMAX with softfpMartin Kroeker1-1/+5
2019-05-29Separate implementations of AMAX and IAMAX on armMartin Kroeker2-12/+453
2019-05-09Replace ISMIN and ISAMIN kernels on all x86_64 platforms (#2125)Martin Kroeker2-52/+58
2019-05-05Merge pull request #2111 from martin-frbg/issue1955Martin Kroeker1-2/+2
2019-05-05Disable DGEMMINCOPY as well for nowMartin Kroeker1-1/+1
2019-05-04Disable the SkyLakeX DGEMMITCOPY kernel as wellMartin Kroeker1-1/+1
2019-05-02Merge pull request #2107 from quickwritereader/developMartin Kroeker6-4/+8251