summaryrefslogtreecommitdiff
path: root/kernel
AgeCommit message (Expand)AuthorFilesLines
2022-01-18Merge pull request #3492 from binebrank/arm_sve_zgemmMartin Kroeker32-104/+7903
2022-01-18update armv8sve + contributorsBine Brank1-21/+32
2022-01-17adapt CMakeBine Brank1-16/+40
2022-01-16adapt Makefile for SVE trsmBine Brank1-0/+128
2022-01-16fix ztrsm lt/ut copyBine Brank2-2/+2
2022-01-15add sve ztrsmBine Brank13-27/+542
2022-01-15fix sve dtrsm kernelsBine Brank7-80/+79
2022-01-11add remaining sve trsm copy kernelsBine Brank3-0/+341
2022-01-10trsm_lncopy_sveBine Brank1-0/+114
2022-01-10sve trsmRN and trsmRTBine Brank3-0/+603
2022-01-09add trsm_kernel_LT_sveBine Brank2-4/+307
2022-01-09sve trsm_kernel_LNBine Brank1-0/+301
2022-01-09Merge pull request #3508 from snadampal/v1_n2Martin Kroeker2-0/+378
2022-01-07OpenBLAS: aarch64: Add neoverse-v1/n2 architecture specificsSunita Nadampalli2-0/+378
2022-01-06fix makefile.L3Bine Brank1-6/+6
2022-01-05combine zchemm into single fileBine Brank6-219/+135
2022-01-05adapt CMake for SVEBine Brank1-12/+38
2022-01-05sve copy functions for cgemm chemm zsymmBine Brank6-0/+669
2022-01-05add cgemm ctrmm sve kernelsBine Brank2-0/+1880
2022-01-05modify sve zgemmcopy kernelsBine Brank2-4/+1
2022-01-05update configuration of kernels for A64FX and ARMV8SVEBine Brank2-24/+59
2022-01-05configure Makefile for sveBine Brank1-6/+78
2022-01-04fix sve ztrmm kernelBine Brank1-4/+4
2022-01-04ztrmm sve copy functionsBine Brank4-26/+26
2022-01-03add sve zhemm copy routinesBine Brank3-2/+215
2022-01-02add sve ztrmmBine Brank3-6/+1044
2021-12-30ztrmm sve copy kernelsBine Brank4-0/+574
2021-12-29fix zgemm kernelBine Brank3-34/+29
2021-12-26zgemm sve copy routinesBine Brank2-0/+157
2021-12-26sve zgemm kernelBine Brank1-411/+131
2021-12-25added macros for sve zgemm kernelsBine Brank1-0/+1159
2021-12-24fix function typecastMartin Kroeker1-1/+1
2021-12-24fix function typecastMartin Kroeker1-1/+1
2021-12-21fix function typecastsMartin Kroeker6-6/+6
2021-12-21prepare kernel for sve zgemmBine Brank1-8/+17
2021-12-21loongarch64: Optimize dgemm_kernelgxw6-1/+6172
2021-12-15fix bug in zscal functionWu Zhigang1-1/+22
2021-12-12Merge pull request #3475 from wjc404/optimize-A53-dgemmMartin Kroeker3-2/+900
2021-12-12Merge pull request #3474 from rafaelcfsousa/rafael/cmake_powerMartin Kroeker1-4/+4
2021-12-12optimize cgemm on ARM cortex A53 & cortex A55Jia-Chen3-2/+900
2021-12-11Merge pull request #3464 from binebrank/arm_sve_sgemmMartin Kroeker14-54/+3959
2021-12-11fix UNROLL_MN and add to targets for SVEBine Brank2-26/+23
2021-12-11adjust Makefile.L3 for SVEBine Brank1-0/+32
2021-12-09Fix error cmake (small kernels)Rafael Cardoso Fernandes Sousa1-4/+4
2021-12-06roll back DGEMM kernels to 4x8 when compiling for DYNAMIC_ARCHMartin Kroeker1-0/+6
2021-12-05sgemm v2x8 SVE kernelBine Brank1-0/+1683
2021-12-05strmm sve v1x8 kernelBine Brank1-0/+1008
2021-12-03Merge pull request #3466 from rafaelcfsousa/rafael/small_matrix_p10Martin Kroeker10-0/+9006
2021-12-03Merge pull request #3455 from cenewcombe/developMartin Kroeker1-2/+14
2021-11-29trmm sve copy fucntions for single precisionBine Brank4-6/+66