summaryrefslogtreecommitdiff
AgeCommit message (Collapse)AuthorFilesLines
2017-07-01arm: add softfp support in kernel/arm/swap_vfp.SAshwin Sekhar T K2-5/+42
2017-07-01arm: add softfp support in kernel/arm/nrm2_vfp*.SAshwin Sekhar T K4-12/+24
2017-06-30arm: add softfp support in kernel/arm/*dot_vfp.SAshwin Sekhar T K5-24/+66
2017-06-30arm: add softfp support in kernel/arm/rot_vfp.SAshwin Sekhar T K2-6/+23
2017-06-30arm: add softfp support in kernel/arm/axpy_vfp.SAshwin Sekhar T K2-16/+65
2017-06-30arm: add softfp support in kernel/arm/asum_vfp.SAshwin Sekhar T K2-5/+13
2017-06-30arm: Use assembly implementations based on the ARM abiAshwin Sekhar T K2-177/+60
In case of softfp abi, assembly implementations of only those APIs are used which doesnt have a floating point argument or return value. In case of hard abi, all assembly implementations are used.
2017-06-30generic: add some generic gemm and trmm kernelsAshwin Sekhar T K3-0/+1416
Added generic 4x4 and 4x2 gemm kernels Added generic 4x2 trmm kernel
2017-06-30arm: Determine the abi from compiler if not specified on command lineAshwin Sekhar T K4-34/+34
If ARM abi is not explicitly mentioned on the command line, then set the arm abi to softfp or hard according to the compiler environment. This assumes that compiler sets the defines __ARM_PCS and __ARM_PCS_VFP accordingly.
2017-06-28Add ReLAPACK to MakefilesMartin Kroeker2-7/+18
2017-06-28Restore ReLAPACK test folderMartin Kroeker19-0/+950
2017-06-28Add Elmar Peise's ReLAPACKMartin Kroeker82-0/+20579
2017-06-23Add Microsoft Windows 10 UWP build supportNeil Shipp7-35/+73
2017-06-23Merge branch 'arm_soft_fp_abi' into developZhang Xianyi9-17/+123
2017-06-23Merge pull request #1211 from neilsh-msft/developZhang Xianyi6-22/+76
Add 64bit support for Microsoft Visual Studio
2017-06-22Reorder dependencies to allow in-place build to succeed the first time.Neil Shipp1-15/+7
2017-06-22Avoid truncating cblas.h when compiling gencblas targetNeil Shipp1-1/+2
2017-06-21Revert changes to sed and awkNeil Shipp1-14/+2
2017-06-21Add 64bit support for Microsoft Visual StudioNeil Shipp6-19/+92
2017-06-14Optimise sscal for POWER9Matt Brown1-40/+40
Use lxvd2x instruction instead of lxvw4x. lxvd2x performs far better on the new POWER architecture than lxvw4x.
2017-06-14Optimise srot for POWER9Matt Brown1-32/+32
Use lxvd2x instruction instead of lxvw4x. lxvd2x performs far better on the new POWER architecture than lxvw4x.
2017-06-14Optimise sdot for POWER9Matt Brown1-32/+32
Use lxvd2x instruction instead of lxvw4x. lxvd2x performs far better on the new POWER architecture than lxvw4x.
2017-06-14Optimise sasum for POWER9Matt Brown1-16/+16
Use lxvd2x instruction instead of lxvw4x. lxvd2x performs far better on the new POWER architecture than lxvw4x.
2017-06-14Optimise casum for POWER9Matt Brown1-16/+16
Use lxvd2x instruction instead of lxvw4x. lxvd2x performs far better on the new POWER architecture than lxvw4x.
2017-06-14Optimise cswap for POWER9Matt Brown1-64/+64
Use lxvd2x instruction instead of lxvw4x. lxvd2x performs far better on the new POWER architecture than lxvw4x.
2017-06-14Optimise sswap for POWER9Matt Brown1-32/+32
Use lxvd2x instruction instead of lxvw4x. lxvd2x performs far better on the new POWER architecture than lxvw4x.
2017-06-14Optimise scopy for POWER9Matt Brown1-32/+32
Use lxvd2x instruction instead of lxvw4x. lxvd2x performs far better on the new POWER architecture than lxvw4x.
2017-06-14Optimise ccopy for POWER9Matt Brown1-64/+64
Use lxvd2x instruction instead of lxvw4x. lxvd2x performs far better on the new POWER architecture than lxvw4x.
2017-06-01Fix installation of header files with cmake (#1186)Martin Kroeker2-2/+79
* Fix installation of header files with cmake Install only the required header files, with openblas_config.h preprocessed like in Makefile.install Fixes #1184 * Update CMakeLists.txt Escape remaining semicolons in awk argument list (to get it working on Windows as well) * Update CMakeLists.txt * Update CMakeLists.txt * Update CMakeLists.txt * Update CMakeLists.txt * Update CMakeLists.txt * Add files via upload * Update CMakeLists.txt * Update CMakeLists.txt * Update CMakeLists.txt see if it is the single quotes that cause the problem on windows * Update CMakeLists.txt * Update CMakeLists.txt * Update CMakeLists.txt * Update CMakeLists.txt * Update CMakeLists.txt * Use C utility instead of awk for header generation in cmake builds * Update CMakeLists.txt * Fix generation and installation of header files Generate openblas_config.h and f77blas.h with same contents as in plain Makefile builds and install only the public header files
2017-06-01Merge pull request #1190 from oviradoi/utest_make_complexMartin Kroeker1-33/+333
Update test to use openblas_make_complex_float and openblas_make_comp…
2017-05-30Update test to use openblas_make_complex_float and ↵Ovidiu Radoi1-33/+333
openblas_make_complex_double functions
2017-05-28Merge pull request #1189 from pawosm-arm/flangMartin Kroeker1-0/+4
build: Flang has the same interface as PGI
2017-05-27build: Flang has the same interface as PGIPaul Osmialowski1-0/+4
Signed-off-by: Paul Osmialowski <pawel.osmialowski@arm.com>
2017-05-26Merge pull request #1188 from pawosm-arm/flangMartin Kroeker4-3/+54
build: Flang compiler support
2017-05-25build: LLVM: Add Flang compiler support and enable OpenMP for ClangPaul Osmialowski4-3/+54
Signed-off-by: Paul Osmialowski <pawel.osmialowski@arm.com>
2017-05-24Merge pull request #1187 from mine260309/developZhang Xianyi2-3/+3
build: fix libxlmass errors building on Power CPU
2017-05-24build: fix libxlmass errors building on Power CPULei YU2-3/+3
IBM MASS library is upgraded to 8.1.5 and 8.1.3 is not available. Update README.md and Makefile.power to use version 8.1.5 of libxlmass.
2017-05-10Merge pull request #1182 from martin-frbg/martin-frbg-patch-1Martin Kroeker1-2/+8
Build shared library on Android without SONAME versioning
2017-05-10Build shared library on Android without SONAME versioningMartin Kroeker1-2/+8
Android does not support versioned SONAME entries, ref. #1173
2017-05-06Merge pull request #1178 from jcowgill/mips-fixesMartin Kroeker2-31/+4
MIPS threading fixes
2017-05-06Merge pull request #1179 from jcowgill/memory-fixesMartin Kroeker1-3/+3
Fixes to driver/others/memory.c
2017-05-05memory: Fix buffer overflow when position == NUM_BUFFERSJames Cowgill1-1/+1
2017-05-05mips: remove incorrect blas_lock implementationsJames Cowgill2-27/+0
MIPS 32-bit currently has an empty blas_lock implementation which is worse than nothing at all. MIPS 64-bit does has a blas_lock implementation but is broken. Remove them and fallback to the generic version in common.h which should do the right thing on MIPS.
2017-05-05mips: implement MB and WMBJames Cowgill2-4/+4
The MIPS architecture has weak memory ordering and therefore requires sutible memory barriers when doing lock free programming with multiple threads (just like ARM does). This commit implements those barriers for MIPS and MIPS64 using GCC bultins which is probably easiest way.
2017-05-05memory: switch loop condition around in blas_memory_freeJames Cowgill1-2/+2
Before this commit, the "position < NUM_BUFFERS" loop condition from blas_memory_free will be completely optimized away by GCC. This is because the condition can only be false after undefined behavior has already been invoked (reading past the end of an array). As a consequence of this bug, GCC also removes the subsequent if statement and all the code after the error label because all of it is dead. This commit switches the loop condition around so it works as intended.
2017-05-05Merge pull request #1175 from martin-frbg/lapack_143Martin Kroeker4-4/+16
Fix workspace computation in LAPACKE ?tpmqrt
2017-05-05Merge pull request #1176 from staticfloat/sf/dynamic_archMartin Kroeker2-1/+7
Fix DYNAMIC_ARCH=1 breaking builds on non-x86 platforms
2017-05-04Force `DYNAMIC_ARCH` to empty when `DYNAMIC_CORE` is not setElliot Saba1-1/+2
2017-05-04Add Makefile debugging trick so that we can inspect runtime Makefile variablesElliot Saba1-0/+5
2017-05-04Fix workspace computation for side=LMartin Kroeker1-1/+4
From netlib PR#144