- Feb 10, 2022
-
-
Szabolcs Nagy authored
This directory will contain optimized code that is maintained separately from existing routines with different requirements.
-
Szabolcs Nagy authored
Document contributor requirements.
-
Szabolcs Nagy authored
The outgoing license was MIT only. The new dual license allows using the code under Apache-2.0 WITH LLVM-exception license too.
-
- Feb 17, 2021
-
-
Szabolcs Nagy authored
* String routine changes * Added AArch64 ILP32 ABI support. * Fixed SVE strnlen return value. * Added MTE related __mtag_tag_region. * Added MTE related __mtag_tag_zero_region. * Minor code cleanups.
-
- Nov 16, 2020
-
-
Szabolcs Nagy authored
* New math routines * Scalar erff and erf using fma.
-
- Aug 14, 2020
-
-
Szabolcs Nagy authored
* Bug fixes * strcmp-mte nul check * strncmp-mte with large size * arm memcpy with large size (CVE-2020-6096) * String routines performance improvements * strlen * memmove with backward copy * Benchmarking code for strings and memory routines * strlen
-
- May 29, 2020
-
-
Szabolcs Nagy authored
* New functionality (64-bit Arm) * string: Optimized MTE variants of strlen, strnlen, strchr, strchrnul, strrchr, memchr, memrchr, strcpy, stpcpy, strcmp, strncmp * string: Changes to support BTI * string: New optimized memrchr, strnlen * Performance improvements (Neoverse N1) * strchr/strchrnul: 21% improvement on long strings * strrchr: 11% improvement * strnlen: 130% improvement on long strings, 50% on short strings * Benchmark and tests * string: New memcpy benchmark * string: Cleanup testsuite and improve test coverage
-
- Feb 28, 2020
-
-
Szabolcs Nagy authored
New functionality * string: New strrchr and stpcpy routines * string: New Memory Tagging Extension (MTE) variants of strlen and strchr * math: New vector version of pow(double) * networking: Optimized ones' complement checksum for 32-bit and 64-bit Arm Performance improvements * string: Improved memcpy and memmove (SIMD and non-SIMD) for 64-bit Arm * string: Improved memset for 64-bit Arm
-
- Feb 27, 2020
-
-
Ola Liljedahl authored
Add scalar and NEON ones' complement checksumming implementations for AArch64 and Armv7-A.
-
- Jan 03, 2020
-
-
Szabolcs Nagy authored
This Assignment Agreement has to be filled in, signed and sent to optimized-routines-assignment@arm.com by Contributors before their contributions can be accepted into optimized-routines.
-
- Nov 27, 2019
-
-
Szabolcs Nagy authored
Mention releases.
-
- Oct 08, 2019
-
-
Szabolcs Nagy authored
-
Szabolcs Nagy authored
-
- Jul 22, 2019
-
-
Szabolcs Nagy authored
The string directory is for optimized string functions, currently it is just a skeleton with dummy code.
-
Szabolcs Nagy authored
To allow subprojects other than math, the build system and directory layout is changed: all math related code, tools and tests are under the math directory now, new subprojects should be similarly self- contained. The top level Makefile design is still kept, but the subproject build directories provide their own Dir.mk with the build rules for the subproject. The user interface of config.mk is kept for now, in the future subproject specific flags and make variables may be added for finer grained control.
-
- Jul 18, 2019
-
-
Szabolcs Nagy authored
math/single contained code for systems without double precision fpu and rem_pio2 is not used currently and likely will be designed differently when double precision trigonometric functions are added.
-
Szabolcs Nagy authored
Scripts are known to work with sollya 6.0.
-
- Nov 22, 2018
-
-
Szabolcs Nagy authored
-
- May 16, 2018
-
-
Szabolcs Nagy authored
Add a single makefile such that it will rarely need modifications with regular source code changes. The usual configure step for build environment detection is done manually by editing config.mk, it is expected to be simple make variable changes (mostly CFLAGS), later a simple configure script can be added to generate config.mk if necessary. Update the README.
-
- Nov 19, 2015
-
-
George Lander authored
-