aosp12/external/FXdiv
hcl 7be3fd486c init from android-12.1.0_r8 2023-01-09 17:11:35 +08:00
..
bench init from android-12.1.0_r8 2023-01-09 17:11:35 +08:00
cmake init from android-12.1.0_r8 2023-01-09 17:11:35 +08:00
include init from android-12.1.0_r8 2023-01-09 17:11:35 +08:00
test init from android-12.1.0_r8 2023-01-09 17:11:35 +08:00
.gitignore init from android-12.1.0_r8 2023-01-09 17:11:35 +08:00
.travis.yml init from android-12.1.0_r8 2023-01-09 17:11:35 +08:00
Android.bp init from android-12.1.0_r8 2023-01-09 17:11:35 +08:00
BUILD.bazel init from android-12.1.0_r8 2023-01-09 17:11:35 +08:00
CMakeLists.txt init from android-12.1.0_r8 2023-01-09 17:11:35 +08:00
LICENSE init from android-12.1.0_r8 2023-01-09 17:11:35 +08:00
METADATA init from android-12.1.0_r8 2023-01-09 17:11:35 +08:00
MODULE_LICENSE_MIT init from android-12.1.0_r8 2023-01-09 17:11:35 +08:00
OWNERS init from android-12.1.0_r8 2023-01-09 17:11:35 +08:00
README.md init from android-12.1.0_r8 2023-01-09 17:11:35 +08:00
TEST_MAPPING init from android-12.1.0_r8 2023-01-09 17:11:35 +08:00
WORKSPACE init from android-12.1.0_r8 2023-01-09 17:11:35 +08:00
configure.py init from android-12.1.0_r8 2023-01-09 17:11:35 +08:00
confu.yaml init from android-12.1.0_r8 2023-01-09 17:11:35 +08:00

README.md

FXdiv

MIT License Build Status

Header-only library for division via fixed-point multiplication by inverse

On modern CPUs and GPUs integer division is several times slower than multiplication. FXdiv implements an algorithm to replace an integer division with a multiplication and two shifts. This algorithm improves performance when an application performs repeated divisions by the same divisor.

Features

  • Integer division for uint32_t, uint64_t, and size_t
  • Header-only library, no installation or build required
  • Compatible with C99, C++, OpenCL, and CUDA
  • Uses platform-specific compiler intrinsics for optimal performance
  • Covered with unit tests and microbenchmarks

Example

#include <fxdiv.h>

/* Division of array by a constant: reference implementation */
void divide_array_c(size_t length, uint32_t array[], uint32_t divisor) {
  for (size_t i = 0; i < length; i++) {
    array[i] /= divisor;
  }
}

/* Division of array by a constant: implementation with FXdiv */
void divide_array_fxdiv(size_t length, uint32_t array[], uint32_t divisor) {
  const struct fxdiv_divisor_uint32_t precomputed_divisor =
    fxdiv_init_uint32_t(divisor);
  for (size_t i = 0; i < length; i++) {
    array[i] = fxdiv_quotient_uint32_t(array[i], precomputed_divisor);
  }
}

Status

Currently working features:

Platform uint32_t uint64_t size_t
x86-64 gcc Works Works Works
x86-64 clang Works Works Works
x86-64 MSVC Works Works Works
x86 gcc Works Works Works
x86 clang Works Works Works
x86 MSVC Works Works Works
ARMv7 gcc Works Works Works
ARMv7 clang Works Works Works
ARMv7 MSVC* Compiles Compiles Compiles
ARM64 gcc Works Works Works
ARM64 clang Works Works Works
ARM64 MSVC* Compiles Compiles Compiles
PPC64 gcc Works Works Works
WAsm clang Works Works Works
Asm.js clang Works Works Works
PNaCl clang Works Works Works
CUDA Untested Untested Untested
OpenCL Untested Untested Untested

*ARMv7 and ARM64 builds with MSVC are presumed to work, but were only verified to compile successfully

References

  • Granlund, Torbjörn, and Peter L. Montgomery. "Division by invariant integers using multiplication." In ACM SIGPLAN Notices, vol. 29, no. 6, pp. 61-72. ACM, 1994. Available: gmplib.org/~tege/divcnst-pldi94.pdf