Optimize `fmod` with a method using integer multiplication #1002

quaternic · 2025-08-01T00:39:01Z

This is kind of a retry at #898. One of the problems there was that it would have added overhead and regressed performance for typical inputs.

Unlike that PR, this doesn't aim for sub-linear scaling; the cost of evaluating fmod(x, y) is still roughly proportional to log2(|x/y|). However, the constant factor is much better. Running the random-benchmarks locally, I got walltime reductions of

fmodf16:  -56.9%
fmodf:    -85.0%
fmod:     -95.4%
fmodf128: -98.7%

New utilities in libm::support:

trait NarrowingDiv for dividing u2N / uN when the quotient fits in uN
a reasonable implementation of that for up to u256 / u128
fn linear_mul_reduction<U>(x: U, mut e: u32, y: U) -> U computes (x << e) % y with the new method

quaternic added 3 commits August 1, 2025 02:25

define and implement trait NarrowingDiv for unsigned integer division

fac05ed

include more basics in DInt

fd2aed8

optimize fmod performance

d749e82

quaternic force-pushed the fmod-reduce-opt branch from e795649 to d749e82 Compare August 1, 2025 00:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Optimize `fmod` with a method using integer multiplication #1002

Optimize `fmod` with a method using integer multiplication #1002

Uh oh!

quaternic commented Aug 1, 2025 •

edited by rustbot

Loading

Uh oh!

Uh oh!

Optimize fmod with a method using integer multiplication #1002

Are you sure you want to change the base?

Optimize fmod with a method using integer multiplication #1002

Uh oh!

Conversation

quaternic commented Aug 1, 2025 • edited by rustbot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Optimize `fmod` with a method using integer multiplication #1002

Optimize `fmod` with a method using integer multiplication #1002

quaternic commented Aug 1, 2025 •

edited by rustbot

Loading