Improve numerical stability of batch normalization on CPUs #3113

reunanen · 2025-09-01T12:00:20Z

Problem: Calculating mean and variance in the CPU implementation of batch normalization is prone to losing some numerical precision (when the data is not zero mean, for example).

Solution: Use Welford's algorithm that is numerically more stable.

Note that the CUDA implementation should already be doing this (or something similar at least), so this change should make the CPU implementation better match the CUDA one.

…` fail by introducing non-zero mean

…_normalize_conv` by computing variances using Welford's algorithm

reunanen · 2025-09-01T12:35:40Z

Note that the CI error is addressed separately in PR #3112.

reunanen added 2 commits September 1, 2025 10:51

Make test cases test_batch_normalize and `test_batch_normalize_conv…

3dd187c

…` fail by introducing non-zero mean

Improve numerical stability of cpu::batch_normalize and `cpu::batch…

25bcd14

…_normalize_conv` by computing variances using Welford's algorithm

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Improve numerical stability of batch normalization on CPUs #3113

Improve numerical stability of batch normalization on CPUs #3113

Uh oh!

reunanen commented Sep 1, 2025

Uh oh!

reunanen commented Sep 1, 2025

Uh oh!

Uh oh!

Improve numerical stability of batch normalization on CPUs #3113

Are you sure you want to change the base?

Improve numerical stability of batch normalization on CPUs #3113

Uh oh!

Conversation

reunanen commented Sep 1, 2025

Uh oh!

reunanen commented Sep 1, 2025

Uh oh!

Uh oh!