Reproducibility of particle Gibbs is not guaranteed on x86 #2446

penelopeysm · 2024-12-19T12:30:28Z

Not for the first time, x86 CI gives us mysterious problems, in that some numerical tests fail on x86 despite all RNGs being thoroughly seeded (i.e. the results should be fully deterministic).

For example, using this runtests.jl on x64 and x86 GitHub runners will give different results at the end, see https://github.com/penelopeysm/Shaymin.jl/actions/runs/12412993388/job/34653933016

using Test
using StableRNGs
using Random
using Turing

Random.seed!(468)

@testset verbose = true "Shaymin.jl" begin
    @testset "using global seed" begin
        x1 = randn(3)
        @info x1
    end

    @testset "stablerng" begin
        x2 = randn(StableRNG(468), 3)
        @info x2
    end

    @testset "pg"  begin
        @model function f(y)
            a ~ Normal(0, 1)
            y ~ Normal(a, 1)
        end
        Random.seed!(468)
        alg = PG(15)
        chain = sample(StableRNG(468), f(1.5), alg, 50; progress=false)
        @show mean(chain[:a])
    end
end

# x64
[ Info: [0.07200886749732076, -0.0740437565595174, 0.6327762377562545]
[ Info: [1.2876157288026433, -0.2953479054222536, -1.205615981210787]
mean(chain[:a]) = 0.7475257036106626

# x86
[ Info: [0.07200886749732076, -0.0740437565595174, 0.6327762377562545]
[ Info: [1.2876157288026433, -0.2953479054222536, -1.205615981210787]
mean(chain[:a]) = 0.7973086809553678

Note that:

The results are deterministic if run repeatedly on the same architecture, so the problem isn't that the implementation doesn't use the provided rng;
The first two testsets with Random.randn(10) and rand(StableRNG(468), 10) are deterministic across architectures, so it's not a mistake in the implementation of the random number generator.

The text was updated successfully, but these errors were encountered:

penelopeysm · 2024-12-19T13:04:29Z

Is it related to this?

    @testset "advancedps" begin
        x4 = randn(AdvancedPS.TracedRNG(), 3)
        @info x4
    end

x64:

[ Info: [0.9001334534074001, -0.21170514711276572, 0.04622435546537583]

x86:

[ Info: [-0.29490566974498955, 0.02019167249744647, 1.7979388207251714]

TracedRNG isn't deterministic even on the same architecture, though, so that doesn't match up with previous observations.

yebai · 2024-12-19T16:44:45Z

TracedRNG isn't deterministic even on the same architecture, so that doesn't match up with previous observations.

We will consider transferring TracedRNG to Libtask in #2427 and improve it so it is reproducible across architectures. cc @willtebbutt

penelopeysm · 2024-12-20T16:27:56Z

I think we should keep this one open; we haven't figured out the root cause yet (#2449 only really plasters over it 😄 ).

penelopeysm · 2024-12-20T16:28:37Z

(I've been spending a little bit of time on narrowing it down, but haven't quite figured it out yet.)

penelopeysm added the tests label Dec 19, 2024

penelopeysm changed the title ~~Reproducibility of specific tests is not guaranteed on x86~~ Reproducibility of particle Gibbs is not guaranteed on x86 Dec 19, 2024

yebai mentioned this issue Dec 19, 2024

Significantly improve the Libtask library using ideas from Mooncake / ReverseDiff #2427

Open

penelopeysm mentioned this issue Dec 20, 2024

Increase atol on specific tests for x86 #2449

Merged

yebai closed this as completed in #2449 Dec 20, 2024

penelopeysm reopened this Dec 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reproducibility of particle Gibbs is not guaranteed on x86 #2446

Reproducibility of particle Gibbs is not guaranteed on x86 #2446

penelopeysm commented Dec 19, 2024 •

edited

Loading

penelopeysm commented Dec 19, 2024 •

edited

Loading

yebai commented Dec 19, 2024

penelopeysm commented Dec 20, 2024

penelopeysm commented Dec 20, 2024

Reproducibility of particle Gibbs is not guaranteed on x86 #2446

Reproducibility of particle Gibbs is not guaranteed on x86 #2446

Comments

penelopeysm commented Dec 19, 2024 • edited Loading

penelopeysm commented Dec 19, 2024 • edited Loading

yebai commented Dec 19, 2024

penelopeysm commented Dec 20, 2024

penelopeysm commented Dec 20, 2024

penelopeysm commented Dec 19, 2024 •

edited

Loading

penelopeysm commented Dec 19, 2024 •

edited

Loading