Compositional sampling diffusion #572

arrjon · 2025-09-08T15:13:21Z

This pull request introduces compositional sampling support to the BayesFlow framework, enabling diffusion models to handle multiple compositional conditions efficiently. The main changes span the continuous approximator, diffusion model, and inference network modules, adding new methods and refactoring existing ones to support compositional structures in sampling, inference, and diffusion processes.

Larger changes include:

Added a new compositional_sample method to ContinuousApproximator, which generates samples with compositional structure and handles flattening, reshaping, and prior score computation for multiple compositional conditions. Supporting internal method _compositional_sample was also introduced.
In DiffusionModel, implemented compositional diffusion support including:
- New compositional_bridge and compositional_velocity methods for compositional score calculation.
- _compute_individual_scores helper for handling multiple compositional conditions.
- _inverse_compositional method for inverse compositional diffusion sampling.

The idea is that the workflow now has the method compositional_sample, which expects conditions in the form (n_datasets, n_conditions, ...). Then we can perform compositional sampling with diffusion models.
compositional_sample allows to set a mini_batch_size for memory efficient computation of the compositional score, which does not work with jax backend however, as jax does not like stochasticity in its integrators which cannot be precomputed. We could support here only fixed step sizes though?

To compute the compositional score we need access to the score of the prior. Here we need to handle the adapter carefully so that we compute the correct score. In the current draft, I am not sure I computed the prior score correctly. Some ideas would be great, currently it fails for jax because the adpater is converting stuff to numpy back and forth, but for torch it is working.

codecov · 2025-09-08T15:41:41Z

Codecov Report

❌ Patch coverage is 56.47668% with 84 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
bayesflow/approximators/continuous_approximator.py	3.57%	54 Missing ⚠️
...esflow/networks/diffusion_model/diffusion_model.py	77.88%	23 Missing ⚠️
bayesflow/networks/inference_network.py	55.55%	4 Missing ⚠️
bayesflow/simulators/simulator.py	50.00%	1 Missing ⚠️
bayesflow/utils/integrate.py	95.00%	1 Missing ⚠️
bayesflow/workflows/basic_workflow.py	50.00%	1 Missing ⚠️

Files with missing lines	Coverage Δ
bayesflow/simulators/sequential_simulator.py	`46.03% <ø> (ø)`
bayesflow/simulators/simulator.py	`82.92% <50.00%> (-1.69%)`	⬇️
bayesflow/utils/integrate.py	`67.68% <95.00%> (+16.01%)`	⬆️
bayesflow/workflows/basic_workflow.py	`69.71% <50.00%> (-0.23%)`	⬇️
bayesflow/networks/inference_network.py	`70.58% <55.55%> (-3.23%)`	⬇️
...esflow/networks/diffusion_model/diffusion_model.py	`80.41% <77.88%> (+0.61%)`	⬆️
bayesflow/approximators/continuous_approximator.py	`72.33% <3.57%> (-19.55%)`	⬇️

... and 1 file with indirect coverage changes

stefanradev93 · 2025-09-13T14:15:18Z

Hi Jonas, this is great! One general design question that I would like to discuss is whether to add the new capabilities to the existing classes or inherit from the existing classes and add the new methods there , e.g., as in CompositionalApproximator, CompositionalDiffusionModel, ... etc. The latter has the advantage that the existing interfaces remain more compact but introduces the need for new classes. @vpratz @paul-buerkner Since the interface is already working well (except for JAX), I think it's a good time to discuss.

paul-buerkner · 2025-09-16T08:52:15Z

Where can I see examples of it's use and how it would alternatively look if the structure was different?

arrjon · 2025-09-16T09:46:31Z

So at the moment, the compositional part is only relevant during inference. You train a diffusion model, and then you can do the following:

# training_data.shape = (n_datasets, ...), so no conditions
# sim_data.shape = (n_datasets, n_condtions, ...)

workflow.approximator.inference_network.integrate_kwargs = {'method': 'euler_maruyama',
                                                                   'steps': 200,
                                                                   'mini_batch_size': 2,  # how many conditions in each step are used to compute the estimate of the compositional score
                                                                   'compositional_d1': 0.05,  # density bridge 
                                                                   }

posterior_samples = workflow.compositional_sample(num_samples=100,
                                                                  conditions={'sim_data': test_data_comp_trials['sim_data']},
                                                                  compute_prior_score=prior_score)
# posterior_samples.shape = (n_datasets, n_samples, n_parameters)

This implementation is based on the compositional approach in here.

Defining CompositionalApproximator, CompositionalDiffusionModel would essentially only change the code organization and that you have to specify the correct approximator already during training (even though there is no difference) or load the trained diffusion model into the CompositionalApproximator after training before you can do inference (which would mean define a new workflow and load the model, so not too difficult). A nice point about CompositionalDiffusionModel could be, if we want to have specific standard settings which differ to the other diffusion model, e.g., one should use a stochastic sampler rather than the deterministic one.

paul-buerkner · 2025-09-16T10:52:21Z

Thank you. That makes sense. Is there any practical use-case where we could want to use the same diffusion model for both standard and compositional sampling?

arrjon · 2025-09-16T11:36:31Z

I think the usual case is you know from the beginning that you want to do a compositional model. Only in rare cases, where you get new data after you trained a network, you might consider switching from diffusion to compositional diffusion.

However, at the moment a CompositionalApproximator might be overkill as we only have a single inference network suitable for this task.

paul-buerkner · 2025-09-16T11:44:24Z

So you would suggest a new diffusion model class but not a new approximator class. I would personally be fine with that. That said, is there anything else beyond the approximator.compositional_sample method that would have to be added to the approximator class? If not, I think keeping the existing approximator is fine and checking inside of its compositional sampling method if the employed inference network supports this method.

…_diffusion

stefanradev93 · 2025-09-29T17:17:03Z

@arrjon Can you post a minimal interface example for the latest version (model definition and sampling) to discuss with the others?

arrjon added 24 commits September 6, 2025 11:31

allow tensor in DiagonalNormal dimension

82b3ab4

fix sum dims

8fbf737

fix batch_shape for sample

5c27246

dims to tuple

c684bca

first draft compositional

0697634

first draft compositional

b8e849e

first draft compositional

a280af3

first draft compositional

b9faf31

fix shapes

9b7eb16

fix shapes

e79aac1

fix shapes

8a80240

fix shapes

00fbc61

fix shapes

e6158e7

fix shapes

1ac39b2

add minibatch

9fd9cf8

add compositional_bridge

830e929

fix mini batch randomness

f97594b

fix mini batch randomness

7219a71

fix mini batch randomness

a10026a

add prior score

457eb5d

add prior score

7de4736

add prior score draft

1ee0e78

add prior score draft

f71359b

add prior score draft

6210c07

arrjon self-assigned this Sep 8, 2025

arrjon requested a review from stefanradev93 September 8, 2025 15:13

arrjon added 2 commits September 8, 2025 17:19

add prior score draft

bcb9f60

fix dtype

455f03c

fix docstring

89523a9

arrjon added 7 commits September 12, 2025 20:17

add test for compute_prior_score_pre

9a1ba32

fix order of prior scores

93b59ba

fix prior scores standardize

922040d

better standard values for compositional

b2991d1

better compositional_bridge

d2a36a8

fix integrate_kwargs

0ff960f

fix integrate_kwargs

b2ef755

arrjon added 2 commits September 16, 2025 12:21

fix kwargs in sample

ca7f3bd

Merge branch 'dev' into fix_sampling_method_kwargs

09df093

arrjon added 3 commits September 16, 2025 15:32

fix kwargs in set transformer

2c161c6

fix kwargs in set transformer

9d4c1a1

remove print

ea0659d

elseml mentioned this pull request Sep 22, 2025

log_gamma diagnostic tests failing frequently #576

Closed

arrjon added 10 commits September 22, 2025 10:23

Merge branch 'fix_sampling_method_kwargs' into compositional_sampling…

922412f

…_diffusion

new class for compositional diffusion

9220816

fix import

ee1c320

Merge branch 'dev' into compositional_sampling_diffusion

c977959

Merge branch 'dev' into compositional_sampling_diffusion

9fee1d4

Merge branch 'dev' into compositional_sampling_diffusion

d3f639d

Merge branch 'dev' into compositional_sampling_diffusion

7d15b49

add import

e6513c1

fix mini_batch_size

e87f9d1

fix mini_batch_size

983cb8d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Compositional sampling diffusion #572

Compositional sampling diffusion #572

Uh oh!

arrjon commented Sep 8, 2025 •

edited

Loading

Uh oh!

codecov bot commented Sep 8, 2025 •

edited

Loading

Uh oh!

stefanradev93 commented Sep 13, 2025

Uh oh!

paul-buerkner commented Sep 16, 2025

Uh oh!

arrjon commented Sep 16, 2025

Uh oh!

paul-buerkner commented Sep 16, 2025

Uh oh!

arrjon commented Sep 16, 2025

Uh oh!

paul-buerkner commented Sep 16, 2025

Uh oh!

stefanradev93 commented Sep 29, 2025

Uh oh!

Uh oh!

Compositional sampling diffusion #572

Are you sure you want to change the base?

Compositional sampling diffusion #572

Uh oh!

Conversation

arrjon commented Sep 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Sep 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

stefanradev93 commented Sep 13, 2025

Uh oh!

paul-buerkner commented Sep 16, 2025

Uh oh!

arrjon commented Sep 16, 2025

Uh oh!

paul-buerkner commented Sep 16, 2025

Uh oh!

arrjon commented Sep 16, 2025

Uh oh!

paul-buerkner commented Sep 16, 2025

Uh oh!

stefanradev93 commented Sep 29, 2025

Uh oh!

Uh oh!

arrjon commented Sep 8, 2025 •

edited

Loading

codecov bot commented Sep 8, 2025 •

edited

Loading