FIX-#7675: Allow backend switching to backends other than provided arguments #7679

sfc-gh-joshi · 2025-09-25T22:39:48Z

What do these changes do?

After this PR, AutoSwitchBackend now has 2 separate behaviors for functions with multiple query compiler arguments:

If the method called is a registered pre-operation switch point, ALL active backends are considered as valid candidates for switching.
If the method is NOT a pre-operation switch point, then arguments may only be moved to backends found among the original query compilers.

For example, after calling pd.concat([A1, A2]), we previously would only consider switching to the backends of the query compilers of arguments A1 and A2. Now, after calling register_function_for_pre_op_switch(class_name=None, backend="Backend_A", method="concat"), Modin may now move arguments to some third backend Backend_B.

first commit message and PR title follow format outlined here

NOTE: If you edit the PR title to match this format, you need to add another commit (even if it's empty) or amend your last commit for the CI job that checks the PR title to pick up the new PR title.
passes flake8 modin/ asv_bench/benchmarks scripts/doc_checker.py
passes black --check modin/ asv_bench/benchmarks scripts/doc_checker.py
signed commit with git commit -s
Resolves FEAT: Consider automatic switching for backends other than those of the provided arguments #7675
tests added and passing
module layout described at docs/development/architecture.rst is up-to-date

…n provided arguments Signed-off-by: Jonathan Shi <jonathan.shi@snowflake.com>

modin/core/storage_formats/base/query_compiler_calculator.py

modin/core/storage_formats/pandas/query_compiler_caster.py

sfc-gh-mvashishtha

I have some minor comments. Also, I have some questions:

Have you run any benchmarks with this change? Does it solve the pathological merge case that motivated it?
It possible or likely that we switch to an unexpected and/or suboptimal backend during multi-dataset operations? e.g. say we switch to ray for a snowflake-pandas merge? Is there a good way to test for whether this happens in practice?

modin/core/storage_formats/base/query_compiler_calculator.py

modin/tests/pandas/native_df_interoperability/test_compiler_caster.py

sfc-gh-joshi · 2025-09-26T16:52:39Z

Have you run any benchmarks with this change? Does it solve the pathological merge case that motivated it?

I ran the pathological merge as a sanity check and it went from 140s -> 6s (it takes around 2s with hybrid disabled, but there's some thrashing because an unnecessary switch occurs after read_snowflake). I'm working on doing some more testing.

It possible or likely that we switch to an unexpected and/or suboptimal backend during multi-dataset operations? e.g. say we switch to ray for a snowflake-pandas merge? Is there a good way to test for whether this happens in practice?

Right now we don't allow automatic switching to Ray, as its omitted from all_switchable_backends. It's possible this may happen in the future, but I think these cases are pretty well-simulated by test_compiler_caster.py. I just haven't gotten around to updating those tests yet since I was waiting for #7676 first.

sfc-gh-jkew · 2025-09-26T20:24:12Z

modin/core/storage_formats/base/query_compiler.py

    @disable_logging
-    def max_cost(self) -> int:
+    @classmethod
+    def max_cost(cls) -> int:


So max cost may be implemented using some innate knowledge of the object; so /technically/ it cannot be a class method. In a practical sense I don't think we ever set this to anything other than COST_IMPOSSIBLE - so I'm wondering if the function should be removed in favor of just using COST_IMPOSSIBLE AS the static return value.

max_cost is the maximum cost allowed by this query compiler across all data movements. This method
sets a normalized upper bound for situations where multiple data frames from different engines all
need to move to the same engine. The value returned by this method can exceed
QCCoercionCost.COST_IMPOSSIBLE

We set it to COST_IMPOSSIBLE * 1e10 in the Snowflake QC. I turned it into a class method so that backends w/o arguments present in the operation (for example, calculating cost for the Cloud backend in pd.concat([df_pico1, df_pico2]) can report a max cost.

Should I leave the original max_cost method as-is, and introduce an alternate static/classmethod max_cost for these cases?

sfc-gh-jkew · 2025-09-26T20:27:32Z

modin/core/storage_formats/base/query_compiler_calculator.py

 from modin.logging.metrics import emit_metric


+def all_switchable_backends() -> list[str]:


I don't understand why this cannot be part of envvars

We could make this configurable, but I just refactored this out from a function in the QC caster:

modin/modin/core/storage_formats/pandas/query_compiler_caster.py

Lines 800 to 807 in b002708

for backend in Backend.get_active_backends():

if backend in ("Ray", "Unidist", "Dask"):

# Disable automatically switching to these engines for now, because

# 1) _get_prepared_factory_for_backend() currently calls

# _initialize_engine(), which starts up the ray/dask/unidist

# processes

# 2) we can't decide to switch to unidist in the middle of execution.

continue

sfc-gh-jkew · 2025-09-26T20:30:34Z

modin/core/storage_formats/base/query_compiler_calculator.py

        """
        Calculate which query compiler we should cast to.

+        Switching calculation is performed as follows:


+1 to the documentation here.

sfc-gh-jkew · 2025-09-26T20:34:39Z

modin/core/storage_formats/base/query_compiler_calculator.py

+                    ).io_cls.query_compiler_cls,
+                )
+        if preop_switch:
+            # Initialize backend data for any backends not found among query compiler arguments.


Can we create a new environment variable for this behavior; maybe default on so we can perf test with it on and off?

preop_switch is set for individual functions registered by register_function_for_pre_op_switch when BackendCalculator is initialized. What would you want the environment variable to do?

sfc-gh-joshi · 2025-09-26T20:37:02Z

@sfc-gh-mvashishtha After doing some more testing I realized it made more sense to only switch to other backends if we explicitly registered a function as a switch point, as is the case for 0/1-argument functions. I've updated the code to reflect this.

modin/tests/pandas/native_df_interoperability/test_compiler_caster.py

FIX-modin-project#7675: Allow backend switching to backends other tha…

d139fb2

…n provided arguments Signed-off-by: Jonathan Shi <jonathan.shi@snowflake.com>

sfc-gh-joshi requested review from devin-petersohn, mvashishtha, RehanSD, YarShev, vnlitvinov, anmyachev, dchigarev and a team as code owners September 25, 2025 22:39

sfc-gh-joshi marked this pull request as draft September 25, 2025 22:40

github-advanced-security bot found potential problems Sep 25, 2025

View reviewed changes

modin/core/storage_formats/base/query_compiler_calculator.py Fixed Show fixed Hide fixed

modin/core/storage_formats/base/query_compiler_calculator.py Dismissed Show dismissed Hide dismissed

modin/core/storage_formats/pandas/query_compiler_caster.py Dismissed Show dismissed Hide dismissed

sfc-gh-mvashishtha reviewed Sep 26, 2025

View reviewed changes

review fixes

9e936a7

sfc-gh-joshi added 3 commits September 26, 2025 10:09

Merge remote-tracking branch 'upstream/main' into joshi/nary-switching

257f567

enforce pre-op switch check

9dff915

pre/post typo + tests

48be1ec

sfc-gh-joshi marked this pull request as ready for review September 26, 2025 19:06

doc checker

b902865

sfc-gh-jkew reviewed Sep 26, 2025

View reviewed changes

sfc-gh-mvashishtha reviewed Sep 26, 2025

View reviewed changes

modin/tests/pandas/native_df_interoperability/test_compiler_caster.py Show resolved Hide resolved

fix wrong assert

616eb69

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

FIX-#7675: Allow backend switching to backends other than provided arguments #7679

FIX-#7675: Allow backend switching to backends other than provided arguments #7679

sfc-gh-joshi commented Sep 25, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sfc-gh-mvashishtha left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sfc-gh-joshi commented Sep 26, 2025

Uh oh!

sfc-gh-jkew Sep 26, 2025

Uh oh!

sfc-gh-joshi Sep 26, 2025

Uh oh!

sfc-gh-jkew Sep 26, 2025

Uh oh!

sfc-gh-joshi Sep 26, 2025

Uh oh!

sfc-gh-jkew Sep 26, 2025

Uh oh!

sfc-gh-jkew Sep 26, 2025

Uh oh!

sfc-gh-joshi Sep 26, 2025

Uh oh!

sfc-gh-joshi commented Sep 26, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

		from modin.logging.metrics import emit_metric


		def all_switchable_backends() -> list[str]:

	for backend in Backend.get_active_backends():
	if backend in ("Ray", "Unidist", "Dask"):
	# Disable automatically switching to these engines for now, because
	# 1) _get_prepared_factory_for_backend() currently calls
	# _initialize_engine(), which starts up the ray/dask/unidist
	# processes
	# 2) we can't decide to switch to unidist in the middle of execution.
	continue

FIX-#7675: Allow backend switching to backends other than provided arguments #7679

Are you sure you want to change the base?

FIX-#7675: Allow backend switching to backends other than provided arguments #7679

Conversation

sfc-gh-joshi commented Sep 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What do these changes do?

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sfc-gh-mvashishtha left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sfc-gh-joshi commented Sep 26, 2025

Uh oh!

sfc-gh-jkew Sep 26, 2025

Choose a reason for hiding this comment

Uh oh!

sfc-gh-joshi Sep 26, 2025

Choose a reason for hiding this comment

Uh oh!

sfc-gh-jkew Sep 26, 2025

Choose a reason for hiding this comment

Uh oh!

sfc-gh-joshi Sep 26, 2025

Choose a reason for hiding this comment

Uh oh!

sfc-gh-jkew Sep 26, 2025

Choose a reason for hiding this comment

Uh oh!

sfc-gh-jkew Sep 26, 2025

Choose a reason for hiding this comment

Uh oh!

sfc-gh-joshi Sep 26, 2025

Choose a reason for hiding this comment

Uh oh!

sfc-gh-joshi commented Sep 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sfc-gh-joshi commented Sep 25, 2025 •

edited

Loading

sfc-gh-joshi commented Sep 26, 2025 •

edited

Loading