Skip to content

Fix pointer arguments in addFunction + wasm64 #24693

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

RReverser
Copy link
Collaborator

@RReverser RReverser commented Jul 13, 2025

Without this fix, any usages of addFunction with pointers in signatures would add broken functions to the webassembly table, which would fail with errors like "Cannot mix BigInt and other types, use explicit conversions" when invoked.

To fix this, we need to do essentially the opposite of what we do in dynCall and convert all pointer arguments to numbers before invoking the actual JS function, and the pointer result to bigint before returning to wasm.

@RReverser RReverser requested review from kripken and sbc100 July 13, 2025 01:27
Without this fix, any usages of addFunction  - e.g. dylink - would add broken functions to the webassembly table, which result in errors like "Cannot mix BigInt and other types, use explicit conversions" when invoked.

To fix this, we need to do essentially the opposite of what we do in `dynCall` and convert all pointer arguments to numbers before invoking the actual JS function, and the pointer result to bigint before returning to wasm.
@RReverser RReverser force-pushed the add-function-p-wasm64 branch from e08b2bd to 3d8535f Compare July 13, 2025 12:07
}
''')

self.do_runf('main.c', '')
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm a little surprised we don't have any existing tests of addFunction being used with JS functions with 'p' in their signatures.

I'd like to take a moment to try and figure out why no existing test covered this.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Okay, as long as it doesn't block the fix too long.

Copy link
Collaborator Author

@RReverser RReverser Jul 15, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I suspect it's possible that even if there were any tests, they 1) weren't doing pointer arithmetic and 2) weren't returning a pointer from JS to Wasm, only the other way around.

Without these conditions, e.g. if you have a simple tests that only logs the passed argument, the issues wouldn't trigger.

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm just a little suprised we don't run into this issue with our exist dynamic linking tests. The main user or addFunction for JS function with a signature passed are in dynamic linking, firstly in dlsym and secondly (which I imagine occurs way more often) in reportUndefinedSymbols which will fill in missing GOT entries using JS functions. I'm surprised we don't run into this there.

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Oh wait.. I think its because JS libraries functions are modified in place when they are constructed, right? So this change is not necessary for JS library functions at all, only for JS functions that come from elsewhere?

Is that right?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ok, fair enough. Given that it's possible to use addFunction from user code, and that it lives in its own libaddfunction.js, I assumed it's part of public API, but it sounds like it's just one of many functions that isn't marked as __internal: true for backward compat / legacy reasons rather than intended to be used by external developers?

If so, I'm happy to also close this and fix just my individual usage of addFunction in the Embind PR where I needed it.

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yeah, I think its does have some users, but I'd like to discourage its usage if possible. I'd rather see dynamic dispatch mechanisms be built on the JS side instead of by injecting into (modifying the wasm table at runtime). (I think only the dynamic linker should be doing that really, but I've never really articulated that in any official docs anywhere).

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hmm I'm finding even more usecases for addFunction where it would save quite a bit of JS code as well as make dispatch faster, pretty much win-win.

we would need to make sure this code didn't create extra wrapping for JS library functions (perhaps by removing the 'p' elements from the .sig which we generate at compile time).

So I think I'll spend some more time sorting this out after all.

I'd rather see dynamic dispatch mechanisms be built on the JS side

Pretty much the opposite preference here, the more I play with it - if we can avoid extra JS work and perform "direct" typed calls, why not do that?

I'm tempted to say we shouldn't bother landing this since (I hope) mode usage of addFunction is internal and not user-visible.

I also remembered that it's actually been a documented API for invoking JS callbacks from Wasm: https://emscripten.org/docs/porting/connecting_cpp_and_javascript/Interacting-with-code.html#calling-javascript-functions-as-function-pointers-from-c

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pretty much the opposite preference here, the more I play with it - if we can avoid extra JS work and perform "direct" typed calls, why not do that?

But is there any real performance advantage over having an additional lookup table on the JS side?

Also, remember that addFunction does not currently work with threading, is that a limitation you are OK with?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

But is there any real performance advantage over having an additional lookup table on the JS side?

There is for variadic functions, anywhere where you otherwise have to encode type info and number of arguments out of band.

Also, remember that addFunction does not currently work with threading, is that a limitation you are OK with?

Yeah that's fine, I'm only lazily storing it in thread-locals.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants