Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Unable to broadcast engines in spark #1406

Open
bennydean opened this issue Jun 28, 2024 · 0 comments
Open

Unable to broadcast engines in spark #1406

bennydean opened this issue Jun 28, 2024 · 0 comments

Comments

@bennydean
Copy link

Hello,
I am attempting to follow the example seen here for using the library with spark data frames: https://microsoft.github.io/presidio/samples/deployments/spark/

However, I receive the following error which seems to stem from trying to reference the broadcasted engine in a UDF. I have tried researching this quite a bit with no luck. Any suggestions would be very helpful.

Error: pyspark.errors.exceptions.base.PySparkRuntimeError: [BROADCAST_VARIABLE_NOT_LOADED] Broadcast variable 37 not loaded.
Environment: Databricks runtime 14.3 LTS

Code: Pretty much copied from example line for line, just changed the lines in which I read the data into a data frame.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant