Add new hook spec for better integration model for plugins with ParallelRunners #4769

SajidAlamQB · 2025-05-23T13:30:25Z

Description

Related to: #4692

This PR introduces enhancements to the ParallelRunner, to provide a better mechanism for plugins and hooks that need to operate in a multi-process environment. These changes are aim to enabling plugins like kedro-viz to correctly collect data (e.g., dataset statistics) when pipelines are executed in parallel.

Overview:

Enable Shared State for Hooks: Hooks should now maintain and update shared state across multiple processes spawned by ParallelRunner.
Better Hook Execution in Parallel: Provides a better mechanism for executing hooks in subprocesses by ensuring they are picklable and properly initialised.
Centralised Control: ParallelRunner now has more control over the hook environment in its subprocesses.
Plugin Extensibility: Offers a better integration path for plugin developers whose tools need to interact with ParallelRunner.

Development notes

Introduced two new hook specifications:

on_parallel_runner_start(manager: SyncManager, catalog: CatalogProtocol): Allows hooks to be notified when ParallelRunner initialises. It provides access to the runner's multiprocessing.SyncManager (for creating shared state like dictionaries or lists that are safe for inter-process communication) and the main DataCatalog.
get_picklable_hook_implementations_for_subprocess(): Enables hooks to provide a "picklable" version of themselves that ParallelRunner can safely send to and register within its worker subprocesses. This is crucial because PluginManager instances and many complex hook objects are not inherently picklable.

ParallelRunner Enhancements:

When _run is called, ParallelRunner now triggers the on_parallel_runner_start hook, making its SyncManager and catalog available to interested plugins. This allows plugins to set up any necessary shared data structures before task execution begins.

_prepare_subprocess_hook_manager, has been added. This method:

Calls the get_picklable_hook_implementations_for_subprocess hook on the main process's PluginManager.
Collects any picklable hook instances provided by plugins.
Creates a new, simplified PluginManager (with tracing disabled to ensure picklability) for use in subprocesses and registers these picklable hooks.
This prepared PluginManager (or a _NullPluginManager if no picklable hooks are found) is then passed to each Task executed by the ParallelRunner.

This replaces the previous mechanism where Task instances attempted to re-initialise a PluginManager and re-register hooks independently within each subprocess.

Task Simplification (kedro/runner/task.py):

Removed the _run_node_synchronization and _bootstrap_subprocess static methods, along with the parallel attribute and associated logic in execute().

The responsibility of providing a hook manager to tasks running in parallel now lies solely in ParallelRunner.
If a Task receives no hook_manager (which should mainly happen if ParallelRunner provides a _NullPluginManager), it logs a warning and proceeds with a _NullPluginManager, so node execution is not blocked.

Hook Manager Creation (kedro/framework/hooks/manager.py):

_create_hook_manager now accepts an enable_tracing argument. This is used by ParallelRunner to create a picklable hook manager for subprocesses by disabling tracing.
RunnerSpecs is now added to the list of specifications when a hook manager is created.

Minor Changes:

The __del__ method now checks if _manager exists before calling shutdown.

Impact:

This is a feature enhancement and there is no regressions for runners like SequentialRunner.
Plugins wishing to use shared state with ParallelRunner will need to implement the new RunnerSpecs hooks.
The way hooks are managed within ParallelRunner subprocesses is fundamentally changed, moving away from per-task re-initialisation to a centrally prepared, picklable set of hooks.

How to test:

Check out the Kedro develop branch with these PR changes and check out the corresponding kedro-viz branch that uses these new Kedro hooks (the one where DatasetStatsHook implements on_parallel_runner_start and get_picklable_hook_implementations_for_subprocess) and install them.
Use a Kedro project (like the spaceflights starter)
Ensure kedro-viz is installed and its DatasetStatsHook is active.
Run the pipeline using ParallelRunner: kedro run --runner=ParallelRunner
The pipeline should complete without errors related to hook management, SyncManager, or pickling of hooks/managers.
After the run, check the stats.json is filled for datasets processed by different parallel workers.

Developer Certificate of Origin

We need all contributions to comply with the Developer Certificate of Origin (DCO). All commits must be signed off by including a Signed-off-by line in the commit message. See our wiki for guidance.

If your PR is blocked due to unsigned commits, then you must follow the instructions under "Rebase the branch" on the GitHub Checks page for your PR. This will retroactively add the sign-off to all unsigned commits and allow the DCO check to pass.

Checklist

Read the contributing guidelines
Signed off each commit with a Developer Certificate of Origin (DCO)
Opened this PR as a 'Draft Pull Request' if it is work-in-progress
Updated the documentation to reflect the code changes
Added a description of this change in the RELEASE.md file
Added tests to cover my changes
Checked if this change will affect Kedro-Viz, and if so, communicated that with the Viz team

Signed-off-by: Sajid Alam <[email protected]>

…evelop

astrojuanlu · 2025-05-23T14:21:30Z

Since this has been mentioned in the context of kedro-org/kedro-viz#2310, would like to flag a couple of things:

To what extent pluggy is even compatible with parallel or thread execution in Kedro? In Can plugins be executed concurrently? pytest-dev/pluggy#436 we read

@/dAnjou: can plugins be executed concurrently somehow?
@/RonnyPfannschmidt (pluggy author): Currently it's not sanely possible

And there haven't been any significant updates. Maybe the trigger of the hooks should be at a higher level so that we never have to even distribute the plugin or hook manager to the subprocesses or threads?

Besides from my question above, I know this is a draft but how would users with custom runners make theirs compatible with our on_parallel_runner_start ? For example, let's say that they're not using our kedro.runner.parallel_runner.ParallelRunner, but something else that happens to work in a similar fashion.

SajidAlamQB · 2025-05-29T10:15:52Z

Besides from my question above, I know this is a draft but how would users with custom runners make theirs compatible with our on_parallel_runner_start ? For example, let's say that they're not using our kedro.runner.parallel_runner.ParallelRunner, but something else that happens to work in a similar fashion.

The current design avoids concurrent use of a single PluginManager instance across processes. Instead, ParallelRunner equips subprocesses with new, PluginManager instances containing only picklable hook implementations. These specific hook implementations (like DatasetStatsHook) are then responsible for using process safe mechanisms for any state that needs to be coordinated. The core pluggy PluginManager in the main process does not have its hooks called concurrently by different processes.

…evelop

Signed-off-by: Sajid Alam <[email protected]>

…evelop

rashidakanchwala

the code looks great. this will hopefully solve the problem we have on 'Run Status with Parallel Runners' -- do we have any tests for this; it would be nice to see some intergration tests.

astrojuanlu · 2025-06-04T08:33:51Z

@SajidAlamQB addressed my first question above, but not the second

Besides from my question above, I know this is a draft but how would users with custom runners make theirs compatible with our on_parallel_runner_start ? For example, let's say that they're not using our kedro.runner.parallel_runner.ParallelRunner, but something else that happens to work in a similar fashion.

SajidAlamQB · 2025-06-04T09:33:18Z

Besides from my question above, I know this is a draft but how would users with custom runners make theirs compatible with our on_parallel_runner_start ? For example, let's say that they're not using our kedro.runner.parallel_runner.ParallelRunner, but something else that happens to work in a similar fashion.

These are good points. Firstly, plugins wishing to support parallel execution using shared state (like the updated DatasetStatsHook) will now rely on the runner triggering these new RunnerSpecs hooks (on_parallel_runner_start and get_picklable_hook_implementations_for_subprocess).

For custom parallel runners (not inheriting from kedro.runner.parallel_runner.ParallelRunner) to achieve full compatibility with these advanced plugins, they would need to adopt a similar pattern to our ParallelRunner:

Manage Shared State
Trigger on_parallel_runner_start: Call this hook on their main process's PluginManager.
Handle Picklable Hooks for Workers: Call get_picklable_hook_implementations_for_subprocess on the main PluginManager, and then equip each worker unit with its own PluginManager instance populated with these picklable hook implementations.

Essentially, the new RunnerSpecs and ParallelRunner's implementation give a opinionated interface for this kind of shared state hook integration. While custom runners aren't forced to implement this, plugins designed for this pattern won't have their full parallel functionality enabled with those runners if this interface isn't supported.

I think Documenting this pattern for users developing custom runners who want compatibility with pluings is definitely a good idea.

DimedS · 2025-06-04T14:42:29Z

Thanks for the great proposal, @SajidAlamQB - this is definitely a complex and important problem.

In my opinion, it should be possible to reach a state where hooks "just work" across all runners, without requiring any special handling from plugin authors. If we agree that's achievable, it would be better to aim for that direction, even if it requires more significant restructuring.

Overall, I think this is a great topic for a deeper discussion in a dedicated Tech Design session.

merelcht · 2025-06-05T08:20:56Z

This is definitely not a trivial change and the amount of code still needed on the Viz side (https://github.com/kedro-org/kedro-viz/pull/2336/files) to make this work for just one hook (DatasetStatsHook), makes me wonder if this is the way to go. I totally agree with what @DimedS is saying:

In my opinion, it should be possible to reach a state where hooks "just work" across all runners, without requiring any special handling from plugin authors.

If every hook someone implements needs to have all this code to handle the case for execution with the ParallelRunner that's not a good experience. And I'd guess also too complicated for most people.

Also, following on what @rashidakanchwala said, from what I understand this wouldn't "just" make the run status code work as is right? We'd also need to update those hooks to pass state?

Can you schedule a discussion for this asap? Then we can decide on whether we'll give it another try for 1.0.0 or leave it as is and accept that hooks don't work with the ParallelRunner.

SajidAlamQB · 2025-06-11T13:48:32Z

Following our technical design session today, I'm closing this PR based on the team's decision to pursue a research-first approach.

The team consensus was:

For 1.0: Document current limitations clearly
Post-1.0: Research simpler solutions that don't require plugin developers to have deep multiprocessing experience.

Read TD outcomes here: kedro-org/kedro-viz#1801 (comment)

SajidAlamQB added 2 commits May 23, 2025 11:03

parallelrunner enhancements for plugin integration

e5805ec

Signed-off-by: Sajid Alam <[email protected]>

lint

877d715

Signed-off-by: Sajid Alam <[email protected]>

SajidAlamQB self-assigned this May 23, 2025

SajidAlamQB mentioned this pull request May 23, 2025

Better integration with kedro-viz and ParallelRunners kedro-org/kedro-viz#2336

Closed

5 tasks

Merge branch 'develop' into dev/add-new-hook-spec-for-parallelunner-d…

9121c6c

…evelop

Merge branch 'develop' into dev/add-new-hook-spec-for-parallelunner-d…

a936189

…evelop

SajidAlamQB marked this pull request as ready for review May 29, 2025 10:16

SajidAlamQB requested a review from merelcht as a code owner May 29, 2025 10:16

Update specs.py

78f8694

Signed-off-by: Sajid Alam <[email protected]>

SajidAlamQB requested a review from ElenaKhaustova May 29, 2025 14:06

Merge branch 'develop' into dev/add-new-hook-spec-for-parallelunner-d…

203d5eb

…evelop

SajidAlamQB requested review from rashidakanchwala, DimedS, ankatiyar and ravi-kumar-pilla May 29, 2025 15:07

rashidakanchwala reviewed Jun 4, 2025

View reviewed changes

astrojuanlu mentioned this pull request Jun 10, 2025

Viz hook is broken with ParallelRunner [Blocked by Framework] kedro-org/kedro-viz#1801

Open

1 task

SajidAlamQB closed this Jun 11, 2025

SajidAlamQB deleted the dev/add-new-hook-spec-for-parallelunner-develop branch June 11, 2025 14:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add new hook spec for better integration model for plugins with ParallelRunners #4769

Add new hook spec for better integration model for plugins with ParallelRunners #4769

Uh oh!

SajidAlamQB commented May 23, 2025 •

edited

Loading

Uh oh!

astrojuanlu commented May 23, 2025

Uh oh!

SajidAlamQB commented May 29, 2025

Uh oh!

rashidakanchwala left a comment

Uh oh!

astrojuanlu commented Jun 4, 2025

Uh oh!

SajidAlamQB commented Jun 4, 2025

Uh oh!

DimedS commented Jun 4, 2025

Uh oh!

merelcht commented Jun 5, 2025

Uh oh!

SajidAlamQB commented Jun 11, 2025

Uh oh!

Uh oh!

Add new hook spec for better integration model for plugins with ParallelRunners #4769

Add new hook spec for better integration model for plugins with ParallelRunners #4769

Uh oh!

Conversation

SajidAlamQB commented May 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Development notes

Impact:

How to test:

Developer Certificate of Origin

Checklist

Uh oh!

astrojuanlu commented May 23, 2025

Uh oh!

SajidAlamQB commented May 29, 2025

Uh oh!

rashidakanchwala left a comment

Choose a reason for hiding this comment

Uh oh!

astrojuanlu commented Jun 4, 2025

Uh oh!

SajidAlamQB commented Jun 4, 2025

Uh oh!

DimedS commented Jun 4, 2025

Uh oh!

merelcht commented Jun 5, 2025

Uh oh!

SajidAlamQB commented Jun 11, 2025

Uh oh!

Uh oh!

SajidAlamQB commented May 23, 2025 •

edited

Loading