feat: release tokio runtime on driver/executor exit by wForget · Pull Request #4734 · apache/datafusion-comet

wForget · 2026-06-26T05:31:23Z

Which issue does this PR close?

Rationale for this change

The tokio runtime worker threads attach to non-daemon JVM threads, and these threads are not detached or shut down, which prevents the JVM from exiting.

What changes are included in this PR?

How are these changes tested?

The first is this commit , which ran successfully. The second is last commit of main branch, , which hung.

https://github.com/wForget/benchmarks-spark-native/actions/runs/28219472994/job/83600457105
https://github.com/wForget/benchmarks-spark-native/actions/runs/28220915819/job/83601883415

andygrove

LGTM. Thanks @wForget

mbutrovich

Thanks for tracking this down, the diagnosis is correct. The runtime was a
static, statics are never dropped, and the tokio workers stay attached to the
JVM as non-daemon threads for their whole lifetime, so the JVM cannot exit. The
Mutex<Option<Runtime>> + take() change is the right fix for that leak, and
the iceberg-rust side looks right too since iceberg only borrows our runtime
handle and never spawns threads of its own.

Two things I would like to see before merge. First, a note on why
shutdown_background() over a bounded shutdown_timeout, since the background
variant detaches the workers asynchronously after we return. Second, a short
comment on the double-release in local mode where both plugins share one runtime.

The one larger question is whether plugin shutdown() is enough on its own. It
only runs on a clean SparkContext.stop(), so a JVM that exits without stopping
the context would still hang. Attaching the workers as daemon threads would make
the fix robust to that case. That could be a follow-up if you would rather keep
this PR focused on the teardown path.

mbutrovich · 2026-06-26T13:50:14Z

+/// Must not be called from within the runtime's own worker threads, otherwise the shutdown
+/// would deadlock/panic.
+pub fn release_runtime() {
+    let runtime = TOKIO_RUNTIME.lock().take();


Nice catch on the root cause. The runtime lived in a static, and Rust never
drops statics, so the worker threads never exited. Moving to
Mutex<Option<Runtime>> + take() is what finally lets the runtime drop. That
is the real fix.

One question on shutdown_background(). It signals the workers and returns
without joining them, so the JVM-exit only unblocks asynchronously as each
worker runs its thread-local detach on the way out. Did you consider
shutdown_timeout(...) instead? It would make teardown deterministic, and the
plugin shutdown thread is not latency sensitive. Not blocking, just curious
about the rationale so we can capture it in the doc comment.

mbutrovich · 2026-06-26T13:50:47Z

+  }
+
+  override def shutdown(): Unit = {
+    logInfo("CometExecutorPlugin shutdown")


In local mode both plugins live in the same JVM and share the one native
runtime, so both shutdown() paths call NativeBase.release(). It works
because release_runtime() does take(), which makes the second call a no-op.
Could we add a short comment noting that the double release is expected and
safe? It is not obvious when reading either plugin on its own. The executor
plugin shuts down before the driver plugin in local mode, so the executor wins
the take().

mbutrovich · 2026-06-26T13:51:59Z

@@ -118,7 +117,7 @@ use std::sync::OnceLock;
 #[cfg(feature = "jemalloc")]
 use tikv_jemalloc_ctl::{epoch, stats};



docs/source/contributor-guide/development.md is now stale in two spots and
could be updated separately once this lands:

Line 87 calls the runtime a Lazy<Runtime> static. It is OnceLock today and
becomes Mutex<Option<Runtime>> here, torn down on plugin shutdown.

Line 60 says the AttachGuard detaches the thread when dropped. The
attachment is actually cached in thread-local storage and only released when
the worker thread exits. That detail is exactly why the runtime has to be shut
down for the JVM to exit, so it is worth correcting.

feat: release tokio runtime on driver/executor exit

fb1f23c

wForget force-pushed the release-tokio-rutime-on-exit branch from b23f457 to fb1f23c Compare June 26, 2026 05:36

fix check-jar-contents

6b987cb

wForget marked this pull request as ready for review June 26, 2026 08:00

mbutrovich self-requested a review June 26, 2026 13:39

andygrove approved these changes Jun 26, 2026

View reviewed changes

mbutrovich reviewed Jun 26, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: release tokio runtime on driver/executor exit#4734

feat: release tokio runtime on driver/executor exit#4734
wForget wants to merge 2 commits into
apache:mainfrom
wForget:release-tokio-rutime-on-exit

wForget commented Jun 26, 2026 •

edited

Loading

Uh oh!

andygrove left a comment

Uh oh!

mbutrovich left a comment

Uh oh!

mbutrovich Jun 26, 2026

Uh oh!

mbutrovich Jun 26, 2026

Uh oh!

mbutrovich Jun 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		@@ -118,7 +117,7 @@ use std::sync::OnceLock;
		#[cfg(feature = "jemalloc")]
		use tikv_jemalloc_ctl::{epoch, stats};

Uh oh!

Conversation

wForget commented Jun 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

How are these changes tested?

Uh oh!

andygrove left a comment

Choose a reason for hiding this comment

Uh oh!

mbutrovich left a comment

Choose a reason for hiding this comment

Uh oh!

mbutrovich Jun 26, 2026

Choose a reason for hiding this comment

Uh oh!

mbutrovich Jun 26, 2026

Choose a reason for hiding this comment

Uh oh!

mbutrovich Jun 26, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

wForget commented Jun 26, 2026 •

edited

Loading