[SPARK-54197][K8S] Improve `ExecutorsPodsLifecycleManager` not to request to delete if `deletionTimestamp` exists by dongjoon-hyun · Pull Request #52902 · apache/spark

dongjoon-hyun · 2025-11-05T20:48:08Z

What changes were proposed in this pull request?

The current code handling deletion of Failed or Succeeded driver Pods is calling the Kubernetes API to delete objects until either the Kubelet as started the termination the Pod (the status of the object is terminating).

However, depending on configuration, the ExecutorPodsLifecycleManager loop might run multiple times before the Kubelet starts the deletion of the Pod object, resulting in un-necessary DELETE calls to the Kubernetes API, which are particularly expensive since they are served from Etcd.

Following the Kubernetes API specifications in https://kubernetes.io/docs/reference/using-api/api-concepts/

When a client first sends a delete to request the removal of a resource, the .metadata.deletionTimestamp is set to the current time. Once the .metadata.deletionTimestamp is set, external controllers that act on finalizers may start performing their cleanup work at any time, in any order.

we can assume that whenever the deletionTimestamp is set on a Pod, this will be eventually terminated without the need of additional DELETE calls.

Why are the changes needed?

This change is required to remove the need of redundant API calls agains the Kubernetes API that at scale might lead to excessive load against Etcd.

Does this PR introduce any user-facing change?

No.

How was this patch tested?

This patch includes unit-tests.

Was this patch authored or co-authored using generative AI tooling?

No.

Closes #52898

…etionTimestamp is set on the Pod

dongjoon-hyun · 2025-11-05T20:51:17Z

To the reviewers, this aims to help the following community PR as a quick workaround. He will set up his GitHub Action later.

[SPARK-54197][K8S] Improve ExecutorsPodsLifecycleManager not to request to delete if deletionTimestamp exists #52898

Could you review this PR, @peter-toth and @attilapiros ?

dongjoon-hyun · 2025-11-05T21:16:46Z

I fixed the main code compilation and unit test. Let's see the K8s integration test result, @atosatto

...ore/src/main/scala/org/apache/spark/scheduler/cluster/k8s/ExecutorPodsLifecycleManager.scala

…ark/scheduler/cluster/k8s/ExecutorPodsLifecycleManager.scala Co-authored-by: Attila Zsolt Piros <2017933+attilapiros@users.noreply.github.com>

dongjoon-hyun · 2025-11-06T00:16:39Z

Thank you, @HyukjinKwon and @attilapiros .

…uest to delete if `deletionTimestamp` exists ### What changes were proposed in this pull request? The current code handling deletion of Failed or Succeeded driver Pods is calling the Kubernetes API to delete objects until either the Kubelet as started the termination the Pod (the status of the object is terminating). However, depending on configuration, the ExecutorPodsLifecycleManager loop might run multiple times before the Kubelet starts the deletion of the Pod object, resulting in un-necessary DELETE calls to the Kubernetes API, which are particularly expensive since they are served from Etcd. Following the Kubernetes API specifications in https://kubernetes.io/docs/reference/using-api/api-concepts/ > When a client first sends a delete to request the removal of a resource, the .metadata.deletionTimestamp is set to the current time. Once the .metadata.deletionTimestamp is set, external controllers that act on finalizers may start performing their cleanup work at any time, in any order. we can assume that whenever the deletionTimestamp is set on a Pod, this will be eventually terminated without the need of additional DELETE calls. ### Why are the changes needed? This change is required to remove the need of redundant API calls agains the Kubernetes API that at scale might lead to excessive load against Etcd. ### Does this PR introduce _any_ user-facing change? No. ### How was this patch tested? This patch includes unit-tests. ### Was this patch authored or co-authored using generative AI tooling? No. Closes #52898 Closes #52902 from dongjoon-hyun/driver-do-not-call-delete-for-terminating-pods-master. Lead-authored-by: Dongjoon Hyun <dongjoon@apache.org> Co-authored-by: Andrea Tosatto <atosatto@apple.com> Signed-off-by: Dongjoon Hyun <dongjoon@apache.org> (cherry picked from commit 3b368ca) Signed-off-by: Dongjoon Hyun <dongjoon@apache.org>

dongjoon-hyun · 2025-11-06T01:57:46Z

Merged to master/4.1 for Apache Spark 4.1.0.

Welcome to the Apache Spark community, @atosatto !

I added you to the Apache Spark contributor group and assigned SPARK-54197 to you.

…uest to delete if `deletionTimestamp` exists ### What changes were proposed in this pull request? The current code handling deletion of Failed or Succeeded driver Pods is calling the Kubernetes API to delete objects until either the Kubelet as started the termination the Pod (the status of the object is terminating). However, depending on configuration, the ExecutorPodsLifecycleManager loop might run multiple times before the Kubelet starts the deletion of the Pod object, resulting in un-necessary DELETE calls to the Kubernetes API, which are particularly expensive since they are served from Etcd. Following the Kubernetes API specifications in https://kubernetes.io/docs/reference/using-api/api-concepts/ > When a client first sends a delete to request the removal of a resource, the .metadata.deletionTimestamp is set to the current time. Once the .metadata.deletionTimestamp is set, external controllers that act on finalizers may start performing their cleanup work at any time, in any order. we can assume that whenever the deletionTimestamp is set on a Pod, this will be eventually terminated without the need of additional DELETE calls. ### Why are the changes needed? This change is required to remove the need of redundant API calls agains the Kubernetes API that at scale might lead to excessive load against Etcd. ### Does this PR introduce _any_ user-facing change? No. ### How was this patch tested? This patch includes unit-tests. ### Was this patch authored or co-authored using generative AI tooling? No. Closes apache#52898 Closes apache#52902 from dongjoon-hyun/driver-do-not-call-delete-for-terminating-pods-master. Lead-authored-by: Dongjoon Hyun <dongjoon@apache.org> Co-authored-by: Andrea Tosatto <atosatto@apple.com> Signed-off-by: Dongjoon Hyun <dongjoon@apache.org>

ExecutorsPodsLifecycleManager: do not call delete requests if the del…

bd311aa

…etionTimestamp is set on the Pod

github-actions bot added the KUBERNETES label Nov 5, 2025

Fix compilation and test case

c104ed2

HyukjinKwon approved these changes Nov 5, 2025

View reviewed changes

attilapiros reviewed Nov 5, 2025

View reviewed changes

...ore/src/main/scala/org/apache/spark/scheduler/cluster/k8s/ExecutorPodsLifecycleManager.scala Outdated Show resolved Hide resolved

Update resource-managers/kubernetes/core/src/main/scala/org/apache/sp…

73a5ea6

…ark/scheduler/cluster/k8s/ExecutorPodsLifecycleManager.scala Co-authored-by: Attila Zsolt Piros <2017933+attilapiros@users.noreply.github.com>

dongjoon-hyun closed this in 3b368ca Nov 6, 2025

dongjoon-hyun deleted the driver-do-not-call-delete-for-terminating-pods-master branch November 6, 2025 01:57

peter-toth mentioned this pull request Nov 6, 2025

[SPARK-54198][K8S] Delete Kubernetes executor pods only once per event processing interval #52899

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-54197][K8S] Improve `ExecutorsPodsLifecycleManager` not to request to delete if `deletionTimestamp` exists#52902

[SPARK-54197][K8S] Improve `ExecutorsPodsLifecycleManager` not to request to delete if `deletionTimestamp` exists#52902
dongjoon-hyun wants to merge 3 commits intoapache:masterfrom
dongjoon-hyun:driver-do-not-call-delete-for-terminating-pods-master

dongjoon-hyun commented Nov 5, 2025 •

edited

Loading

Uh oh!

dongjoon-hyun commented Nov 5, 2025 •

edited

Loading

Uh oh!

dongjoon-hyun commented Nov 5, 2025 •

edited

Loading

Uh oh!

Uh oh!

dongjoon-hyun commented Nov 6, 2025

Uh oh!

dongjoon-hyun commented Nov 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

dongjoon-hyun commented Nov 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Was this patch authored or co-authored using generative AI tooling?

Uh oh!

dongjoon-hyun commented Nov 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dongjoon-hyun commented Nov 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

dongjoon-hyun commented Nov 6, 2025

Uh oh!

dongjoon-hyun commented Nov 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

dongjoon-hyun commented Nov 5, 2025 •

edited

Loading

dongjoon-hyun commented Nov 5, 2025 •

edited

Loading

dongjoon-hyun commented Nov 5, 2025 •

edited

Loading