Skip to content

Stop restarting Pelican if it crashes too frequently#2316

Closed
matyasselmeci wants to merge 1 commit into
PelicanPlatform:mainfrom
matyasselmeci:pr/startlimit
Closed

Stop restarting Pelican if it crashes too frequently#2316
matyasselmeci wants to merge 1 commit into
PelicanPlatform:mainfrom
matyasselmeci:pr/startlimit

Conversation

@matyasselmeci

Copy link
Copy Markdown
Contributor

Disable the SystemD auto-restart of Pelican services if they've exited after less than 60 seconds, 3 times in a row. After that, the admin will need to systemctl start SERVICE by hand.

Disable the SystemD auto-restart of Pelican services if they've exited
after less than 60 seconds, 3 times in a row.  After that, the admin
will need to `systemctl start SERVICE` by hand.

(PelicanPlatform#2315)
@matyasselmeci matyasselmeci requested a review from brianhlin May 14, 2025 22:15
@matyasselmeci matyasselmeci linked an issue May 14, 2025 that may be closed by this pull request
@matyasselmeci matyasselmeci added enhancement New feature or request cache Issue relating to the cache component origin Issue relating to the origin component director Issue relating to the director component registry Issue relating to the registry component labels May 14, 2025

@brianhlin brianhlin left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@matyasselmeci

Copy link
Copy Markdown
Contributor Author

This does not work like I thought it worked. It stops restarting if the service starts three times in 60 seconds (for any reason, manual or automatic); since there is a 20 second delay between restarts, it means that

  1. the process basically has to crash immediately in order for this to kick in
  2. if it goes into the 'failed, started too often' state, the admin has to know to run systemctl daemon-reload or systemctl reset-failed before systemd will let them start up the service again.

@bbockelm

Copy link
Copy Markdown
Collaborator

@matyasselmeci - what do you want to do with this PR in the end? Per the last comment, should we just close it out?

@matyasselmeci

Copy link
Copy Markdown
Contributor Author

Yeah, I'll close it.

@matyasselmeci matyasselmeci deleted the pr/startlimit branch July 16, 2025 02:16
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

cache Issue relating to the cache component director Issue relating to the director component enhancement New feature or request origin Issue relating to the origin component registry Issue relating to the registry component

Projects

None yet

Development

Successfully merging this pull request may close these issues.

SystemD: stop restarting Pelican if it exits too frequently

3 participants