Impact of the bug
WMAgent --> RetryManager Component of the agent
Describe the bug
Every once in a while we see that the retry manager crashes while loading the job report for a job in cooloff. The error message narrows down to a pickle data was truncated.
How to reproduce it
Steps to reproduce the behavior:
No idea
Expected behavior
The job should either retry a maximum of three times in the case the job report is not accessible.
Additional context and error message
Every job that gets retried needs to unpickle the job report to check the exit code. In the case the unpickling of the report fails, then the retry manager crashes, which is a blocker for other jobs that require a retry. The solution to this should be to allow to retry without checking the exit code in case that unpickling the Report.pkl fails.
Impact of the bug
WMAgent --> RetryManager Component of the agent
Describe the bug
Every once in a while we see that the retry manager crashes while loading the job report for a job in cooloff. The error message narrows down to a
pickle data was truncated.How to reproduce it
Steps to reproduce the behavior:
No idea
Expected behavior
The job should either retry a maximum of three times in the case the job report is not accessible.
Additional context and error message
Every job that gets retried needs to unpickle the job report to check the exit code. In the case the unpickling of the report fails, then the retry manager crashes, which is a blocker for other jobs that require a retry. The solution to this should be to allow to retry without checking the exit code in case that unpickling the
Report.pklfails.