[HTC] Intermittent OSDF hold errors

Identified

Confirmed user reports of jobs going on hold with a message along the lines of "Transfer input files failure at ... using protocol osdf ... Unable to read (...Path...); permission denied ...".
This appears to be an intermittent issue and we are working to identify the root cause, which we believe to be related to a certain system being overwhelmed.

If you encounter said hold message, wait a few minutes and release or resubmit the jobs to try again.
Let us know at chtc@cs.wisc.edu if this is significantly impacting your work.
Posted Apr 03, 2025 - 11:57 CDT
This incident affects: High Throughput Computing (HTC) System (File Transfers).