Investigating - We have received some reports of issues with file transfers from ResearchDrive in HTCondor jobs for folks with the ResearchDrive/CHTC integration.

*** Please report any issues with file transfer between CHTC and ResearchDrive to chtc@cs.wisc.edu as we are investigating the cause ***

Feb 13, 2026 - 17:06 CST
Monitoring - We've implemented a fix for the CUDA_VISIBLE_DEVICES issues on various GPU machines. Please resubmit jobs and email chtc@cs.wisc.edu if you are still experiencing issues with it.
Feb 06, 2026 - 10:18 CST
Update - We are continuing to investigate this issue.
Feb 04, 2026 - 16:38 CST
Investigating - We have received reports of issues with the CUDA_VISIBLE_DEVICES environment variable being set incorrectly on certain GPU jobs. We are investigating the issue and will update this page once more information is known.
Feb 02, 2026 - 13:57 CST

About This Site

This page provides information about unplanned downtimes and scheduled maintenance for services offered by the Center for High Throughput Computing

High Throughput Computing (HTC) System Degraded Performance
90 days ago
99.95 % uptime
Today
Access Points Operational
90 days ago
99.83 % uptime
Today
CHTC Pool Operational
90 days ago
100.0 % uptime
Today
External Pools (OSPool, Campus HTCondor Pools) Operational
90 days ago
100.0 % uptime
Today
Staging and Projects Space Operational
90 days ago
99.99 % uptime
Today
File Transfers Degraded Performance
90 days ago
99.95 % uptime
Today
High Performance Computing (HPC) System Operational
90 days ago
100.0 % uptime
Today
Login Nodes Operational
90 days ago
100.0 % uptime
Today
Cluster Nodes and Jobs Operational
90 days ago
100.0 % uptime
Today
Central Software Installations Operational
90 days ago
100.0 % uptime
Today
Home and Scratch File Systems Operational
90 days ago
100.0 % uptime
Today
Data Transfer Tools Operational
90 days ago
100.0 % uptime
Today
Globus Endpoint Operational
90 days ago
100.0 % uptime
Today
CHTC Internal Infrastructure Operational
90 days ago
100.0 % uptime
Today
Tiger Cluster Operational
90 days ago
100.0 % uptime
Today
RT Email/Ticket Support System Operational
90 days ago
100.0 % uptime
Today
Operational
Degraded Performance
Partial Outage
Major Outage
Maintenance
Major outage
Partial outage
No downtime recorded on this day.
No data exists for this day.
had a major outage.
had a partial outage.
Feb 20, 2026

No incidents reported today.

Feb 19, 2026

No incidents reported.

Feb 18, 2026
Resolved - This incident has been resolved.
Feb 18, 17:13 CST
Monitoring - Users reported errors involving jobs using Docker containers, encountering a pull rate limit message. Additionally, users building Apptainer images that use a Docker image as a base encountered a "conveyor failed to get" message. These both stem from an automatic update that rewrote network configuration.

We have resolved the broken configuration. Users are encouraged to resubmit their jobs or rebuild their containers. If you encounter the pull rate limit message again, please wait a few hours before retrying.

Contact chtc@cs.wisc.edu with any concerns.

Feb 18, 10:56 CST
Resolved - This incident has been resolved.
Feb 18, 10:50 CST
Monitoring - We have created a containerized install of CST 2025 and are testing the install. Users interested in testing CST 2025 should email us at chtc@cs.wisc.edu.
Feb 10, 17:11 CST
Update - We are continuing to investigate this issue.
Jan 29, 10:20 CST
Investigating - Users of the licensed software CST may encounter this error: "modeler_AMD64: line 154: Aborted (core dumped) "${CST_REGSVR32}"". This occurs on most Execution Points, with the exception of build machines.

We are currently investigating.

Jan 29, 10:19 CST
Feb 17, 2026
Resolved - This incident has been resolved.
Feb 17, 09:20 CST
Monitoring - A fix has been implemented and we are monitoring the results.
Feb 16, 17:05 CST
Investigating - We have reports that job submissions are failing on HTC Access Point ap2001.chtc.wisc.edu with the following error:

Submitting job(s)Failed to process job credential requests (1): 'CRED: startCommand to CredD failed!
'; BAILING OUT.
ERROR: condor_submit failed; aborting.

We are investigating the issue and will share updates via this status page.

Feb 16, 08:41 CST
Feb 16, 2026
Resolved - The license was updated the day after this incident was posted - we forgot to update the status page!
Feb 16, 11:50 CST
Identified - The MATLAB license for CHTC has expired and needs to be renewed. We are working with campus IT to do so.
In the meantime, users submitting MATLAB jobs will encounter the error "License checkout failed".

Feb 5, 09:16 CST
Feb 15, 2026

No incidents reported.

Feb 14, 2026

No incidents reported.

Feb 13, 2026

Unresolved incident: UWDF/ResearchDrive.

Feb 12, 2026

No incidents reported.

Feb 11, 2026

No incidents reported.

Feb 10, 2026
Feb 9, 2026

No incidents reported.

Feb 8, 2026

No incidents reported.

Feb 7, 2026

No incidents reported.

Feb 6, 2026

Unresolved incident: GPU CUDA_VISIBLE_DEVICES Errors.