In progress - Scheduled maintenance is currently in progress. We will provide updates as necessary.
Mar 06, 2026 - 06:00 CST
Scheduled - We will be shutting down mrudolphgpu4001 to repair a faulty GPU. The machine will be brought back up with 7 GPUs while we get the GPU replaced.
Mar 6, 2026 06:00 - Mar 13, 2026 07:00 CDT

About This Site

This page provides information about unplanned downtimes and scheduled maintenance for services offered by the Center for High Throughput Computing

High Throughput Computing (HTC) System Operational
90 days ago
99.92 % uptime
Today
Access Points Operational
90 days ago
99.86 % uptime
Today
CHTC Pool Operational
90 days ago
100.0 % uptime
Today
External Pools (OSPool, Campus HTCondor Pools) Operational
90 days ago
100.0 % uptime
Today
Staging and Projects Space Operational
90 days ago
100.0 % uptime
Today
File Transfers Operational
90 days ago
99.77 % uptime
Today
High Performance Computing (HPC) System Operational
90 days ago
100.0 % uptime
Today
Login Nodes Operational
90 days ago
100.0 % uptime
Today
Cluster Nodes and Jobs Operational
90 days ago
100.0 % uptime
Today
Central Software Installations Operational
90 days ago
100.0 % uptime
Today
Home and Scratch File Systems Operational
90 days ago
100.0 % uptime
Today
Data Transfer Tools Operational
90 days ago
100.0 % uptime
Today
Globus Endpoint Operational
90 days ago
100.0 % uptime
Today
CHTC Internal Infrastructure Operational
90 days ago
100.0 % uptime
Today
Tiger Cluster Operational
90 days ago
100.0 % uptime
Today
RT Email/Ticket Support System Operational
90 days ago
100.0 % uptime
Today
Operational
Degraded Performance
Partial Outage
Major Outage
Maintenance
Major outage
Partial outage
No downtime recorded on this day.
No data exists for this day.
had a major outage.
had a partial outage.
Mar 6, 2026
Resolved - This incident has been resolved.
Mar 6, 09:36 CST
Monitoring - We've implemented a fix for the CUDA_VISIBLE_DEVICES issues on various GPU machines. Please resubmit jobs and email chtc@cs.wisc.edu if you are still experiencing issues with it.
Feb 6, 10:18 CST
Update - We are continuing to investigate this issue.
Feb 4, 16:38 CST
Investigating - We have received reports of issues with the CUDA_VISIBLE_DEVICES environment variable being set incorrectly on certain GPU jobs. We are investigating the issue and will update this page once more information is known.
Feb 2, 13:57 CST
Resolved - This incident has been resolved.
Mar 6, 09:36 CST
Investigating - We have received some reports of issues with file transfers from ResearchDrive in HTCondor jobs for folks with the ResearchDrive/CHTC integration.

*** Please report any issues with file transfer between CHTC and ResearchDrive to chtc@cs.wisc.edu as we are investigating the cause ***

Feb 13, 17:06 CST
Resolved - This incident has been resolved.
Mar 6, 08:58 CST
Identified - OSDF file transfers are currently not working. Users may see a hold message like, "Details: failed to get namespace information for remote URL ... error while querying the director". This is due to an unexpected issue with an upgrade. We are working to resolve the issue.
Mar 5, 16:31 CST
Mar 5, 2026
Mar 4, 2026

No incidents reported.

Mar 3, 2026

No incidents reported.

Mar 2, 2026

No incidents reported.

Mar 1, 2026

No incidents reported.

Feb 28, 2026

No incidents reported.

Feb 27, 2026

No incidents reported.

Feb 26, 2026

No incidents reported.

Feb 25, 2026

No incidents reported.

Feb 24, 2026

No incidents reported.

Feb 23, 2026

No incidents reported.

Feb 22, 2026

No incidents reported.

Feb 21, 2026

No incidents reported.

Feb 20, 2026

No incidents reported.