Identified - The Gurobi license for CHTC has expired. We are working with campus IT to renew the license.
In the meantime, user jobs attempting to use the Gurobi license will likely fail due to a "license expired" error.

Apr 02, 2026 - 15:13 CDT

About This Site

This page provides information about unplanned downtimes and scheduled maintenance for services offered by the Center for High Throughput Computing

High Throughput Computing (HTC) System Degraded Performance
90 days ago
99.92 % uptime
Today
Access Points Operational
90 days ago
99.84 % uptime
Today
CHTC Pool Degraded Performance
90 days ago
100.0 % uptime
Today
External Pools (OSPool, Campus HTCondor Pools) Operational
90 days ago
100.0 % uptime
Today
Staging and Projects Space Operational
90 days ago
100.0 % uptime
Today
File Transfers Operational
90 days ago
99.77 % uptime
Today
High Performance Computing (HPC) System Operational
90 days ago
99.78 % uptime
Today
Login Nodes Operational
90 days ago
99.52 % uptime
Today
Cluster Nodes and Jobs Operational
90 days ago
99.79 % uptime
Today
Central Software Installations Operational
90 days ago
100.0 % uptime
Today
Home and Scratch File Systems Operational
90 days ago
99.81 % uptime
Today
Data Transfer Tools Operational
90 days ago
100.0 % uptime
Today
Globus Endpoint Operational
90 days ago
100.0 % uptime
Today
CHTC Internal Infrastructure Operational
90 days ago
100.0 % uptime
Today
Tiger Cluster Operational
90 days ago
100.0 % uptime
Today
RT Email/Ticket Support System Operational
90 days ago
100.0 % uptime
Today
Operational
Degraded Performance
Partial Outage
Major Outage
Maintenance
Major outage
Partial outage
No downtime recorded on this day.
No data exists for this day.
had a major outage.
had a partial outage.
Apr 3, 2026

No incidents reported today.

Apr 2, 2026
Resolved - This incident has been resolved.
Apr 2, 10:37 CDT
Monitoring - We identified the cause of the problem and have applied a fix. Initial tests appear successful. Let us know at chtc@cs.wisc.edu if you continue to encounter problems.
Apr 2, 09:28 CDT
Investigating - Confirmed user reports that job submission on ap2002.chtc.wisc.edu is failing.
We are investigating the issue and will provide updates as they become available.

Apr 2, 09:09 CDT
Apr 1, 2026

No incidents reported.

Mar 31, 2026
Completed - The scheduled maintenance has been completed.
Mar 31, 17:45 CDT
In progress - Scheduled maintenance is currently in progress. We will provide updates as necessary.
Mar 31, 13:45 CDT
Scheduled - Network infrastructure is being updated in one of our server rooms. Jobs will be much slower to match, due to reduced capacity during this time. Jobs already running on hosts in this server room may fail due to timeouts. Any ongoing filetransfer or interactive sessions to these nodes will also be disrupted. Nodes impacted: E40XX, build4001, cxiaogpu4000, xhuangpgu4000
Mar 31, 13:37 CDT
Completed - The scheduled maintenance has been completed.
Mar 31, 13:00 CDT
In progress - Scheduled maintenance is currently in progress. We will provide updates as necessary.
Mar 31, 07:00 CDT
Scheduled - Network infrastructure is being updated in one of our server rooms. Jobs will be much slower to match, due to reduced capacity during this time. Jobs already running on hosts in this server room may fail due to timeouts. Any ongoing filetransfer or interactive sessions to these nodes will also be disrupted. Nodes impacted: E40XX, build4001, cxiaogpu4000, xhuangpgu4000
Mar 27, 10:41 CDT
Resolved - This incident has been resolved.
Mar 31, 11:38 CDT
Monitoring - A fix has been implemented and we are monitoring the results.
Mar 31, 10:26 CDT
Investigating - Users reported being unable to access or log in to spark-login.chtc.wisc.edu since 7 PM on 3/30/2026. We are currently investigating.
Mar 31, 09:23 CDT
Mar 30, 2026

No incidents reported.

Mar 29, 2026

No incidents reported.

Mar 28, 2026

No incidents reported.

Mar 27, 2026

No incidents reported.

Mar 26, 2026
Completed - We have brought mrudophgpu4001 back online. Maintenance is completed.
Mar 26, 11:39 CDT
In progress - Scheduled maintenance is currently in progress. We will provide updates as necessary.
Mar 26, 06:00 CDT
Scheduled - We will be powering down mrudolphgpu4001 to perform maintenance. Jobs will not match to this machine during this time.
Mar 20, 15:30 CDT
Mar 25, 2026

No incidents reported.

Mar 24, 2026

No incidents reported.

Mar 23, 2026

No incidents reported.

Mar 22, 2026

No incidents reported.

Mar 21, 2026

No incidents reported.

Mar 20, 2026
Resolved - Looks back to normal.
Mar 20, 13:09 CDT
Monitoring - We believe we've fixed the problem. We'll keep an eye on things to make sure it sticks.
Mar 20, 09:04 CDT
Identified - A system-wide update to our GPU machines has resulted in a mixup in the configuration of GPU machines prioritized for researchers. We are working to address the problem.
In the meantime, you may see reduced or unusually matchmaking behavior for GPU jobs that target prioritized machines, including backfill jobs.

Mar 19, 12:34 CDT