Investigating - GPU4002 is experiencing some transient issues and will be brought offline for maintenance.
Jul 10, 2025 - 15:35 CDT

About This Site

This page provides information about unplanned downtimes and scheduled maintenance for services offered by the Center for High Throughput Computing

High Throughput Computing (HTC) System Degraded Performance
90 days ago
99.97 % uptime
Today
Access Points Operational
90 days ago
99.94 % uptime
Today
CHTC Pool Degraded Performance
90 days ago
100.0 % uptime
Today
External Pools (OSPool, Campus HTCondor Pools) Operational
90 days ago
100.0 % uptime
Today
Staging and Projects Space Operational
90 days ago
100.0 % uptime
Today
File Transfers Operational
90 days ago
99.94 % uptime
Today
High Performance Computing (HPC) System Operational
90 days ago
99.97 % uptime
Today
Login Nodes Operational
90 days ago
99.98 % uptime
Today
Cluster Nodes and Jobs Operational
90 days ago
100.0 % uptime
Today
Central Software Installations Operational
90 days ago
100.0 % uptime
Today
Home and Scratch File Systems Operational
90 days ago
99.9 % uptime
Today
Data Transfer Tools Operational
90 days ago
100.0 % uptime
Today
Globus Endpoint Operational
90 days ago
100.0 % uptime
Today
CHTC Internal Infrastructure Operational
90 days ago
100.0 % uptime
Today
Tiger Cluster ? Operational
90 days ago
100.0 % uptime
Today
RT Email/Ticket Support System Operational
90 days ago
100.0 % uptime
Today
Operational
Degraded Performance
Partial Outage
Major Outage
Maintenance
Major outage
Partial outage
No downtime recorded on this day.
No data exists for this day.
had a major outage.
had a partial outage.

Scheduled Maintenance

[HPC] Downtime for maintenance Jul 14, 2025 07:00 - Jul 15, 2025 07:00 CDT

The HPC system (spark-login.chtc.wisc.edu) will be inaccessible as we perform maintenance for the Data Center. We expect this maintenance to take no longer than a day.
Posted on Jul 02, 2025 - 12:40 CDT
Jul 11, 2025

No incidents reported today.

Jul 10, 2025

Unresolved incident: GPU 4002 Downtime.

Jul 9, 2025

No incidents reported.

Jul 8, 2025

No incidents reported.

Jul 7, 2025

No incidents reported.

Jul 6, 2025

No incidents reported.

Jul 5, 2025

No incidents reported.

Jul 4, 2025

No incidents reported.

Jul 3, 2025

No incidents reported.

Jul 2, 2025

No incidents reported.

Jul 1, 2025
Resolved - This incident has been resolved.
Jul 1, 11:34 CDT
Investigating - A cooling outage in the campus datacenter that hosts our primary APs has brought them offline
Jul 1, 10:29 CDT
Jun 30, 2025

No incidents reported.

Jun 29, 2025

No incidents reported.

Jun 28, 2025

No incidents reported.

Jun 27, 2025

No incidents reported.