All Systems Operational

About This Site

This page provides information about unplanned downtimes and scheduled maintenance for services offered by the Center for High Throughput Computing

High Throughput Computing (HTC) System Operational
90 days ago
99.98 % uptime
Today
Access Points Operational
90 days ago
100.0 % uptime
Today
CHTC Pool Operational
90 days ago
100.0 % uptime
Today
External Pools (OSPool, Campus HTCondor Pools) Operational
90 days ago
100.0 % uptime
Today
Staging and Projects Space Operational
90 days ago
100.0 % uptime
Today
File Transfers Operational
90 days ago
99.94 % uptime
Today
High Performance Computing (HPC) System Operational
90 days ago
99.97 % uptime
Today
Login Nodes Operational
90 days ago
99.98 % uptime
Today
Cluster Nodes and Jobs Operational
90 days ago
100.0 % uptime
Today
Central Software Installations Operational
90 days ago
100.0 % uptime
Today
Home and Scratch File Systems Operational
90 days ago
99.9 % uptime
Today
Data Transfer Tools Operational
90 days ago
100.0 % uptime
Today
Globus Endpoint Operational
90 days ago
100.0 % uptime
Today
CHTC Internal Infrastructure Operational
90 days ago
100.0 % uptime
Today
Tiger Cluster ? Operational
90 days ago
100.0 % uptime
Today
RT Email/Ticket Support System Operational
90 days ago
100.0 % uptime
Today
Operational
Degraded Performance
Partial Outage
Major Outage
Maintenance
Major outage
Partial outage
No downtime recorded on this day.
No data exists for this day.
had a major outage.
had a partial outage.
Jun 13, 2025

No incidents reported today.

Jun 12, 2025
Completed - We have finished maintenance. Users can now access oconnor-ap.
Jun 12, 16:51 CDT
In progress - Scheduled maintenance is currently in progress. We will provide updates as necessary.
Jun 12, 11:00 CDT
Scheduled - We are retiring oconnor-ap and transitioning users to a new machine. Users will be unable to access oconnor-ap until maintenance is over.
Jun 12, 10:42 CDT
Jun 11, 2025
Completed - Our ticketing system has been successfully upgraded and is operational at this time.
Jun 11, 16:06 CDT
In progress - Scheduled maintenance is currently in progress. We will provide updates as necessary.
Jun 11, 00:00 CDT
Scheduled - CHTC will be switching to a new email support/ticketing system on June 11th. You can still email us at chtc@cs.wisc.edu, and all open tickets should carry over automatically.

Please note: We may experience brief downtime or delayed responses during the transition.

For the latest updates, check our status page:
https://status.chtc.wisc.edu

If there are any issues or delays during our system upgrade window, we will post status updates to the system Status Page.

Jun 9, 17:00 CDT
Jun 10, 2025
Completed - The HPC cluster outage scheduled for today has been canceled due to a change in server room maintenance plans. The Spark/HPC cluster is back up and operation at this time. We will announce the new date for maintenance once we know.
Jun 10, 12:17 CDT
Update - Scheduled maintenance is still in progress. We will provide updates as necessary.
Jun 9, 16:51 CDT
In progress - Scheduled maintenance is currently in progress. We will provide updates as necessary.

For HTC Users: The HTC system is not expected to be affected during this downtime.

Jun 9, 16:49 CDT
Scheduled - The Spark HPC cluster will be unavailable during June 10 due to scheduled maintenance. No jobs will run or be accepted during this time and queued jobs should continue once the maintenance downtime has completed.

For the most up-to-date information, including when the system is back online, please refer to the system Status Page.

Jun 9, 15:00 CDT
Jun 9, 2025
Completed - This upgrade has been re-scheduled to later in the week.
Jun 9, 12:23 CDT
In progress - Scheduled maintenance is currently in progress. We will provide updates as necessary.
Jun 9, 00:00 CDT
Scheduled - CHTC will be switching to a new email support/ticketing system on June 9th. You can still email us at chtc@cs.wisc.edu, and all open tickets should carry over automatically.

Please note: We may experience brief downtime or delayed responses during the transition.

For the latest updates, check our status page:
https://status.chtc.wisc.edu

If there are any issues or delays during our system upgrade window, we will post status updates to the system Status Page.

May 30, 16:21 CDT
Jun 8, 2025

No incidents reported.

Jun 7, 2025

No incidents reported.

Jun 6, 2025

No incidents reported.

Jun 5, 2025

No incidents reported.

Jun 4, 2025

No incidents reported.

Jun 3, 2025
Resolved - This incident has been resolved.
Jun 3, 12:37 CDT
Update - We've put in place a hacky and rather temporary fix for the integration. That hopefully means things will work over the weekend, but until we resolve the underlying issue, this problem will reoccur sometime in the near future.
May 23, 18:17 CDT
Update - While we are still unable to identify the root cause, it appears to be originating from a combination of software updates and complex credential configurations. We are unlikely to resolve the issue before the weekend, and we will continue to provide updates next week.
May 23, 16:56 CDT
Investigating - Apparently the fix did not last for very long. Users may now encounter hold messages along the lines of "Details: Pelican Client Error: from uwdf-pelican-cache.uwdf-prod.chtc.io:8443: server returned 403 Forbidden". Good news is that this is different from before, bad news is that now we have to figure out why its different..
May 20, 15:04 CDT
Monitoring - We have implemented a fix and are monitoring the situation.
May 20, 13:37 CDT
Investigating - HTC users attempting to transfer files directly to and from their jobs from ResearchDrive using the Pelican plugin will encounter a hold message: "Details: error while querying the director at https://uwdf-director.chtc.wisc.edu: server returned 404 Not Found". We are currently investigating the problem and working on implementing a fix.
May 20, 09:53 CDT
Jun 2, 2025

No incidents reported.

Jun 1, 2025

No incidents reported.

May 31, 2025

No incidents reported.

May 30, 2025

No incidents reported.