Investigating - HPC users may receive a message when using SLURM commands, saying, "error: NodeNames=spark-a[237-262] CPUs=128 match no Sockets, Sockets*CoresPerSocket or Sockets*CoresPerSocket*ThreadsPerCore. Resetting CPUs."
This message does not affect jobs. We are investigating.

Nov 26, 2025 - 16:49 CST

About This Site

This page provides information about unplanned downtimes and scheduled maintenance for services offered by the Center for High Throughput Computing

High Throughput Computing (HTC) System Operational
90 days ago
99.92 % uptime
Today
Access Points Operational
90 days ago
99.92 % uptime
Today
CHTC Pool Operational
90 days ago
100.0 % uptime
Today
External Pools (OSPool, Campus HTCondor Pools) Operational
90 days ago
100.0 % uptime
Today
Staging and Projects Space Operational
90 days ago
100.0 % uptime
Today
File Transfers Operational
90 days ago
99.71 % uptime
Today
High Performance Computing (HPC) System Operational
90 days ago
99.99 % uptime
Today
Login Nodes Operational
90 days ago
99.98 % uptime
Today
Cluster Nodes and Jobs Operational
90 days ago
100.0 % uptime
Today
Central Software Installations Operational
90 days ago
100.0 % uptime
Today
Home and Scratch File Systems Operational
90 days ago
100.0 % uptime
Today
Data Transfer Tools Operational
90 days ago
100.0 % uptime
Today
Globus Endpoint Operational
90 days ago
100.0 % uptime
Today
CHTC Internal Infrastructure Operational
90 days ago
100.0 % uptime
Today
Tiger Cluster ? Operational
90 days ago
100.0 % uptime
Today
RT Email/Ticket Support System Operational
90 days ago
100.0 % uptime
Today
Operational
Degraded Performance
Partial Outage
Major Outage
Maintenance
Major outage
Partial outage
No downtime recorded on this day.
No data exists for this day.
had a major outage.
had a partial outage.
Dec 1, 2025

No incidents reported today.

Nov 30, 2025

No incidents reported.

Nov 29, 2025

No incidents reported.

Nov 28, 2025

No incidents reported.

Nov 27, 2025

No incidents reported.

Nov 26, 2025

Unresolved incident: [HPC] Warning message when using SLURM commands.

Nov 25, 2025
Resolved - This incident has been resolved.
Nov 25, 15:16 CST
Monitoring - We've implemented a fix and are monitoring the issue.
Nov 25, 14:37 CST
Investigating - Users are unable to log into or access learn.chtc.wisc.edu. When attempting to log in, users are prompted for their password three times before getting a "Permission denied" message. We are investigating.
Nov 25, 14:27 CST
Nov 24, 2025

No incidents reported.

Nov 23, 2025

No incidents reported.

Nov 22, 2025

No incidents reported.

Nov 21, 2025

No incidents reported.

Nov 20, 2025

No incidents reported.

Nov 19, 2025

No incidents reported.

Nov 18, 2025

No incidents reported.

Nov 17, 2025

No incidents reported.