Due to maintenance, the high-performance computing (HPC) clusters and their storage systems (/home and /scratch) will be unavailable:

  • Great Lakes: Monday, August 9, 2021, 6 a.m. – Wednesday, August 11, 2021 5 p.m.
  • Armis2, Lighthouse: Tuesday, August 10, 2021, 6 a.m. – Wednesday, August 11, 2021, 5 p.m.
  • Cavium ThunderX: Monday, August 9, 2021, 6 a.m. – Wednesday, August 11, 2021, 5 p.m.

Copy any files you might need during the maintenance window to your local drive using Globus File Transfer.

Contact arcts-support@umich.edu if you have any questions.

Planned Updates

Great Lakes

  • CentOS 7.9
  • Slurm 20.11
  • NVIDIA driver updates
  • OFED updates
  • InfiniBand updates
  • Policy change: Slurm jobs will no longer automatically re-queue after a hardware failure. Slurm jobs can enable re-queueing in their slurm submission script with the --requeue option.

Lighthouse and Armis2

  • CentOS 7.9
  • Slurm 20.11
  • NVIDIA driver updates
  • OFED updates
  • Policy change: Slurm jobs will no longer automatically re-queue after a hardware failure. Slurm jobs can enable re-queueing in their slurm submission script with the --requeue option.

ThunderX

  • CentOS 7.9 (user software version but not kernel)
  • Hadoop updates to 2.10.1

Update Details

Great Lakes, Armis2, and Lighthouse

Bold denotes changes 

NEW version

OLD version

CentOS 7.9

  • kernel 3.10.0-1160.31.1.el7

  • glibc 2.17-324.el7_9

  • ucx-1.8.0-1.49224

CentOS 7.9

  • kernel 3.10.0-1160.6.1.el7

  • glibc 2.17-317.el7

  • ucx-1.8.0-1.49224

Mlnx-ofa_kernel-modules 

  • 5.1-OFED.5.1-2.5.8.1 (Great Lakes only)

    • Kver 3.10.0-1160.31.1.el7.x86_64

  • 4.9-OFED.4.9-3.1.5.0 (Armis2 and Lighthouse)

    • Kver 3.10.0-1160.31.1.el7.x86_64

Mlnx-ofa_kernel-modules 

  • 5.1-OFED.5.1.2.5.8.1 (Great Lakes only)

    • Kver.3.10.0_1160.6.1.el7.x86_64

  • 4.9-OFED.4.9-2.2.4.0 (Armis2 and Lighthouse)

    • Kver.3.10.0_1160.6.1.el7.x86_64

Slurm 20.11.8 compiled with:

  • PMIx

    • /opt/pmix/2.2.4

    • /opt/pmix/3.2.3

  • hwloc 1.11.8-4.el7 (OS provided)

  • ucx-1.8.0-1.49224 (Mellanox provided)

Slurm 20.02.6 compiled with:

  • PMIx

    • /opt/pmix/2.2.4

    • /opt/pmix/3.2.1

  • hwloc 1.11.8-4.el7 (OS provided)

  • ucx-1.8.0-1.49224 (Mellanox provided)

PMIx LD config /opt/pmix/2.2.4/lib

PMIX LD config /opt/pmix/2.2.4/lib

PMIx versions available in /opt :

  • 1.2.5

  • 2.1.3

  • 2.2.4

  • 3.1.5

  • 3.2.1

  • 3.2.3

PMIx versions available in /opt :

  • 1.2.5

  • 2.1.3

  • 2.2.4

  • 3.1.5

  • 3.2.1

 

hwloc versions available in  /opt:

  • 2.1.0

  • 2.2.0

hwloc versions available in /opt:

  • 2.1.0

  • 2.2.0

NVIDIA driver 465.19.01

NVIDIA driver 460.32.03

Open OnDemand 1.8.20 Open OnDemand 1.8.18-1.el7