Nexus/Vulcan: Difference between revisions

From UMIACS
Jump to navigation Jump to search
(76 intermediate revisions by 4 users not shown)
Line 1: Line 1:
The [https://wiki.umiacs.umd.edu/cfar/index.php/Vulcan Vulcan] standalone cluster's compute nodes will fold into [[Nexus]] on Thursday, August 17th, 2023 during the scheduled [[MonthlyMaintenanceWindow | maintenance window]] for August (5-8pm).
The compute nodes from Vulcan's previous standalone cluster have folded into [[Nexus]] as of the scheduled [[MonthlyMaintenanceWindow | maintenance window]] for August 2023 (Thursday 08/17/2023, 5-8pm).


The Nexus cluster already has a large pool of compute resources made possible through leftover funding for the [[Iribe | Brendan Iribe Center]]. Details on common nodes already in the cluster (Tron partition) can be found [[Nexus/Tron | here]].
The Nexus cluster already has a large pool of compute resources made possible through college-level funding for UMIACS and CSD faculty. Details on common nodes already in the cluster (Tron partition) can be found [[Nexus/Tron | here]].
 
In addition, the Vulcan cluster's standalone submission nodes <code>vulcansub00.umiacs.umd.edu</code> and <code>vulcansub01.umiacs.umd.edu</code> will be retired on Thursday, September 21st, 2023 during that month's maintenance window (5-8pm), as they will no longer be able to submit jobs to Vulcan compute nodes after the August maintenance window. Please use <code>nexusvulcan00.umiacs.umd.edu</code> and <code>nexusvulcan01.umiacs.umd.edu</code> for any general purpose Vulcan compute needs after this time.
 
As of Friday, July 21st, 2023, two 1080 Ti compute nodes (<code>vulcan11</code> and <code>vulcan12</code>) have been moved into Nexus to give you a chance to test your new submission scripts now if you would like. Please continue to run your normal Vulcan workloads on the standalone Vulcan cluster for now, as two compute nodes will not be able to handle jobs from multiple users simultaneously. Only the <code>vulcan-dpart</code> and <code>vulcan-scavenger</code> partitions are available to test with.
 
Please see the [[#Timeline | Timeline]] section below for concrete dates in chronological order.


Please [[HelpDesk | contact staff]] with any questions or concerns.
Please [[HelpDesk | contact staff]] with any questions or concerns.


==Usage==
==Usage==
The Nexus cluster submission nodes that are allocated to Vulcan are <code>nexusvulcan00.umiacs.umd.edu</code> and <code>nexusvulcan01.umiacs.umd.edu</code>. '''You must use these nodes to submit jobs to Vulcan compute nodes after the August maintenance window.''' Submission from <code>vulcansub00.umiacs.umd.edu</code> or <code>vulcansub01.umiacs.umd.edu</code> will no longer work.
You can [[SSH]] to <code>nexusvulcan.umiacs.umd.edu</code> to log in to a submission node.


All partitions, QoSes, and account names from the standalone Vulcan cluster are being moved over to Nexus when the compute nodes move. However, please note that <code>vulcan-</code> will be prepended to all of the values that were present in the standalone Vulcan cluster to distinguish them from existing values in Nexus. The lone exception is the base account currently named <code>vulcan</code> in the standalone cluster (will retain same name).
If you store something in a local directory (/tmp, /scratch0) on one of the two submission nodes, you will need to connect to that same submission node to access it later. The actual submission nodes are:
* <code>nexusvulcan00.umiacs.umd.edu</code>
* <code>nexusvulcan01.umiacs.umd.edu</code>
 
All partitions, QoSes, and account names from the standalone Vulcan cluster have been moved over to Nexus. However, please note that <code>vulcan-</code> is prepended to all of the values that were present in the standalone Vulcan cluster to distinguish them from existing values in Nexus. The lone exception is the base account that was named <code>vulcan</code> in the standalone cluster (it is also named just <code>vulcan</code> in Nexus).


Here are some before/after examples of job submission with various parameters:
Here are some before/after examples of job submission with various parameters:
Line 22: Line 20:
! Nexus cluster submission command
! Nexus cluster submission command
|-
|-
|<code>srun --partition=dpart --qos=medium --account=abhinav --gres=gpu:rtxa4000:2 --pty bash</code>
|<code>srun --partition=dpart --qos=medium --account=abhinav --gres=gpu:gtx1080ti:2 --pty bash</code>
|<code>srun --partition=vulcan-dpart --qos=vulcan-medium --account=vulcan-abhinav --gres=gpu:rtxa4000:2 --pty bash</code>
|<code>srun --partition=vulcan-dpart --qos=vulcan-medium --account=vulcan-abhinav --gres=gpu:gtx1080ti:2 --pty bash</code>
|-
|-
|<code>srun --partition=cpu --qos=cpu --pty bash</code>
|<code>srun --partition=cpu --qos=cpu --pty bash</code>
|<code>srun --partition=vulcan-cpu --qos=vulcan-cpu --pty bash</code>
|<code>srun --partition=vulcan-cpu --qos=vulcan-cpu --account=vulcan --pty bash</code>
|-
|-
|<code>srun --partition=scavenger --qos=scavenger --account=vulcan --gres=gpu:4 --pty bash</code>
|<code>srun --partition=scavenger --qos=scavenger --account=vulcan --gres=gpu:4 --pty bash</code>
Line 32: Line 30:
|}
|}


Vulcan users (exclusively) can schedule non-interruptible jobs on the moved nodes with any non-scavenger job parameters. Please note that the <code>vulcan-dpart</code> partition will have a <code>GrpTRES</code> limit of 100% of the available cores/RAM on vulcan## nodes plus 50% of the available cores/RAM on legacy## nodes, so your job may need to wait if all available cores/RAM (or GPUs) are in use. It also has a max submission limit of 500 jobs simultaneously so as to not overload the cluster. This is codified by the partition QoS named '''vulcan'''.
Vulcan users (exclusively) can schedule non-interruptible jobs on Vulcan nodes with any non-scavenger job parameters. Please note that the <code>vulcan-dpart</code> partition has a <code>GrpTRES</code> limit of 100% of the available cores/RAM on all vulcan## in aggregate nodes plus 50% of the available cores/RAM on legacy## nodes in aggregate, so your job may need to wait if all available cores/RAM (or GPUs) are in use. It also has a max submission limit of 500 jobs per user simultaneously so as to not overload the cluster. This is codified by the partition QoS named '''vulcan'''.


Please note that the Vulcan compute nodes will also be added to the institute-wide <code>scavenger</code> partition in Nexus. Vulcan users will still have scavenging priority over these nodes via the <code>vulcan-scavenger</code> partition (i.e., all <code>vulcan-</code> queue jobs (other than <code>vulcan-scavenger</code>) can preempt both <code>vulcan-scavenger</code> and <code>scavenger</code> queue jobs, and <code>vulcan-scavenger</code> queue jobs can preempt <code>scavenger</code> queue jobs).
Please note that the Vulcan compute nodes are also in the institute-wide <code>scavenger</code> partition in Nexus. Vulcan users still have scavenging priority over these nodes via the <code>vulcan-scavenger</code> partition (i.e., all <code>vulcan-</code> partition jobs (other than <code>vulcan-scavenger</code>) can preempt both <code>vulcan-scavenger</code> and <code>scavenger</code> partition jobs, and <code>vulcan-scavenger</code> partition jobs can preempt <code>scavenger</code> partition jobs).


==Timeline==
==Compute Nodes==
Each event will be completed within the timeframe specified.
There are currently 46 [[Nexus/Vulcan/GPUs | GPU nodes]] available, named vulcan[00-45], running a mixture of NVIDIA RTX A6000, NVIDIA RTX A5000, NVIDIA RTX A4000, and a number of different older generation cards. There are also 4 CPU-only nodes available, named brigid[16-19].


{| class="wikitable"
All nodes are scheduled with the [[SLURM]] resource manager.
! Date
 
! Event
==Network==
|-
The network infrastructure supporting the Vulcan partition consists of:
| July 21st 2023
# One pair of network switches connected to each other via dual 100GbE links for redundancy, serving the following compute nodes:
| Two compute nodes <code>vulcan11</code> and <code>vulcan12</code> are moved into Nexus so submission can be tested
#* brigid[16-17],vulcan[29-45]: Two 100GbE links per node, one to each switch in the pair (redundancy).
|-
# One pair of network switches connected to the above pair of switches through several sets of intermediary switches, and to each other via dual 10GbE links for redundancy. The immediate connection to these sets of intermediary switches is via two 40GbE links to a pair of them, one between the first two switches in each pair and one between the second two switches in each pair for redundancy. This pair serves the following compute nodes:
| [[MonthlyMaintenanceWindow | August 17th 2023, 5-8pm]]
#* brigid[18-19],vulcan[00-28]: Two 10GbE links per node, one to each switch in the pair (redundancy).
| All other standalone Vulcan cluster compute nodes are moved into Nexus in corresponding <code>vulcan-</code> named partitions
 
|-
The fileserver hosting all Vulcan [[Nexus/Vulcan#Scratch_Directories | scratch]], [[Nexus/Vulcan#Project_Storage | project]], and [[Nexus/Vulcan#Datasets | dataset]] allocations first connects to a pair of intermediary switches and then the first pair of switches mentioned [[Nexus/Tron#Network | here (Tron page's network section)]]. It then connects to the first pair of switches mentioned on this page through a set of four (different) intermediary switches. The last hop from the four intermediary switches to the first pair of switches mentioned on this page is via 32 100GbE links, four from each switch in the set to each switch in the first pair mentioned on this page for redundancy and increased bandwidth.
| [[MonthlyMaintenanceWindow | September 21st 2023, 5-8pm]]
 
| <code>vulcansub00.umiacs.umd.edu</code> and <code>vulcansub01.umiacs.umd.edu</code> are taken offline
For a broader overview of the network infrastructure supporting the Nexus cluster, please see [[Nexus/Network]].
|-
|}


==Post-Migration==
==Partitions==
===Partitions===
There are three partitions available to general Vulcan [[SLURM]] users.  You must specify a partition when submitting your job.
There are three partitions available to general Vulcan [[SLURM]] users.  You must specify a partition when submitting your job.


* '''vulcan-dpart''' - This is the default partition. Job allocations are guaranteed.
* '''vulcan-dpart''' - This is the default partition. Job allocations are guaranteed. Only nodes with GPUs from architectures older than NVIDIA's [https://www.nvidia.com/en-us/data-center/ampere-architecture/ Ampere architecture] are included in this partition.
* '''vulcan-scavenger''' - This is the alternate partition that allows jobs longer run times and more resources but is preemptable when jobs in other <code>vulcan-</code> partitions are ready to be scheduled.
* '''vulcan-scavenger''' - This is the alternate partition that allows jobs longer run times and more resources but is preemptable when jobs in other non-scavenger-named <code>vulcan-</code> partitions are ready to be scheduled.
* '''vulcan-cpu''' - This partition is for CPU focused jobs. Job allocations are guaranteed.
* '''vulcan-cpu''' - This partition is for CPU focused jobs. Job allocations are guaranteed.


There are a few additional partitions available to other Vulcan users based on temporary requirements.
There are a few additional partitions available to subsets of Vulcan users based on specific requirements.
 
* '''vulcan-ampere''' - This partition contains nodes with GPUs from NVIDIA's [https://www.nvidia.com/en-us/data-center/ampere-architecture/ Ampere architecture]. Job allocations are guaranteed.
*: As of Thursday 02/29/2024 at 12pm, there is a 4 hour time limit on interactive jobs in this partition. If you need to run longer jobs, you will need to modify your workflow into a job that can be submitted as a batch script.
*: As of Thursday 03/21/2024 at 5pm, there is a limit of 4 CPUs and 48G memory maximum per GPU requested by a job. If you need to run jobs with more CPUs/memory, you will either need to request more GPUs in the job or use a different partition.
 
: Submission is restricted to the Slurm [[#Accounts | accounts]] of the faculty who invested in these nodes:
:* Abhinav Shrivastava (vulcan-abhinav)
:* Jia-Bin Huang (vulcan-jbhuang)
:* Christopher Metzler (vulcan-metzler)
:* Matthias Zwicker (vulcan-zwicker)
 
* '''vulcan-scavenger-multi''' - This partition allows multi-node jobs (up to 9 total nodes per job) and allows jobs more resources than the vulcan-scavenger partition, but only contains nodes with GTX 1080 Ti and RTX 2080 Ti GPUs in them. As with vulcan-scavenger, it is preemptable when jobs in other non-scavenged-named <code>vulcan-</code> partitions are ready to be scheduled.
*: Please contact Abhinav Shrivastava if you would like to be granted access to this partition.
 
==Accounts==
Vulcan has a base SLURM account <code>vulcan</code> which has a modest number of guaranteed billing resources available to all cluster users at any given time.  Other faculty that have invested in Vulcan compute infrastructure have an additional account provided to their sponsored accounts on the cluster.
 
If you do not specify an account when submitting your job, you will receive the '''vulcan''' account.  If your faculty sponsor has their own account, it is recommended to use that account for job submission.


===Accounts===
The current faculty accounts are:
Vulcan has a base SLURM account <code>vulcan</code> which has a modest number of guaranteed billing resources available to all cluster users at any given time.  Other faculty that have invested in the cluster have an additional account provided to their sponsored accounts on the cluster, which provides a number of guaranteed billing resources corresponding to the amount that they invested.  If you do not specify an account when submitting your job, you will receive the '''vulcan''' account.
* vulcan-abhinav
* vulcan-djacobs
* vulcan-jbhuang
* vulcan-lsd
* vulcan-metzler
* vulcan-rama
* vulcan-ramani
* vulcan-yaser
* vulcan-zwicker


<pre>
<pre>
Line 71: Line 92:
             Account                          Descr        Org
             Account                          Descr        Org
-------------------- ------------------------------ ----------
-------------------- ------------------------------ ----------
                 ...
                 ...                            ...        ...
               vulcan                        vulcan    vulcan
               vulcan                        vulcan    vulcan
       vulcan-abhinav  vulcan - abhinav shrivastava    vulcan
       vulcan-abhinav  vulcan - abhinav shrivastava    vulcan
       vulcan-djacobs          vulcan - david jacobs    vulcan
       vulcan-djacobs          vulcan - david jacobs    vulcan
        vulcan-janus                vulcan - janus    vulcan
       vulcan-jbhuang        vulcan - jia-bin huang    vulcan
       vulcan-jbhuang        vulcan - jia-bin huang    vulcan
           vulcan-lsd          vulcan - larry davis    vulcan
           vulcan-lsd          vulcan - larry davis    vulcan
Line 83: Line 103:
         vulcan-yaser          vulcan - yaser yacoob    vulcan
         vulcan-yaser          vulcan - yaser yacoob    vulcan
       vulcan-zwicker      vulcan - matthias zwicker    vulcan
       vulcan-zwicker      vulcan - matthias zwicker    vulcan
                 ...
                 ...                            ...        ...
</pre>
</pre>
Faculty can manage this list of users via our [https://intranet.umiacs.umd.edu/directory/secgroup/ Directory application] in the Security Groups section.  The security group that controls access has the prefix <code>vulcan_</code> and then the faculty username.  It will also list <code>slurm://nexusctl.umiacs.umd.edu</code> as the associated URI.


You can check your account associations by running the '''show_assoc''' command to see the accounts you are associated with.  Please [[HelpDesk | contact staff]] and include your faculty member in the conversation if you do not see the appropriate association.  
You can check your account associations by running the '''show_assoc''' command to see the accounts you are associated with.  Please [[HelpDesk | contact staff]] and include your faculty member in the conversation if you do not see the appropriate association.  
Line 92: Line 114:
       User          Account MaxJobs      GrpTRES                                                                              QOS
       User          Account MaxJobs      GrpTRES                                                                              QOS
---------- ---------------- ------- ------------- --------------------------------------------------------------------------------
---------- ---------------- ------- ------------- --------------------------------------------------------------------------------
       ...
       ...             ...    ...                                                                                            ...
  abhinav          abhinav      48                          vulcan-cpu,vulcan-default,vulcan-high,vulcan-medium,vulcan-scavenger
   abhinav          vulcan      48                                      vulcan-cpu,vulcan-default,vulcan-medium,vulcan-scavenger
   abhinav          vulcan      48                                      vulcan-cpu,vulcan-default,vulcan-medium,vulcan-scavenger
       ...
  abhinav  vulcan-abhinav      48                          vulcan-cpu,vulcan-default,vulcan-high,vulcan-medium,vulcan-scavenger
       ...              ...    ...                                                                                            ...
</pre>
</pre>


You can also see the total number of Track-able Resources (TRES) allowed for each account by running the following command. Please make sure you give the appropriate account that you are looking for.  
You can also see the total number of Track-able Resources (TRES) allowed for each account by running the following command. Please make sure you give the appropriate account that you are looking for. As shown below, there is a concurrent limit of 64 total GPUs for all users not in a contributing faculty group.


<pre>
<pre>
Line 105: Line 127:
---------- ---------- -------------------- -------------
---------- ---------- -------------------- -------------
               vulcan                        gres/gpu=64
               vulcan                        gres/gpu=64
                   ...
                   ...                                ...
</pre>
</pre>


===QoS===
==QoS==
You need to decide the QOS to submit with which will set a certain number of restrictions to your job.  If you do not specify a QoS when submitting your job, you will receive the '''vulcan-default''' QoS assuming you are using a Vulcan account.
You need to decide the QOS to submit with which will set a certain number of restrictions to your job.  If you do not specify a QoS when submitting your job using the <code>--qos</code> parameter, you will receive the '''vulcan-default''' QoS assuming you are using a Vulcan account.


The following <code>sacctmgr</code> command will list the current QOS.  Either the <code>vulcan-default</code>, <code>vulcan-medium</code>, or <code>vulcan-high</code> QOS is required for the vulcan-dpart partition.  This will be passed to all your submission commands as <code>--qos</code>.
The following <code>sacctmgr</code> command will list the current QOS.  Either the <code>vulcan-default</code>, <code>vulcan-medium</code>, or <code>vulcan-high</code> QOS is required for the vulcan-dpart partition.  Please note that only faculty accounts (see above) have access to the <code>vulcan-high</code> QoS.


The following example will show you the current limits that the QOS have.
The following example will show you the current limits that the QOS have. The output is truncated to show only relevant Vulcan QoS.
<pre>
<pre>
$ show_qos
$ show_qos
                 Name    MaxWall                        MaxTRES MaxJobsPU MaxSubmitPU                     MaxTRESPU             GrpTRES
                 Name    MaxWall                        MaxTRES MaxJobsPU                      MaxTRESPU  
-------------------- ----------- ------------------------------ --------- ----------- ------------------------------ --------------------
-------------------- ----------- ------------------------------ --------- ------------------------------  
                ...        ...                            ...      ...        ...                            ...                  ...
...
      vulcan-medium 3-00:00:00       cpu=8,gres/gpu=2,mem=64G         2
          vulcan-cpu 2-00:00:00               cpu=1024,mem=4T         4                               
        vulcan-high 1-12:00:00     cpu=16,gres/gpu=4,mem=128G         2
      vulcan-default 7-00:00:00       cpu=4,gres/gpu=1,mem=32G         2                              
      vulcan-default 7-00:00:00       cpu=4,gres/gpu=1,mem=32G         2
      vulcan-exempt 7-00:00:00     cpu=32,gres/gpu=8,mem=256G         2                              
    vulcan-scavenger 3-00:00:00    cpu=32,gres/gpu=8,mem=256G
        vulcan-high 1-12:00:00    cpu=16,gres/gpu=4,mem=128G        2                               
         vulcan-janus  3-00:00:00    cpu=32,gres/gpu=10,mem=256G
         vulcan-janus  3-00:00:00    cpu=32,gres/gpu=10,mem=256G                                        
       vulcan-exempt 7-00:00:00     cpu=32,gres/gpu=8,mem=256G         2
       vulcan-medium 3-00:00:00       cpu=8,gres/gpu=2,mem=64G         2                              
          vulcan-cpu 2-00:00:00               cpu=1024,mem=4T        4
      vulcan-sailon 3-00:00:00     cpu=32,gres/gpu=8,mem=256G                              gres/gpu=48
     vulcan-exclusive 30-00:00:00
     vulcan-scavenger 3-00:00:00    cpu=32,gres/gpu=8,mem=256G                                           
      vulcan-sailon 3-00:00:00    cpu=32,gres/gpu=8,mem=256G                                          gres/gpu=48
...
                ...         ...                            ...      ...        ...                            ...                  ...
 
$ show_partition_qos
                Name MaxSubmitPU                      MaxTRESPU              GrpTRES
-------------------- ----------- ------------------------------ --------------------
...
              vulcan        500                                cpu=1760,mem=15824G
    vulcan-scavenger         500                                                   
...
</pre>
</pre>


===Data Storage===
==Storage==
[https://wiki.umiacs.umd.edu/cfar/index.php/Vulcan/Storage All data storage that was available on the standalone Vulcan cluster] will continue to be available in Nexus.
Vulcan has the following storage available.  Please also review UMIACS [[FilesystemDataStorage | Filesystem Data Storage]] policies including any volume that is labeled as scratch.
 
Vulcan users can also request [[Nexus#Project_Allocations | Nexus project allocations]].
 
===Home Directories===
{{Nfshomes}}
 
===Scratch Directories===
Scratch data has no data protection including no snapshots and the data is not backed up. There are two types of scratch directories in the Vulcan compute infrastructure:
* Network scratch directory
* Local scratch directories
 
====Network Scratch Directory====
You have 300GB of scratch storage available at <code>/vulcanscratch/<username></code>.  '''It is not backed up or protected in any way.'''  This directory is '''automounted''' so you will need to <code>cd</code> into the directory or request/specify a fully qualified file path to access this.
 
You may request a temporary increase of up to 500GB total space for a maximum of 120 days without any faculty approval by [[HelpDesk | contacting staff]].  Once the temporary increase period is over, you will be contacted and given a one-week window of opportunity to clean and secure your data before staff will forcibly remove data to get your space back under 300GB.  If you need space beyond 500GB or for longer than 120 days, you will need faculty approval and/or a project directory.
 
This file system is available on all submission and computational nodes within the cluster.
 
====Local Scratch Directories====
Each computational node that you can schedule compute jobs on has one or more local scratch directories.  These are always named <code>/scratch0</code>, <code>/scratch1</code>, etc.  These are almost always more performant than any other storage available to the job.  However, you must stage their data within the confine of their job and stage the data out before the end of their job.
 
These local scratch directories have a tmpwatch job which will '''delete unaccessed data after 90 days''', scheduled via maintenance jobs to run once a month at 1am.  Different nodes will run the maintenance jobs on different days of the month to ensure the cluster is still highly available at all times.  Please make sure you secure any data you write to these directories at the end of your job.
 
===Datasets===
We have read-only dataset storage available at <code>/fs/vulcan-datasets</code>.  If there are datasets that you would like to see curated and made available, please see [[Datasets | this page]].
 
The list of Vulcan datasets we currently host can be viewed [https://info.umiacs.umd.edu/datasets/list/?q=Vulcan here].
 
===Project Storage===
Users within the Vulcan compute infrastructure can request project based allocations for up to 10TB for up to 180 days by [[HelpDesk | contacting staff]] with approval from the Vulcan faculty manager (Dr. Shrivastava).  These allocations will be available from <code>/fs/vulcan-projects</code> and <code>/fs/cfar-projects</code> under a name that you provide when you request the allocation.  Near the end of the allocation period, staff will contact you and ask if you would like to renew the allocation for up to another 180 days (requires re-approval from Dr. Shrivastava).
* If you are no longer in need of the storage allocation, you will need to relocate all desired data within two weeks of the end of the allocation period.  Staff will then remove the allocation.
* If you do not respond to staff's request by the end of the allocation period, staff will make the allocation temporarily inaccessible.
** If you do respond asking for renewal but the original faculty approver does not respond within two weeks of the end of the allocation period, staff will also make the allocation temporarily inaccessible.
** If one month from the end of the allocation period is reached without both you and the faculty approver responding, staff will remove the allocation.
 
Project storage is fully protected.  It has [[Snapshots | snapshots]] enabled and is [[NightlyBackups | backed up nightly]].
 
===Object Storage===
All Vulcan users can request project allocations in the [https://obj.umiacs.umd.edu/obj/help UMIACS Object Store]. Please [[HelpDesk | contact staff]] with a short project name and the amount of storage you will need to get started.
 
To access this storage, you'll need to use a [[S3Clients | S3 client]] or our [[UMobj]] command line utilities.
 
An example on how to use the umobj command line utilities can be found [[UMobj/Example | here]].  A full set of documentation for the utilities can be found on the [https://gitlab.umiacs.umd.edu/staff/umobj/blob/master/README.md#umobj umobj Gitlab page].


However, please note that the Nexus cluster uses [[NFShomes]] home directories - if your UMIACS account was created before February 22nd, 2023, you have been using <code>/fs/cfarhomes/<username></code> as your home directory on the standalone Vulcan cluster. While <code>/fs/cfarhomes</code> is available on Nexus, your shell initialization scripts from it will not automatically load. Please copy over anything you need to your <code>/fs/nfshomes/<username></code> directory at your earliest convenience, as <code>/fs/cfarhomes</code> may be retired in the coming year.
==Migration==
===Home Directories===
The [[Nexus]] uses [[NFShomes]] home directories - if your UMIACS account was created before February 22nd, 2023, you were using <code>/cfarhomes/<username></code> as your home directory on the standalone Vulcan cluster. While <code>/cfarhomes</code> is available on Nexus, your shell initialization scripts from it will not automatically load. Please copy over anything you need to your <code>/nfshomes/<username></code> directory at your earliest convenience, as <code>/cfarhomes</code> will be retired in a two phase process:
* Fri 11/17/2023, 5pm: cfarhomes directories are made read-only
* Thu 12/21/2023, 5-8pm ([[MonthlyMaintenanceWindow |monthly maintenance window]]): cfarhomes directories are taken offline

Revision as of 14:50, 13 December 2024

The compute nodes from Vulcan's previous standalone cluster have folded into Nexus as of the scheduled maintenance window for August 2023 (Thursday 08/17/2023, 5-8pm).

The Nexus cluster already has a large pool of compute resources made possible through college-level funding for UMIACS and CSD faculty. Details on common nodes already in the cluster (Tron partition) can be found here.

Please contact staff with any questions or concerns.

Usage

You can SSH to nexusvulcan.umiacs.umd.edu to log in to a submission node.

If you store something in a local directory (/tmp, /scratch0) on one of the two submission nodes, you will need to connect to that same submission node to access it later. The actual submission nodes are:

  • nexusvulcan00.umiacs.umd.edu
  • nexusvulcan01.umiacs.umd.edu

All partitions, QoSes, and account names from the standalone Vulcan cluster have been moved over to Nexus. However, please note that vulcan- is prepended to all of the values that were present in the standalone Vulcan cluster to distinguish them from existing values in Nexus. The lone exception is the base account that was named vulcan in the standalone cluster (it is also named just vulcan in Nexus).

Here are some before/after examples of job submission with various parameters:

Standalone Vulcan cluster submission command Nexus cluster submission command
srun --partition=dpart --qos=medium --account=abhinav --gres=gpu:gtx1080ti:2 --pty bash srun --partition=vulcan-dpart --qos=vulcan-medium --account=vulcan-abhinav --gres=gpu:gtx1080ti:2 --pty bash
srun --partition=cpu --qos=cpu --pty bash srun --partition=vulcan-cpu --qos=vulcan-cpu --account=vulcan --pty bash
srun --partition=scavenger --qos=scavenger --account=vulcan --gres=gpu:4 --pty bash srun --partition=vulcan-scavenger --qos=vulcan-scavenger --account=vulcan --gres=gpu:4 --pty bash

Vulcan users (exclusively) can schedule non-interruptible jobs on Vulcan nodes with any non-scavenger job parameters. Please note that the vulcan-dpart partition has a GrpTRES limit of 100% of the available cores/RAM on all vulcan## in aggregate nodes plus 50% of the available cores/RAM on legacy## nodes in aggregate, so your job may need to wait if all available cores/RAM (or GPUs) are in use. It also has a max submission limit of 500 jobs per user simultaneously so as to not overload the cluster. This is codified by the partition QoS named vulcan.

Please note that the Vulcan compute nodes are also in the institute-wide scavenger partition in Nexus. Vulcan users still have scavenging priority over these nodes via the vulcan-scavenger partition (i.e., all vulcan- partition jobs (other than vulcan-scavenger) can preempt both vulcan-scavenger and scavenger partition jobs, and vulcan-scavenger partition jobs can preempt scavenger partition jobs).

Compute Nodes

There are currently 46 GPU nodes available, named vulcan[00-45], running a mixture of NVIDIA RTX A6000, NVIDIA RTX A5000, NVIDIA RTX A4000, and a number of different older generation cards. There are also 4 CPU-only nodes available, named brigid[16-19].

All nodes are scheduled with the SLURM resource manager.

Network

The network infrastructure supporting the Vulcan partition consists of:

  1. One pair of network switches connected to each other via dual 100GbE links for redundancy, serving the following compute nodes:
    • brigid[16-17],vulcan[29-45]: Two 100GbE links per node, one to each switch in the pair (redundancy).
  2. One pair of network switches connected to the above pair of switches through several sets of intermediary switches, and to each other via dual 10GbE links for redundancy. The immediate connection to these sets of intermediary switches is via two 40GbE links to a pair of them, one between the first two switches in each pair and one between the second two switches in each pair for redundancy. This pair serves the following compute nodes:
    • brigid[18-19],vulcan[00-28]: Two 10GbE links per node, one to each switch in the pair (redundancy).

The fileserver hosting all Vulcan scratch, project, and dataset allocations first connects to a pair of intermediary switches and then the first pair of switches mentioned here (Tron page's network section). It then connects to the first pair of switches mentioned on this page through a set of four (different) intermediary switches. The last hop from the four intermediary switches to the first pair of switches mentioned on this page is via 32 100GbE links, four from each switch in the set to each switch in the first pair mentioned on this page for redundancy and increased bandwidth.

For a broader overview of the network infrastructure supporting the Nexus cluster, please see Nexus/Network.

Partitions

There are three partitions available to general Vulcan SLURM users. You must specify a partition when submitting your job.

  • vulcan-dpart - This is the default partition. Job allocations are guaranteed. Only nodes with GPUs from architectures older than NVIDIA's Ampere architecture are included in this partition.
  • vulcan-scavenger - This is the alternate partition that allows jobs longer run times and more resources but is preemptable when jobs in other non-scavenger-named vulcan- partitions are ready to be scheduled.
  • vulcan-cpu - This partition is for CPU focused jobs. Job allocations are guaranteed.

There are a few additional partitions available to subsets of Vulcan users based on specific requirements.

  • vulcan-ampere - This partition contains nodes with GPUs from NVIDIA's Ampere architecture. Job allocations are guaranteed.
    As of Thursday 02/29/2024 at 12pm, there is a 4 hour time limit on interactive jobs in this partition. If you need to run longer jobs, you will need to modify your workflow into a job that can be submitted as a batch script.
    As of Thursday 03/21/2024 at 5pm, there is a limit of 4 CPUs and 48G memory maximum per GPU requested by a job. If you need to run jobs with more CPUs/memory, you will either need to request more GPUs in the job or use a different partition.
Submission is restricted to the Slurm accounts of the faculty who invested in these nodes:
  • Abhinav Shrivastava (vulcan-abhinav)
  • Jia-Bin Huang (vulcan-jbhuang)
  • Christopher Metzler (vulcan-metzler)
  • Matthias Zwicker (vulcan-zwicker)
  • vulcan-scavenger-multi - This partition allows multi-node jobs (up to 9 total nodes per job) and allows jobs more resources than the vulcan-scavenger partition, but only contains nodes with GTX 1080 Ti and RTX 2080 Ti GPUs in them. As with vulcan-scavenger, it is preemptable when jobs in other non-scavenged-named vulcan- partitions are ready to be scheduled.
    Please contact Abhinav Shrivastava if you would like to be granted access to this partition.

Accounts

Vulcan has a base SLURM account vulcan which has a modest number of guaranteed billing resources available to all cluster users at any given time. Other faculty that have invested in Vulcan compute infrastructure have an additional account provided to their sponsored accounts on the cluster.

If you do not specify an account when submitting your job, you will receive the vulcan account. If your faculty sponsor has their own account, it is recommended to use that account for job submission.

The current faculty accounts are:

  • vulcan-abhinav
  • vulcan-djacobs
  • vulcan-jbhuang
  • vulcan-lsd
  • vulcan-metzler
  • vulcan-rama
  • vulcan-ramani
  • vulcan-yaser
  • vulcan-zwicker
$ sacctmgr show account format=account%20,description%30,organization%10
             Account                          Descr        Org
-------------------- ------------------------------ ----------
                 ...                            ...        ...
              vulcan                         vulcan     vulcan
      vulcan-abhinav   vulcan - abhinav shrivastava     vulcan
      vulcan-djacobs          vulcan - david jacobs     vulcan
      vulcan-jbhuang         vulcan - jia-bin huang     vulcan
          vulcan-lsd           vulcan - larry davis     vulcan
      vulcan-metzler         vulcan - chris metzler     vulcan
         vulcan-rama        vulcan - rama chellappa     vulcan
       vulcan-ramani     vulcan - ramani duraiswami     vulcan
        vulcan-yaser          vulcan - yaser yacoob     vulcan
      vulcan-zwicker      vulcan - matthias zwicker     vulcan
                 ...                            ...        ...

Faculty can manage this list of users via our Directory application in the Security Groups section. The security group that controls access has the prefix vulcan_ and then the faculty username. It will also list slurm://nexusctl.umiacs.umd.edu as the associated URI.

You can check your account associations by running the show_assoc command to see the accounts you are associated with. Please contact staff and include your faculty member in the conversation if you do not see the appropriate association.

$ show_assoc
      User          Account MaxJobs       GrpTRES                                                                              QOS
---------- ---------------- ------- ------------- --------------------------------------------------------------------------------
       ...              ...     ...                                                                                            ...
   abhinav           vulcan      48                                       vulcan-cpu,vulcan-default,vulcan-medium,vulcan-scavenger
   abhinav   vulcan-abhinav      48                           vulcan-cpu,vulcan-default,vulcan-high,vulcan-medium,vulcan-scavenger
       ...              ...     ...                                                                                            ...

You can also see the total number of Track-able Resources (TRES) allowed for each account by running the following command. Please make sure you give the appropriate account that you are looking for. As shown below, there is a concurrent limit of 64 total GPUs for all users not in a contributing faculty group.

$ sacctmgr show assoc account=vulcan format=user,account,qos,grptres
      User    Account                  QOS       GrpTRES
---------- ---------- -------------------- -------------
               vulcan                        gres/gpu=64
                  ...                                ...

QoS

You need to decide the QOS to submit with which will set a certain number of restrictions to your job. If you do not specify a QoS when submitting your job using the --qos parameter, you will receive the vulcan-default QoS assuming you are using a Vulcan account.

The following sacctmgr command will list the current QOS. Either the vulcan-default, vulcan-medium, or vulcan-high QOS is required for the vulcan-dpart partition. Please note that only faculty accounts (see above) have access to the vulcan-high QoS.

The following example will show you the current limits that the QOS have. The output is truncated to show only relevant Vulcan QoS.

$ show_qos
                Name     MaxWall                        MaxTRES MaxJobsPU                      MaxTRESPU 
-------------------- ----------- ------------------------------ --------- ------------------------------ 
...
          vulcan-cpu  2-00:00:00                cpu=1024,mem=4T         4                                
      vulcan-default  7-00:00:00       cpu=4,gres/gpu=1,mem=32G         2                                
       vulcan-exempt  7-00:00:00     cpu=32,gres/gpu=8,mem=256G         2                                
         vulcan-high  1-12:00:00     cpu=16,gres/gpu=4,mem=128G         2                                
        vulcan-janus  3-00:00:00    cpu=32,gres/gpu=10,mem=256G                                          
       vulcan-medium  3-00:00:00       cpu=8,gres/gpu=2,mem=64G         2                                
       vulcan-sailon  3-00:00:00     cpu=32,gres/gpu=8,mem=256G                              gres/gpu=48 
    vulcan-scavenger  3-00:00:00     cpu=32,gres/gpu=8,mem=256G                                          
...

$ show_partition_qos
                Name MaxSubmitPU                      MaxTRESPU              GrpTRES 
-------------------- ----------- ------------------------------ -------------------- 
...
              vulcan         500                                 cpu=1760,mem=15824G 
    vulcan-scavenger         500                                                     
...

Storage

Vulcan has the following storage available. Please also review UMIACS Filesystem Data Storage policies including any volume that is labeled as scratch.

Vulcan users can also request Nexus project allocations.

Home Directories

You have 30GB of home directory storage available at /nfshomes/<username>. It has both Snapshots and Backups enabled.

Home directories are intended to store personal or configuration files only. We encourage you to not share any data in your home directory. You are encouraged to utilize our GitLab infrastructure to host your code repositories.

NOTE: To check your quota on this directory, use the command df -h ~.

Scratch Directories

Scratch data has no data protection including no snapshots and the data is not backed up. There are two types of scratch directories in the Vulcan compute infrastructure:

  • Network scratch directory
  • Local scratch directories

Network Scratch Directory

You have 300GB of scratch storage available at /vulcanscratch/<username>. It is not backed up or protected in any way. This directory is automounted so you will need to cd into the directory or request/specify a fully qualified file path to access this.

You may request a temporary increase of up to 500GB total space for a maximum of 120 days without any faculty approval by contacting staff. Once the temporary increase period is over, you will be contacted and given a one-week window of opportunity to clean and secure your data before staff will forcibly remove data to get your space back under 300GB. If you need space beyond 500GB or for longer than 120 days, you will need faculty approval and/or a project directory.

This file system is available on all submission and computational nodes within the cluster.

Local Scratch Directories

Each computational node that you can schedule compute jobs on has one or more local scratch directories. These are always named /scratch0, /scratch1, etc. These are almost always more performant than any other storage available to the job. However, you must stage their data within the confine of their job and stage the data out before the end of their job.

These local scratch directories have a tmpwatch job which will delete unaccessed data after 90 days, scheduled via maintenance jobs to run once a month at 1am. Different nodes will run the maintenance jobs on different days of the month to ensure the cluster is still highly available at all times. Please make sure you secure any data you write to these directories at the end of your job.

Datasets

We have read-only dataset storage available at /fs/vulcan-datasets. If there are datasets that you would like to see curated and made available, please see this page.

The list of Vulcan datasets we currently host can be viewed here.

Project Storage

Users within the Vulcan compute infrastructure can request project based allocations for up to 10TB for up to 180 days by contacting staff with approval from the Vulcan faculty manager (Dr. Shrivastava). These allocations will be available from /fs/vulcan-projects and /fs/cfar-projects under a name that you provide when you request the allocation. Near the end of the allocation period, staff will contact you and ask if you would like to renew the allocation for up to another 180 days (requires re-approval from Dr. Shrivastava).

  • If you are no longer in need of the storage allocation, you will need to relocate all desired data within two weeks of the end of the allocation period. Staff will then remove the allocation.
  • If you do not respond to staff's request by the end of the allocation period, staff will make the allocation temporarily inaccessible.
    • If you do respond asking for renewal but the original faculty approver does not respond within two weeks of the end of the allocation period, staff will also make the allocation temporarily inaccessible.
    • If one month from the end of the allocation period is reached without both you and the faculty approver responding, staff will remove the allocation.

Project storage is fully protected. It has snapshots enabled and is backed up nightly.

Object Storage

All Vulcan users can request project allocations in the UMIACS Object Store. Please contact staff with a short project name and the amount of storage you will need to get started.

To access this storage, you'll need to use a S3 client or our UMobj command line utilities.

An example on how to use the umobj command line utilities can be found here. A full set of documentation for the utilities can be found on the umobj Gitlab page.

Migration

Home Directories

The Nexus uses NFShomes home directories - if your UMIACS account was created before February 22nd, 2023, you were using /cfarhomes/<username> as your home directory on the standalone Vulcan cluster. While /cfarhomes is available on Nexus, your shell initialization scripts from it will not automatically load. Please copy over anything you need to your /nfshomes/<username> directory at your earliest convenience, as /cfarhomes will be retired in a two phase process:

  • Fri 11/17/2023, 5pm: cfarhomes directories are made read-only
  • Thu 12/21/2023, 5-8pm (monthly maintenance window): cfarhomes directories are taken offline