HPC Storage¶
Overview¶
The University’s Research Data Center provides data storage for active analysis on the high-performance computers (HPCs). Using central computing storage services and resources, University researchers, faculty researchers, and post-doctoral researchers are able to:
- Share research data in a collaborative environment with other UArizona affiliates on the HPC system.
- Store large-scale computational research data.
- Request additional storage for further data analysis.
The storage is mounted as a filesystem and all the clusters have access to the same filesystems.
Every user has access to individual and shared storage on the system where they can host data for active analyses. A summary of these locations is shown below:
Path |
Description | Quota |
Duration |
---|---|---|---|
/home/uxx/netid |
An individual storage allocation provided for every HPC user | 50 GB | Accessible for the duration of user's account |
/groups/pi_netid |
A communal storage allocation provided for every research group | 500 GB | Accessible for the duration of a PI's account |
/xdisk/pi_netid |
Temporary communal storage available for every group on request. See xdisk section below for details. | 200 GB to 20 TB | Up to 300 days |
/tmp |
Local storage available on individual compute nodes. | \(<\) 800 GB to 1.4 TB | Only accessible for the duration of a job's run |
Managing permissions
If you're working with other members of your group and need to make your files more accessible, see our bash cheat sheet. This offers an overview on Linux file permissions, ownership, and some tips for working with collaborators.
Best Practices¶
The shared file system on HPC is the location for everything in /home
, /groups
, and /xdisk
. The /tmp
directory is also available to users, and refers to the local disk on each node. Your I/O activity can have dramatic activity on other users. Extreme read/write activity can cause bottlenecks and may be cancelled without warning. It is generally best to limit I/O whenever possible to avoid straining the system. The HPC consult team is available to help optimize workflows that may be impacted by I/O.
-
Be aware of I/O load.
Running multiple instances of jobs performing significant I/O activity may be detrimental to the system, especially if these occur within the same subdirectories. It may be best to read in data at the beginning of a workflow, perform the entire analysis, then write at the very end. Reconfiguring your workflow to limit I/O may cost some time up front, but will most likely be made back through faster job completion.
If you are running array jobs, please be cognizant of your I/O activity.
-
Use /tmp for working space
If you have multiple jobs that will use the same data, consider copying it to
/tmp
and run multiple jobs. This can increase performance and reduce I/O load. -
Avoid storing many files in a single directory
Hundreds of files is probably ok; tens of thousands is not.
-
Avoid opening and closing files repeatedly in tight loops
If possible, open files once at the beginning of your workflow/program, then close them at the end.
-
Watch your quotas
You are limited in capacity and exceeding your storage quotas may have unintended side effects (e.g., login issues, data loss, or failed jobs). See the section below on checking your storage usage.
-
Avoid frequent snapshot files
This can stress the storage.
-
Use parallel I/O
Some modules enable parallelized file operations, such as
phdf5
.
Checking Your Storage Usage¶
To check your storage usage, on a compute node, file transfer node, or login node, use the command uquota
. This will show you all the spaces you have access to, their quotas, and current usage.
(puma) [netid@junonia ~]$ uquota
used soft limit hard limit
/groups/pi_netid 6.6G 500.0G 500.0G
/home 37.1G 50.0G 50.0G
/xdisk/pi_netid 12.9G 9.8T 9.8T
You can check your storage allocation through our online user portal by navigating to the Storage tab and clicking Check Disk Quotas:
xdisk¶
What is xdisk?¶
xdisk is a temporary storage allocation available to all faculty members (PIs) and offers up to 20 TB of usable space for their group for up to 300 days. PIs may only have one active xdisk at a time.
A PI can request an allocation either via the command line or through our web portal (no paperwork necessary!). Only PIs may request, alter, or delete an allocation from the command line. However, members of their research group may be delegated management rights allowing them to manage a group's xdisk on their PI's behalf through our web portal.
Once an xdisk allocation is created, it is immediately available for use. Groups can find their allocations under /xdisk/pi_netid
. By default, a subdirectory is created for each group member under /xdisk/pi_netid/netid
. If a group member is added after the allocation is created, a directory is not automatically created for them. To add one, reach out to our consultants.
Because xdisk allocations are temporary, they will be removed as soon as their time limit is reached. Warnings will be sent to every group member at their netid@arizona.edu addresses beginning one month before the expiration. It is the group's responsibility to renew xdisk allocations or copy files to an alternate storage location prior to the expiration date. Once an xdisk allocation expires, the contents are deleted.
PIs may request a new xdisk allocation immediately after their previous one has expired. This ensures groups will always have access to increased storage on HPC on a rolling basis with the requirement that housekeeping be done once per academic year.
Requesting, Modifying, and Deleting an Allocation¶
XDisk management is limited to PIs and delegates
Only PIs and trusted delegates can create, delete, and modify xdisk allocations. The CLI interface is restricted to PIs only. Delegates may manage their groups allocation through the user portal. For more information on adding group delegates and how they can use the portal on their PI's behalf, see: Delegating Group Management Rights.
User Portal¶
Warning
If a group has an active xdisk allocation, a new one cannot be created until the active allocation expires or is deleted.
PIs or delegates can request an xdisk allocation at any time through the user portal. Under the Storage tab, select Create XDisk
This will open a web form where you can enter your size and duration requirements in GB and days, respectively. The maximum size that can be requested is 20000 GB and the maximum duration is 300 days. In addition, specify the desired group ownership for the allocation from the Group dropdown menu This will determine file permissions and who has access. Once you click , your allocation should immediately be available.
PIs or delegates may manage their xdisk allocation at any time through the user portal. Under the Storage tab, either select Update XDisk Size or Update XDisk Duration, depending on the property you would like to update.
This will open a form which will allow you to modify the size and duration of your xdisk. Xdisk allocations cannot be increased beyond 20000 GB and the maximum duration of 300 days. Note: the Group field may only be modified at the time of the allocation's creation.
PIs or delegates may delete their xdisk allocation at any time through the user portal. Under the Storage tab, select Delete XDisk
Clicking this link will open a window with a prompt. Type confirm and then select to complete the process.
If you would like to request a new xdisk, you may do so as soon as the request is processed. Note: sometimes processing the request can take a few minutes, depending on the number of files and the size of the allocation.
CLI Commands (PIs only)¶
Warning
The xdisk CLI commands are usable by PIs only. Group delegates can manage allocations via the user portal after switching to their PI's account.
xdisk
is a locally written utility for PI's to create, delete, resize, and extend xdisk allocations. Any PIs who wish to utilize the CLI to manage their allocations can do so using the syntax shown below:
xdisk Function | Command | Examples |
---|---|---|
Display xdisk help |
|
|
View Current Information |
|
|
Create an xdisk |
|
|
Extend xdisk Expiration Date |
|
|
Resize an xdisk Allocation |
|
|
Delete an xdisk Allocation |
|
|