HPC-Scripts

Scripts to help Unix Administrators and Users manage High Performance Computing (HPC) environments.

HPC Cleanup Scripts

Scratch Filesystem Cleanup (expirfiles.py)

HPC systems have very large fast parallel filesystems where users can generate and use literally terrabytes of data during computation.

"scratch" filesystems. Unfortunately, users to tend to leave files around in these filesystems rather than backing them up to long term storage such as HSM. Quotering can get around this issue, but can still results in people leaving uneeded files around and hogging space. Also, since "scratch" filesystems are typically not backed up, the practice of leaving files in "scratch" filesystems is not safe.

Given a file system expirefiles will find all files that have not been accessed in a specified number of days. It has options to warn users of files which are about to be expired (removed) via email.

Exceptions for usernames and also file paths are supported, where certain files can be exempted from a later deletion.

For more details see expirefiles.

HPC Head Node Scripts

In a typical HPC environment users login to head nodes also referred to a login nodes , from where they submit their batch jobs.

HPC Head Node Abuse Detection (goodcitizen.sh)

Sometimes users run CPU intensive jobs on the head nodes rather than submitting batch jobs to PBS/Torque.

The goodcitizen.sh script detects users who are running CPU intensive jobs and notifies them via email to use interactive batch jobs instead.

Other checks can be added, for example:

"watch qstat" detection - users sometimes overload the PBS/Torque scheduler by continually polling the status of their jobs with watch qstat.

For more details on configuration see goodcitizen.

HPC Hierarchical Storage Management (HSM) Scripts

Most HSM facilities using HSM storage management. This usually consists of a quota based NFS online frontend disk cache to a much larger backend offline tape component. Users copy data to the cache and the HSM offlines the data in the background.

HSM Chunk Small Files Into Large Files (chunkybackup.sh)

As can be expected copying lots of small files to HSM storage is not particularly efficient. Small files are typically not big enough to be automatically moved to tape and will remain forever in the cache. This is why chunkybackup.sh was written to allow users of a HPC faility to easily "chunk up" their smaller data files.

For more details see chunkybackup.

Name		Name	Last commit message	Last commit date
Latest commit History 97 Commits
README.md		README.md
apan_du.sh		apan_du.sh
apan_du_notify.sh		apan_du_notify.sh
aquota		aquota
aquota.sh		aquota.sh
chunkybackup.md		chunkybackup.md
chunkybackup.sh		chunkybackup.sh
cleantmp.sh		cleantmp.sh
create-gluster-cluster.sh		create-gluster-cluster.sh
dpart.sh		dpart.sh
dynamic-motd.sh		dynamic-motd.sh
expirefiles.md		expirefiles.md
expirefiles.py		expirefiles.py
goodcitizen.md		goodcitizen.md
goodcitizen.sh		goodcitizen.sh
hpc-eol.sh		hpc-eol.sh
hsync.sh		hsync.sh
lsdircount.sh		lsdircount.sh
pbsqcheck.sh		pbsqcheck.sh
rquota		rquota
usage-check.sh		usage-check.sh
user-disk-usage.sh		user-disk-usage.sh
volfillrate.sh		volfillrate.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

HPC-Scripts

HPC Cleanup Scripts

Scratch Filesystem Cleanup (expirfiles.py)

HPC Head Node Scripts

HPC Head Node Abuse Detection (goodcitizen.sh)

HPC Hierarchical Storage Management (HSM) Scripts

HSM Chunk Small Files Into Large Files (chunkybackup.sh)

About

Releases

Packages

Contributors 2

Languages

dannysheehan/HPC-Scripts

Folders and files

Latest commit

History

Repository files navigation

HPC-Scripts

HPC Cleanup Scripts

Scratch Filesystem Cleanup (expirfiles.py)

HPC Head Node Scripts

HPC Head Node Abuse Detection (goodcitizen.sh)

HPC Hierarchical Storage Management (HSM) Scripts

HSM Chunk Small Files Into Large Files (chunkybackup.sh)

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages