Others
Ansys¶
Both Ansys Workbench and Ansys Electronic Desktop are available on Katana. The most user friendly way to run Ansys is to use Katana OnDemand.
Ansys Batch Jobs
Note: The version of Ansys available via myAccess MUST NOT be used for any research including generating files to be used on Katana.
Once you are familiar with how running jobs on Katana works you can run them in a batch job which mean that your jobs don't need any input from you.
Ansys CFX
If you want to use Ansys CFX on Katana then you will need to upload the .cfx and .def files to Katana.
A brief batch script example is given below. For more detail visit our GitHub page. (Note: You need to join the UNSW GitHub organisation to access this repository)
#!/bin/bash
#<RESOURCE REQUESTS>
cd $PBS_O_WORKDIR
module load intel-mpi/2021.7.1
module load ansys/2021r1
cfx5solve -batch -def <filename>.def -part $NCPUS -start-method "Intel MPI Local Parallel"
Ansys Fluent
Ansys Fluent input can also be generated locally and transferred to Katana to run in a batch job with a brief bash script shown here snd more complete examples in our Github repository. (Note: You will need to join the UNSW GitHub organisation to access this repo).
#!/bin/bash
#<RESOURCE REQUESTS>
cd $PBS_O_WORKDIR
module load ansys/2021r1
module load intel-mpi/2021.7.1
fluent 3d -g -t $NCPUS -i fluent.in > output.out
Ansys on Gadi
If you have an account on Gadi, the Supercomputer located at NCI, you can also use Ansys there once you join the on once you have an account. UNSW has a institutional licence that requies you join the relevant software group as detailed in NCI's documentation.
Biosciences¶
There are a number of Bioscience software packages installed. If you are having trouble finding a software package it might be within another package such as the omes mentioned on this page.
Datasets
To avoid multiple users needing to download datasets for themselves we have downloaded some of the most commonly used ones multiple versions
of the NCBI nr and nt databases into the /data
directory.
Stand alone
Blast+: module load blast-plus/2.12.0
Mothur: module load mothur/1.48.0
Python Module
Any of the Python versions that you see when running the module command on Katana will include BioPython and Snakemake.
module avail python
R Module
module avail r
As Bioconductor is installed within R the best approach is to load the latest version of R to reduce the possibility of dependency issues. Then you can install it in your personal R library by following the instructions at the Bioconductor web site. An example of how to install Bioconductor and some Bioconductor packages is shown below.
> if (!require("BiocManager", quietly = TRUE))
install.packages("BiocManager")
> BiocManager::install("Biobase")
> BiocManager::install(c("GenomicFeatures", "AnnotationDbi"))
> BiocManager::install("DESeq2")
Perl Module
Any of the Perl versions that you see when running the module command on Katana will include BioPerl. If you do not use the
module command to access Perl then you won't have be able to use BioPerl and other Perl tools.
module avail perl
Conda If you would like to install software using Conda there are instructions on how to do it on the Python page.
Note
Some software packages like Bioconductor do not work well if installed in Conda due to software dependencies and the time that it takes for a new version of the software to be included in Conda.
Comsol¶
The most user friendly way to run Comsol interactively is to use Katana OnDemand.
Note
You will need to belong to a group that owns a COMSOL licence (mech, spree, quantum, biomodel)
Comsol Batch Jobs
Note: The version of COMSOL available via myAccess MUST NOT be used for any research including generating files to be used on Katana.
An example comsol batch job file is available in our GitHub repository. (Note: You need to join the UNSW GitHub organisation to access this repository) as well as being presented below.
#!/bin/bash
#<RESOURCE REQUESTS>
mkdir -p ${TMPDIR}/comsol
export MY_COMSOL_DIR=/srv/scratch/$USER/comsoldir
module load comsol/5.6-spree
comsol -nn 1 -np $NCPUS \
-recoverydir ${MY_COMSOL_DIR}/recoveries \
-tmpdir ${TMPDIR}/comsol \
batch \
-inputfile ${MY_COMSOL_DIR}/MyModel.mph \
-outputfile ${MY_COMSOL_DIR}/MyModelOut.mph \
-batchlog ${MY_COMSOL_DIR}/MyModel.log
Intel Compilers and Software Libraries¶
Research Technology Services has a licence for Intel Compiler Collection which can be accessed by loading a module and contains 4 groups of software, namely compilers, libraries, a debugger and MPI. This software has been optimised by Intel to take advantage of the specific capabilities of the different intel CPUs installed in the Intel based clusters.
- Compilers (module name: intel-compilers)
- Intel C Compiler (icc)
- Intel C++ Compiler (icpc)
- Intel Fortran Compiler (ifort)
- Libraries
- Intel Math Kernel Library (MKL) (module name: intel-mkl)
- Intel Threading Building Blocks (TBB) (module name: intel-tbb)
- Intel Integrated Performance Primitives (IPP)
- Debugger
- Intel Debugger (idbc)
Java¶
Java is installed as part of the Operating System but that version of Java can change without warning leading to reproducable concerns. Because of this risk we recommend using one of the versions of Java available via the module command.
module avail java
Each Java module sets
_JAVA_TOOL_OPTIONS -Xmx1g
This sets the heap memory to 1GB. If you need more, set the environment variable _JAVA_OPTIONS
which overrides _JAVA_TOOL_OPTIONS
export _JAVA_OPTIONS="-Xmx5g"
Matlab¶
Running interactively PBS_O_WORKDIR You can run an interactive session of Matlab using Katana OnDemand for a graphical session or using the qsub command for a text based session using the commands below.
Batch Jobs
You can run Matab within a batch job. The example below shows the flags used to start Matlab without a graphical interface. Your job will start in your Katana home directory so we are assuming that your Matlab script is in your home directory.
module load matlab/R2022a
matlab -nodisplay -nosplash -r scriptfile
If your Matlab script is not located in your home directory you can either provide the full path to the Matlab command or change to the directory within your batch file. The advantage of changing directory is that it makes it easy to save any results into the same directory.
To provide a full path you can use the following example:
module load matlab/R2022a
matlab -nodisplay -nosplash -r /path/to/script/scriptfile
If you want to change to a different directory you can use the following example.
cd /path/to/script/
module load matlab/R2022a
matlab -nodisplay -nosplash -r scriptfile
If your files are in the same directory that you submitted the batch job from then you can use the variable $PBS_O_WORKDIR
, which contains the location that the job was submitted, in your script.
cd $PBS_O_WORKDIR
module load matlab/R2022a
matlab -nodisplay -nosplash -r scriptfile
Later versions of matlab provide the '-batch' flag as an alternative.
module load matlab/R2022a
matlab -batch -r scriptfile
Operating Systems¶
Katana nodes currently run Rocky Linux. To find out exactly which version you can use the command cat /etc/redhat-release
.
Research software is installed in environment modules. This enables multiple versions of the same software to be installed, and each user can choose which version they wish to use.
Perl¶
The default version of Perl on Katana is 5.26.3, which is provided by Rocky Linux and can be found at /usr/bin/perl
.
We have also installed Perl via an environment module.
It is common for Perl scripts to begin with:
#!/usr/bin/perl
However, that will restrict you to the default version of Perl supplied with the Linux distribution. If, instead of using that line, you use the following:
#!/usr/bin/env perl
This will then look in your path so if you load Perl via an environment module it will not be necessary to modify your scripts.
Stata¶
Stata is available as a module.
When using Stata in a pbs batch script, the syntax is
stata -b do StataClusterWorkshop.do
If you wish to load or install additional Stata modules or commands you should use findit command within Stata on your local computer to find the command that you are looking for.
You should then create a directory called myadofiles
in your home directory and copy the .ado (and possibly the .hlp) file into that directory. Now that the command is there
it just remains to tell Stata to look in that directory which can be done by using the following Stata command.
sysdir set PERSONAL $HOME/myadofiles
tmux¶
When you login to Katana using the terminal, it is a "live" session - if you close the terminal or turn off your computer the session will close. If you have a long running program such as
downloading a large data set or have an interactive session running from the command line and you need to go somewhere else then you may
end up losing all of your work and need to start again. In order to stop this from happening you can use tmux
to create an interruptible session when you first connect to Katana.
Note
You will need to take note of the Katana login node that you have logged in to (katana1, katana2 or katana3) as you will need to connect to the same login node to return to your tmux
session.
To start tmux, type tmux
at the terminal which will create a new session will start with a green information band at the bottom of the screen. Anything you start in this session
will keep running even if you disconnect or are disconnected from that session for any reason unless the server needs to be restarted.
When you wish to reconnect you login to the server that you used last time and reconnect by using the command tmux a
.
tmux
has a number of other useful features such as multiple sessions and split screens. More information on features and how to use tmux
is available
on the tmux website.
Zip¶
Compressing Large Directories
If you want to compress large directories or directories with a large number of files, we recommend using tgzme which was developed by one of our researchers, Dr. Edwards.
The software is based on the command:
tar -c $DIRECTORY | pigz > $DIRECTORY.tar.gz
We thank Dr. Edwards
for his contribution.