Skip to content

Others

Ansys

Both Ansys Workbench and Ansys Electronic Desktop are available on Katana. The most user friendly way to run Ansys is to use Katana OnDemand.

Ansys Batch Jobs

Note: The version of Ansys available via myAccess MUST NOT be used for any research including generating files to be used on Katana.

Once you are familiar with how running jobs on Katana works you can run them in a batch job which mean that your jobs don't need any input from you.

Ansys CFX

If you want to use Ansys CFX on Katana then you will need to upload the .cfx and .def files to Katana.

A brief batch script example is given below. For more detail visit our GitHub page. (Note: You need to join the UNSW GitHub organisation to access this repository)

   #!/bin/bash
   #<RESOURCE REQUESTS>

   cd $PBS_O_WORKDIR

   module load intel-mpi/2021.7.1
   module load ansys/2021r1

   cfx5solve -batch -def <filename>.def -part $NCPUS -start-method "Intel MPI Local Parallel"

Ansys Fluent

Ansys Fluent input can also be generated locally and transferred to Katana to run in a batch job with a brief bash script shown here snd more complete examples in our Github repository. (Note: You will need to join the UNSW GitHub organisation to access this repo).

   #!/bin/bash
   #<RESOURCE REQUESTS>

   cd $PBS_O_WORKDIR

   module load ansys/2021r1
   module load intel-mpi/2021.7.1

   fluent 3d -g -t $NCPUS -i fluent.in > output.out

Ansys on Gadi

If you have an account on Gadi, the Supercomputer located at NCI, you can also use Ansys there once you join the on once you have an account. UNSW has a institutional licence that requies you join the relevant software group as detailed in NCI's documentation.

Biosciences

There are a number of Bioscience software packages installed. If you are having trouble finding a software package it might be within another package such as the omes mentioned on this page.

Datasets

To avoid multiple users needing to download datasets for themselves we have downloaded some of the most commonly used ones multiple versions of the NCBI nr and nt databases into the /data directory.

Stand alone

Blast+: module load blast-plus/2.12.0

Mothur: module load mothur/1.48.0

Python Module

Any of the Python versions that you see when running the module command on Katana will include BioPython and Snakemake. module avail python

R Module

module avail r

As Bioconductor is installed within R the best approach is to load the latest version of R to reduce the possibility of dependency issues. Then you can install it in your personal R library by following the instructions at the Bioconductor web site. An example of how to install Bioconductor and some Bioconductor packages is shown below.

    > if (!require("BiocManager", quietly = TRUE))
    install.packages("BiocManager")
    > BiocManager::install("Biobase")
    > BiocManager::install(c("GenomicFeatures", "AnnotationDbi"))
    > BiocManager::install("DESeq2")

Perl Module Any of the Perl versions that you see when running the module command on Katana will include BioPerl. If you do not use the module command to access Perl then you won't have be able to use BioPerl and other Perl tools. module avail perl

Conda If you would like to install software using Conda there are instructions on how to do it on the Python page.

Note

Some software packages like Bioconductor do not work well if installed in Conda due to software dependencies and the time that it takes for a new version of the software to be included in Conda.

Comsol

The most user friendly way to run Comsol interactively is to use Katana OnDemand.

Note

You will need to belong to a group that owns a COMSOL licence (mech, spree, quantum, biomodel)

Comsol Batch Jobs

Note: The version of COMSOL available via myAccess MUST NOT be used for any research including generating files to be used on Katana.

An example comsol batch job file is available in our GitHub repository. (Note: You need to join the UNSW GitHub organisation to access this repository) as well as being presented below.

console
    #!/bin/bash
    #<RESOURCE REQUESTS>

    mkdir -p ${TMPDIR}/comsol

    export MY_COMSOL_DIR=/srv/scratch/$USER/comsoldir

    module load comsol/5.6-spree

    comsol -nn 1 -np $NCPUS \
    -recoverydir ${MY_COMSOL_DIR}/recoveries \
    -tmpdir ${TMPDIR}/comsol \
    batch \
    -inputfile ${MY_COMSOL_DIR}/MyModel.mph \
    -outputfile ${MY_COMSOL_DIR}/MyModelOut.mph \
    -batchlog ${MY_COMSOL_DIR}/MyModel.log

Intel Compilers and Software Libraries

Research Technology Services has a licence for Intel Compiler Collection which can be accessed by loading a module and contains 4 groups of software, namely compilers, libraries, a debugger and MPI. This software has been optimised by Intel to take advantage of the specific capabilities of the different intel CPUs installed in the Intel based clusters.

  • Compilers (module name: intel-compilers)
    • Intel C Compiler (icc)
    • Intel C++ Compiler (icpc)
    • Intel Fortran Compiler (ifort)
  • Libraries
    • Intel Math Kernel Library (MKL) (module name: intel-mkl)
    • Intel Threading Building Blocks (TBB) (module name: intel-tbb)
    • Intel Integrated Performance Primitives (IPP)
  • Debugger
    • Intel Debugger (idbc)

Java

Java is installed as part of the Operating System but that version of Java can change without warning leading to reproducable concerns. Because of this risk we recommend using one of the versions of Java available via the module command.

module avail java

Each Java module sets

    _JAVA_TOOL_OPTIONS -Xmx1g

This sets the heap memory to 1GB. If you need more, set the environment variable _JAVA_OPTIONS which overrides _JAVA_TOOL_OPTIONS

    export _JAVA_OPTIONS="-Xmx5g"

Matlab

Running interactively PBS_O_WORKDIR You can run an interactive session of Matlab using Katana OnDemand for a graphical session or using the qsub command for a text based session using the commands below.

Batch Jobs

You can run Matab within a batch job. The example below shows the flags used to start Matlab without a graphical interface. Your job will start in your Katana home directory so we are assuming that your Matlab script is in your home directory.

module load matlab/R2022a

matlab -nodisplay -nosplash -r scriptfile

If your Matlab script is not located in your home directory you can either provide the full path to the Matlab command or change to the directory within your batch file. The advantage of changing directory is that it makes it easy to save any results into the same directory.

To provide a full path you can use the following example:

module load matlab/R2022a

matlab -nodisplay -nosplash -r /path/to/script/scriptfile

If you want to change to a different directory you can use the following example.

cd /path/to/script/
module load matlab/R2022a

matlab -nodisplay -nosplash -r scriptfile

If your files are in the same directory that you submitted the batch job from then you can use the variable $PBS_O_WORKDIR, which contains the location that the job was submitted, in your script.

cd $PBS_O_WORKDIR
module load matlab/R2022a

matlab -nodisplay -nosplash -r scriptfile

Later versions of matlab provide the '-batch' flag as an alternative.

module load matlab/R2022a

matlab -batch -r scriptfile

Operating Systems

Katana nodes currently run Rocky Linux. To find out exactly which version you can use the command cat /etc/redhat-release.

Research software is installed in environment modules. This enables multiple versions of the same software to be installed, and each user can choose which version they wish to use.

Perl

The default version of Perl on Katana is 5.26.3, which is provided by Rocky Linux and can be found at /usr/bin/perl.

We have also installed Perl via an environment module.

It is common for Perl scripts to begin with:

#!/usr/bin/perl

However, that will restrict you to the default version of Perl supplied with the Linux distribution. If, instead of using that line, you use the following:

#!/usr/bin/env perl

This will then look in your path so if you load Perl via an environment module it will not be necessary to modify your scripts.

Stata

Stata is available as a module.

When using Stata in a pbs batch script, the syntax is

stata -b do StataClusterWorkshop.do

If you wish to load or install additional Stata modules or commands you should use findit command within Stata on your local computer to find the command that you are looking for. You should then create a directory called myadofiles in your home directory and copy the .ado (and possibly the .hlp) file into that directory. Now that the command is there it just remains to tell Stata to look in that directory which can be done by using the following Stata command.

sysdir set PERSONAL $HOME/myadofiles

tmux

When you login to Katana using the terminal, it is a "live" session - if you close the terminal or turn off your computer the session will close. If you have a long running program such as downloading a large data set or have an interactive session running from the command line and you need to go somewhere else then you may end up losing all of your work and need to start again. In order to stop this from happening you can use tmux to create an interruptible session when you first connect to Katana.

Note

You will need to take note of the Katana login node that you have logged in to (katana1, katana2 or katana3) as you will need to connect to the same login node to return to your tmux session.

To start tmux, type tmux at the terminal which will create a new session will start with a green information band at the bottom of the screen. Anything you start in this session will keep running even if you disconnect or are disconnected from that session for any reason unless the server needs to be restarted.

When you wish to reconnect you login to the server that you used last time and reconnect by using the command tmux a.

tmux has a number of other useful features such as multiple sessions and split screens. More information on features and how to use tmux is available on the tmux website.

Zip

Compressing Large Directories

If you want to compress large directories or directories with a large number of files, we recommend using tgzme which was developed by one of our researchers, Dr. Edwards.

The software is based on the command:

tar -c $DIRECTORY | pigz > $DIRECTORY.tar.gz

We thank Dr. Edwards for his contribution.