node00
/scratch/
Login using ssh
by typing the following command in respective terminal
ssh <hgcc_user_ID>@hgcc.genetics.emory.edu
Terminal App is available on Mac OS X and Linux system
Under Windows 10, open up a Powershell (search for “Powershell” in the search tool next to the Start menu) and type ssh. (Older versions of the operating system may require the installation of a third party ssh client, like Putty.)
mkdir ~/RHPC/
/home/jyang/
to local mount directory ~/RHPC/
. Replace jyang
by your HGCC user ID.
sshfs jyang@hgcc.genetics.emory.edu:/home/jyang/ ~/RHPC/ -o auto_cache -ovolname=HGCC -o follow_symlinks
Command rsync
is recommended
rsync [options] [SOURCE] [DESTINATION]
Command scp
can also do the job
scp <path_to_file> <username>@hgcc.genetics.emory.edu:<destination_path_on_hgcc>
rsync
is recommended for copying data on clustercp
also copys datarm
mkdir
mv
ls
ls -l -h
vi
or nano
to edit text files on the clusterless
, cat
gzip
zcat [file_name.gz] | less -S
man
to see user manual/instructions of Linux tools, e.g., man ls
|
to take output from the command before |
as the input for the command after |
alias
to set up your own shortcup of commands. See Examples.man [command]
would give help page for the following commands, e.g., man rsync
. Type q
to exit from the help page.qlogin
under headnode or the command qlogin -l h=[node07.local]
with specification of interactive compute node. qsub
under headnode.
-pe smp 2
: qsub –q b.q –cwd –j y –N [jobname] –pe smp 2 [job commands]
. qsub
is limited up to 500 jobsqsub -t
. Array jobs need to be written to account for the single job ID in an Array job. See Instructions./scratch/
on each compute node to avoid extensive I/O between compute node memory and head node storage disks.
/mnt/YangFSS/data
, /mnt/YangFSS/data2
/home/[userid]
is likely to be located on our lab data storage disksqstat -f
qstat -f -u '*'
qdel [jobid]
or all of your jobs by qdel -u [userid]
/sw/hgcc/Data
. Do not make another copy of the data unless you need to make changes.qlogin
first, and then use command module avail
to list all installed software modules on HGCC${PATH}
by which [software_name]
module load [software_name]
module list
R
in the current session to start an R session after loading R by module load R/4.0.3
python
in the session to start a python session after loading Anaconda3 by module load Anaconda3/5.2.0
python
with command which python
plink -h
to see plink user mannual after loading plink by module load plink/1.90b53
which
can also be used to check if a software is available under the ${PATH}
directory in current session module unload [software_name]
module purge
~/.local/
pip --user
. Or search "install without root access", "install with specificed installation directory". The installation directory should be those you have writting access to. qlogin
, as many C++ libraries only available on compute nodes/mnt/YangFSS/data/bin
by the following commands
cd /mnt/YangFSS/data/bin
ln -s [tool directory]
Add /mnt/YangFSS/data/bin
into the ${PATH}
environment variable by including the following command line in your ~/.bashrc
file
export PATH=/mnt/YangFSS/data/bin:$PATH
.bashrc
filebashrc
file under your home directory to save tying of commands.
/mnt/icebreaker/data2/
/mnt/YangFSS/data/
. All lab member home directories are located in this data drive. Consider using our data drive 2 if you need more storage space.Data Drive 2 /mnt/YangFSS/data2/
.
mkdir -p /mnt/YangFSS/data2/jyang
cd /mnt/YangFSS/data2/jyang
ls
df -h
du -h -d1