0% found this document useful (0 votes)

12 views42 pages

HPC Intro Genentech

Uploaded by

Yuri Menezes

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views42 pages

HPC Intro Genentech

Uploaded by

Yuri Menezes

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 42

Introduction to

High-Performance Computing (HPC)

Computer components

CPU : Central Processing Unit

cores : individual processing units within a CPU

Storage : Disk drives

HDD : Hard Disk Drive
SSD : Solid State Drive

Memory : small amount of volatile or temporary

information storage
Computer components (my Macbook Pro)

Model Name: MacBook Pro

Number of Processors: 1

Total Number of Cores: 4

Memory: 16 GB

Data storage: 512 GB

“High Performance Computing most generally
refers to the practice of aggregating computing
power in a way that delivers much higher
performance than one could get out of a typical
desktop computer or workstation in order to
solve large problems in science, engineering, or
business.”

http://insidehpc.com/hpc-basic-training/what-is-hpc/
Computer resources required for NGS Data Analysis

100s of cores for processing!

100s of Gigabytes or even Petabytes of storage!
100s of Gigabytes of memory!
High-Performance Computing

Provides all the resources to run the desired Omics analysis

in one place.

Provides software that is unavailable or unusable on your

computer/local system
HPC cluster structure
HPC cluster components

Nodes: Individual computers in the cluster

Cores (threads): individual processing units available within each

CPU of each Node
e.g. a “Node” with eight “quad”-core CPUs = 32 cores for that node.

Shared disk: storage that can be shared (and accessed) by all

nodes
Parallel Computing

“Parallel computing is a form of computation in

which many calculations are carried out
simultaneously, operating on the principle that
large problems can often be divided into
smaller ones, which are then solved
concurrently ("in parallel”)."

http://en.wikipedia.org/wiki/Parallel_computing
For 1 sample
High-Performance Computing

Serial Multithreaded

input
input

CPU
core CPU CPU CPU CPU CPU CPU CPU CPU
core core core core core core core core

output
output

Faster and more efficient…

NGS data analysis is very amenable to this strategy
For 3 samples
High-Performance Computing

Serial Multithreaded & Serial

Multithreaded and Parallel

HPC Cluster

• multi-user, shared resource

• lots of nodes = lots of processing capacity + lots of memory

• a system like this requires constant maintenance and upkeep,

and there is an associated cost
Introduction to High Performance
Computing for New Users
gRED IT

Information in slides courtesy of

Slaton Lipscomb
Architect, gRED IT
lipscomb.slaton@gene.com
Welcome to HPC @ Genentech!
• Old cluster = “Rescomp” is being phased out
• New cluster = “Rosalind” is in early production stage
• New cluster includes of the following changes:
✦ the Lmod environmental module system
✦ the Slurm scheduler
✦ automated software build and installation framework with
EasyBuild
✦ CentOS 7 (Linux)

http://go.gene.com/rosalind
Rosalind Tech Specs
• 4 login nodes
• ~8500 cores on compute nodes
✦ 291 nodes (RAM: 256 GB)
✦ 6 high-memory nodes (RAM: 1 TB)
✦ 8 gpu nodes (RAM: 128 GB)
• ~12 PB of new storage (on Rosalind)
+ existing 45 PB (Isilon NAS)
Using the cluster!
1. Logging in to remote machines (securely)

• When logging in we used the “ssh” command,

ssh stands for Secure SHell

• ssh is a protocol for data transfer that is secure, i.e the data is
encrypted as it travels between your computer and the cluster
(remote computer)

• Commonly used commands that use the ssh protocol for data
transfer are, scp and sftp
Log in!
Open a terminal

ssh username@rosalind.gene.com
Welcome to Rosalind!
Where are you when you log in?

username@nl002:~ %
Logged into login node nl002. These nodes are not meant for
heavy lifting!

username@nl002:~ % pwd
Land in the home directory. The “~” is short for the path to a
user’s home directory, i.e. /gstore/home/username
Interactive Sessions
The srun command can be used to start an interactive
session. It sends a request to the scheduler to give you
access to a compute node.

username@nl002:~ % srun --pty -p defq

--qos=interactive --mem 8G bash
“srun –-pty” is how interactive sessions are started
“-p defq” is the partition
“--qos=interactive” is the Slurm QoS policy
“--mem 8G” is the memory requested
username@nc026:~ %
No longer on the login node, but using the compute node nc026
2. Using installed software
LMOD: Software Modules
• Many tools that are available on the cluster are installed as
environment modules
• The LMOD system adds the path to a given software
package, as well as it’s dependencies into the $PATH
environment variable
• Allows for clean, easy loading, unloading and version
tracking/switching.
LMOD: Software Modules
Using the LMOD system:

% module avail #to see software now available

% module spider #verbose software currently

available
% echo $PATH
Loading/Unloading Modules
• Loading modules
% module load fastqc OR % ml fastqc
• Which module version is loaded (if at all)?
% which fastqc

% echo $PATH
• Need help with the module?
% module help fastqc
• Unloading modules
% module unload fastqc
• Dump all modules
% module purge
3. The Job Scheduler, Slurm
Submitting Jobs
In an “interactive session”, programs can be called directly.

% fastqc –n 2 file1_1.fq file1_2.fq

What if you wanted to run the program and come back later to
check on it?
You can do this by submitting a batch job with sbatch

% sbatch mybowtiejob.sh
Simple Linux Utility for Resource Management
(SLURM)

• Fairly allocates access to resources (computer nodes) to users for

some duration of time so they can perform work

• Provides a framework for starting, executing, and monitoring

batch jobs

• Manages a queue of pending jobs; ensures that no single user or

core monopolizes the cluster
Choosing the proper resources for your job with
the appropriate SBATCH options
The “sbatch” way of submitting jobs (1/2)
% sbatch -p defq --qos=short -n 2 --wrap=“fastqc
-t 2 file1.fq file2.fq"

Arguments used:
• -p (partition depends on which node type you want to use)
• --qos (Job runtime/walltime policies selected via Slurm QoS policy)
• -n (number of cores)
• --wrap (specifying the command you want to run)

-p available --qos available

○ defq => compute nodes ○ veryshort = 10 min
○ himem => high memory nodes (1 TB) ○ short = 2 hr
○ gpu => gpu nodes ○ medium = 24 hr
○ long = 3 days
○ verylong = 14 days
○ interactive = 6 days
The “sbatch” way of submitting jobs (2/2)

Recommended: write a job submission script

% sbatch completeSlurmJob.run
Creating a job submission script
#! /bin/sh #Always at the top of the script

#SBATCH -p defq
#SBATCH --qos short
#SBATCH -n 4
#SBATCH --mem=8G
#SBATCH -o %j.out
#SBATCH -e %j.err
#SBATCH -J bowtie2_run1 Save as myJobScript.run
#SBATCH --mail-type=ALL Run as % sbatch myJobScript.run
#SBATCH --mail-user=<email>

module load bowtie2

bowtie -n 4 hg19 file1_1.fq file1_2.fq

Note: The sbatch options are specified differently here as

compared to the first method
sbatch options
#SBATCH -p #partition
#SBATCH --qos=<name> #QOS for length of job
#SBATCH -n X #number of cores
#SBATCH -N 1 #confine cores to 1 node (default: 1)
#SBATCH -J name_of_job (default: name of job script)
#SBATCH -o %j.out #out file
#SBATCH -e %j.err #error file

-p available --qos available

○ defq => compute nodes ○ veryshort = 10 min
○ himem => high memory nodes (1 TB) ○ short = 2 hr
○ gpu => gpu nodes ○ medium = 24 hr
○ long = 3 days
○ verylong = 14 days
○ interactive = 6 days
Managing jobs and getting information about
submitted/running jobs
Job Monitoring
% squeue –u <username> –t RUNNING/ PENDING
% squeue –u <username> –p partition
% squeue –u <username> --start

Detailed job info:

% scontrol show jobid <jobid>

Completed job statistics:

% sacct -j <jobid>
Cancelling/Pausing Jobs
% scancel <jobid>
% scancel –t PENDING
% scancel --name JOBNAME
% scontrol hold <jobid> #pause pending jobs
% scontrol release <jobid> #resume
% scontrol requeue <jobid> #cancel and rerun
4. Filesystems and storage
Filesystems and storage
Filesystems and storage

• Storage on HPC systems is organized differently than on your

personal machine

• Physical disks are bundled together into a virtual volume; this

volume may represent a single filesystem, or may be divided up, or
partitioned, into multiple filesystems

• Filesystems are accessed over the internal network

/gstore (GPFS)
Data Storage /gstore/home/username
✴ individual home directories
✴ path stored in $HOME

✴ 50 GB limit

/gstore/scratch/u/username
Cluster Nodes ✴ path stored in $SCRATCH
(login/compute)
✴ unlimited (within reason)

✴ deleted after 90 day if untouched

/gstore/data/some-project
✴ project or department specific

Your computer /gne (Isilon)

/gne/data/some-project
✴ project or department specific
More direction on storage
1. /gstore/home/username :: should be used for scripts,
locally compiled or installed applications, documentation,
papers, and other small files.

2. /gstore/scratch/u/username :: should be used for all

active workflow (scientific computation) and in-progress
data.

3. /gstore/data/some-project & /gne/data/some-

project :: should be used for valuable intermediate or
results data as well as raw instrument data.
More information
• http://go.gene.com/rosalind
• Get an account at http://go.gene.com/hpchelp
• Storage guide: https://rochewiki.roche.com/confluence/
display/SCICOMP/Storage+Guide#StorageGuide-
gstorescratch
Thanks!

• Slides adapted from Kristina Holton at HMS-RC

These materials have been developed by members of the teaching team at the Harvard Chan
Bioinformatics Core (HBC). These are open access materials distributed under the terms of the
Creative Commons Attribution license (CC BY 4.0), which permits unrestricted use, distribution, and
reproduction in any medium, provided the original author and source are credited.

HPC Introduction Lecture 2
No ratings yet
HPC Introduction Lecture 2
55 pages
HPC Basics for New Users
No ratings yet
HPC Basics for New Users
77 pages
Introductory Supercomputing PDF
No ratings yet
Introductory Supercomputing PDF
94 pages
Slurm Talk
No ratings yet
Slurm Talk
40 pages
HPC Rosalind Gettingstarted
No ratings yet
HPC Rosalind Gettingstarted
6 pages
HPC User Manual-Updated
No ratings yet
HPC User Manual-Updated
4 pages
Serverservices - Gpu-Cluster (LME - WIKI)
No ratings yet
Serverservices - Gpu-Cluster (LME - WIKI)
4 pages
Slurm Usage Guide
No ratings yet
Slurm Usage Guide
6 pages
HPC Job Management Commands Guide
No ratings yet
HPC Job Management Commands Guide
1 page
Scheduler Commands Cheatsheet-2020-Ally
No ratings yet
Scheduler Commands Cheatsheet-2020-Ally
1 page
User Manual
No ratings yet
User Manual
17 pages
Basic Usage Command Line Interface Shell Scripts
No ratings yet
Basic Usage Command Line Interface Shell Scripts
7 pages
Bunya User Guide 2022 12 06
No ratings yet
Bunya User Guide 2022 12 06
10 pages
Sun Grid Engine Tutorial
No ratings yet
Sun Grid Engine Tutorial
14 pages
HPC 2013 Cluster User Guide
No ratings yet
HPC 2013 Cluster User Guide
4 pages
05 RSB Cluster
No ratings yet
05 RSB Cluster
14 pages
Intro HPC Linux Gent
No ratings yet
Intro HPC Linux Gent
124 pages
Using The Batch Farm: Technische Universität München
No ratings yet
Using The Batch Farm: Technische Universität München
28 pages
HPC Cluster Setup for Beginners
No ratings yet
HPC Cluster Setup for Beginners
30 pages
SciNet Tutorial
No ratings yet
SciNet Tutorial
22 pages
How To Make Computers Work For You When You Are Enjoying Life
No ratings yet
How To Make Computers Work For You When You Are Enjoying Life
29 pages
Using A CPU Farm
No ratings yet
Using A CPU Farm
22 pages
19 JobSchedulers
No ratings yet
19 JobSchedulers
37 pages
TorqueAdminGuide 4.2.6
No ratings yet
TorqueAdminGuide 4.2.6
322 pages
Intro To Slurm
No ratings yet
Intro To Slurm
27 pages
Intro To Linux and HPC
No ratings yet
Intro To Linux and HPC
67 pages
Doing More With Slurm Advanced Capabilities
No ratings yet
Doing More With Slurm Advanced Capabilities
31 pages
Submitting Your MATLAB Jobs Using Slurm To High-Performance Clusters - by Rahul Bhadani - Towards Da
No ratings yet
Submitting Your MATLAB Jobs Using Slurm To High-Performance Clusters - by Rahul Bhadani - Towards Da
1 page
Linux Clusters Institute: Scheduling
No ratings yet
Linux Clusters Institute: Scheduling
93 pages
HPC Cluster Guide for Physics Students
No ratings yet
HPC Cluster Guide for Physics Students
8 pages
HPC Intro Ad OS
No ratings yet
HPC Intro Ad OS
44 pages
ULHPC Beginner's Guide: Cluster Use
No ratings yet
ULHPC Beginner's Guide: Cluster Use
54 pages
Iceberg
No ratings yet
Iceberg
165 pages
HPC Job Submission Guide
No ratings yet
HPC Job Submission Guide
8 pages
1 Cluster Computing
No ratings yet
1 Cluster Computing
42 pages
LSF For Users: Mike Page SCD Consulting Services Group
No ratings yet
LSF For Users: Mike Page SCD Consulting Services Group
26 pages
HPC Workflow Setup Guide
No ratings yet
HPC Workflow Setup Guide
3 pages
03 SGE Training
No ratings yet
03 SGE Training
39 pages
Summary
No ratings yet
Summary
2 pages
Slides
No ratings yet
Slides
33 pages
HPC User Guide: Sydney University
No ratings yet
HPC User Guide: Sydney University
2 pages
IntroductionHPC Session03 Mar23
No ratings yet
IntroductionHPC Session03 Mar23
52 pages
Media 1080170 SMXX
No ratings yet
Media 1080170 SMXX
10 pages
Introduction To High Performance Computing: Shaohao Chen Research Computing Services (RCS) Boston University
No ratings yet
Introduction To High Performance Computing: Shaohao Chen Research Computing Services (RCS) Boston University
29 pages
1 Big Data Lab - 230823 - 103054
No ratings yet
1 Big Data Lab - 230823 - 103054
34 pages
Windows HPC Guide for Researchers
No ratings yet
Windows HPC Guide for Researchers
23 pages
User Guide
No ratings yet
User Guide
3 pages
FLUENT Cluster
No ratings yet
FLUENT Cluster
11 pages
Mscluster 08 02 2024
No ratings yet
Mscluster 08 02 2024
14 pages
TORQUE Administrator's Guide
No ratings yet
TORQUE Administrator's Guide
205 pages
2013luv Supercomputers
No ratings yet
2013luv Supercomputers
12 pages
HPC UserGuide 20250527
No ratings yet
HPC UserGuide 20250527
7 pages
A Practical Guide To Building High-Performance Computing Clusters
No ratings yet
A Practical Guide To Building High-Performance Computing Clusters
69 pages
Parallel and Cluster Computing
No ratings yet
Parallel and Cluster Computing
31 pages
Distributed & Parallel Computing Cluster: Patrick Mcguigan
No ratings yet
Distributed & Parallel Computing Cluster: Patrick Mcguigan
42 pages
ANSYS GRAHAM Guideline
No ratings yet
ANSYS GRAHAM Guideline
4 pages
TORQUE Administrator's Guide
No ratings yet
TORQUE Administrator's Guide
238 pages
TORQUE Administrators Guide PDF
No ratings yet
TORQUE Administrators Guide PDF
238 pages
DHL Express Mybill Userguide BM en
No ratings yet
DHL Express Mybill Userguide BM en
26 pages
The GAMP Guide: Validation of Automated Systems Overview and Structure of Gamp4
No ratings yet
The GAMP Guide: Validation of Automated Systems Overview and Structure of Gamp4
34 pages
Module Declaration and Instantiation
No ratings yet
Module Declaration and Instantiation
41 pages
Cisco Pix Simulation On gns3
No ratings yet
Cisco Pix Simulation On gns3
3 pages
Bartender Print Portal
No ratings yet
Bartender Print Portal
25 pages
Term - 1-CS 11 2024
No ratings yet
Term - 1-CS 11 2024
5 pages
ISYS6310 - Information System Project Management
No ratings yet
ISYS6310 - Information System Project Management
26 pages
IBM DVM For zOS
No ratings yet
IBM DVM For zOS
270 pages
Citrix SSLVPN CPS DeploymentGuide
No ratings yet
Citrix SSLVPN CPS DeploymentGuide
48 pages
Foundations of Neural Networks Fuzzy Sys PDF
No ratings yet
Foundations of Neural Networks Fuzzy Sys PDF
1 page
Ss 3 E-Note Second Term Computer
No ratings yet
Ss 3 E-Note Second Term Computer
28 pages
Java, Spring Boot, Microservices, and Angular
No ratings yet
Java, Spring Boot, Microservices, and Angular
38 pages
90203-1114DEB F Series Operation Manual - English
No ratings yet
90203-1114DEB F Series Operation Manual - English
478 pages
Embedded System Testing
No ratings yet
Embedded System Testing
2 pages
Server Checklist
100% (1)
Server Checklist
22 pages
(Operating Systems) File System DPP Discussion Notes
No ratings yet
(Operating Systems) File System DPP Discussion Notes
8 pages
REB500 V8.3 ProductGuide 1MRK505402 BEN
No ratings yet
REB500 V8.3 ProductGuide 1MRK505402 BEN
53 pages
Data Analysis Using ChatGPT
No ratings yet
Data Analysis Using ChatGPT
10 pages
Ultimate Scraper Workflow For n8n
No ratings yet
Ultimate Scraper Workflow For n8n
36 pages
Blockchain Developer Insights
No ratings yet
Blockchain Developer Insights
3 pages
SQL It Vedant
No ratings yet
SQL It Vedant
14 pages
MSP432 DriverLib Users Guide-MSP432P4xx-3 10 00 09
No ratings yet
MSP432 DriverLib Users Guide-MSP432P4xx-3 10 00 09
390 pages
Heuristic Usability Analysis of Lloyds Andriod App For Secure Banking
No ratings yet
Heuristic Usability Analysis of Lloyds Andriod App For Secure Banking
10 pages
Preface: H81H3-M7 User Manual
No ratings yet
Preface: H81H3-M7 User Manual
72 pages
12-Deutsche Post Customer Portal - User Manual
No ratings yet
12-Deutsche Post Customer Portal - User Manual
84 pages
Advertising Solutions for Brokers
No ratings yet
Advertising Solutions for Brokers
36 pages
Highest Penetration in The Market: Checkpoint Security: Baggage and Parcel Inspection
No ratings yet
Highest Penetration in The Market: Checkpoint Security: Baggage and Parcel Inspection
2 pages
B Ing
67% (3)
B Ing
13 pages
COMP1010 Text Book
No ratings yet
COMP1010 Text Book
10 pages
v2 Led LCD Screen Panel Repair PDF
100% (3)
v2 Led LCD Screen Panel Repair PDF
73 pages

HPC Intro Genentech

Uploaded by

HPC Intro Genentech

Uploaded by

Introduction to

High-Performance Computing (HPC)

CPU : Central Processing Unit

Storage : Disk drives

Memory : small amount of volatile or temporary

Model Name: MacBook Pro

Total Number of Cores: 4

Data storage: 512 GB

100s of cores for processing!

Provides all the resources to run the desired Omics analysis

Provides software that is unavailable or unusable on your

Nodes: Individual computers in the cluster

Cores (threads): individual processing units available within each

Shared disk: storage that can be shared (and accessed) by all

“Parallel computing is a form of computation in

Faster and more efficient…

Serial Multithreaded & Serial

Multithreaded and Parallel

• multi-user, shared resource

• lots of nodes = lots of processing capacity + lots of memory

• a system like this requires constant maintenance and upkeep,

Information in slides courtesy of

• When logging in we used the “ssh” command,

username@nl002:~ % srun --pty -p defq

% module avail #to see software now available

% module spider #verbose software currently

% fastqc –n 2 file1_1.fq file1_2.fq

• Fairly allocates access to resources (computer nodes) to users for

• Provides a framework for starting, executing, and monitoring

• Manages a queue of pending jobs; ensures that no single user or

-p available --qos available

Recommended: write a job submission script

module load bowtie2

bowtie -n 4 hg19 file1_1.fq file1_2.fq

Note: The sbatch options are specified differently here as

-p available --qos available

Detailed job info:

Completed job statistics:

• Storage on HPC systems is organized differently than on your

• Physical disks are bundled together into a virtual volume; this

• Filesystems are accessed over the internal network

✴ deleted after 90 day if untouched

Your computer /gne (Isilon)

2. /gstore/scratch/u/username :: should be used for all

3. /gstore/data/some-project & /gne/data/some-

• Slides adapted from Kristina Holton at HMS-RC

You might also like