0% found this document useful (0 votes)

19 views37 pages

Bioinformatics 1

The document outlines an introductory bioinformatics course led by Amir Mitchell, detailing the course layout, grading system, and key topics covered over eleven lessons. It emphasizes the integration of computer science and biology in analyzing genetic data, utilizing databases and tools like GCG for sequence analysis. Additionally, it highlights significant milestones in bioinformatics and resources such as the NCBI for further research and data access.

Uploaded by

HuongPham

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views37 pages

Bioinformatics 1

Uploaded by

HuongPham

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 37

Introduction to Bioinformatics

Questions & Help

• Amir Mitchell – lecturer.

• Itay Mayros, Einat Hazkani-Covo, and Shira

Mintz – Teaching assistants

• Emails:
mitchel@post.tau.ac.il, itaymay@post.tau.ac.il,
einat@kimura.tau.ac.il, mintzshi@post.tau.ac.il

• Course site

2
Course Layout
• Eleven lessons – eleven weeks.
• Lecture, exercise, discussion.
• Presentations and exercises.
• Books and additional material.
• Missing lessons or exercises.
• Consultation hour.
• Personal gene/protein.
3
Final grade
• Final exam (80%):
– Multiple choice questions
– Open questions
– No online part

• Home assignment (20%)

4
Bioinformatics
• Buzzword …
Nanotechnology, Biotechnology …

Bioinformatics: Bioinformatics is the branch of computer science

that focuses on sub-domains of biology: research on genes and
proteins. Researchers in this field must use powerful computers and
special calculation methods to process the large body of complex data
generated by genetics. Using these tools, it was possible to sequence
the human genome .
Lexicon-encyclobio

5
Two separate approaches
• Computer science - inventing tools,
developing algorithms.

• Biology - Utilizing tools for biological

research.
1. Purely bioinformatics (comparing exon/intron
structure in human and mouse).
2. “Fairly” bioinformatics (Locating the active site of
an enzyme by identifying conserved residues in
the protein sequence).
6
Research outline
Databases (public, local)

Retrieve data

Analysis

Results

Lab (wet biology) Literature

7
Databases & Tools
• Free shared databases (on-line, bioinfo unit)

• Internet based tools (PC)

• GCG package tools (unix)

8
GCG
• Commercial DNA and protein sequence
analysis package.

• Written by Wisconsin Genetics Computing

Group.

• Includes more than 130 separate tools.

9
GCG
• GCG works in unix environment (OS)

• Same principles apply to all GCG programs

• On-line help

10
Divided work
PC1 Unix2 Web
- Databases Databases
(main ones only) (all)
Data storage Data storage -
Tools Tools Tools

1Access(unix and web)

2Advanced analysis, user databases, web site

11
Lesson 1 – Introduction,
Unix environment
1. Administration
2. Introduction to Bioinformatics.
3. NCBI
4. Working in Unix environment

12
Lesson 2 – databases and text
based searching:
1. Databases: organization and entries.
2. Database problems.
3. Principles of database searching.
4. Unix and GCG.

13
Lesson 3 – pairwise alignment

1. Comparing two sequences.

2. Scoring: good and bad alignments.
3. Comparison methods.
4. Comparison programs.
5. Unix.

14
Lesson 4 – Sequence based
searching
1. DNA or protein sequences as search queries.

2. Problems with sequence search.

3. Methods for searching (fasta, blast).

15
Lesson 5 – Multiple sequence
alignment
1. Comparing multiple sequences.
2. Uses of multiple alignment.
3. Methods for multiple alignment, efficiency
and limitations.
4. Profiles and consensus sequences.

16
Lesson 6 – Phylogenies

1. Introduction to phylogeny.

2. Methods for constructing evolutionary trees.

3. Statistical analysis of constructed trees.

17
Lesson 7 – Protein families,
secondary databases
1. Dividing proteins into families.
2. Patterns.
3. Different approaches: motifs, fingerprints.
4. Different databases.
5. Consurf.

18
Lesson 8 – DNA sequence
analysis
1. Gene structure.
2. Gene finding.
3. Predicting gene features.
4. Consurf.

19
Lesson 9 - genomes
• Genome features.
• Prokaryotic and Eukaryotic genomes.
• Genome viewers
• Model organisms

20
Lesson 10 - Various tools
• Making things easy, useful tools for lab
work.

Lesson 11 - Summary
• Overview, Q&A before the exam.

21
Last comments
• Introduction only.

• Finding sites: Links and google.

• Biology background.

• Unix accounts.

• Terminology

22
Milestones in bioinformatics
1965 Theory of molecular evolution (Zuckerkandl & Pauling)
1967 Atlas of protein sequences (Dayhoff)
1970 Global alignment algorithm (Needleman, Wunsch)
1981 Local alignment algorithm (Smith, Waterman)
1981 Sequence motif concept (Doolittle)
1982 GenBank made public
1982 Phage lambda genome fully sequenced
1983 Database search algorithm (Wilbur, Lipman)
1985 Fast sequence similarity searching
1990 Blast
1991 ESTs
23
* 1953 Watson and Crick
Milestones in bioinformatics
1995 First bacterial genome fully sequenced H. influenzae
1996 Yeast genome fully sequenced
1997 C. elegans genome fully sequenced
1999 Fruit fly genome fully sequenced
2000 Human genome fully sequenced (draft)

24
Today …
• Over 1500 fully sequenced genomes from
all domains of life.

• Numerous databases.

• Numerous tools.

25
Today …

Archea (16)

Eukarya (20)

Bacteria (139)

Viruses (1500)
26
Examples
• Human , mouse, rat, zebra fish, drosophila,
yeast, anopheles, tomato, rice, wheat.

• E. coli (4 strains), M. tuberculosis, M.

leprae.

• Mitochondria, chloroplast, plasmids.

27
Public interest:
Human Genome Project
• 2000 - Working draft of the Genome, work of 20
groups world wide.
(http://www.ncbi.nlm.nih.gov).
• 2003 - Obtain a complete, high-quality genomic
sequence.
• Determine the sequences of the 3 billion bases.
• Identify all the estimated 30,000 genes in human
DNA

28
Human Genome Project

Chromosome 21
9 May, 2000

Chromosome 22
2 Dec, 1999

Initial analysis
15 Feb, 2001

29
NCBI – at a glance
The biggest and most comprehensive site!
Includes numerous tools and databases!

30
NCBI - overview
PubMed OMIM

Books Exp’ profiles

Structure
NCBI Nucleotides

Domains Proteins

Taxonomy Genomes
31
* Cross references between the databases
NCBI
PubMed

• Citations, abstracts, full articles.

Books

• Online books, full text from books (Cell,

introduction to genetic analysis)

32
NCBI
OMIM

• Online Mendelian Inheritance in Man. A

comprehensive database of human genes
and genetic disorders.

• Entries include textual information and

,most importantly, references to literature
and sequences.
33
NCBI
GEO

• Gene Expression Omnibus

Results from a high throughput
experiments. mRNA, DNA, and protein
arrays.

34
NCBI
Genomes Nucleotides Proteins

• Sequence databases. Divided into sections

and sub-sections.

Domains

• Protein domains, both conserved sequence

domains and 3D domains.

35
NCBI
Structure

• 3D structure of proteins (~20,000 entries).

Taxonomy

• Taxonomy of all organisms found in NCBI

36
NCBI - Interconnectivity
PubMed OMIM

Books Exp’ profiles

Structure
NCBI Nucleotides

Domains Proteins

Taxonomy Genomes
37
* Cross references between the databases

Bioinformatics & Protein Analysis Guide
No ratings yet
Bioinformatics & Protein Analysis Guide
70 pages
Unit I
No ratings yet
Unit I
11 pages
Module 2 (Bioinformatics)
No ratings yet
Module 2 (Bioinformatics)
81 pages
Bioinformatics: Nadiya Akmal Binti Baharum (PHD)
100% (2)
Bioinformatics: Nadiya Akmal Binti Baharum (PHD)
54 pages
Intro to Bioinformatics Course
No ratings yet
Intro to Bioinformatics Course
104 pages
"If You Can't Do Bioinformatics, You Can't Do Biology", J.D. Tisdall, 2003
No ratings yet
"If You Can't Do Bioinformatics, You Can't Do Biology", J.D. Tisdall, 2003
12 pages
Bioinformatics Overview & Applications
No ratings yet
Bioinformatics Overview & Applications
9 pages
8024 Bio Info
No ratings yet
8024 Bio Info
28 pages
Pharmacogenomics 002A Kashyap MK 06-09-2020
No ratings yet
Pharmacogenomics 002A Kashyap MK 06-09-2020
93 pages
PB Bioinfo L1 2023
No ratings yet
PB Bioinfo L1 2023
21 pages
BTH 403-BTG407 Lecture 1
No ratings yet
BTH 403-BTG407 Lecture 1
6 pages
Joint Beca-Ilri Hub, Slu and Unesco Advanced Genomics and Bioinformatics
No ratings yet
Joint Beca-Ilri Hub, Slu and Unesco Advanced Genomics and Bioinformatics
27 pages
BCH 516-1
No ratings yet
BCH 516-1
32 pages
Bioinformatics Class Notes
No ratings yet
Bioinformatics Class Notes
12 pages
Bio in For Matics
100% (1)
Bio in For Matics
160 pages
Collection
No ratings yet
Collection
8 pages
Lec (1) - Introduction
No ratings yet
Lec (1) - Introduction
41 pages
Bioinformatics: Tools and Applications
No ratings yet
Bioinformatics: Tools and Applications
17 pages
Intro To Bioinformatics
No ratings yet
Intro To Bioinformatics
50 pages
Bioin
No ratings yet
Bioin
34 pages
Module1 Understanding Bioinformatics
No ratings yet
Module1 Understanding Bioinformatics
28 pages
Bioinformatics
No ratings yet
Bioinformatics
55 pages
120-202 Lab 01 - Fall 2018
No ratings yet
120-202 Lab 01 - Fall 2018
13 pages
Basics of Bioinformatics
100% (7)
Basics of Bioinformatics
99 pages
Class 1 Bioinfo Course Microdome-1
No ratings yet
Class 1 Bioinfo Course Microdome-1
23 pages
Databases
No ratings yet
Databases
34 pages
Lecture 1-2 Intro
No ratings yet
Lecture 1-2 Intro
24 pages
Introduction To Bioinformatics
No ratings yet
Introduction To Bioinformatics
10 pages
Sec1 Introduction To Bioinformatics
No ratings yet
Sec1 Introduction To Bioinformatics
20 pages
Milta 37
No ratings yet
Milta 37
5 pages
1 Introduction To Bioinformatics
No ratings yet
1 Introduction To Bioinformatics
71 pages
Bio in For Ma Tics
No ratings yet
Bio in For Ma Tics
8 pages
Download
No ratings yet
Download
19 pages
Bioinformatics:: Guide To Bio-Computing and The Internet
No ratings yet
Bioinformatics:: Guide To Bio-Computing and The Internet
34 pages
Bio in For Matics
No ratings yet
Bio in For Matics
67 pages
Introduction To NCBI Resources
No ratings yet
Introduction To NCBI Resources
39 pages
Bioinformatics
No ratings yet
Bioinformatics
22 pages
BFG Chapter1 Introduction v03
No ratings yet
BFG Chapter1 Introduction v03
26 pages
Introduction A La Bioinformatique
100% (1)
Introduction A La Bioinformatique
165 pages
Bioinformatics Database Basics
No ratings yet
Bioinformatics Database Basics
18 pages
Bioinformatics Lab Guide
No ratings yet
Bioinformatics Lab Guide
14 pages
Bio in For Ma Tics
No ratings yet
Bio in For Ma Tics
7 pages
BMS Lecture 1
No ratings yet
BMS Lecture 1
24 pages
To Bioinformatics: Dan Lopresti
No ratings yet
To Bioinformatics: Dan Lopresti
43 pages
Concepts of Bioinformatics PDF
100% (2)
Concepts of Bioinformatics PDF
20 pages
Lab 1 - Introduction and Protocol
No ratings yet
Lab 1 - Introduction and Protocol
28 pages
Bioinformatics Learning Framework
No ratings yet
Bioinformatics Learning Framework
7 pages
MSC - Bioinformatics - Year1 Detailing by Bioinformatics Centre SPPU - 03082023
No ratings yet
MSC - Bioinformatics - Year1 Detailing by Bioinformatics Centre SPPU - 03082023
33 pages
Bioinfo PPT Unit 1 Half
No ratings yet
Bioinfo PPT Unit 1 Half
42 pages
Lecture 1and 2 Introduction
No ratings yet
Lecture 1and 2 Introduction
47 pages
Toolsofbioinforformatics 200511063020
No ratings yet
Toolsofbioinforformatics 200511063020
18 pages
Bioinformatics for Researchers
100% (2)
Bioinformatics for Researchers
21 pages
Capture D'écran . 2023-03-14 À 00.15.22
No ratings yet
Capture D'écran . 2023-03-14 À 00.15.22
54 pages
Bioinformatics
100% (2)
Bioinformatics
104 pages
Introduction To Bioinformatics
No ratings yet
Introduction To Bioinformatics
61 pages
Bioinformatics Database and Applications
100% (3)
Bioinformatics Database and Applications
82 pages
5 1 Article Review
No ratings yet
5 1 Article Review
3 pages
Early Education of Bioinformatics
No ratings yet
Early Education of Bioinformatics
4 pages
Introduction To Biostatistics and Machine Learning - Online - SciLifeLab
No ratings yet
Introduction To Biostatistics and Machine Learning - Online - SciLifeLab
1 page
History and Origin of Statistics
67% (3)
History and Origin of Statistics
11 pages
Biostatisticsfor Pharmacy Students
No ratings yet
Biostatisticsfor Pharmacy Students
236 pages
Jspeciesws: A Web Server For Prokaryotic Species Circumscription Based On Pairwise Genome Comparison
No ratings yet
Jspeciesws: A Web Server For Prokaryotic Species Circumscription Based On Pairwise Genome Comparison
3 pages
GSU Statistics Program
No ratings yet
GSU Statistics Program
10 pages
1introduction To Biostat
No ratings yet
1introduction To Biostat
21 pages
Introduction To Bioinformatics: Tolga Can
No ratings yet
Introduction To Bioinformatics: Tolga Can
21 pages
Biostatistics Regression Course
No ratings yet
Biostatistics Regression Course
4 pages
HHsearch Guide
No ratings yet
HHsearch Guide
25 pages
Smith Waterman
No ratings yet
Smith Waterman
9 pages
Protein 3D Modelling Guide
No ratings yet
Protein 3D Modelling Guide
19 pages
Local and Global Sequence Alignment 5+5 Examples
No ratings yet
Local and Global Sequence Alignment 5+5 Examples
10 pages
1.the Advantages and Disadvantages of Statistics
100% (1)
1.the Advantages and Disadvantages of Statistics
3 pages
Profile HMMs for Bioinformatics
No ratings yet
Profile HMMs for Bioinformatics
36 pages
Biostatistics for Health Students
No ratings yet
Biostatistics for Health Students
34 pages
Bioinformatics Session8
No ratings yet
Bioinformatics Session8
33 pages
Needlemanwunsch 130216130832 Phpapp01
No ratings yet
Needlemanwunsch 130216130832 Phpapp01
39 pages
Introduction To Biostatistics
No ratings yet
Introduction To Biostatistics
44 pages
M.Sc. III-Semester Exam Schedule
No ratings yet
M.Sc. III-Semester Exam Schedule
23 pages
Academic CV for Data Science Experts
No ratings yet
Academic CV for Data Science Experts
10 pages
Bioinformatics and Functional Genomics 3ed. Edition Jonathan Pevsner Download
No ratings yet
Bioinformatics and Functional Genomics 3ed. Edition Jonathan Pevsner Download
52 pages
Biostatistics
No ratings yet
Biostatistics
12 pages
Biostatistics For Medical Science
No ratings yet
Biostatistics For Medical Science
4 pages
Genomics and Bioinformatics: Peter Gregory and Senthil Natesan
No ratings yet
Genomics and Bioinformatics: Peter Gregory and Senthil Natesan
22 pages
JMP For Medical and Health Sciences
No ratings yet
JMP For Medical and Health Sciences
2 pages
BLOSUM Matrices
No ratings yet
BLOSUM Matrices
18 pages
Bioinformatics ZOOL4128 Notes
No ratings yet
Bioinformatics ZOOL4128 Notes
3 pages
BIMM143 Exam Guidlines
No ratings yet
BIMM143 Exam Guidlines
8 pages

Bioinformatics 1

Uploaded by

Bioinformatics 1

Uploaded by

Introduction to Bioinformatics

Questions & Help

• Itay Mayros, Einat Hazkani-Covo, and Shira

• Home assignment (20%)

Bioinformatics: Bioinformatics is the branch of computer science

• Biology - Utilizing tools for biological

Lab (wet biology) Literature

• Internet based tools (PC)

• GCG package tools (unix)

• Written by Wisconsin Genetics Computing

• Includes more than 130 separate tools.

• Same principles apply to all GCG programs

1Access(unix and web)

1. Comparing two sequences.

2. Problems with sequence search.

3. Methods for searching (fasta, blast).

2. Methods for constructing evolutionary trees.

3. Statistical analysis of constructed trees.

• Finding sites: Links and google.

• E. coli (4 strains), M. tuberculosis, M.

• Mitochondria, chloroplast, plasmids.

Books Exp’ profiles

• Citations, abstracts, full articles.

• Online books, full text from books (Cell,

• Online Mendelian Inheritance in Man. A

• Entries include textual information and

• Gene Expression Omnibus

• Sequence databases. Divided into sections

• Protein domains, both conserved sequence

• 3D structure of proteins (~20,000 entries).

• Taxonomy of all organisms found in NCBI

Books Exp’ profiles

You might also like