0% found this document useful (0 votes)

107 views33 pages

Cours 7: Annotation Des Génomes

After obtaining the sequence of a genome, the next step is to annotate the genome. Annotation involves using bioinformatics methods to establish an inventory of all genetic elements (location of genes, their translation, location of promoters, etc.) present in a genome and their functions. There are two levels of annotation: 1) Structural annotation, which involves inventorying and analyzing all elements in a genome, such as identifying protein-coding and RNA genes. 2) Functional annotation, which aims to identify the function of all genes detected in structural annotation. Structural annotation first locates genes by distinguishing coding from non-coding regions using computational programs combining different types of information, such as identifying open reading frames (ORFs) between start and stop cod

Uploaded by

Imene Semmar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

107 views33 pages

Cours 7: Annotation Des Génomes

Uploaded by

Imene Semmar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 33

Cours 7:

Annotation des génomes

Projet de séquençage d’un génome
ATGTCTCTGAGGAGCGGCGGGCGGCGGCGCGCGGACCCAGGCGCGGATGGCGAGGCCAGCAGGGATGATG
GCGCCACTTCCTCAGTTTCGGCACTCAAGCGCCTGGAACGGAGTCAGTGGACGGATAAGATGGATTTGCG
GTTTGGTTTTGAGCGGCTGAAGGAGCCTGGTGAGAAGACAGGCTGGCTCATTAACATGCATCCTACCGAG
ATTTTAGATGAAGATAAGCGCTTAGGCAGTGCAGTGGATTACTACTTTATTCAAGATGACGGAAGCAGAT
TTAAGGTGGCTTTGCCCTATAAACCGTATTTCTACATTGCGACCAGAAAGGGTTGTGAGCGAGAAGTTTC
ATCTTTTCTCTCCAAGAAGTTTCAGGGCAAAATTGCAAAAGTGGAGACTGTCCCCAAAGAGGATCTGGAC
TTGCCAAATCACTTGGTGGGTTTGAAGCGAAATTACATCAGGCTGTCCTTCCACACTGTGGAGGATCTTG
TCAAAGTGAGGAAGGAGATCTCCCCTGCCGTGAAGAAGAACAGGGAGCAGGATCACGCCAGCGACGCGTA
CACAGCTCTGCTTTCCAGTGTTCTGCAGAGGGGCGGTGTCATTACTGATGAAGAGGAAACCTCTAAGAAG
ATAGCTGACCAGTTGGACAACATTGTGGACATGCGCGAGTACGATGTTCCCTACCACATCCGCCTCTCCA
TTGACCTGAAGATCCACGTGGCTCATTGGTACAATGTCAGATACCGAGGAAATGCTTTTCCGGTAGAAAT
CACCCGCCGAGATGACCTTGTTGAACGACCTGACCCTGTGGTTTTGGCATTTGACATTGAGACGACCAAA
CTGCCCCTCAAGTTTCCTGATGCTGAGACAGACCAGATTATGATGATTTCCTACATGATCGATGGCCAGG
GCTACCTCATCACCAACAGGGAGATTGTTTCAGAAGATATTGAAGATTTTGAGTTCACCCCCAAGCCAGA
ATATGAAGGCCCCTTTTGTGTCTTCAATGAACCCGATGAGGCTCATCTGATCCAAAGGTGGTTTGAACAC
GTCCAGGAGACCAAACCCACCATCATGGTCACCTACAACGGGGACTTTTTTGACTGGCCATTTGTGGAGG
CCCGGGCAGCAGTCCACGGTCTGAGCATGCAGCAGGAGATAGGCTTCCAGAAGGACAGCCAGGGGGAGTA
CAAGGCGCCCCAGTGCATCCACATGGACTGCCTCAGGTGGGTGAAGAGGGACAGTTACCTTCCTGTGGGC
AGTCATAATCTCAAGGCGGCCGCCAAGGCCAAGCTAGGCTATGATCCCGTGGAGCTAGACCCGGAGGACA
TGTGCCGGATGGCCACGGAGCAGCCCCAGACTCTGGCCACGTATTCTGTGTCAGATGCTGTCGCCACTTA
CTACCTGTACATGAAGTACGTCCACCCATTCATCTTTGCTCTGTGCACCATTATTCCCATGGAGCCCGAC
GAGGTGCTGCGGAAGGGCTCTGGCACTCTGTGTGAGGCCTTGCTGATGGTGCAGGCCTTCCACGCCAACA
TCATCTTCCCCAACAAGCAAGAGCAGGAGTTCAATAAGCTGACGGACGACGGACACGTGCTGGACTCTGA
GACCTACGTCGGGGGCCACGTGGAGGCCCTCGAGTCTGGGGTTTTCCGCAGCGATATCCCTTGCCGGTTT
AGGATGAATCCTGCCGCCTTTGACTTCCTGCTGCAGCGGGTTGAGAAGACCTTGCGCCACGCCCTTGAGG
AAGAGGAGAAAGTGCCTGTGGAGCAAGTCACCAACTTTGAAGAGGTGTGTGATGAGATTAAGAGCAAGCT
3
TGCCTCCCTGAAGGACGTTCCCAGCCGCATCGAGTGTCCACTCATCTACCACCTGGACGTGGGGGCCTGA
On a une séquence d’un génome,
et aprés?

Quel sont donc les problemes?

– On ne sait pas combien de
genes il y’a
– On ne sait pas oû ils sont
ces genes
– On ne sait pas leurs
fonctions

Une fois la séquence d’un génome obtenue, il s’agit de la faire parler

Et aprés la séquence?
AGTAGTGTGTGCCCGTCTGTTGTGTGACTCTGGTAGCTAGAGATCCCTCAGACCCTTTGT
GGTAGTGTGGAAAATCTCTAGCAGTGGCGCCCGAACAGGGACTTAAAAGCGAAAGTAAGA
CCAGAGGAGATCTCTCGACGCAGGACTCGGCTTGCTGAAGTGCACTCGGCAAGAGGCGAG
AGGGGCGGCTGGTGAGTACGCCATTTTTTATTTGACTAGCGGAGGCTAGAAGGAGAGAGA
TGGGTGCGAGAGCGTCAATATTAAGAGGCGAAAAATTAGATAAATGGGAAAGAATTAGGT
TAAGGCCAGGGGGAAAGAAAAGCTATATGATATAGCACTTAATATGGGCAAGCAGGGAGC
TGGAAAGATTTGCACTCAACTCTGGCCTTTTAGAAACATCAGGAGGCTGTAAACAAATAA
TGAAACAGCTACAACCAGCTCTACAGACAGGAACAGAGGAACTTAAATCATTATATAACA Un discours étrange:
CAGTAGCAACTCTCTATTGTGTACATGAAAAAATAGAAGTACGAGACACCAAGGAAGCCT
TAGACAAGATAGAGGAAGAACAAAACAAAAGTCAGCAAAAAACACAGCAGGCAGCTGACG exprimé dans la langue de
GAAAGGTCAGTCAAAATTATCCTATAGTGCAGAATCTTCAAGGGCAAATGGTACATCAAG
CCATATCACCTAGAACCTTGAATGCATGGGTAAAAGTAATAGAGGAGAAGGCTTTTAGCC
l’ADN (ATCG)
CAGAGGTAATACCCATGTTTACAGCATTATCAGAAGGAGCCACCCCACAAGATTTAAACA
CCATGTTAAATACGGTGGGGGGACATCAAGCAGCCATGCAAATGTTAAAGGATACCATCA
ATGAAGAGGCTGCAGAATGGGATAGATTACATCCAGTACATGCGGGGCCTATTGCACCAG
GCCAAATGAGAGAACCAAGGGGAAGTGACATAGCAGGAACTACTAGTACCCTTCAGGAAC
AAATATCATGGATAACAGGTAACCCACCTATTCCAGTGGGAGAAATCTATAAAAGATGGA
TAATTCTGGGGTTAAACAAAATAGTGAGAATGTATAGCCCTGTCAGCATTTTGGACATAA
La séquence du génome peut être comparer à un livre écrit dans
une langue inconnue. Il faut donner un sens au texte du génome:
trouver les mots, les phrases, le sens de chaque phrase et les liens entre
elles.

Annoter un génome, c’est donner un sens à sa séquence : trouver les mots, la

ponctuation, le sens de chaque phrase et les liens entre elles …“ .

Ces analyses font partie de la discipline nommée Genomique.

Génomique

Génomique : discipline qui étudie les génomes, visant à

dresser l’inventaire de l’ensemble des gènes d’un
organisme, à les localiser sur les chromosomes, à
déterminer leur séquence et à étudier leur fonction.
Annotation des génomes
Suite au séquençage d’un génome, on doit procéder à l’étape
suivante qui est l’ annotation des génomes.

Annotation:
Etablir un inventaire (par des méthodes bioinformatiques) de tous
les éléments génétiques (emplacement de génes, leur traduction,
localisation de promoteurs etc…) présents dans un génome, ainsi
que leurs fonctions.
Annotation des génomes

Deux niveaux d’annotation:

1- Annotation structurelle: Inventaire et analyse des éléments

presents dans un génome
• Identification de tous les génes codants pour des protéines et
des ARN (ribosomique, de transfert).

• Identification de sites promoteurs, de terminaison de la

transcription et de la traduction, d’epissage, intron, exons…etc

2- Annotation fonctionnelle
Identifier la fonction de tous les gènes détectés lors
de la première étape
Annotation structurelle des
génomes

La localisation des gènes:

• C’est la première étape pour interpréter un génome

Distinction entre régions codantes et non codantes

• Réalisée par des programmes informatiques combinant différents types

d’informations.
Comment reconnaitre un gène?

caractéristiques des gènes:

Cadre de lecture ouvert
Open Reading frame (ORF)
Un «cadre de lecture ouvert » ou «Open Reading Frame (ORF)” est une region d’ADN
entre un codon START et un codon STOP
sequence codante
Coding sequence (CDS)
Si on établit qu’un ORF code pour une protéine ou un ARN on le
désigne comme séquence codante (CDS) = gène
Difference entre ORF et CDS

Un «cadre de lecture ouvert » ou «Open Reading

Frame (ORF)” est une region d’ADN entre un codon
START et un codon STOP

Une sequence codantes (CDS) = gène

code pour une proteine ou un ARN

Attention: Un gène (CoDing Sequence CDS) est un ORF,

mais un ORF ne correspond pas forcément a un gène!
Recherche d’ORFs:
Les phases de lecture possibles dans un ARNm
ARNm
+1 phase de lecture

+2 phase de lecture

+3 phase de lecture
Recherche d’ORFs
“Scanning” d’une séquence d’ADN pour identifier des ORF
(traduction a l’aveugle):
Chaque séquence d’ADN peut traduite en 6 phases de lectures
différentes:
3 dans un brin et 3 sur le brin complémentaire

+3
+2
+1
5’ 3’
3’ 5’
-1
-2
-3
16
Open reading frames - ORF

Spring 2002 Christophe Roos - 2/6 Gene

finding
Recherche des ORF

une séquence d’ADN

logiciel DNA strider
1000 2000 3000 4000 5000 6000 7000

3> 3>

2> 2>

1> 1>

<1 <1

<2 <2

<3 <3

1000 2000 3000 4000 5000 6000 7000

- 6 phases de lecture - codon ATG - codon stop

Recherche des ORF

logiciel DNA strider (C. Marck)

1000 2000 3000 4000 5000 6000 7000

3> 3>

2> 2>

1> 1>

<1 <1

<2 <2

<3 <3

1000 2000 3000 4000 5000 6000 7000

- 6 phases de lecture - codon ATG - codon stop

Recherche des ORF

1000 2000 3000 4000 5000 6000 7000

3> 3>

2> 2>

1> 1>

<1 <1

<2 <2

<3 <3

1000 2000 3000 4000 5000 6000 7000

- 6 phases de lecture - codon ATG - codon stop

Recherche des ORF

1000 2000 3000 4000 5000 6000 7000

3> 3>

2> 2>

1> 1>

<1 <1

<2 <2

<3 <3

1000 2000 3000 4000 5000 6000 7000

- 6 phases de lecture - codon ATG - codon stop

Recherche des ORF

1000 2000 3000 4000 5000 6000 7000

3> 3>

2> 2>

1> 1>

<1 <1

<2 <2

<3 <3

1000 2000 3000 4000 5000 6000 7000

- 6 phases de lecture
- codon ATG Informations dans ces régions ?
- codon stop Promoteurs et terminateurs des gènes
Méthodes de prédiction de gènes (CDS)

Il existe 2 méthodes pour identifier les gènes

La méthode Ab initio La méthode comparative

ou “Codon Preference”
La prédiction des gènes par la
méthode du “Codon Preference
Les séquences codantes pour des gènes ont une particularité forte : code génetique
Escherichia coli vs Streptococcus pyogenes

UU UA
Phe U 1.15 1.57 Ser UCU 0.87 1.55 Tyr U 1.14 1.32 Cys UGU 0.89 1.41
UU
C 0.85 0.43 UCC 0.89 0.44 UAC 0.86 0.68 UGC 1.11 0.59
Usage des codons UU UA
(Codon Bias)Leu A 0.78 1.92 UCA 0.74 1.54 TER A 1.90 1.90 TER UGA 0.87 0.48
UU UA
G 0.77 1.30 UCG 0.92 0.32 G 0.23 0.62 Trp UGG 1.00 1.00
CU
U 0.62 1.26 Pro CCU 0.63 1.60 His CAU 1.14 1.33 Arg CGU 2.28 2.43
CUC 0.63 0.45 CCC 0.49 0.36 CAC 0.86 0.67 CGC 2.40 0.99
CU
A 0.22 0.68 CCA 0.76 1.71 Gln CAA 0.69 1.45 CGA 0.38 0.73
CU
G 2.98 0.39 CCG 2.11 0.33 CAG 1.31 0.55 CGG 0.59 0.33
AU AA
Ile U 1.52 1.91 Thr ACU 0.66 1.27 Asn U 0.90 1.35 Ser AGU 0.91 1.37
AU
C 1.26 0.78 ACC 1.74 0.90 AAC 1.10 0.65 AGC 1.66 0.78
AU AA
A 0.22 0.31 ACA 0.52 1.35 Lys A 1.53 1.48 Arg AGA 0.22 1.10
AU AA
Met G 1.00 1.00 ACG 1.07 0.49 G 0.47 0.52 AGG 0.12 0.41
GU La préférence est propre àGAl’espèce considérée
Val U 1.04 1.74 Ala GCU 0.64 1.84 Asp U 1.26 1.40 Gly GGU 1.35 1.62
Méthode comparative de
prediction des gènes
On compare la séquence des ORF avec les séquences d’autres espèces qui sont
disponibles dans les banques de données

La logique de cette approche est basée sur le fait que si la séquence d’un gène G X chez
l’espèce X est similaire a celle d’un gène Gy chez l’espèce Y dont la fonction est connue,
le fonction du gène GX est la même que celle du gène Gy..
• Les programmes d’identification des genes sont prédictifs.
• Ils génèrent des erreurs certains gènes échappent a la détection.
•Certain gènes prédit ne correspondent pas à de vrais gènes.

• Même pour les prédictions correspondent a des gènes réels, les limites précisent des
gènes sont parfois erronées
Qu’est ce qu’on peux connaitre sur un
organisme a partir de l’annotation
structurelle?
Statistiques des genomes
• Size: 0.49 Mb (Nanoarchaeum equitans ), ~9.2 Mb (Myxococcus xanthus )
• Chromosome: 1, but few bacteria have 2 (Vibrio);
circular and linear (Streptomyces)
• GC content: 26% (Buchnera) up to 72% (Streptomyces coelicolor )
• Codon usage: Different bacteria prefer to use specific codons over others
• Coding density: ~80-90% is coding, ~1 gene for every 1000 bases.
• Huge degree of diversity: genomes of closley related bacterial species
differ both in their contents and organizations (insertions, deletions,
rearrangements)

Codon
table
BP GENE
• Haemophilus influenzae 1,830,137 1,743
• Campylobacter jejuni 1,641,481 1,708
• Mycobacterium tuberculosis 4,115,291 3,924
• Neisseria meningitidis 2,184,406 2,121
• Escherichia coli 4,639,2214,288
Typically ~90% of the genome is coding
There is on average 1 gene/1000bp
Taille des génomes

• The genome size varies

significantly from
species to species.
Taille des génomes

Prise de conscience de plusieurs problèmes fondamentaux

Annotation fonctionnelle
Une fois que que tout les élément génétiques sur le génome
ont été identifiés, on doit passer a l’étape suivante: annotation
fonctionnelle

Computational Biology, Part 8: Protein Coding Regions
No ratings yet
Computational Biology, Part 8: Protein Coding Regions
40 pages
Ghosh and Mallik
No ratings yet
Ghosh and Mallik
68 pages
Gene Prediction
No ratings yet
Gene Prediction
17 pages
Gene Prediction
No ratings yet
Gene Prediction
5 pages
Genomes 4 (C-5, Genome Annotation)
No ratings yet
Genomes 4 (C-5, Genome Annotation)
16 pages
Omputational ENE Rediction: Cse/Bimm/Beng 181 M 24, 2011
No ratings yet
Omputational ENE Rediction: Cse/Bimm/Beng 181 M 24, 2011
45 pages
Lecture 2 Revision of Concepts NR
No ratings yet
Lecture 2 Revision of Concepts NR
12 pages
BioAlg10 9
No ratings yet
BioAlg10 9
69 pages
Lec (6) - Gene Prediction
No ratings yet
Lec (6) - Gene Prediction
19 pages
LAB 5 - Gene Discovery
No ratings yet
LAB 5 - Gene Discovery
10 pages
Gene Prediction
No ratings yet
Gene Prediction
15 pages
Lect9 GeneFinding 2010-03-25 LTK
No ratings yet
Lect9 GeneFinding 2010-03-25 LTK
84 pages
Assignment 3
No ratings yet
Assignment 3
8 pages
Section A Finding Public Biological Databases
No ratings yet
Section A Finding Public Biological Databases
4 pages
Gene Identification - I: Shivani Chandra Birla Institute of Scientific Research
No ratings yet
Gene Identification - I: Shivani Chandra Birla Institute of Scientific Research
35 pages
Gene Pridiction and Orf
No ratings yet
Gene Pridiction and Orf
34 pages
Lva1 App6891 PDF
No ratings yet
Lva1 App6891 PDF
33 pages
CUBT401 - 4 - Sequence and Genome Annotation
No ratings yet
CUBT401 - 4 - Sequence and Genome Annotation
66 pages
Chapter 2 (Human Genome Project)
No ratings yet
Chapter 2 (Human Genome Project)
22 pages
Comp Method Gene
No ratings yet
Comp Method Gene
10 pages
Genome Annotation
No ratings yet
Genome Annotation
58 pages
An Overview of Gene Identification
No ratings yet
An Overview of Gene Identification
9 pages
Genetics for Biology Students
No ratings yet
Genetics for Biology Students
12 pages
Module 1 - Session 3 - Part 2
No ratings yet
Module 1 - Session 3 - Part 2
36 pages
Lecture 8 Chapter 11
No ratings yet
Lecture 8 Chapter 11
61 pages
Ans .: DNA Annotation or Genome Annotation Is The Process of Identifying The
100% (1)
Ans .: DNA Annotation or Genome Annotation Is The Process of Identifying The
3 pages
Part of Notes For BIO 305 April 2025 From DR
No ratings yet
Part of Notes For BIO 305 April 2025 From DR
6 pages
Genomic Annotation
No ratings yet
Genomic Annotation
2 pages
Advanced Genetics & Protein Synthesis
No ratings yet
Advanced Genetics & Protein Synthesis
48 pages
PM703 Practical Biotechnology (2019) PM703 Practical Biotechnology (2019)
No ratings yet
PM703 Practical Biotechnology (2019) PM703 Practical Biotechnology (2019)
20 pages
Unit-6 Information Transfer
No ratings yet
Unit-6 Information Transfer
10 pages
Gene Prediction
No ratings yet
Gene Prediction
50 pages
Exercises 2013 Teachernotes
No ratings yet
Exercises 2013 Teachernotes
16 pages
Genetic Code, Translation, Gene Expression Regulation, HGP
100% (1)
Genetic Code, Translation, Gene Expression Regulation, HGP
26 pages
Lecture 4 - Open Reading Frame
No ratings yet
Lecture 4 - Open Reading Frame
16 pages
Lesson 2 Genetic Code KES
No ratings yet
Lesson 2 Genetic Code KES
19 pages
Gene Structure and Identification: Genes and Genomes Orfs and More Consensus Sequences Gene Finding
No ratings yet
Gene Structure and Identification: Genes and Genomes Orfs and More Consensus Sequences Gene Finding
16 pages
Lecture 1: Genes and The Genetic Code Bioinformatics: Definition?
No ratings yet
Lecture 1: Genes and The Genetic Code Bioinformatics: Definition?
4 pages
CL662 PW 02 Gene Finding
No ratings yet
CL662 PW 02 Gene Finding
39 pages
Annotation
No ratings yet
Annotation
2 pages
Bioinfo Unit-1
No ratings yet
Bioinfo Unit-1
4 pages
Gene, Proteins, and Genetic Code
No ratings yet
Gene, Proteins, and Genetic Code
37 pages
Rosales
No ratings yet
Rosales
27 pages
Biochimie PCEM1: Exercices et Études
100% (2)
Biochimie PCEM1: Exercices et Études
26 pages
Computational Problem For Practice
No ratings yet
Computational Problem For Practice
18 pages
Gene Annotation Compatible
No ratings yet
Gene Annotation Compatible
17 pages
BCM514 2019-3 Modifications
No ratings yet
BCM514 2019-3 Modifications
119 pages
Genome Functional Annotation
No ratings yet
Genome Functional Annotation
24 pages
Post Genomics
No ratings yet
Post Genomics
82 pages
Module 1 - Session 3 - Part 3
No ratings yet
Module 1 - Session 3 - Part 3
21 pages
Bioinformatics
No ratings yet
Bioinformatics
11 pages
HW 13
No ratings yet
HW 13
6 pages
4th Class
No ratings yet
4th Class
19 pages
Unit 6 - Bioinformatics
No ratings yet
Unit 6 - Bioinformatics
41 pages
Rna Gene Finding
No ratings yet
Rna Gene Finding
13 pages
BPS 3101 Mid 1 Study Guide
No ratings yet
BPS 3101 Mid 1 Study Guide
32 pages
UNIT-6 (BIOLOGY) - Debkanta
No ratings yet
UNIT-6 (BIOLOGY) - Debkanta
29 pages
(Ebook) The Lentil: Botany, Production and Uses by W. Erskine, F. Muehlbauer, A. Sarker, B. Sharma ISBN 9781845934873, 9781845934880, 1845934873, 1845934881 All Chapters Instant Download
100% (4)
(Ebook) The Lentil: Botany, Production and Uses by W. Erskine, F. Muehlbauer, A. Sarker, B. Sharma ISBN 9781845934873, 9781845934880, 1845934873, 1845934881 All Chapters Instant Download
81 pages
Ammonites
100% (4)
Ammonites
156 pages
Quantum Computing in Drug Design Enhancing Precision and Efficiency in Pharmaceutical Development
No ratings yet
Quantum Computing in Drug Design Enhancing Precision and Efficiency in Pharmaceutical Development
9 pages
Hypochlorous Acid
100% (2)
Hypochlorous Acid
25 pages
10.1007@s13204 020 01599 1
No ratings yet
10.1007@s13204 020 01599 1
10 pages
Genomic Medicine Principles and Practice 2nd Edition Dhavendra Kumar Instant Download
No ratings yet
Genomic Medicine Principles and Practice 2nd Edition Dhavendra Kumar Instant Download
82 pages
Gellan. Food Applications PDF
No ratings yet
Gellan. Food Applications PDF
13 pages
Report of Ms. Diana Pinto
No ratings yet
Report of Ms. Diana Pinto
1 page
Course Outline For NCM 114
100% (1)
Course Outline For NCM 114
3 pages
The Influence of Preparation Size On The Mechanical
No ratings yet
The Influence of Preparation Size On The Mechanical
4 pages
Encyclopedic Dictionary of Plant Breeding and Related Subjects 1st Edition Rolf H. J. Schlegel
100% (4)
Encyclopedic Dictionary of Plant Breeding and Related Subjects 1st Edition Rolf H. J. Schlegel
57 pages
SC 5 L 14 2-Plants and Animals
No ratings yet
SC 5 L 14 2-Plants and Animals
28 pages
Bio Syllabus Form 1
100% (1)
Bio Syllabus Form 1
3 pages
Lab # - Respiration II
No ratings yet
Lab # - Respiration II
4 pages
Understanding The Self
100% (1)
Understanding The Self
13 pages
Biochemistry Lab Report Part F
No ratings yet
Biochemistry Lab Report Part F
11 pages
EINC Retdem
No ratings yet
EINC Retdem
11 pages
Molecular Biology Techniques Manual PDF
No ratings yet
Molecular Biology Techniques Manual PDF
96 pages
Reading Comprehension Exercise 1 and Discussion
No ratings yet
Reading Comprehension Exercise 1 and Discussion
12 pages
The Animal Organ System - Lesson Outline
No ratings yet
The Animal Organ System - Lesson Outline
5 pages
Seri Final Notes
No ratings yet
Seri Final Notes
179 pages
Poster Presentation
No ratings yet
Poster Presentation
1 page
Animals Guess
No ratings yet
Animals Guess
1 page
Diabetes Complications in Elderly
No ratings yet
Diabetes Complications in Elderly
15 pages
Year 3 SKCTL English UASA
No ratings yet
Year 3 SKCTL English UASA
9 pages
Diversity of Cyperaceae in Brazil
No ratings yet
Diversity of Cyperaceae in Brazil
12 pages
Gut Thyroid Connection How To Fix Your Thyroid Through Your Gut
No ratings yet
Gut Thyroid Connection How To Fix Your Thyroid Through Your Gut
11 pages
Ayele Hegena Anabo-Indigenous People Rights and Protected Area Management
No ratings yet
Ayele Hegena Anabo-Indigenous People Rights and Protected Area Management
25 pages
ACTIVITY 3 BIOMEDICAL PERSPECTIVE IN GENDER AND SEXUALiTY
No ratings yet
ACTIVITY 3 BIOMEDICAL PERSPECTIVE IN GENDER AND SEXUALiTY
9 pages
From DNA To Potein
No ratings yet
From DNA To Potein
6 pages

Cours 7: Annotation Des Génomes

Uploaded by

Cours 7: Annotation Des Génomes

Uploaded by

Cours 7:

Annotation des génomes

Quel sont donc les problemes?

Une fois la séquence d’un génome obtenue, il s’agit de la faire parler

Annoter un génome, c’est donner un sens à sa séquence : trouver les mots, la

Ces analyses font partie de la discipline nommée Genomique.

Génomique : discipline qui étudie les génomes, visant à

Deux niveaux d’annotation:

1- Annotation structurelle: Inventaire et analyse des éléments

• Identification de sites promoteurs, de terminaison de la

La localisation des gènes:

• C’est la première étape pour interpréter un génome

• Réalisée par des programmes informatiques combinant différents types

caractéristiques des gènes:

Un «cadre de lecture ouvert » ou «Open Reading

Une sequence codantes (CDS) = gène

Attention: Un gène (CoDing Sequence CDS) est un ORF,

Spring 2002 Christophe Roos - 2/6 Gene

une séquence d’ADN

1000 2000 3000 4000 5000 6000 7000

- 6 phases de lecture - codon ATG - codon stop

logiciel DNA strider (C. Marck)

1000 2000 3000 4000 5000 6000 7000

- 6 phases de lecture - codon ATG - codon stop

1000 2000 3000 4000 5000 6000 7000

1000 2000 3000 4000 5000 6000 7000

- 6 phases de lecture - codon ATG - codon stop

1000 2000 3000 4000 5000 6000 7000

1000 2000 3000 4000 5000 6000 7000

- 6 phases de lecture - codon ATG - codon stop

1000 2000 3000 4000 5000 6000 7000

1000 2000 3000 4000 5000 6000 7000

Il existe 2 méthodes pour identifier les gènes

La méthode Ab initio La méthode comparative

• The genome size varies

Prise de conscience de plusieurs problèmes fondamentaux

You might also like