Annotation d’un gène bactérien
Gataagtatctggcggatatttatcagcttgcccggcagcgtctggcgaacgtgggtgttgagcaaat
tttcggcggcgaccgttgtacatatacggaaaatgagactttcttctcttatcgtcgcgacaagacca
ccggtcgtatggcaagtttcatttggctgatataacctaaagaatcaagacgatccggtacgcgtgat
tttcttttcacattaatctggtcaataaccttgaataattgagggatgacctcatttaatctccagta
+1
gcaactttgatccgttatgggaggagttatgcgtctggatcgtcttactaataaattccagcttgctc
ttgccgatgcccaatcacttgcactcgggcacgacaaccaatttatcgaaccacttcatttaatgagc
gccctgctgaatcaggaagggggttcggttagtcctttattaacatccgctggcataaatgctggcca
gttgcgcacagatatcaatcaggcattaaatcgtttaccgcaggttgaaggtactggtggtgatgtcc
agccatcacaggatctggtgcgcgttcttaatctttgcgacaagctggcgcaaaaacgtggtgataac
tttatctcgtcagaactgttcgttctggcggcacttgagtctcgcggcacgctggccgacatcctgaa
agcagcaggggcgaccaccgccaacattactcaagcgattgaacaaatgcgtggaggtgaaagcgtga
acgatcaaggtgctgaagaccaacgtcaggctttgaaaaaatataccatcgaccttaccgaacgagcc
gaacagggcaaactcgatccggtgattggtcgtgatgaagaaattcgccgtaccattcaggtgctgca
acgtcgtactaaaaataacccggtactgattggtgaacccggcgtcggtaaaactgccatcgttgaag
gtctggcgcagcgtattatcaacggcgaagtgccggaagggttgaaaggccgccgggtactggcgctg
gatatgggcgcgctggtggctggggcgaaatatcgcggtgagtttgaagaacgtttaaaaggcgtgct
taacgatcttgccaaacaggaaggcaacgtcatcctatttatcgacgaattacataccatggtcggcg
cgggtaaagccgatggcgcaatggacgccggaaacatgctgaaaccggcgctggcgcgtggtgaattg
cactgcgtaggtgccacgacgcttgacgaatatcgccagtacattgaaaaagatgctgcgctggaacg
tcgtttccagaaagtgtttgttgccgagccttctgttgaagataccattgcgattctgcgtggcctga
aagaacgttacgaattgcaccaccatgtgcaaattactgacccggcaattgttgcagcggcgacgttg
tctcatcgctacattgctgaccgtcagctgccggataaagccatcgacctgatcgatgaagcagcatc
cagcattcgtatgcagattgactcaaaaccagaagaactcgaccgactcgatcgtcgtatcatccagc
tcaaactggaacaacaggcgttaatgaaagagtctgatgaagccagtaaaaaacgtctggatatgctc
aacgaagaactgagcgacaaagaacgtcagtactccgagttagaagaagagtggaaagcagagaaggc
atcgctttctggtacgcagaccattaaagcggaactggaacaggcgaaaatcgctattgaacaggctc
gccgtgtgggggacctggcgcggatgtctgaactgcaatacggcaaaatcccggaactggaaaagcaa
ctggaagccgcaacgcagctcgaaggcaaaactatgcgtctgttgcgtaataaagtgaccgacgccga
aattgctgaagtgctggcgcgttggacggggattccggtttctcgcatgatggaaagcgagcgcgaaa
aactgctgcgtatggagcaagaactgcaccatcgcgtaattggtcagaacgaagcggttgatgcggta
tctaacgctattcgtcgtagccgtgcggggctggcggatccaaatcgcccgattggttcattcctgtt
cctcggcccaactggtgtggggaaaacagagctttgtaaggcgctggcgaactttatgtttgatagcg
acgaggcgatggtccgtatcgatatgtccgagtttatggagaaacactcggtgtctcgtttggttggt
gcgcctccgggatatgtcggttatgaagaaggtggctacctgaccgaagcggtgcgtcgtcgtccgta
ttccgtcatcctgctggatgaagtggaaaaagcgcatccggatgtcttcaacattctgttgcaggtac
tggatgatgggcgtctgactgacgggcaagggagaacggtcgacttccgtaatacggtcgtcattatg
acctctaacctcggttccgatctgattcaggaacgcttcggtgaactggattatgcgcacatgaaaga
gctggtgctcggtgtggtaagccataacttccgtccggaattcattaaccgtatcgatgaagtggtgg
tcttccatccgctgggtgaacagcacattgcctcgattgcgcagattcagttgaaacgtctgtacaaa
cgtctggaagaacgtggttatgaaatccacatttctgacgaggcgctgaaactgctgagcgagaacgg
ttacgatccggtctatggtgcacgtcctctgaaacgtgcaattcagcagcagatcgaaaacccgctgg
cacagcaaatactgtctggtgaattggttccgggtaaagtgattcgcctggaagttaatgaagaccgg
attgtcgccgtccagtaaatgataaaacgagccc ttc ggggctcgt ttttgtctataagttagacgga
aaagactatatttaagatgttttgcctgaaaagtgagcgaacgataaagtttttatatttttcgcttg
tcaggccggaataactccctataatgcgccaccactgacacggaacaacggcaaacacgccgccgggt
cggcggggttctcctgagaatctcaacagagaaaagcaaagaaatgcttgactctgtagcgggaaggc
gtattatgcacaccccgcgccgctgagaaaaagcgaagcggcactg
ORF
Shine-Dalgarno : fixation ribosome
Rouge : TATA box (région -10) ou primobox : tttaat et région -35 :
ttgaat
+1 : initiation transcription débute à AAC…
ORF commence bien au second ATG
Terminaison rho dépendante ou rho indépendante (nécessite pas le
facteur rho)-> présence d’un terminateur intrinsèque
ARNold => identifie séquence terminateur : ACGAGCCC
Terminateurs
TTC = boucle => formation épingle à cheveux