0% found this document useful (0 votes)

88 views6 pages

Mid Bioinfor

The document is a mid-term examination for a Bioinformatics course at the International University, Vietnam National University, HCMC, dated 5/11/2021. It consists of two parts: a paper-based exam with short answer, multiple choice, matching, and calculation questions, and a computer-based exam involving BLAST and ORF Finder tasks. The exam is open book and requires students to answer questions immediately after they are posed.

Uploaded by

Khánh My Đỗ Bùi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

88 views6 pages

Mid Bioinfor

Uploaded by

Khánh My Đỗ Bùi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

THE INTERNATIONAL UNIVERSITY (IU) – VIETNAM NATIONAL UNIVERSITY – HCMC

MID-TERM EXAMINATION – CLASS

Date: 5/11/2021
Duration: 90 minutes

Student ID: .......BTBTIU19107....... Name: ….Lê Phước Quyền…….

SUBJECT: BIOINFORMATICS
Dean of School of Lecturer Proctor 1 Score
Biotechnology Signature:
Signature:

Proctor 2

Full name: Full name:

Dr. Nguyen Minh Thanh
(Sign and write
full name)

Instruction:
1. This is an open book examination
2. Student must answer right after a question

Part 1 – Paper-based exam (50 points)

A. Short answer (8 points)
1. Arrange the E-value in ascending order (lowest to highest): 8e-146, 7e-52, 0.0, 3e-45. (2 points)
0.0, 8e-146, 7e-52, 3e-45
2. PAM195 is a matrix where an average of -----75%---- amino acids have changed during evolution.
(write percentage in the blank). (2 points)
3. N50 is the length of contig/scaffold at which ------50%------ of the bases in a given assembly reside.
(write number or percentage in the blank) (2 points)
4. Give a name of assembly method that merges short reads to create a novel full-length DNA
sequence with no-prior reference sequence available. (2 points) De novo assembly

1
B. Multiple choice (20 points) (Please highlight your choice in yellow)

5. A heterotrimer contains
a. One subunit
b. Two identical subunits
c. Two identical subunits & one different subunit
d. Three different subunits
e. c&d

6. Which of the following is incorrect about next-generation sequencing (NGS)

technologies?
a. Fast
b. NGS generate a huge number of reads per run
c. NGS reduces the cost of sequencing dramatically
d. NGS reads are typically long
e. Lower accuracy in comparison with Sanger technology

7. The order of study of genome based on NGS technologies is

a. DNA library preparation – sequencing – trimming – de novo assembly – BUSCO
performance – chromosome assembly – annotation.
b. Sequencing – DNA library preparation – trimming – de novo assembly – BUSCO
performance – chromosome assembly – annotation.
c. DNA library preparation – annotation – trimming – de novo assembly – BUSCO
performance – chromosome assembly – sequencing.
d. DNA library preparation – sequencing – trimming – chromosome assembly – de
novo assembly – BUSCO performance – annotation.
e. DNA library preparation – trimming – sequencing – de novo assembly – BUSCO
performance – chromosome assembly – annotation.

8. Which of the following is incorrect about primary database?

a. Raw sequence data with some basic information
b. Redundancy
c. Contain only protein sequences
d. Majority of protein sequences derived from computational translation.
e. All above

9. You have two distantly related proteins. Which BLOSUM or PAM matrix is best suited to
compare them?
a. BLOSUM45 or PAM250
b. BLOSUM45 or PAM1
c. BLOSUM80 or PAM250
d. BLOSUM80 or PAM1
e. Non-best suited with the above options.

2
10. Which of the following is incorrect about typical basic BLAST output?
a. The subject sequences are listed from the highest similarity at the top to
progressively lower similarities going down the list.
b. The subject sequences are listed from the highest similarity at the top together with
the highest E-values.
c. The subject sequences are listed from the highest similarity at the top together with
the highest bit scores.
d. The E-values are listed from lowest value at the top to increasingly higher values
going down the list.
e. The bit scores are listed from the highest value at the top to progressively lower
values going down the list.

11. What is the difference between RefSeq and Gen-Bank?

a. RefSeq includes publicly available DNA sequences submitted from individual
laboratories and sequencing projects.
b. GenBank provides nonredundant curated data.
c. GenBank sequences are derived from RefSeq.
d. RefSeq sequences are derived from GenBank and provide nonredundant curated
data.
e. There is no difference between two databases.

12. Which of the following is correct about normalized BLAST scores (also called bit scores):
a. are unitless;
b. are not related to the scoring matrix that is used;
c. can be compared between different BLAST searches, even if different scoring
matrices are used;
d. can be compared between different BLAST searches, but only if the same scoring
matrices are used.
e. cannot be compared between different BLAST searches, if different scoring
matrices are used;

13. It is extremely difficult for intrinsic (ab initio) gene‐finding algorithms to predict protein‐
coding genes in eukaryotic genomic DNA. What is the main problem?
a. exon/intron borders are hard to predict;
b. introns may be many kilobases in length;
c. the GC content of coding regions is not always differentiated from the GC content
of noncoding regions;
d. All of the above.
e. None of the above.

3
14. Most sequencing technologies produce raw data in what format?
a. FASTA;
b. FASTQ;
c. FASTG;
d. FASTX;
e. FASTQC.

C. Matching (match an appropriate term with each definition) (10 points)

Terms:
DNA sequencing Accession number Contig
Chromosome Consensus sequence Coding sequence
Paralogs Orthologs Read

Definitions:
Definitions Terms
15. A unique identification is given to mark the entry of a sequence (protein Accession number
or nucleic acid) to a primary or secondary database.
16. A method determines the nucleotide sequence of a DNA molecule. DNA sequencing
17. Contiguous segment of a DNA that was generated by joining Contig
overlapping reads.
18. Part of the DNA that is transcribed into mRNA during transcription and Coding sequence
then translated into protein.
19. Homologous proteins that perform the same function in different Orthologs
species.

D. Calculation (12 points)

Consider the following alignment (| is perfect match, : is similar match, . is dissimilar match):

Question Answer
20. What is the length of the alignment? 17
21. What is the percent identity? 5/17 = 29.4 %
22. What is the percent similarity? 7/17 = 41.2 %
23. What is the percent gap? 8/17 = 47.1%

4
Part 2 – Computer-based exam (50 points)
Question 24: (14 points)
Set up an appropriate BLAST for the sequence with the file name as Unknown sequence_A. Answer
the following questions?
Question Answer
a. What is the accession number of the closest match NP_001187174
to the query sequence?
b. Protein name Somatotropin precursor
c. Length of peptide 200 aa
d. The common name of the species Channel catfish
e. The scientific name of the species Ictalurus punctatus
f. Function of the protein The function stimulate the liver and
other tissues to secrete IGF-1, which
stimulates both the differentiation and
proliferation of myoblasts.

Question 25: (36 points)

Run ORF Finder (https://www.ncbi.nlm.nih.gov/orffinder/) to predict potential genes in an unknown
sequence with the file name as Unknown sequence_B. Answer:
a. How many are Open Reading Frames (ORFs) found? (2 points)

b. The following information of the largest ORF: (7 points)

Question Answer
Label ORF143
Strand (+ or -) +
Frame number 2
Position of start nucleotide 39014
Position of stop nucleotide 42259
Length of ORF 3246 nt
Length of polypeptide translated from this frame 1081 aa

5
Navigate to ORF200 and perform a protein BLAST (BLASTP) for the polypeptide sequence
translated from ORF200, choose Non-redundant protein sequences (nr) database. Answer:

c. The following information of ORF200 from ORF Finder: (11 points)

Question Answer
Strand (+ or -) +
Frame number 3
Position of start nucleotide 32361
Position of stop nucleotide 34049
Length of ORF 1689 nt
Length of polypeptide translated from this frame 562 aa
Write the first five amino acids MRGCV
Write the nucleotide sequence of the coding strand ATGCGCGGGTGCGTA
that corresponds to the first five amino acids
Write the nucleotide sequence of the template TACGCGCCCACGCAT
strand that corresponds to the first five amino acids
Note: the template strand is the strand that is complementary to the coding strand

d. Results of the best hit from BLASTP: (16 points)

Question Answer
Accession number AEI75106
Max score 1033
E-value 0.0
Percent identity 100.00 %
Length of polypeptide 518 aa
Protein name Putative 30S ribosomal protein S1
Organism name Candidatus Tremblaya princeps PCIT

GOOD LUCK!

Exam Year Questions and Answers
No ratings yet
Exam Year Questions and Answers
8 pages
Exam Year Questions and Answers
No ratings yet
Exam Year Questions and Answers
8 pages
Bioinformatics Tutorial
No ratings yet
Bioinformatics Tutorial
12 pages
BC434 Mid'25-26
No ratings yet
BC434 Mid'25-26
2 pages
BIO Final22 Questionssol
No ratings yet
BIO Final22 Questionssol
16 pages
Computational Biology Problem Set
No ratings yet
Computational Biology Problem Set
10 pages
Bioinfo Key
No ratings yet
Bioinfo Key
3 pages
Sequence Similarity Search with BLAST
No ratings yet
Sequence Similarity Search with BLAST
19 pages
Bioinformatics
No ratings yet
Bioinformatics
11 pages
University of Kwazulu-Natal Bioinformatics Gene320 3 May 2016 Test 2 Duration 100 Minutes Total Marks: 70
No ratings yet
University of Kwazulu-Natal Bioinformatics Gene320 3 May 2016 Test 2 Duration 100 Minutes Total Marks: 70
6 pages
Bioinformatics Exam Guide
No ratings yet
Bioinformatics Exam Guide
6 pages
Assignment 2 - Database Searching - 19 Mar. 2024
No ratings yet
Assignment 2 - Database Searching - 19 Mar. 2024
4 pages
Practical 2 Sequence Alignment
No ratings yet
Practical 2 Sequence Alignment
8 pages
Biochem 225internal 2005
No ratings yet
Biochem 225internal 2005
1 page
TY-Exercise 4 (35) (Updated)
No ratings yet
TY-Exercise 4 (35) (Updated)
7 pages
Bif501-Mid File-By Asmat Khan Niazi
No ratings yet
Bif501-Mid File-By Asmat Khan Niazi
49 pages
Bioinfo
No ratings yet
Bioinfo
8 pages
Bioinformatics for Biochem Students
No ratings yet
Bioinformatics for Biochem Students
6 pages
Bioinformatics BLAST Assignment
100% (3)
Bioinformatics BLAST Assignment
5 pages
DNA, RNA, and Protein Analysis Guide
No ratings yet
DNA, RNA, and Protein Analysis Guide
12 pages
Bioinformatics Assingment - New Kandy - Draft
100% (1)
Bioinformatics Assingment - New Kandy - Draft
14 pages
M-SC BIOINFORMATICS
No ratings yet
M-SC BIOINFORMATICS
27 pages
Module in Tics
No ratings yet
Module in Tics
20 pages
BIO206 Term Test 2 Key
No ratings yet
BIO206 Term Test 2 Key
15 pages
18GEO104T
No ratings yet
18GEO104T
2 pages
BS10003 Mid-Spring23 QP Final
0% (1)
BS10003 Mid-Spring23 QP Final
4 pages
BIF Problems
No ratings yet
BIF Problems
2 pages
SLR-VC-47 P: Seat No. M.Sc. (Semester - I) (CBCS) Examination Nov/Dec-2018 Bioinformatics Basic Bioinformatics
No ratings yet
SLR-VC-47 P: Seat No. M.Sc. (Semester - I) (CBCS) Examination Nov/Dec-2018 Bioinformatics Basic Bioinformatics
38 pages
BI MSE 1 2 Marks
No ratings yet
BI MSE 1 2 Marks
6 pages
Solnlug
No ratings yet
Solnlug
10 pages
Part B PH.D Question Paper Course Title: Bioinformatics Course Code: A
No ratings yet
Part B PH.D Question Paper Course Title: Bioinformatics Course Code: A
7 pages
Asm 4
No ratings yet
Asm 4
12 pages
Biot 306 - 2017
No ratings yet
Biot 306 - 2017
4 pages
Ia1 Comp Bio QP 2025
No ratings yet
Ia1 Comp Bio QP 2025
2 pages
Chbe 473/594B Homework #1 Spring 2013 (Due Jan. 31, 2011 in Class) 1. Multiple Choice (Only One Correct Answer) (3' For Each Problem)
No ratings yet
Chbe 473/594B Homework #1 Spring 2013 (Due Jan. 31, 2011 in Class) 1. Multiple Choice (Only One Correct Answer) (3' For Each Problem)
6 pages
قواعد البيانات الحيوية - امتحان نهائي
No ratings yet
قواعد البيانات الحيوية - امتحان نهائي
19 pages
Bioinformatics Assingment - B8.Docx Alex Presly-37
No ratings yet
Bioinformatics Assingment - B8.Docx Alex Presly-37
10 pages
Assign 4 - GR5 - S22324
No ratings yet
Assign 4 - GR5 - S22324
9 pages
Quiz Dna
100% (3)
Quiz Dna
8 pages
Bioinformatics Cheat Sheet
No ratings yet
Bioinformatics Cheat Sheet
4 pages
TY-Exercise 4
No ratings yet
TY-Exercise 4
8 pages
Bio Informatics 02 (TYPES OF BLAST)
No ratings yet
Bio Informatics 02 (TYPES OF BLAST)
2 pages
Aanchal Maurya Bioinformatics 2
No ratings yet
Aanchal Maurya Bioinformatics 2
24 pages
Bioinformatics Questions Based On The Exit Exam
No ratings yet
Bioinformatics Questions Based On The Exit Exam
7 pages
Bioinformatics Answers
100% (1)
Bioinformatics Answers
13 pages
Bioinformatics Tutorial 2019
No ratings yet
Bioinformatics Tutorial 2019
54 pages
Bioinformatics Tools: Stuart M. Brown, PH.D Dept of Cell Biology NYU School of Medicine
No ratings yet
Bioinformatics Tools: Stuart M. Brown, PH.D Dept of Cell Biology NYU School of Medicine
50 pages
BIOT643 Midterm Exam Summer 2016
No ratings yet
BIOT643 Midterm Exam Summer 2016
4 pages
Test Exam With Answers
No ratings yet
Test Exam With Answers
11 pages
CL662 HW3
No ratings yet
CL662 HW3
5 pages
Bi183 HW2
No ratings yet
Bi183 HW2
4 pages
General Biology Exam Questions
100% (1)
General Biology Exam Questions
4 pages
W9-SIO1003 Practical 4-Questions
No ratings yet
W9-SIO1003 Practical 4-Questions
6 pages
Lecture 4
No ratings yet
Lecture 4
106 pages
Rosales
No ratings yet
Rosales
27 pages
Part B PH.D Question Paper Course Title: Bioinformatics Course Code: C
No ratings yet
Part B PH.D Question Paper Course Title: Bioinformatics Course Code: C
7 pages
Lecture - 02 - Comparative Sequence Analysis
No ratings yet
Lecture - 02 - Comparative Sequence Analysis
28 pages
Bioinformatics Worksheet: Bacteria ID & Sequence Analysis
No ratings yet
Bioinformatics Worksheet: Bacteria ID & Sequence Analysis
5 pages
Search Sequence Database
No ratings yet
Search Sequence Database
6 pages
Bmri2013 268249
No ratings yet
Bmri2013 268249
10 pages
Production and Market Comparison of Urok
No ratings yet
Production and Market Comparison of Urok
12 pages
PharmBT Le Huynh Khanh Doan Assignment 1
No ratings yet
PharmBT Le Huynh Khanh Doan Assignment 1
7 pages
Isolation, Production, Assay and Characterization of Fibrinolytic Enzymes (Nattokinase and Streptokinase) From Bacteria
No ratings yet
Isolation, Production, Assay and Characterization of Fibrinolytic Enzymes (Nattokinase and Streptokinase) From Bacteria
4 pages
Students on Ice: Future Leaders
No ratings yet
Students on Ice: Future Leaders
29 pages
Wang, 2021
No ratings yet
Wang, 2021
15 pages
Cell Organelles & Compartmentalization
No ratings yet
Cell Organelles & Compartmentalization
82 pages
Cell Theory & Cell Structures Guide
No ratings yet
Cell Theory & Cell Structures Guide
3 pages
Aseptic Technique & Culture Media Guide
No ratings yet
Aseptic Technique & Culture Media Guide
4 pages
Ravichandran 2019
No ratings yet
Ravichandran 2019
99 pages
Research Scientist Profile: Molecular Biology & Cancer
No ratings yet
Research Scientist Profile: Molecular Biology & Cancer
8 pages
TPJF Manuscript Template
No ratings yet
TPJF Manuscript Template
22 pages
Earth's Evolving Systems 2nd Edition (Ebook PDF) Instant Download
100% (1)
Earth's Evolving Systems 2nd Edition (Ebook PDF) Instant Download
56 pages
Official MATATAG Weekly Lesson Log Format
88% (8)
Official MATATAG Weekly Lesson Log Format
3 pages
Anatomy & Physiology: Nervous System
No ratings yet
Anatomy & Physiology: Nervous System
22 pages
Untitled
No ratings yet
Untitled
4 pages
Final Examination in People and The Earths Ecosystem
No ratings yet
Final Examination in People and The Earths Ecosystem
3 pages
Grade 5 Science Lesson Plan: Animal Reproduction
No ratings yet
Grade 5 Science Lesson Plan: Animal Reproduction
7 pages
Rekap Nik & Sip PKM Suri
No ratings yet
Rekap Nik & Sip PKM Suri
6 pages
Slip Test - 2 (+2) and Slip Test - 1 (+1) Time Table & Syllabus July 2025
No ratings yet
Slip Test - 2 (+2) and Slip Test - 1 (+1) Time Table & Syllabus July 2025
4 pages
BioTech Worksheet
No ratings yet
BioTech Worksheet
10 pages
Worksheet - Adaptation in Plants
No ratings yet
Worksheet - Adaptation in Plants
2 pages
Correlation and Path Coefficient Analysis in Fodder Maize
No ratings yet
Correlation and Path Coefficient Analysis in Fodder Maize
6 pages
Oligonucleotide Chemist Resume
No ratings yet
Oligonucleotide Chemist Resume
2 pages
Microbiology Chapter 1 For Student
No ratings yet
Microbiology Chapter 1 For Student
107 pages
Course Result Slip
No ratings yet
Course Result Slip
2 pages
15 To 20
No ratings yet
15 To 20
5 pages
Biology For The AP&#174 Course - James Morris - Exported
No ratings yet
Biology For The AP&#174 Course - James Morris - Exported
455 pages
Front-Matter Foundations of Fisheries Science
No ratings yet
Front-Matter Foundations of Fisheries Science
16 pages
Sas Bahasa Inggris
No ratings yet
Sas Bahasa Inggris
6 pages
Biochemical Circuits: Engineering Insights
No ratings yet
Biochemical Circuits: Engineering Insights
3 pages
Nervous System Reflexes Worksheet
100% (2)
Nervous System Reflexes Worksheet
4 pages
Grade-07 Science Chapter10 Respiration-In-Organisms
No ratings yet
Grade-07 Science Chapter10 Respiration-In-Organisms
9 pages
Evolutionary Journey of Humankind
No ratings yet
Evolutionary Journey of Humankind
3 pages