Species Concepts and Molecular Taxonomy
Species Concepts and Molecular Taxonomy
last class
§ All
the
naturalists
are
not
sa9sfied
with
any
one
of
the
concepts,
but
every
naturalist
vaguely
knows
while
talks
about
species
§ The
biological
concept
of
species
defines
as
the
members
of
the
popula9on
that
freely
interbreed
and
produce
fer9le
progeny
§ All
concepts
are
valid
to
date
but
s9ll
considered
as
flawed
1. Typological
species
concept
2. Nominalis9c
species
concept
3. Biological
species
concept
4. Evolu9onary
species
concept
Molecular
Taxonomy
Narendran
2009
Taxonomy
To
be
members
of
the
same
species,
if
>70%
of
their
genomes
hybridize
It
can
dis9nguish
related
and
unrelated
species
But
it
has
limita9on
to
use
for
assessing
intra-‐species
rela9onships
Labor-‐intensive
technique
§ require
large
amounts
of
input
DNA
Polymerase
Chain
ReacJon
(PCR)
1993
Gel
electrophoresis
Ao10.98
Ao11.00
Ao20.98
Ao21.00
Advantages
§ Discriminate
a
range
of
species
simultaneously
§ DifferenJaJon
by
electrophoresis,
no
need
sequencing
or
amplicons
processing
Disadvantages
§ Size
of
amplicon
needs
to
vary
substanJally
to
enable
discriminaJon
§ ITS
regions
contain
repeJJve
regions
cause
PCR
products
with
mulJple
bands
Summary
Taxonomic
Characters
Examples
§ Bird
feather
§ Morphological
Characters__
§ Fish
scale
§ Drosophila
sex
combs
Advantages:
§ Useful
for
discrimina9ng
closely
related
or
cryp9c
species
as
it
targets
mul9ple
gene9c
loci
§ DNA
fingerprint
generate
in
a
single
reac9on
§ Data
used
for
phylogene9c
reconstruc9on
in
some
instances
Disadvantages
§ Reproducibility
issues
§ Can’t
use
on
mixed
sample
RFLP
(Restric9on
Fragment
Length
Polymorphisms)
Involves
discrimina9on
of
species
based
on
restric9on
profile
of
amplicons
RFLP
(Restric9on
Fragment
Length
Polymorphisms)
RFLP
(Restric9on
Fragment
Length
Polymorphisms)
Involves
discrimina9on
of
species
based
on
restric9on
profile
of
amplicons
Advantages
§ Simultaneously
discriminate
a
range
of
species
§ Used
on
a
range
of
geneJc
markers
(i.e.,
not
restricted
to
size
variable
markers)
§ Provide
an
addiJonal
level
of
discriminaJon
if
differenJaJon
based
on
size
fails
§ Able
to
detect
new
types
in
some
instances
Disadvantages
§ Requires
digesJon
of
PCR
product
§ MutaJons
some
Jme
result
in
unidenJfied
RFLP
AFLP
(Amplified
Fragment
Length
Polymorphism)
Involves
liga9on
of
adaptors
to
digested
DNA
followed
by
PCR
using
primers
that
are
par9ally
adaptor
and
par9ally
gene-‐specific
AFLP
(Amplified
Fragment
Length
Polymorphism)
Gel
picture
AFLP
(Amplified
Fragment
Length
Polymorphism)
Advantages
§ Useful
for
discrimina9ng
closely
related
or
cryp9c
species
as
it
targets
mul9ple
gene9c
loci
Disadvantages
§ Requires
manipula9ons
in
addi9on
to
PCR
§ Can’t
use
on
mixed
sample
Specific
Nuclear
DNA
markers
Variable
Number
of
Tandem
Repeat
(VNTR)
DNA
is
repeated
hundreds
of
Jmes
in
human
genome.
They
vary
in
number
in
different
loci
and
differently
in
individuals.
Advantages
§ Targets
mulJple
geneJc
loci
that
useful
for
closely
related
or
crypJc
species.
§ ParJcularly
useful
for
tracing
populaJons
§ Automated
fragments
analysis
by
using
fluorescent
primers
§ MulJplexed
PCR
for
detecJon
of
fragment
analysis
§ Some
microsatellite
assays
apply
across
a
number
of
different
species
Disadvantages
§ Ini9al
assay
development
is
9me
consuming
§ Can’t
use
on
mixed
sample
Figure 1. Development and use of microsatellite markers for species identification.
Microsatellite
analysis
Microsatellite
CACACACA
Extract
Or genomic Transform
DNA Digest Ligate cut plasmids
DNA DNA into plasmid into bacteria
vector and grow
colonies
GTGTGTGT
Peaks indicate
GACTAAATCTGGTCTTATTT length of individual
CCGATTTCACACACAGATT amplicons
GTTGAGCAGTAAGGTATGA
Perform Perform
multiplex PCR fragment analysis
targeting multiple on fluorescent
microsatellites PCR products
Microsatellites occur at numerous loci within genomes and several of these markers
Jenkins
are usually
et
al.
Insects
2012
Microsatellite
analysis
With
advancing
technology,
LocaJon
of
the
COI
gene
in
the
human
mitochondrial
genome.
COI
or
COX1
is
one
of
the
three
cytochrome
c
oxidase
subunit
mitochondrial
genes.
COI
is
a
gene
used
as
a
DNA
barcode
to
idenJfy
animal
species.
The
COI
gene
sequence
is
suitable
for
this
role
because:
§ its
muta9on
rate
is
fast
enough
to
dis9nguish
closely
related
species
§ its
sequence
is
conserved
among
conspecifics.
>2%
sequence
divergence
is
typically
detected
between
closely
related
animal
species,
sugges9ng
that
the
barcode
is
effec9ve
for
many
animals.
DNA
barcodes
are
segments
of
approximately
600
base
pairs
of
COI
gene
(5’
region)
which
help
in
cataloguing
the
biodiversity.
DNA
barcoding
Disadvantages
§ Need
database
of
sequences
for
comparison
§ Prior
knowledge
of
the
barcoding
region
is
required
§ Individual
sequences
not
provide
sufficient
data
when
studying
crypJc
species
§ Mitochondrial
genes
are
maternally
inherited
that
decreased
barcode
diversity
Jenkins
et
al.
Insects
2012
Advantages
of
Molecular
data
§ Heritable
§ Unambiguous
§ Quan9ta9ve
§ Assessment
(homology)
is
easier
than
morphology
traits
§ Abundant
and
robust
to
evolu9onary
distance
§ Take
less
9me
How
do
you
give
a
name
to
a
virus?
Naming
the
coronavirus
disease
(COVID-‐19)
and
the
virus
that
causes
it
Virus:
Severe
Acute
Respiratory
Syndrome
Coronavirus
2
(SARS-‐CoV-‐2)
Why
do
the
viruses
have
different
names?
§ Different
processes
and
purposes
for
naming
viruses
and
diseases
§ Viruses
are
named
based
on
their
gene9c
structure
to
facilitate
the
development
of
diagnos9c
tests,
vaccines,
and
medicines
§ Viruses
are
named
by
the
Interna9onal
Commi_ee
on
Taxonomy
of
Viruses
(ICTV).
§ This
name
was
chosen
because
the
virus
is
gene9cally
related
to
the
coronavirus
responsible
for
the
SARS
outbreak
of
2003.
WHO
2022
Nobel
Prize
in
Physiology
or
Medicine
7 MAY 2010 VOL 328 SCIENCE www.sciencemag.org
RESEARCH ARTICLE
changed parts of their g
tors of these groups.
A Han- Han-
B
Neandertals French Chinese PNG Yoruba San Neandertals French Chinese PNG Yoruba San
-10
-8
-6
S
-4
-2
0
0 0
C
SNPs (ND)
SNPs
2
For
many
regions
of
the
genome,
the
variaJon
within
current
humans
is
old
enough
to
include
(ND,s,e) / E(ND,s,e))
Fig. 4. Selective sweep screen. (A) Schematic illustration of
Neandertals
(leN).
Thus,
for
SNPs
in
present-‐day
humans,
Neandertals
oNen
carry
1 the
derived
-‐1
the rationale for the selective sweep screen. For many
allele
(blue).
regions of the genome, the variation within current humans 0
However,
in
isgenomic
r egions
w here
a n
a dvantageous
old enough to include Neandertals (left). Thus, for SNPs m utaJon
a rises
( right,
r ed
star)
and
sweeps
to
high
frequency
or
fixaJon
ihumans,
in present-day n
present-‐day
Neandertals humans,
oftenNeandertals
carry thewderived
ill
be
devoid
of
-1derived
alleles.
allele (blue). However, in genomic regions where an
modern humans (81). The authors o
estimated the fraction of non-Africa
Han- Four
possible
scenarios
of
gene9c
mixture
involving
French Chinese PNG Yoruba San Neandertals.
affected by “archaic” gene flow to
almost an order of magnitude great
estimates,
Scenario
1
represents
gene
suggesting that theirfrom
flow
into
Neandertal
observ
other
archaic
not
hominins,
be entirely here
collecJvely
explained referred
byto
gene
as
Homo
erectus.
This
would
manifest
itself
as
segments
of
Neandertals.
the
Neandertal
genome
with
unexpectedly
high
divergence
from
pImplications
resent-‐day
humans.
for
modern huma
One model
Scenario
2
represents
gene
for modern
flow
between
humanlate
origins s
Neandertals
aall nd
epresent-day
arly
modern
h umans
in
trace
humans Europe
allatheir
nd/or
an
western
Asia.
to a small African population that ex
Scenario
3
represents
gene
flow
between
Neandertals
replaced
and
the
ancestors
of
all
narchaic forms
on-‐Africans.
ofis
humans
This
the
most
with
Neandertals
ture.
par-‐
simonious
Our analysis
explanaJon
of
our
oof the Neandertal
bservaJon.
Although
g
we
detect
gene
notflow
beonly
from
Neandertals
compatible with this into
view
modern
bec
humans,
gene
flow
in
the
reverse
direcJon
may
also
dertals are on average closer to ind
have
occurred.
Homo erectus Eurasia othan
Scenario
4
represents
to individuals
ld
substructure
in tAfrica.
in
Africa
hat
F
individuals
persisted
from
the
origin
of
in Eurasia today
Neandertals
carry regi
unJl
the
ancestors
of
ngenome
on-‐Africans
thatleN
are
Africa.
This
scenario
closely relatedis
atolso
th
compaJble
with
the
current
data.
andertals and distant from other pres
6. Four possible scenarios of genetic mixture mans. The data suggest that between 1
Vol 464 | 8 April 2010 | doi:10.1038/nature08976
LETTERS
The complete mitochondrial DNA genome of an
unknown hominin from southern Siberia
Johannes Krause1, Qiaomei Fu1, Jeffrey M. Good2, Bence Viola1,3, Michael V. Shunkov4, Anatoli P. Derevianko4
& Svante Pääbo1
With
the
excepJon
of
Neanderthals,
from
which
DNA
sequences
of
numerous
With the exception of Neanderthals, from which DNA sequences containing both Upper and Middle Palaeolithic elements has been
individuals
of numerous individualshave
nhave ow
nowbeen
been d etermined
determined 1 1 ,
the
number
, the number reported (see and
geneJc
rInformation).
Supplementary elaJonships
of
and genetic relationships of other hominin lineages are largely We extracted DNA from 30 mg of bone powder and converted it
other
hominin
unknown. lineages
Here we report a complete are
mitochondrial
largely
unknown.
(mt) DNA Hinto ere
anwIllumina
e
report
a
complete
sequencing library using DNA adaptors that carry
mitochondrial
sequence retrieved from (mt)
a bone DNA
sequence
excavated in 2008 rin etrieved
Denisova from
a
bone
project-specific excavated
barcodes. in
2a 008
We next used recentlyin
published protocol
Cave in the Altai Mountains in southern Siberia. It represents a for targeted sequence retrieval called primer extension capture (PEC)1
Denisova
hitherto unknown Cave
typeiof n
hominin
the
Altai
mtDNAMountains
that shares a commonin
southern
to isolate Siberia.
It
represents
mtDNA fragments from the entirea
hmitochondrial
itherto
genome.
unknown
ancestor with tanatomically
ype
of
hominin
modern m human tDNA
mtDNAs about 1.0 million years ago. This indicates that it derives
andthat
shares
aThe
Neanderthal
GAcommon
a
isolated fragments ncestor
w
were sequenced ith
a
from natomically
both ends on the Illumina
II platform, using 76 cycles for each read. Fragments were included
modern
from a hominin human
migration and
outN ofeanderthal
Africa distinct from mtDNAs
that of the about
in further1.0
analyses
million
years
if their forwardago.
This
reads
and reverse indicates
overlapped by at
that
it
ofderives
from
theabone
hominin
ancestors of Neanderthals and of modern humans. The stra-
tigraphy the cave where was found msuggests
igra9on
thato ut
oThis
the f
Africa
removesdall is9nct
fragments from
that
bpo f
the
least 11 base pairs (bp) and thus could be merged into single sequences.
over ,134 from analysis, but reduces
ancestors
of
Neanderthals
Denisova hominin lived close in time
2–4
and and
of
modern
humans.
The
straJgraphy
of
the
cave
space with Neanderthals errors at the 39 ends of Illumina reads where error rates are highest21,22.
as well as with modern humans . We generated 1,178,300 merged sequences, of which 93,349 (7.9%)
where
The firstthominin
he
bone
group was
found
to leave Africaswasuggests
Homo erectus that
about
the
Daligned
enisova
to thehrevised
ominin
Cambridgelived
close
Sequence
Reference in
9me
(rCRS)23 using
and
space
years w ith
N eanderthals
. Archaeological asawell s
wasell
a s
w ith
manodern
iterative h umans 2–4.
MIA . This alignment program is
5 1
1.9 million (Myr) ago genetic data mapping assembler,
indicate that at least two groups of hominins left Africa after this event: particularly suitable for aligning ancient DNA sequences because it
first, the ancestors of the Neanderthals between 500,000 and 300,000 takes into account the frequent sequence errors associated with base
years ago (500 and 300 kyr ago, respectively), presumably Homo damage in ancient DNA sequences24,25. Owing to the library amplifica-
PhylogeneJc
tree
of
complete
mtDNAs.
35 caus
30 prob
Neanderthal–human even
25
20 and
15
(see
Denisova–human A
10
the r
5
and
0 the
0 50 100 150 200 250 300 350 400
Pairwise nucleotide distance initi
perh
Pairwise
nucleoJde
differences
from
all
pairs
of
complete
mtDNAs
from
54
present-‐
day
and
2 | Distribution direc
one
Figure
Pleistocene
of pairwise
modern
human,
nucleotide
six
Neanderthals
and
tdifferences. Pairwise
he
Denisova
hominin
are
shown.
nucleotide differences from all pairs of complete mtDNAs from 54 present- the
day and one Pleistocene modern human, six Neanderthals and the Denisova awai
Thank
you