0% found this document useful (0 votes)
75 views53 pages

01 (Primera)

informacion de cultivo
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
75 views53 pages

01 (Primera)

informacion de cultivo
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 53

Cover Page

The handle http://hdl.handle.net/1887/28328 holds various files of this Leiden University


dissertation.

Author: Vosse, Esther van de


Title: Positional Cloning in Xp22 : towards the isolation of the gene involved in X-linked
retinoschisis
Issue Date: 1998-01-07
CHAPTER!

INTRODUCTION
Introduction

CHAPTER 1 INTRODUCTION

1.1 X chromosome
The X chromosome is about 160Mb in length (1) and contains an estimated 2500-5000 genes.
The X chromosome has many special features that distinguishes it from the autosomes. The most
obvious is that it is one of the sex determining chromosomes; XX individuals are female and XY
individuals are male. All other chromosomes (the autosomes) are always present in two identical
copies but the sex chromosomes differ greatly from each other, not only in size and morphology
but also in gene content. Homologies and differences between the sex chromosomes are
discussed in 1.1.1. Since the X chromosome is present in either one or two copies, unequal
dosage of transcripts of X chromosomal genes in males and females would occur if not X
inactivation would compensate for this. Dosage compensation in XX individuals is provided by
transcriptional inactivation of a large fraction of the genes on one X chromosome; this is
discussed in 1.1.2.

1.1.1 XIY homology


TheY chromosome is 50Mb (2) in length and based on its size could contain an estimated 750-
1500 genes, however, this amount is an overestimation since theY chromosome is gene poor (see
below). TheY chromosome has two main functions: it is required for the male phenotype and
provides a pairing partner for the X chromosome during male meiosis. The gene (or genes)
required for initiating male development is called the testis determining factor (TDF). Only one
gene has been identified that is a candidate for TDF; sex-determining region Y (SRY) (3). Very
few other Y-specific genes have been isolated thus far (2) (the Deleted in Azoospermia gene
cluster (DAZ) gene family (4,5), Spermatogenesis gene on theY (SPGY) (6), testis-specific
protein, Y-encoded (TSPY) gene family (7), Ribonucleic acid binding motifs (RBM) gene family
(8)).
Only a small portion of the genes on the X and Y chromosome are shared, consequently
most genes on the X chromosome are haploid in males. The shared genes are transcribed from
both X and Y chromosome and are located in the pseudoautosomal regions (PAR) which
effectively behave as autosomal genes. The PAR1 region on the Xp and Yp telomeres is a region
of 2.6 Mb (2), delimited by the pseudoautosomal boundary (PAB). The PAR2 region (9) on
subtelomeric Xq and Yq is much smaller (320 kb ). So far only two genes have been cloned from

11
Cha ter I

this region: the Interleukin 9 Receptor (IL9R) (10) and a Synaptobrevin~like gene (SYBLl) (I I).
The genes identified in PAR I and P AR2 are indicated in Figure 1. In contrast to the genes in
PARI, SYBLl in PAR2 is subject to X inactivation. Even more remarkable is that theY copy
is also inactive. The (in)activation state of IL9R is not known and if this gene shows the same
pattern as SYBLl, a position effect caused by the heterochromatin on Yq may play a role in this.
Another explanation is that these -although located in the PAR- are pseudo-genes.

PAR1
.... ....
Xp22.3 Yp11.3

CSF2RA
IL3A
ANT3
Yq12
ASMT
XE?

MIC2R.:
MIC2 '

V
Xq13

Figure 1: The pseudoautosomal regions.


PARI has a size of2.6 Mb, PAR2 has a size of

.·· ··~ :.
320 kb. The pseudoautosomal regions are
........ SYBL1 / identical in X and Y. The border of PARl is an
Xq28 :.. .. .. .. .. . IL9R •
Alu repeat, the border of PAR2 is a LINE
X PAR2
repeat. XIC localised in Xq 13 is discussed
in 1.1.2.

Pairing between the X and Y chromosome during male meiosis seems to involve only
part of the short arm of the X and Y and includes an obligatory cross-over in PARI. However,
recent reports have been published in which pairing at PAR2 is described (9,12,13). The rate of
recombination in PAR2 is 168 kb/cM (14), in PARI it is 55 kb/cM (15). These rates are very
high compared to the average for the genome (1 Mb/cM), due to the fact that these regions, while

I2
Introduction

small, have an obligate cross-over (at least in PAR1).


Other regions of homology between X and Y mainly consist of pseudo-genes on the Y,
and are believed to have arisen through non-homologous pairing between the X and Y (16)
followed by inversions (2). These regions are hotspots for illegitimate recombination and are
located in Xp22.3/Yq 11.21, involving recombination between the KAL-X and KAL-Y gene (17),
in Xp22.3/Ypll, involving recombination between PKXl and PKY1 (18), and in Xp22.3/Yp11
in a region just proximal from the PAB, involving recombination between sequences (1/3 in
repeats) that have a high homology (96-98%) between X and Y (19,20).

1.1.2 X inactivation
To compensate for the unequal dose of X genes in males and females, one of the X chromosomes
is inactivated at an early stage of embryogenesis in all somatic tissues in the female. This
phenomenon is called X inactivation or lyonisation after Mary Lyon who first described it in
1961 (21). X inactivation takes place at the time of uterine implantation (22), occurs in all cells
except the germ cells and is random and maintained during all further cell divisions. This
mechanism of dosage compensation is unique to mammals (23).
Basically, all but very few genes on the inactivated chromosome (Xi) are thought to be
inactive except for the genes in PARI and the gene(s) at the X inactivation center (XIC) itself.
On the active chromosome (Xa) all genes are active except for the gene(s) of the X inactivation
center. Amongst the few genes outside PARl that escape X inactivation are for example; SMCX
(24), SB1.8 (25) and ubiquitin-activating enzyme (UBEl) (26). These genes do not have
detectable Y homologues, so their transcription levels are likely to be different in male and
female, the functional significance of this difft;:rence is not yet known. The existence of
functional homologues on the Y chromosome with widely divergent or different sequence can
not be ruled out.
The X inactivation center (XIC) has been localised to Xql3 (Figure 1) (27). X
inactivation spreads from the XIC across the chromosome. The XIST gene (specifying an X-
inactive specific transcript) maps to the XIC and is only transcribed from the Xi (28). X
inactivation is preceded by XIST expressio'l (29) and as no other genes have been found in the
XIC it is assumed that XIST is the gene causing X inactivation. XIST does not code for a protein
and only inactivates the X it is expressed from (in cis). Recent studies using XIST knockout
mice have proven that XIST is essential for X inactivation (30).

13
Chapter 1

The Xi (also known as the Barr body) replicates late in S phase and is visible as
condensed heterochromatin in the nucleus, even in G 1. Although the mechanism is not
completely clear, inactivation seems to be maintained by methylation of the 5' -region of genes
(31). Experiments with patient-derived cell lines with supernumery X chromosomes have shown
that XIC also is involved as part of a counting mechanism to ensure the appropriate activity state
of X-linked genes by allowing only one active X per two sets of autosomes (32).
When one of the X chromosomes harbours a gene that through a mutation is deleterious
to cells in a specific tissue, a skewed X inactivation is observed. This results in the presence of
-in all or most of the cells in this tissue- the non-mutant X as the active chromosome. This is not
caused by a change in the activity state of the X chromosomes but by a selection against the cells
that contain the X chromosome with the mutant gene as the Xa (for example in Incontinentia
Pigmenti (IP) (33)). The opposite effect is often found when larger deletions of the X
chromosome or X/ autosome translocations are present, in those cases the normal X chromosome
is inactivated. X/autosome translocations (34) have been found in, for example, patients with
Duchenne muscular dystrophy (DMD) (35), magnesium-dependent hypocalcemia (HSH) (36),
and Hunter disease (37).

1.1.3 Evolutionary origin of the sex chromosomes


In mammals XX individuals are female and XY individuals are male, but in birds this is the other
way around (38). In the much more distantly related D. melanogaster XX are females and XY
males, while in C. elegans XX are hermaphrodites and XO are males (39). Three different forms
of sex chromosomes are found in fish; some fish have almost identical X and Y chromosomes,
others have an X and Y which hardly recombine and finally there are fish that have lost the Y
completely (38). Obviously, many species have developed sex chromosomes independently
during evolution so there must be a strong evolutionary force pushing all these species to
solutions of similar nature but different in endpoint.
The most likely reason for the initial cosexual species (hermaphrodites) to have favoured
the evolution of separate sexes, is that self-fertilisation is more likely to produce unfit progeny
than sexual reproduction (40). Cosexual species have all genes required for both 'male' and
'female' reproductive organs located on their autosomes. A simple form of acquiring a difference
between sexes is seen in C. elegans, where missing one of these chromosomes causes male
development instead of hermaphroditic. This X-monosomy however causes non-disjunction

14
Introduction

during meiosis, resulting in non viable embryos in part of the progeny.


In general, X and Y chromosome are believed to have evolved from an autosomal pair
of chromosomes (two 'pre-XY' chromosomes). The first difference between these two
chromosomes may have been the occurrence of a large deletion or inversion on one of them,
disturbing homologous recombination locally. Once homologous recombination was disturbed,
more and more of this mutated (Y) chromosome was lost because it became prone to
rearrangements and steady loss and inactivation of genes (41). In addition, a range of pseudo-
genes originating from autosomal genes have accumulated on the human Y chromosome,
probably through retrotransposition (16). These processes have generated a chromosome which
can only have retained and/or accumulated genes that would enhance male fitness, and will
otherwise only have been selected for appropriate size for efficient meiotic segregation.
Whether the evolution of a dosage compensation system was required before the
degeneration of theY chromosome could start (42) or whether it evolved as a consequence has
not been proven. In C. elegans, transcription of both X's in hermaphrodites is reduced to -50%,
regulated by at least 8 genes and depending on X:A ratio (43). This suggests that dosage
compensation was already available, independent of the presence of a Y chromosome. Dosage
compensation in Drosophila is mainly obtained through increased expression of genes on the
male X (regulated by male-specific lethal genes on the Y), although there is also evidence for a
parallel dosage compensation pathway thit~down regulates some genes on the X in females. In
mammals dosage compensation involves inactivation of most genes on one X in females (see
1.1.2). In many species unique dosage compensation systems have evolved that allowed
development of separate sexes and thereby opened the way to evolution into a 'higher' order of
species.

The mammalian sex chromosomes The size of the X chromosome has been strongly
conserved amongst eutherian ('placental') mammals, being 5% of the haploid genome, and it is
also strongly conserved in gene content (44). Many genes on the human X chromosome have also
been localised to the X chromosome in a wide variety of other eutherian mammals. Comparison
with other therian mammals, the metatheria (marsupials) and prototheria (monotremes), shows
that Xq is part of the X in all therians. Human Xp genes, in contrast, are located autosomally in
both marsupials and monotremes. For instance, in monotremes the human Xp-linked genes
Synapsin 1 (SYN1), DNA polymerase a (POLA) and Ornithine transcarbamylase (OTC) are

15
Chapter 1

located in one block on chromosome 1, while the Xp-linked genes Dystrophin (DMD),
Synapsin1 (SYN1), Cytochrome b heavy chain (CYBB) and Monoamine Oxidase A (MAOA)
are located in one block on chromosome 2 (45) (Figure 2). In marsupials, MAOA, ZFY, OTC,
DMD, STS, POLA, SYN1 and OATLl have also been shown to be autosomal (46). This means
either that this region was lost from an ancestral X chromosome in the marsupial and monotreme
lineages or was acquired by an ancestral X in the eutherian lineage. Not much is known yet about
the gene content of the Y chromosome in other therians.

~~~A·.::.· ....................... .
ZFX
CYBB
~~~B ·.:·.~·>.;.·:·.::::::::::""""'":::::: ....... . DMD
OTC ·.. \
~~~1A··.·.··:,,.,·.·.:>·.:::,.~:
MAOA

........ ................
··········
:·.: · : .;:~.:.... . OTC
1
SYN1
POLA

2
Figure 2. Assembly of the therian X
············ chromosome. Human Xq genes are found on X
Human X ······ . . . . . . . .
in prototheria as well, human Xp genes are
X present in two blocks on prototheria
Platypus chromosomes 1 and 2.

Not only the gene content of the X chromosome is highly conserved among eutherian
mammals, so is the order. Although blocks of genes have been rearranged, the order of genes
within these blocks is conserved. These rearrangements (through several inversions) are typical
for each subclass. As an example the mouse X is compared to the human X in Figure 3.

16
Introduction

Xp22.3
Xp22.2
Xp22.1
Xp21.3
Xp21.2
Xp21.1
Xp11.4
Xp11.3

Xp11.2
Xp11.1
Xq11
Xq12
Xq13

Xq21.1
Xq21.2
8
Xq21.3

Xq22 Figure 3: Comparison of the human


Xq23 and mouse X chromosomes. Through
Xq24
Xq25
several inversions in both mouse and
Xq26 human X the order of the genes
Xq27 became different but the 9 conserved
Xq28
blocks can still be recognised. Note:
mouse chromosomes do not show
Human X Mouse X
banding

1.1.4 Deletions in Xp, contiguous deletion syndromes


Deletions and rearrangements of chromosomal regions can greatly facilitate the mapping of
disease genes. Comparison of the deletions or phenotypes in patients with contiguous deletion
syndromes can be used to assign disease genes to a distinct region. For the X chromosome,
deletion mapping has been very useful for the characterisation of several genomic regions, for
example Xp22.3 (47-49) and Xp21 (50-52) (Figure 4). In Xp22.3 amongst others, the
identification of the genes for Kallmann syndrome (53,54) and X-linked ichthyosis (STS)(55)
have been facilitated by the available patients with deletions and contiguous deletion syndromes.
Many of these deletions are thought to be a result of aberrant recombination between the X and
Y chromosome (see 1.2.1). In Xp21, for instance the genes for Duchenne muscular dystrophy
(DMD) (56), McLeod syndrome (XK) (57), and X-linked chronic granulomatous disease
(CYBB) (58) have been identified using deletion mapping.

17
Chapter 1

Xp
··
/
.· ~5Px
MRX
KAL
STS
I interstitial and terminal
deletions found in
males and females
Xp22.3 MLS, AIC, FDH - male lethal
Xp22.2

I
Xp22.1
..... AHC
Xp21.3
interstitial deletions
Xp21.2 DMD
GK found in males and
XK
Xp21.1 CYBB females
RP3
Xp11.4 ..... AIED

Xpi 1.3
Xp11.23
Xp11.22
Xp11.21
Xp11.1

Figure 4. Contiguous deletion syndromes on Xp.


Contiguous deletion syndromes on Xp have been found in Xp22.3 and in Xp2l (extending into Xp11.4).
In Xp22.3 contiguous deletion syndromes are either interstitial or terminal deletions that can involve
combinations of short stature (SS), chondrodysplasia punctata (CDPX), Kallmann syndrome (KAL),
mental retardation (MRX) and X-linked ichthyosis (STS) in both males and females. Microphthalmia with
linear skin defects (MLS), Aicardi syndrome (AIC) and focal dermal hypoplasia (FDH, also known as
Goltz syndrome) are male lethal and therefore almost exclusively found in females. The phenotypes of
these three syndromes overlap so they probably result from a defect in the same gene (64) or are due to a
contiguous deletion syndrome (65). In Xp2l contiguous deletion syndromes are interstitial deletions that
can involve combinations of Duchenne muscular dystrophy (DMD), chronic granulomatous disease
(CYBB), McLeod syndrome (XK), retinitis pigmentosa (RP), mental retardation (MRX, not indicated in
figure since location is still unclear), glycerol kinase deficiency (GK), adrenal hypoplasia congenita (AHC)
and Aland island eye disease (AIED). No contiguous deletion syndromes have been found in Xp22.1-
p22.2.

18
Introduction

In contrast, in the stretch of DNA between these two regions (Xp22.1-p22.31), deletions
are rare. Deletions found in this region are in general due to inheritance of an X/autosome
translocation and are only found in females where the phenotypic effect is either generated by
spreading of X inactivation onto the autosome, nullisomy of the missing autosomal region, or by
inactivation of the normal X, causing functional nullisomy of the deleted region (35,59). No large
terminal or interstitial deletions (other than through a translocation event) of this region have
been found. The apparent lack of large deletions in the Xp22.1-p22.31 suggests that one or more
genes may be present in this region that, when present in single copy in female, or absent in male,
would be lethal. Consistently, three syndromes in Xp22.31 have been found, microphthalmia
with linear skin defects (MLS), Aicardi syndrome (AIC) and focal dermal hypoplasia (FDH, also
known as Goltz syndrome), that appear to be male lethal.
Mutations in the genes that have been isolated so far from the Xp22.1-p22.2 region are
seldomly due to deletions and when deletions are detected these are small. In the PEX gene,
mutated in X-linked hypophosphatemic rickets (HYP), in only 4 patients out of 150 families
tested, deletions were detected that ranged in size from less than 1 kb to over 55 kb (60). In
PHKA2 (phosphorylase kinase liver a-subunit), the gene mutated in X-linked liver glycogenosis
type I and II (XLG) initial studies showed 1 deletion (of 3 bp) out of 2 XLG I families studied
and 1 deletion (of 3 bp) out of 4 XLG II families studied (61,62). In RSK2, the gene mutated in
Coffin-Lowry syndrome (CLS) an initial screen of76 families revealed two deletions of 118 bp
and -2kb respectively (63).

19
Chapter 1

1.1.5 Disease genes in Xp22.1-p22.2


Several disease genes have been localised in the Xp22.1-p22.2 region (see Figure 5), some of
which were recently found. X-linked glycogenosis type I and II (XLGI and II, MIM 306000) are
caused by mutations in PHKA2 (61,62,66), X-linked hypophosphaternic rickets (HYP, MIM
307800) by mutations in PEX (60), Coffin-Lowry syndrome (CLS, MIM 303600) by mutations
in RSK2 (63). We have focused on RS and KFSD which are discussed below.

X-Iinked juvenile retinoschisis


X-linkedjuvenile retinoschisis (RS, MIM 31270) is an eye disease that causes acuity reduction
and peripheral visual field loss, typically beginning early in life. The first report of what is now
known as RS was by Haas in 1898 (77) who reported the simultaneous findings of changes in
retina and choroid and already suggested hereditary degeneration as possible cause. The term
retinoschisis was introduced by Wilczek in 1935 (78). The first suggestion of sex-linked
inheritance however was by Sorsby in 1951 (79). The frequency is about 1:10.000 (80). Most
patients are diagnosed at school age, although pathological changes are probably already present
at birth and progression is in general slow. Folding and splitting of the macula (simulating cysts)
cause the visual acuity loss (81), intra retinal splitting through the nerve fiber layer causes the
peripheral visual field loss. Severity can range from mild acuity reduction to total blindness at
an early age due to complete retinal detachment (82,83).The RS disease gene has a high
penetrance, with variable expression between families but little variation within a family, this
phenotypic variation may be due to different mutations in one gene. Other explanations for the
phenotypic variation are differences in expression, modifying genes, or environmental factors.
No evidence for genetic heterogeneity has been found (84,85).

Figure 5. Disease gene regions in Xp22.1-p22.2.


The markers and scale (in Mb) are according to the 6'h X Chromosome Workshop (1). Markers are
indicated above the bar, known genes are indicated under the bar. SEDL= spondylo-epiphyseal dysplasia
(MIM 313400) (67), NHS= Nance-Horan syndrome (MIM 302350) (68), RP15= X-linked cone-rod
degeneration (MlM 300029) (69), DFN6= sensorineural deafness (DFN6, MIM 300066) (70), PRTS=
X-linked mental retardation with dystonic movements of the hands (MIM 309510) (71), MRX= non-
specific X-linked mental retardation (MIM 309540) (72-74), RS= X-linkedjuvenile retinoschisis (MIM
312700) (75), KFSD= keratosis fqllicularis spinulosa decalvans (KFSD, MIM 308800) (76), HSH=
hypomagnesemia with hypocalcemia (HSH, MIM 307600)(36).

20

I
Introduction

TEL
KALIG1 1O
OA1
APXL
CLCN4
PRPS2

Xp22.3

GLRA2
15 ISEDLIHSH
PIGA
MRX19
DFN6
GRPR NHS
Xp22.2 CALB3

IRS
PPEF 20
PHKA2
RSK-2
PDHA1
KFSD
PEX

DXS1226
DXS989
Xp22.1 DXS451 MRX2
SAT RP15
ZFX
PRTS
POLA

DXS1048

MRX38

DXS28

DXS1065

Xp21.3
AHC

GK

DMD
DXS1238

CEN
21
Chapter 1

Except in one case of a female with RS, who was probably a homozygote due to a
consanguineous marriage (86), all reported patients are male. Vision in female carriers is usually
normal. Although publications have stated for a long time that female carriers do not have any
symptoms of the disease (81 ,87 -90) closer examination showed abnormal cone-rod interactions
in some of the carriers (91) and peripheral lesions of the retina in 4 of 5 carriers (92). Features
as reported in these carriers however can also be found in the normal population (93).
The most characteristic clinical finding in RS are macular changes, consisting of any of
the following: splitting, radial folds, pigment dissemination and development of macular scars
(81 ). Other findings include white areas in the peripheral retina, hyperopia, liquefaction of the
vitreous body, vitreous strands, peripheral retinoschisis (in 50% of cases (94)), constricted nasal
r<visual field, subnormal ERG, and a range of rarer fmdings (81).
The biochemical defect of RS is unknown, but histopathologic and electrophysiologic
studies suggest a defect in the Muller cell (93-96) possibly an inability of these cells to remove
the extracellular potassium ions resulting from exposure to light (93,97). In normal eye
development the Miiller cell has a function as a migration determinant for retinal development.
Another theory proposes that the retinoschisis arises from delayed development of the retinal and
choroidal vasculature, causing the retina to outgrow its blood supply (98) but this does not
explain the fovea! schisis. No treatment is available, although surgical intervention is sometimes
performed with varying success rates and often leading to complete retinal detachment and other
complications (83,99,100).

Table I. RS candidate region


Genetic analyses in the RS disease gene region. A= Wieacker et al. 1983 (102), B= Alitalo et al. 1987
(103), C= Gellert et al. 1988 (90), D= Dahl et al. 1988 (90), E= Alitalo et al. 1988 (89), F= Sieving et
al. 1990 (84), G= Alitalo et al. 1991 (104), H= Kaplan et al. 1991(92), I= Oudet et al. 1992(105), K=
Bergen et al. 1993 (106), L= Biancalana et al. 1994 (107), M= George et al. 1994 (85), N= Bergen et
al. 1995 (80), 0= Pawar et al. 1995 (82), P= Shastry et al. 1996 (108), Q= Van de Vosse et al. 1996
(I 09). Marker order is according to the 6th international workshop on X chromosome mapping (110).
* indicates a marker used in lilikage analysis, e indicates the marker with the highest lod score . .A. and
T indicate a recombination between the marker and the RS disease gene. Based on the recombinations
in column M and Q the candidate region for RS is located between DXS418 and DXS999 (hatched
region). The recombinants identified in earlier studies may provide a valuable further refinement of the
region when analysed with markers that have become available more recently between DXS418 and
DXS999.

22
Introduction

A B c D E F G H I K L M N 0 p Q
Marker
Xg
*
DXS89
* *
DXS143
*
DXS85
* *• * *•• :~

DXS16
* * * * *
>i:T
* * * *
DXS9
*• *•• * *• *
DXS987
* *
DXS207
*•• * *• :f:
*• *•• *•
DXS1053
* •
DXS197
* *•
DXS43
* *• r.*• *•• .''>" .:·· *• .• ,,I;;·* * ·>·,*
. . ·/ .. ·. •. ~
''~;.:'T ;,
··~
~~;,;;.,
"F:'
·~
DXS4;fs/
;: ' ... '. ... ...,,,
I'" :'
,:

1Dxs257 ·,: .,;.. :.· •J: ··;~::~··s 1;'::}\!:! .•. I ~~~: ·i llf' ;. ' <'ti 7
B,xsQ99
: :~:{' :1.:.,. ,:.: . . ·,· ,.. I*
'·i''
~.·..~ l'f' li~ ,: *'~ ~;' '..~~
1..

DXS7161 ·•···.·
* ... ...
DXS443
* * *
DXS1229
*
DXS365
* * ... *
DXS1052
*
DXS274
*••• *••• *•• *
DXS92

DXS1226
* ...
;

DXS41
*• * * *•• *•• *•• * *• .,:k

DXS451
*
ZFX
*
DXS208
*
DXS207
*
DXS28
*
DXS164 *A* *•••
DXS206
*•
DXS84
*•
OTC
*•
23
Chapter I

The first linkage studies and recombination events using RFLP markers placed RS in
Xp22 between DXS43 and DXS41. Further studies using microsatellite markers placed the RS
locus in decreasing intervals (see Table 1), until the present localisation ofRS in a 1Mb interval
between DXS418 and DXS999 (see also Chapter 2).

Candidate genes Recently, several candidate genes for retinoschisis have been cloned by
members of the Retinoschisis Consortium (see note). The frrst two, PPEF (111) and Txp3 (112)
have been excluded as the genes mutated in RS (see Chapter 5). Two others; SCMLI (113) and
Txp7 (114) are still being tested. Recently, the complete RS candidate region has been sequenced
by the Sanger Centre based on clones provided by the Retinoschisis Consortium. Analysis of this
sequence will reveal many novel genes present in the region that can be tested as candidates for
RS.

Note: The Retinoschisis Consortium consists of the following groups:


B.Franco, A. Ballabio in Milan, Italy.
T.Alitalo, A. De la Chapelle in Helsinki, Finland.
D.Trump, J.R.W.Yates in Cambridge, United Kingdom.
W. Berger, H.H. Ropers in Berlin, Germany.
A.A.B. Bergen in Amsterdam, the Netherlands.
T.E. Darga, P.A. Sieving, Michigan, U.S.A.
E. Van de Vosse, J.T. Den Dunnen ln Leiden, the Netherlands.

Note: other forms of hereditary retinoschisis are; autosomal recessive, autosomal dominant and
some unclear hereditary forms of retinoschisis·. Acquired forms of retinoschisis; degenerative
retinoschisis, also called senile retinoschisis and secondary retinoschisis associated with various
diseases, of which diabetic retinopathy is the most common (101).

Keratosis follicularis spinulosa decalvans


Keratosis follicularis spinulosa decalvans (KFSD, MIM 308800) is an extremely rare disorder
affecting skin and eyes. Patients show hyperkeratosis (thickening) of the skin of the neck, ears,
palms and soles, loss of eyebrows, eyelashes and beard, thickening of the eyelids with blepharitis
and ectropion, corneal degeneration, photophobia and baldness (alopecia) in winding streaks. The
symptoms diminish with age. KFSD was first described by Lameris in 1905 (115). The name

24

I
Introduction

KFSD was given by Siemens (116) and the disorder has also been called Siemens syndrome.
Siemens (117) described KFSD in 1925 as the first dominant sex-linked disease, however, only
about half of the carriers show (mild) clinical symptoms, which is more suggestive of skewed
X inactivation than of KFSD being a dominant sex-linked disease.
Essentially five families have been described thus far, located in Germany (117), the
Netherlands (76), France (118), Finland (119) and the UK (120). Linkage analysis using RFLPs
placed the genefor KFSD in Xp22 between DXS 16 and DXS269 (121 ), analysis of recombinants
using microsatellite markers further refined the region to between DXS7161 and DXS 1226 (76)
(see also Chapter 2). KFSD has been reported to show genetic heterogeneity (122,123) but since
few families are available for research this may just reflect a variation in phenotype between
families.
Since there are so few families with KFSD available, no systematic efforts had been
undertaken yet to specifically clone the KFSD gene prior to this study. However, because the
KFSD candidate region overlaps with other disease candidate regions, several genes have been
cloned that can be tested as KFSD candidate genes purely based on location. The biochemical
defect in KFSD is still unknown.

1.2 Positional cloning of disease genes


To identify the molecular mechanism underlying a hereditary disease, the mutant gene needs to
be identified. If a cellular defect resulting from the mutation is identified, cloning by functional
complementation is possible. If the (defective) protein is known, its identification can lead to the
cloning of the corresponding gene. In most hereditary diseases however, neither protein nor
cellular function are known and in those cases positional cloning is used to identify the disease
gene. In the early days of gene identification, functional cloning was the only way of gene
identification. Since the mid-SO's positional cloning has rapidly taken over simply because
techniques became available that allowed the analysis of larger regions. The ideal approach is
when both functional and positional information can be used to identify a gene.
Positional cloning is usually done following a strategy (illustrated in Fig. 6) that narrows
the search from the complete genome to a small region, preferably a single gene. The first step
is genetic mapping: the region on a chromosome where the disease gene is localised is defined
by linkage and recombinant analysis (discussed in 1.2.1 ). The second step is physical mapping;
identification and isolation of genoinic clones (e.g. YAC, Pl, BAC, cosmid) that are located in

25
Chapter 1

the disease gene region, construction of contigs and assembly of a restriction map (discussed in
1.2.2). The final stage is isolation of transcripts for candidate genes amongst which the disease
gene may be present (discussed in 1.3).

Linkage analysis Isolation of clones, Digestions, Gene


and recombinants contig construction hybridisations identification

Xp22.2 Noli
Xp22.1

BssHll

Eagl

Sfil

Nrul

X Genetic map Physical map


Slit
~~/..
..
.
······•

Restriction map Transcript map

Figure 6: From chromosome to gene.

1.2.1 Genetic mapping


In order to isolate a gene through positional cloning, the genetic location of the gene needs to be
known. Cytogenetically visible chromosomal aberrations, such as translocations or large
deletions, may give a direct indication of the region. When these are not present in the patients,
systematical scanning of the genome with polymorphic markers (linkage analysis) in a subset of
families is sufficient to acquire an approximate chromosomal location. Once an approximate

26
Introduction

chromosomal localisation has been established, more refined linkage analysis is used to detect
markers close to a disease gene by measuring whether certain marker alleles are statistically more
often inherited together with the disease than the frequency of the allele in the general population
would suggest. The further two markers are apart, the more likely it becomes that a cross-over
between the two markers occurs during meiosis. The general rule for such cross-overs is that
when, amongst every 100 meioses, typically 1 recombination occurs (1% recombination): this
is an interval of 1 centiMorgan (cM). This 1 cM interval on average represents a physical region
of around 1 Mb, but this can differ greatly between regions, due to local differences in
recombination rate along our chromosomes.
In recombinant analysis, usually employed for finer localisation, individual meioses are
analysed to see whether a recombination has taken place between any of the markers and the
disease gene. Identification of recombinations located either distal or proximal to the gene is used
to reduce the candidate region.
The interval to which a disease gene can be localised using genetic mapping is limited
(124). In principle, these limitations are set by the number and distribution of the available
polymorphic markers, the distance (in cM) between these markers (affecting the chance of
detecting cross-overs), the heterozygosity of the markers and the number of available patients.
In practice, however, through the large abundance of genetic markers currently available, the
genetic mapping is mainly limited by the number of patients and hence the number of cross-overs
in the candidate region. The point at which the genetic interval in which the disease gene is
localised is small enough to start physical mapping is hard to define. Starting physical mapping
with a too large genetic interval is a waste of time and energy, while continuing genetic mapping
for too long may not provide the increased refinement of localisation due to lack of informative
recombinants. However, one should always remain alert to new patients and family extension,
i.e. classical advances, since, if a new recombinant appears, this often greatly limits the scan
region.

1.2.2 Physical mapping


Isolation of clones Once a sufficiently small interval has been established by genetic mapping
or by chromosomal aberrations, physical mapping is initiated. One physical method is the
isolation of YAC clones as these contain inserts of -1 Mb, are easy to handle and are readily
available. Markers located in the region can be used to isolate YAC clones either by hybridisation

27
Chapter 1

of gridded YAC libraries (125), PCR screening of YAC pools (126), or -since about 1993- by
screening databases (127). In regions where markers are sparse, chromosome walking can be
done using end-clones (128), jumping and linking libraries (129), Alu-PCR products of
previously isolated YACs (130) or of radiation-hybrids (131). Many whole-genome or
chromosome-specific YAC libraries are available (125,132-134).
Positive genomic clones must be rescreened and their marker content established.
Because a relative high percentage of YAC clones are derived from ligation of DNA fragments
from different genomic regions, chimerism should be checked. This is done either by fluorescent
in situ hybridisation (FISH) of the whole YAC -which will at the same time confirm its
chromosomal localisation-, by mapping YAC end clones using FISH, or by hybridisation to
panels of hybrid cell-lines. The disadvantage of using entire YACs in a FISH experiment is that
small chimeric regions may not be detected. The additional advantage of generating end clones
is that these can be used as markers in subsequent experiments. The length of the YACs is
determined by pulsed-field gel electrophoresis (PFGE).

Contig assembly A contig is assembled based on marker content, on fingerprinting of shared


restriction or PCR fragments or on a combination of the two. The oldest fingerprinting method
is based on comparison of the hybridisation patterns after restriction digestion of the clones and
hybridisation with a repetitive element (e.g. Alu, Line-1, THE) (135-137). The PCR based
methods are based on radioactive PCR using Alu specific or random primers on the clones and
analysing the PCR products after electrophoresis on a sequencing gel (138,139). All approaches
generate a unique pattern of bands for each clone that can be analysed using computer programs.
Because YAC clones frequently show qeletions, rearrangements and chimerism (132, 140)
it is important to analyse several coverages of the whole contig rather than a minimum tiling path
ofYACs. Better still is to have a contig cloned in different cloning systems (e.g. PI-clones and
BACs) to analyse clones from independent sources. An additional advantage of constructing a
contig from different cloning systems is that when certain genomic regions are unclonable or
unstable in one system they may be obtained from another system.

Restriction mapping YACs in general have inserts too large to construct detailed restriction and
transcript maps. A frequently used step to improve the resolution of a contig and to allow the
construction of a restriction map is the isolation of clones that are an order of magnitude smaller

28
Introduction

than the YAC clones in the original contig. The isolation of smaller clones can be performed by
screening of Pl- (up to 100 kb), BAC- (up to 100 kb) or cosmid (up to 40 kb) libraries (or
subcloning of the YACs into one of these vectors), or by an alternative approach, YAC
fragmentation.

···
A~-,,,,
... ··· . . . insert . ··· ... ... TRP1
llllilllilllll
-11111-----i~f--'';.;.'.;.,'-olllll-olll---~~-~~~~--~~--·..,.·11il0. ..._••__,C::-J!l!?lKI8J"IT!cy~-o;;;;ric_..:;a;.:,:m"-p-1qlllllll5'
TEL URA3 ACEN4 ars1 TEL

pBP108/ADE2
...... amp ori ,~,A ~''""'"'~
~r
TEL HIS3 ADE2 Alu

• 1111 11 c::n;;;$

• 11 Ill r· ~

• Ill Ill ,-~


~~---·--____.,
~~~•P-••~•--~m.---~-~
11 11 L_~

~'i>-ltlll-tllll.--lr_~~------1211l4lllll!lllllt

Figure 7: Principle of Y AC fragmentation.


Upon transformation of yeast containing a YAC with plasmid pBP108/ADE2, homologous recombination
between an Alu in the YAC and the Alu in pBP108/ADE2 will occur in part of the yeast cells. Growing
of the yeast on medium lacking tryptophan and adenine allows selection of fragmented YACs (which
contain both ADE2 and TRP 1). A panel of fragmented YACs with various insert sizes is thus generated.

YAC fragmentation Since YAC fragmentation was first described by Pavan et al. in 1990
(141), several improvements of the YAC fragmentation vectors (142,143) made it a rapid and
simple way of generating a panel of clones of decreasing size that can be used for clustering of
markers and clones to defined, consecutive regions ('binning') and restriction mapping. YAC

29
Chapter 1

fragmentation is based on the homologous recombination between a repeat in the YAC insert and
a repeat in a YAC-vector arm containing a selectable marker not present in the original vector
arms. After the recombination the replaced vector arm plus part of the insert is lost and a smaller
YAC is obtained (Figure 7). A panel of fragmented YACs can be used to generate a restriction
map without having to use partial digestions of YACs that are usually difficult to interpret (144 ).
Panels of fragmented YACs have also been used to delimit a duplicated chromosomal region
(145) and to refine translocation breakpoints (146).

1.3 Identification of transcripts


Methods to identify transcripts can roughly be divided in transcript dependent (cDNA based) and
independent (genomic DNA based) techniques. Not one technique is capable of identifying all
genes in a region, so two or more complementary techniques are required to construct a complete
transcription map.
The advantage of cDNA based techniques is that when a cDNA is identified this will
immediately tell something about the tissue and stage it is expressed in and it is proof that the
region is transcribed. The quality of the cDNA is very important, the presence of genomic DNA,
incompletely processed RNA and rRNA should be avoided (by polyA+selection).
The advantage of genomic DNA based techniques is that they are independent of the time
and tissue of transcription, thus enabling the isolation of genes expressed only transiently, in a
specific subset of cells, or at extremely low levels just as well as genes that are expressed
ubiquitously and/or at high levels.

1.3.1 cDNA based gene identification


To identify a gene based on cDNA can be done following three different approaches; screening
of cDNA libraries, cDNA selection and transcript sequencing. The choice of approach mainly
depends on the goal.

Screening cDNA libraries Screening of cDNA libraries is used to identify a specific gene. The
screening is usually done with a specific probe, for instance a genomic fragment deleted in
patients or evolutionary conserved. The technique is simple, as it requires only hybridisation of
a probe to cDNA filters. Many (gridded) cDNA libraries are available and these have been
generated from a range of different tissues and developmental stages. Alternatively, one may

30
Introduction

generate a new cDNA library from mRNA of any desired tissue. A complementary approach can
be used to identify genes from a more complex source. This involves the hybridisation of
radiolabeled cDNAs (from oligo(dT) primed RNA) to arrays of genomic clones to identify the
clones that contain genes (147) and use those for further analysis.

cDNA selection cDNA selection is the more common approach to isolate transcripts from a
large region and is often used to generate a transcript map from a contig. The first two articles
on cDNA selection were published simultaneously by Lovett et al. ( 148) and Parirnoo et al. ( 149)
in 1991. Several variations have been published since (150,151) but all are based on
hybridisation of cDNA to immobilised DNA, elution, amplification and subsequent rounds of
hybridisation to emich for specifically binding cDNA (Figure 8). The genomic target DNA can
be derived from YAC, P1, BAC and cosmid clones. Clones propagated in bacteria have the

Genomic DNA cDNA

~~
~::;
Immobilise I .
on filter t ' Block
repeats
~~
Hybcidise / ~-~':::::::
'
Elute and amplify
Remove non-specific
cDNAs by washing

specific cDNAs

More amplification rounds Clone cDNA

Figure 8: cDNA selection

31
Chapter 1

advantage of generating less background than YAC clones. cDNA selection has led to the
· identification of a variety of novel genes amongst which the disease genes for glycerol kinase
deficiency (GKD) (150), hereditary breast and ovarian cancer (BRCAl) (152) and Wiskott-
Aldrich syndrome (WAS) (153).
The disadvantage of cDNA selection is that during the selection not only genuine
transcripts but also pseudo-genes and homologous genes will be isolated that are located in a
different region (usually on another chromosome). On the other hand, these 'artefacts' can be
used to specifically isolate members of a gene family (or the 'parental' gene to a pseudo-gene).
cDNA selection is further limited by the abundance of a transcript in a cDNA library, transcripts
that are present at less than 0.01% are unlikely to be selected for (154).

Transcript sequencing Sequencing random transcripts is not an approach to isolate genes in


a specific region but an approach to isolate all genes present in the genome and one of the major
goals of the Human Genome Project (155). From both the 5' and 3' ends of each transcript a
sequence (200-400 bp) is generated, called an expressed sequence tag (EST). All ESTs are
deposited in a specific database; dbEST, and can thus be screened using sequences from the
region of interest. ESTs that have already been assembled into contigs are present in a separate
database called Unigene. These in silica cloned genes of course need to be verified as to whether
they are derived from the region of interest (and not a homologue on another chromosome) and
whether they are not constructed from two separate genes that happen to share a domain and are
thus 'software-merged' into an overlapping transcript. Recently, many genes have been
identified by in silica cloning, i.e. defined by comparative software analysis, based on homology
to a gene in another species, like the human,thymic shared Ag-1/stem cell Ag-2 gene (TSA-
1/SCA-2) that was identified based on the mouse homologue (156), and two human peroxisome
biogenesis disorder genes (PXR1 and PXAAA1) as the yeast PAS8 and PASS (peroxisome
assembly genes) homologs (157). At least one group has started to systematically compare all
known phenotype-causing genes in one species (D. melanogaster) to human ESTs, in order to
define in silica all homologues and thus to identify potential candidate diseases for these genes
based on their genomic localisation and a potential correspondence of association between
phenotypes of -in this case- Drosophila and human (158).

32
Introduction

1.3.2 Genomic DNA based gene identification


The four DNA based techniques that can be used to identify genes are evolutionary conservation,
isolation of CpG islands, exon trapping and genomic sequencing.

Evolutionary conservation Functionally important regions in the genome (for instance exons
and regulatory sequences) are conserved through evolution. Thus evolutionary conserved
sequences are likely to be an element of a gene. Evolutionary conservation is detected by
hybridisation of DNA of different species to one another.
Analysis of evolutionary conservation is frequently used to test the presence of a gene
by hybridisation to so-called 'zoo-blots'. Zoo-blots are blots containing DNA from a range of
species, typically DNA of mammals (for instance human, ape, bovine, rodent), birds, fish, etc.
Hybridisation of a genomic fragment to such a blot gives an indication of the extent to which the
fragment is conserved. Although a gene like the DMD-gene was discovered using this approach,
it is a laborious method and is less suitable for analysis of large regions.
A suitable approach for larger scale analyses is the comparison with only one other
species using a protocol similar to cDNA selection. Several rounds of hybridisation and
amplification of genomic DNA from another species to immobilised or biotinylated genomic
DNA of the region of interest will enrich for the conserved sequences (159). Unlike the zoo-blot
method this does not only give an indication of conservation but also provides the homologous
region as an actual clone for further analysis.

[solation of CpG islands A CpG island is a relative short stretch of a G+C rich region (up to
~ kb) in which the frequency of (unmethylated) <;pG nucleotides is significantly higher than
~lsewhere in genomic DNA. About 60% of human genes are associated with CpG islands. They
~re typically located at the 5' -ends of genes in, or close to the promoter region and often include
he first exon of a gene. CpG islands can be identified by restriction enzymes that recognise
tretches of C and G nucleotides and at least one CpG (these restriction enzymes are known as
are cutter enzymes). A CpG island contains a cluster of these restriction sites while normally in
enomic DNA these restriction sites are widely spaced (10s or lOOs ofkb apart). CpG islands are
!so called HTF islands because the enzyme which revealed these sequences, HpaTJ., produced
pall tiny fragments (HTF) (160). There are an estimated 45,000 CpG islands present in the
human genome (161).

33
Chapter 1

Isolation of CpG islands is applicable to both large and small regions and different
techniques can be used. The easiest involves the subcloning of any source DNA using one rare
cutter enzyme and one frequent cutter enzyme and ligating these in a plasmid vector. In this way
a transcription map has been succesfully generated from the Huntington disease gene region
(162) where 24 out of 42 clones contained putative exons and three novel genes were isolated.
A slightly different method involves digestion of the source DNA with a rare cutter enzyme and
ligating linkers to the digested DNA. PCR using one primer directed at the linker and one primer
directed atAlu-repeats will allow the amplification of the CpG islands (163).
A second, more elaborate, approach involves denaturing gradient gel electrophoresis
(DGGE) and is based on the difference in melting temperature between regions with a normal
and a high G+C content. In a denaturing gradient gel, fragments that are G+C rich will melt later
than G+C poor fragments and will therefore have a higher electrophoretic mobility. These faster
fragments can than be isolated from the gel and Cloned. The source DNA needs to be digested
using several enzymes (in this case: Msel, Tsp509I, Nlaiil and Bfal) to provide fragments of
appropriate size prior to DGGE (164).
A third approach involves the digestion of the source DNA with a frequent cutter that
leaves CpG islands in tact (Msel) after which the CpG islands are isolated using a column that
specifically binds methylated DNA (165).

Exon trapping In vivo identification of splice acceptor and splice donor sites, or 'exon trapping'
as it was first described by Auch and Reth in 1990 (166), has been used for both small and large
scale gene identification. Genomic fragments are cloned into a vector containing an exon trap
cassette (a gene preceded by a strong promoter and with a multiple cloning site introduced in one
of its introns) and subsequently transfected into a cell line. Transcription of the ex on trap cassette
gene will incorporate exons present in the genomic fragment, which will then (in principle) be
included in the subsequent splicing (Figure 9). Reverse transcription (RT) and PCR on the RNA
isolated from the cell-line will reveal the trapped exons which can be used for further analysis.
Depending on the vector chosen and RT-PCR protocol applied, one can isolate internal, 3'-
terminal or 5' -terminal exons. Internal exons can be trapped using several vectors (166-171). 3 '-
terminal exons can be trapped using pTAG4 (172). One system allows both internal and 3'-exon
trapping; pETV-SD2 (173,174).
The major disadvantage of exon trapping is that it is sensitive to artefacts: spliced

34
Introduction

products, involving cryptic splice sites and sequences with fortuitous homology to splice sites,
as well as non-specific polyT/polyT primed products (in 3 '-terminal exon trapping). The exon
trap vectors mentioned above can only contain plasmid size inserts, thereby allowing at best one
or only a few exons to be trapped in one product. Moreover, the loss of the genomic context
through the small insert size gives rise to the isolation of sequences which are recognised as
exons although in nature they are intronic or never even transcribed. Furthermore, the order in
which the trapped exons are present in the genome is lost, which makes further analysis tedious.
Since an average internal exon has a length of 137 bp (175) the products tend to be small and not
especially suitable for screening cDNA libraries or databases.
To overcome these prQblems, two vectors have recently been developed that can contain
larger inserts and will thus allow the simultaneous trapping of multiple exons, leaving the order
intact. The exon trap products generated with these vectors make further analysis easier. These
vectors are the sCOGH-vectors ( 176) allowing the isolation of internal, 3 '-terminal and 5'-
terminal exons and pTAG5 (177) suitable for 3'-terminal exon trapping. The most well known
gene that has been identified using exon trapping is IT15, the Huntington disease gene (178) .

. insert ,insert.
mMT1

r ••..·····~
/ ·· ..

ru ~··...••• r. . . . . . . .
mMT1 / ·· ..
· · · · ·...• ••
GH1 GH2 GH3 GH4GH5 GH1 GH2 ~ ~ ~ GH3 GH4GH5

l Transfection into cells


In vivo transcription
RNA isolation
l Transfection into cells
In vivo transcription
RNA isolation

GH1GH2 GH3toGH5 GH1 to GH5

Exon trap product Empty product

Figure 9. Exon trapping using sCOGH2.


Inserts are cloned in a multiple cloning site in intron 2 of the human growth hormone gene (GH). After
transfection of DNA from the clone into a cell line, in vivo transcription of the GH gene will incorporate
exons from the insert present in the same orientation as the growth hormone gene into the exon trap
product. When no exons (or exons in the wrong orientation) are present in the insert an empty product will
result. mMTl= mouse metallothioneine gene promoter. GHl-5= human growth hormone exons.

35
Chapter 1

Genomic sequencing The most detailed information of any region is obtained by sequencing
it completely. First one can perform database searches to find homologies with known genes or
ESTs and in addition one can use computer programs to predict the location of genes. Large scale
genomic sequencing has been undertaken to analyse large regions of DNA that are covered with
contigs ( 179) and ultimately the complete human genome will be sequenced as part of the Human
Genome Project.
Before searching databases with long genomic sequences, repeat masking is essential to
prevent the analysis of large series of repetitive sequences. Subsequent database searches can
involve comparison with all known nucleotide or protein sequences, or subsets that are for
instance species-specific, contain only functional motifs, or contain only new sequences (180).
A selection of these database search programs and their application is presented in Table 2. A
range of programs is available to further analyse the output of these searches, many of which
have been adapted for specific projects. Most programs are accessible through email or via the
World Wide Web (WWW) (181).

Further analysis of the sequence can involve a number of computer programs. Programs
predicting exons, open reading frames (ORFs), promotors, and assembling potential genes are
called gene structure prediction programs. An overview of available gene structure prediction
programs is given in Table 3.

Table 2: Database search programs


" = sequences are translated into 6 reading frames before searching.
b =Blitz is also known as SSEARCH or as the Smith-Waterman method.
' = Nucleotide sequence databases: Genbank, dbEST, Unigene, EMBL, DBBJ, HTGS, dbSTS, or a
locally generated database.
d =Protein sequence databases: PIR, SWISS-Prot, GenPept, PDB.
'=Pattern databases: EC pattern, PIMA, PROBMIN, BLOCKS, PRINTS, PIR-ALN, FSSP, PROSITE,
PRODOM, Sbase.
f = Many options are available to search species-specifiC sequences or only new entries.

36
. -·
Funcf ·-~·-·. .p ''-~ ......... • • 'r-' ... ~ .............................. ..... ...........................................................
Oatab ...Oatab IN
...... " ....................................................

repeat masking XBLAST nucleotide nucleotide REPBASE database

REPEATMASKER nucleotide nucleotide REPEATMASKER database

general database search' FAST A nucleotide nucleotide c. improvement of FASTP


c
BLASTN nucleotide nucleotide

BLASTX translated nucleotide• protein d, post-processing: BEAUTY

BLASTP protein protein d, post-processing: BEAUTY


c
TBLASTN protein translated nucleotide•
c
TBLASTX nucleotide• translated nucleotide•
d
Blitzb protein protein
c
Automat nucleotide/protein nucleotide/protein
c,d
Staden nucleotide/protein nucleotide/protein

search new entries in XREFdb nucleotide nucleotidec monthly automatic search

databases FastAiert nucleotide nucleotidec regular automatic search


e
search pattern database FASTA-PAT protein protein motif
e
for protein motifs FASTASWAP protein protein motif

PROTOMAT protein/nucleotide• protein motif BLOCKS & PROSITE database


e
ProfileScan protein protein motif
e
MOTIFINO protein protein motif

MacPattern protein protein motif PROSITE database

generate 20/30 model of Swiss-Model protein proteins of known structure ExPOB,ExNRL-30 databases ~
protein structure post-processing: ProMod g-
ProteinPredict protein protein SWISS-Prot database f2"
w
--
":::t.
-..]
~

~!'#t:c:tY~3ffr)fw;•J;.:-~-'?-:!;p;,,§&iWWi:"A94M&hi1¥!1£!\1!4k4$W,9iiiiit,,i$W!lk# 1MMM&MM99L.\M;:;;;;::;;;;; &t© 42 .M _;;::;;,; Ji :; .WM& answ; 4#it ;;u;;


~
w training set: predictions: sequence length:
00
t'rogram year spec1es genes Spl. UHt" gene aos· max LSnownJ no1es
Testcode 1982 diverse 570d - + - - n.s. .........~
GeneModeler 1990 C. elegans 5 + + + - no (50 kb) integrated approach"·b

Gelfand method 1990 mammalian 9 + + + - n.s. (6.5 kb)

NetGene 1991 human 95 + + - - n.s. i

GRAILI 1991 human 18 - + - - 100 kb better on long (>100 bp) exons !

GeneiD 1992 vertebrate 169 + + + - 20 kb integrated approacha.b,r I

SORFIND 1992 human 116 + + - - 32 kb predicts internal exons only"·' I

Geneparser 1993 human 56 + + - - no predicts internal exons only"·'

Genviewer 1993 vertebrate

GeneMark 1993 prokaryote .. - + - -

SITEVIDEO 1993 human n.s. + - - - n.s. (22 kb)

GREAT 1993 vertebrate ,,


GenLang 1994 diverse 32 + + + - n.s. (20 kb) integrated approach•.b.f

GRAILII 1994 human n.s. + + - - 100 kb output used in GAP31

GAP3 1994 human n.s. - + + - uses output of GRAILII

FGENEH 1994 human 461 + + + - n.s. output is assembled gene onlyb·'


f
Xpound 1994 human + + - -
Geneparser 3 1995 human 59 + + + + no integrated approacha,b,f

PromoterScan 1995 primate 167 - - - - n.s. predicts only polymerasell promoters


f
Gene ID+ 1996 vertebrate + + + + 8 kbe
Introduction

Table 3: Gene structure prediction programs


"=Integrated approach includes analysis of initiation signal, stopcodon, polyA signal (AATAAA) and
promoter (TATA box).
b = This approach is not especially suitable when more than one gene is present, when overlapping genes
are present or to detect alternative·splicing.
' = Approach especially suitable for partial sequences or when more than one gene is present in the
sequence.
ct =Short sequences, not genes (321 coding, 249 non-coding).
e = May have improved.
r =Used in comparison by Burset et al.(!82) see text.
g = Database searches are used to compare ORF with existing proteins, output not shown.

Extensive comparison of a subset of the gene structure prediction programs (indicated in


Table 3) by Burset and Guig6 (182), showed that 33- 51% of exons are predicted perfect (with
exact splice boundaries), 22- 36% of exons are totally missed, 13 - 27% of predicted exons are
completely wrong. A slightly different evaluation method looks at overlap between actual and
predicted exons, this ranges from 62- 71%. Programs that also predict an amino acid sequence,
generate proteins that show 52- 62% similarity to the actual protein sequence.
The accuracies of the predictions were lower using only new sequences than when using
sequences that were partly available in the databases at the time the programs were trained.
Furthermore, the programs seem to perform worse on long stretches than on short stretches which
will be a problem when large-scale sequence analysis is needed (183).
It is important to realise that the programs bave different, complementary strengths and
the choice of programs depends on emphasis (sensitivity or specificity) and desired features.
However, these programs generate predictions which must be verified as no prediction program
so far is capable of predicting all exons of a gene accurately, and all have a significant false
positive rate. The programs develop rapidly however, and are likely to improve constantly, but
will always stay one or more steps behind of the evolving needs of genomic research.

39
Chapter 1

1.4 Testing candidate genes

Once a gene has been identified it can be tested as a candidate gene for a specific disorder. The
techniques that are available to test a candidate gene are once again complementary, no single
technique can identify all mutations in a given gene. These techniques can be either DNA based
or RNA based. Once it has been proven that mutations in the candidate gene cause the disease,
the same techniques can be used for further mutation analysis. The choice of technique for
mutation analysis depends on the mutation spectrum of the gene, the size of the gene and the
number and size of the exons. The most common techniques are described below, other less
frequently used techniques have been described by R.q.H. Cotton, 1997 (184).

Hybridisation Hybridisation of a gene- or, in an earlier stage of the identification of the gene,
of a genomic fragment- to Southern blots with digested DNA of patients, will reveal genetic
rearrangements caused by deletions, duplications, inversions, or nucleotide changes that alter
restriction sites. Aberrant fragments involving larger genomic regions can best be identified using
pulsed-field gel electrophoresis (185).

Sequencing Sequencing is sometimes chosen, especially for smaller genes, in an initial stage
-when no information is available about the type of mutations to expect- to identify the first
mutations. To reduce the work load, sequencing of a candidate gene in a few patients is usually
performed on RNA-derived material but can also be performed on genomic DNA when a small
gene is involved.

Single Strand Conformation Analysis (SSCA) SSCA is based on the conformational change
of denatured DNA induced by a mutation. In short, DNA fragments (typically 200-400 bp) are
amplified using specific primers in a PCR and these fragments are denatured and run on a
polyacrylamide gel. Missing or smaller products may indicate a deletion of (part of) the fragment
or a mutation at one of the primer sites. The mobility changes are caused by single nucleotide
changes or small deletions. Since analysis of the full length of the genomic DNA of a given gene
is laborious, SSCA primers are usually designed in intron sequences flanking exons to amplify
specifically the latter. A major disadvantage of this is that many mutations in introns that alter
splice sites will be missed (186). Also, rearrangements will be missed in which the exons are

40
Introduction

present, but no longer in the right place or orientation. This was found for the factor VIII gene,
which is inverted in a large fraction(> 40%) of severe hemophilia A patients (187).

Denaturing Gradient Gel Electrophoresis (DGGE) In short, (double-stranded) PCR products


are separated on a polyacrylamide gel with an increasing temperature or an increasing
concentration of denaturant (urea/formarnide). When the temperature or concentration of
denaturant in the gel has been reached at which the low-temperature melting domain will become
single-stranded, the electrophoretic mobility of the product is greatly reduced. The precise
conditions in which this happens and thus the precise position in the gel are highly dependent on
the specific nucleotide sequence. Any change in this by a mutation is likely to cause an altered
migration. Due to the PCR-process, when generating the fragments in heterozygous samples,
besides the normal and mutant strands also heteroduplexes are generated. These are even less
stable and their gel position tends to differ even between the heteroduplexes with normal and
mutant sequence in the two different strands. However, only mutations in the low-temperature
melting domain of a PCR product can be detected. In order to analyse the original high-
temperature domain as a low-temperature domain, a new high-temperature domain is created
by addition of a 'GC-clamp' (by a GC-rich tail on one of the primers) that will alter the melting
characteristics of the product (194). Single base mutations and small deletions will cause an
increase or decrease in the melting temperature that can be detected as a product that runs higher
or lower in the gel than the wild type product.

RT-PCR and protein truncation test The Protein Truncation Test (PTT) (188) is a technique
based on the analysis of the encoded protein. cDNA is generated by reverse transcription (RT)
of patient-derived RNA The cDNA is amplified using PCR to generate stretches of 1-2 kb. One
of the primers used for the PCR contains a transcription initiation signal and a T7 promotor ( 189)
to facilitate transcription and the subsequent translation (Figure 10). The first check for
aberrations is done by running the PCR products on an agarose gel.
In vitro translated products, analysed on SDS/PAGE gel, reveal mutations that directly
or indirectly (through a frame-shift) cause a premature translation termination. This technique
has been successfully used to identify a large number of mutations in hereditary cancers, e.g. the
gene for hereditary breast and ovarian cancer (BRCA1) (190) and adenomatous polyposis coli
(APC) (191). Also, PTT has allowed to find the first protein truncating mutation in CBP in

41
Cha ter 1

Rubinstein-Taybi syndrome (RTS) patients and thus to unambiguously implicate CBP in causing
RTS (192).
One of the requirements for the application of this technique is that the gene is transcribed
in the tissue that is used for RNA isolation (usually lymphocytes). However, illegitimate
transcription (193) of many genes normally not expressed in lymphocytes often provide enough
transcripts to produce a PCR product.

A B
RNA

• •
RNA ---·····~······
Reverse
transcription
In frame deletion
DNA Protein-.. . _. . .

~ • PCR (one primer


with T7-tail)
RNA

~
7;,_

• Protein • Stop mutation

• Transcription


mRNA RNA


AUG

Translation Frame shift


Protein Protein

Figure 10. PTT principle.


A. RNA is reverse transcribed into cDNA. The cDNA is then amplified using a primer containing a T7 -tail
and a translation initiation signal. In vitro transcription and translation results in a protein that can be
analysed on a SDS/PAGE gel. B. In frame deletions in the RNA will result in a decreased protein size,
the decrease proportional to the size of the deletion. Stop mutations will result in a smaller protein, the size
of the protein depending on the position of the stop mutation. Frame shift mutations usually result in a
premature stop downstream thus also generating a smaller protein.

42
Discussion

1.5 DISCUSSION

Since the beginning of the 90s, major improvements have been made to existing gene
identification techniques but no revolutionary new techniques have been developed to identify
genes in the human genome. Most groups are using a combination of the available techniques,
usually including either ex on trapping or cDNA selection and gratefully make use of the rapidly
expanding EST database, which is of enormous assistance in the isolation of disease genes (195).
New genes are found and published every week. In 1996, in the monthly journal Nature Genetics
alone, up to 12 new genes were published per month (in total 105), on average 6-7 monthly.
The Human Genome Project (HGP), started in February 1988 with the National Research
Council (NRC) report: 'Mapping and Sequencing of the Human Genome' (155). The three main
objectives for the years 1990-2005 of the HGP were: 1) to improve the research infrastructure
of human genetics, 2) to help establish DNA sequence as the primary interface between
knowledge of human biology and knowledge of the biology of model organisms 3) to launch an
open-ended effort to improve the analytical biochemistry of DNA.
To reach these goals the HGP aimed to develop genetic and physical maps of the human
genome and to sequence the human genome by the year 2005. In addition genetic and physical
maps of mice, worm, flies and yeast would be developed as these are valuable model organisms
for studying development, diseases and treatments. In the early years there was considerable
skepticism about whether the available technology would be adequate. Technical advances
however, especially in PCR, FISH and YAC cloning, have been so great that the speed of genetic
and physical mapping has rapidly gone up (196).
Since the start of the HGP, many human genome maps have been published based on
genetic or physical data or an integration of both (127,137,197-204). Systematic sequencing of
contigs has already generated many megabases of human sequence (44 Mb at 28/2/97, 170 Mb
predicted at 28/2/98 (205)) and it will not be long before the complete sequence of the human
genome will be available. That is not however, the end of the human genome analysis, but merely
a step along the way toward understanding of the genes and their functions.
In conclusion, the time when the genome sequence of many organisms will be publicly
available to boost biological research, is not far away. In a few years therefore, in what tends to
be called the 'post-genome era', the focus of many groups will transcend from building contigs
and transcript maps to the (large scale) functional analysis of the genes that have been identified

43
Chapter I

in silica. Developmental expression patterns, differentially spliced products, protein folding and
processing, protein-protein interactions, biochemical pathways and regulatory networks need to
be analysed.

The impact of these developments is already fundamentally altering positional cloning.


Genome sequence based approaches will more and more replace parallel and serial transcript
mapping on constructed contigs. Thus, the elucidation of the gene causing RS seems not far off.
While two novel RS candidate genes have recently been cloned 'the old way', meanwhile, using
a clone contig provided by the Retinoschisis Consortium, the candidate region for RS has
recently been rapidly sequenced by the Sanger Centre (Hinxton, UK) and made available on the
internet, thus providing the kick-off for using novel in silica approaches. So it seems only a
matter of time to prove which of the genes cloned or predicted in the region is mutated in RS
patients.
The candidate region for KFSD has not been sequenced yet. Several genes in the region
have been cloned that have not even been tested yet as candidates. The identification of the
KFSD gene will take longer primarily because it will be harder to prove which gene causes the
disorder since only so few families are available .
Once found, the functional analysis of the RS and KFSD genes will lead to an
understanding of the mechanisms underlying these diseases. As RS only involves the eye, there
is good hope for a potential therapy once developed, as the retina may be more easily accessible
for delivery of normal genes through viral vectors (206). KFSD is mainly a skin disorder, which
also makes it more easily accessible than the organs affected in several other severe genetic
diseases, like skeletal muscle in muscul¥ dystrophies or the immune system in
immunodeficiencies. However, much biological research is still required and it is even possible
that the outcome of the genome-based research will allow the development of pharmacological
rather than genetic means of intervention.

44
Outline

1.6 OUTLINE OF THE THESIS

In this study we aimed at the positional cloning of disease genes in Xp22.1-p22.2. To this end
we constructed a YAC contig covering this region including the markers from DXS414 to
DXS451. The contig enabled us to order new markers in the region and to refine the localisation
of the genes for X-linked juvenile retinoschisis (RS) and for keratosis follicularis spinulosa
decalvans (KFSD) (Chapter 2.1). One of the key YACs in the RS candidate region was used in
YAC fragmentation experiments to generate a panel of fragmented YACs used for 'binning'
clones and to construct a 2.5 Mb restriction map ofthis region (Chapter 2.2).
To identify candidate genes for the diseases localised in this region we have applied ex on
trapping. Initially, exon trapping was performed with cosmids subcloned in the vector pSPL3,
isolating -among others- exons from a known gene: the liver a-subunit of phosphorylase kinase
(PHKA2). To analyse larger genomic regions more efficiently, we have designed a new ex on trap
vector (sCOGH2) which allows the direct analysis of cosmid-size clones (Chapter 3). We have
subcloned key YACs from the region into the sCOGH vectors and used these in ex on trap
experiments. We have isolated several novel transcripts from the region, of which one was
analysed in detail (Chapter 4.1).
Two genes, PPEF and Txp3, which have been isolated by members of the Retinoschisis
Consortium were tested as candidate genes for RS (Chapter 5) but no mutations could be detected
thus far.

45
Chapter 1

1.7 REFERENCES

Nelson, D.L., Ballabio, A., Cremers, F., Monaco, A.P., and Schlessinger, D. Report of the sixth
international workshop on X chromosome mapping 1995. Cytogenet. Cell Genet. 71 :308-342 (1995).
2 Affara, N., Bishop, C., Brown, W., Cooke, H., Davey, P., Ellis, N., Graves, J.M., Jones, M.,
Mitchell, M., Rappold, G., Tyler-Smith, C., Yen, P., and Lau, Y.P. Report of the second
international workshop on Y chromosome mapping 1995. Cytogenet.Cell Genet. 73:33-76 (1996).
3 Sinclair, A.H., Berta, P., Palmer, M.S., Hawkins, J.R., Griffiths, B.L., Smith, M.J., Foster, J.W.,
Frischauf, A.M., Lovell-Badge, R., and Goodfellow, P.N. A gene from the human sex-determining
region encodes a protein with homology to a conserved DNA-binding motif. Nature 346:240-244
(1990).
4 Reijo, R., Lee, T.Y., Salo, P., Alagappan, R., Brown, L.G., Rosenberg, M., Rozen, S., Jaffe, T.,
Straus, D., Hovatta, 0., De la Chapelle, A., Silber, S., and Page, D.C. Diverse spermatogenic defects
in humans caused by Y chromosome deletions encompassing a novel RNA-binding protein gene.
Nat.Genet. 10:383-393 (1995).
5 Saxena, R., Brown, L.G., Hawkins, T., Alagappan, R.K., Skaletsky, H., Reeve, M.P., Reijo, R.,
Rozen, S., Dinulos, M.B., Disteche, C.M., and Page, D.C. The DAZ gene cluster on the human Y
chromosome arose from an autosomal gene that was transposed, repeatedly amplified and pruned.
Nat.Genet. 14:292-299 (1996).
6 Henegariu, 0., Hirschmann, P., Kilian, K., Kirsch, S., Lengauer, C., Maiwald, R., Mielke, K., and
Vogt, P. Rapid screening of the Y chromosome in idiopathic sterile men, diagnostic for deletions
in AZF, a genetic Y factor expressed during spermatogenesis. Andrologia 26:97-106 (1994).
7 Arnemann, J., Jakubiczka, S., Thuring, S., and Schmidtke, J. Cloning and sequence analysis of a
human Y-chromosome derived, testicular cDNA, TSPY. Genomics 11: 108-114 ( 1991).
8 Chandley, A.C. and Cooke, H.J. Human male infertility--Y-linked genes and spermatogenesis.
Hum.Mol.Genet. 3:1449-1452 (1994).
9 Freije, D., Helms, C., Watson, M.S., and Donis-Keller, H. Identification of a second
pseudoautosomal region near the Xq and Yq telomeres. Science 258:1784-1787 (1992).
10 Renauld, J.C., Duez, C., Kermouni, A., Houssiau, F., Uyttenhove, C., Van Roost, E., and Van Snick,
J. Expression cloning of the murine and human interleukin 9 receptor cDNAs.
Proc.Natl.Acad.Sci. USA 89:5690-5694 (1992).
11 D'Esposito, M., Ciccodicola, A., Gianfrancesco, F., Esposito, T., Flagiello, L., Mazzarella, R.,
Schlessinger, D., and D'Urso, M. A synaptobrevin-like gene in the Xq28 pseudoautosomal region
undergoes X inactivation. Nat.Genet. 13:227-229 (1996).
12 Speed, R.M. and Chandley, A.C. Prophase of meisosis in human spermatocytes analysed by EM
microspreading in infertile men and their controls and comparisons with human oocytes.
Hum.Genet. 84:547(1990).
13 Schmitt, K., Lazzeroni, L.C., Foote, S., Vollrath, D., Fisher, E.M., Goriada, T.M., Lange, K., Page,
D.C., and Arnheim, N. Multipoint linkage map of the human pseudoautosomal region, based on
single-sperm typing: do double crossovers occur during male meisosis. Am.J.Hum. Genet.
55:423-430 (1994).

46
References

14 Li, L. and Hamer, D.H. Recombination and allelic association in the Xq/Yq homology region.
Hum.Mol.Genet. 4:2013-2016 (1995).
15 Rouyer, F., Simm1er, M.C., Johnsson, C., Vergnaud, G., Cooke, H.J., and Weissenbach, J. A
gradient of sex linkage in the pseudoautosomal region of the human sex chromosomes. Nature
319:291-295 (1986).
16 O'Reilly, A.J., Affara, N.A., Simpson, E., Chandler, P., Goulmy, E., and Ferguson-Smith, M.A. A
molecular deletion map of the Y chromosome long arm defining X and autosomal homologous
regions and the localisation of the HYA locus to the proximal region of the Yq euchromatin.
Hum.Mol.Genet. 1:379-385 (1992).
17 Guioli, S., Incerti, B., Zanaria, E., Bardoni, B.,,Franco, B., Taylor, K., Ballabio, A., and Camerino,
G. Kallmann syndrome due to a translocation resulting in a X/Y fusion gene. Nat. Genet. 1:337-340
(1992).
18 Klink, A., Schiebel, K., Winkelmann, M., Rao, E., Horsthemke, B., Ludecke, H.J., Claussen, U.,
Scherer, G., and Rappold, G. The human protein kinase gene PKX1 on Xp22.3 displays Xp/Yp
homology and is a site of chromosomal instability. Hum.Mol.Genet. 4:869-878 (1995).
19 Weil, D., Wang, I., Dietrich, A., Poustka, A., Weissenbach, J., and Petit, C. Highly homologous loci
on the X and Y chromosomes are hot-spots for ectopic recombinations leading to XX maleness.
Nat.Genet. 7:414-419 (1994).
20 Wang, I., Weil, D., Levilliers, J., Affara, N.A., De la Chapelle, A., and Petit, C. Prevalence and
molecular analysis of two hot spots for ectopic recombination leading to XX maleness. Genomics
28:52-58 (1995).
21 Lyon, M.F. Gene action in the X-chromosome of the mouse (mus musculus 1.): Nature 190:372-373
(1961).
22 Riggs, A.D. and Pfeifer, G.P. X-chromosome inactivation and cell memory. Trends Genet.
8:169-174 (1992).
23 Latham, K.E. X chromosome imprinting and inactivation in the early mammalian embryo. Trends
Genet. 12:134-138 (1996).
24 Wu, J., Ellison, J., Salido, E., Yen, P., Mohandas, T., and Shapiro, L.J. Isolation and
characterization of XE169, a novel human gene that escapes X-inactivation. Hum.Mol.Genet.
3:153-160 (1994).
25 Brown, C.J., Miller, A.P., Carrel, L., Rupert, J.L., Davies, K.E., and Willard, H.F. The DXS423E
gene in Xp11.21 escapes X chromosome inactivation. Hum.Mol.Genet. 4:251-255 (1995).
26 Coleman, M.P., Ambrose, H.J., Carrel, L., Nemeth, A.H., Willard, H.F., and Davies, K.E. A novel
gene, DXS8237E, lies within 20 kb upstream of UBEl in Xp11.23 and has a different X inactivation
status. Genomics 31:135-138 (1996).
27 Brown, C.J., Lafreniere, R.G., Powers, V.E., Sebastio, G., Ballabio, A., Pettigrew, A.L., Ledbetter,
D.H., Levy, E., Craig, I.W., and Willard, H.F. Localization of the X inactivation centre on the
human X chromosome in Xq13. Nature 349:82-84 (1991).
28 Lyon, M.F. X-chromosome inactivation. Pinpointing the centre. Nature. 379:116-117 (1996).
29 Willard, H.F. X chromosome inactivation, XIST, and pursuit of the X-inactivation center. Cell
86:5-7 (1996).

47
Chapter 1

30 Marahrens, Y., Panning, B., Dausman, J., Strauss, W., and Jaenisch, R. Xist-deficient mice are
defective in dosage compensation but not spermatogenesis. Genes & Dev. 11:156-166 (1997).
31 Lyon, M.F. The X inactivation centre and X chromosome imprinting. Eur.J.Hum.Genet. 2:255-261
(1994).
32 Brown, C.J., Ballabio, A., Rupert, J.L., Lafreniere, R.G., Grompe, M., Tonlorenzi, R., and Willard,
H.F. A gene from the region of the human X inactivation centre is expressed exclusively from the
inactive X chromosome. Nature 349:38-44 (1991).
33 Harris, A., Collins, J., Vetrie, D., Cole, C., and Bobrow, M. X inactivation as a mechanism of
selection against lethal alleles: further investigation of incontinentia pigmenti and X linked
lymphoproliferative disease. J.Med.Genet. 29:608-614 (1992).
34 Mattei, M.G., Mattei, J.F., Ayme, S., and Giraud, F. X-autosome translocations: cytogenetic
characteristics and their consequences. Hum Genet 61 :295-309 (1982).
35 Bodrug, S.E., Holden, J.J., Ray, P.N., and Worton, R.G. Molecular analysis of X-autosome
translocations in females with Duchenne muscular dystrophy. EMBO J. 10:3931-3939 (1991).
36 Chery, M., Biancalana, V., Philippe, C., Malpuech, G., Carla, H., Gilgenkrantz, S., Mandel, J.-L.,
and Hanauer, A. Hypomagnesemia with secondary hypocalcemia in a female with balanced X;9
translocation: Mapping of the Xp22 chromosome breakpoint. Hum.Genet. 93:587-591 (1994).
37 Clarke, J.T.R., Wilson, P.J., Morris, C.P., Hopwood, J.J., Richards, R.I., Sutherland, G.R., and Ray,
P.N. Characterization of a deletion at Xq27-q28 associated with unbalanced inactivation of the
nonmutant X chromosome. Am.J.Hum.Genet. 51:316-322 (1992).
38 Morell, V. Rise and fall of theY chromosome. Science 263:171-172 (1994).
39 Wilkins, A.S. Moving up the hierarchy: a hypothesis on the evolution of a genetic sex determination
pathway. Bioessays 17:71-77 (1995).
40 Charlesworth, B. The evolution of chromosomal sex determination and dosage compensation.
Curr.Biol. 6:149-162 (1996).
41 Spurdle, A.B. and Jenkins, T. The Y chromosome as a tool for studying human evolution.
Curr.Opin.Genet.Dev. 2:487-491 (1992).
42 Bone, J.R. and Kuroda, M.I. Dosage compensation regulatory proteins and the evolution of sex
chromosomes in Drosophila. Genetics 144:705-713 (1996).
43 Kelley, R.L. and Kuroda, M .I. Equality for X chromosomes. Science 270:1607-1610 (1995).
44 McKay, L., Watson, J.M., and Marshal! Graves, J.A. Mapping human X-linked genes in the
phalangerid marsupial Trichosurus vulpecula. Genomics 14:302-308 (1992).
45 Watson, J.M., Spencer, J.A., Riggs, A.D., and Marshal! Graves, J.A. Sex chromosome evolution:
Platypus gene mapping suggests that part of the human X chromosome was originally autosomal.
Proc.Natl.Acad.Sci. USA 88:11256-11260 (1991).
46 Spencer, J.A., Sinclair, A.H., Watson, J.M., and Graves, J.A. Genes on the short arm of the human
X chromosome are not shared with the marsupial X. Genomics 11:339-345 (1991).
47 Ballabio, A., Bardoni, B., Carrozzo, R., Andria.G., Bick, D., Campbell, L., Hamel, B.,
Ferguson-Smith, M.A., Gimelli, G., Fraccaro, M., Maraschio, P., Zuffardi, 0., Guioli, S., and
Camerino, G. Contiguous gene syndromes due to deletions in the distal short arm of the human X
chromosome. Proc.Natl.Acad.Sci. USA 86:10001-10005 (1989).

48
References

48 Schaefer, L., Ferrero, G.B., Grillo, A., Bassi, M.T., Roth, E.J., Wapenaar, M.C., Van Ommen,
G.J.B., Mohandas, T.K., Rocchi, M., Zoghbi, H.Y., and Ballabio, A. A high resolution deletion map
of human chromosome Xp22. Nature Genet. 4:272-279 (1993).
49 Schnur, R.E., Wick, P.A., Sosnoski, D.N., Bick, D., and Nussbaum, R.L. Deletion mapping and a
highly reduced radiation hybrid in the Xp22.3-p22.2 region. Genomics 15:500-506 (1993).
50 Bertelson, C.J., Pogo, A.O., Chaudhuri, A., Marsh, W.L., Redman, C.M., Banerjee, D., Symmans,
W.A., Simon, T., Frey, D., and Kunkel, L.M. Localization of the McLeod locus (XK) within Xp21
by deletion analysis. Am.J.Hum.Genet. 42:703-711 (1988).
51 De Saint-Basile, G., Bohler, M.C., Fischer, A., Cartron, J., Dufier, J.L., Griscelli, C., and Orkin,
S.H. Xp21 DNA microdeletion in a patient with chronic granulomatous disease, retinitis
pigmentosa, and McLeod phenotype. Hum.Genet. 80:85-89 (1988).
52 Francke, U., Ochs, H.D., De Martinville, B., Giacalone, J., Lindgren, V., Disteche, C., Pagon, R.A.,
Hofker, M.H., Van Ommen, G.J., Pearson, P.L., and Wedgwood, R.J. Minor Xp21 chromosome
deletion in a male associated with expression of Duchenne muscular dystrophy, chronic
granulomatous disease, retinitis pigmentosa, and McLeod syndrome. Am.J.Hum.Genet. 37:250-267
(1985).
53 Franco, B., Guioli, S., Pragliola, A., Incerti, B., Bardoni, B., Tonlorenzi, R., Carrozzo, R., Maestrini,
E., Pieretti, M., Taillon-Miller, P., Brown, C.J., Willard, H.F., Lawrence, C., Persico, M.G.,
Camerino, G., and Ballabio, A. A gene deleted in Kallmann's syndrome shares homology with
neural cell adhesion and axonal path-finding molecules. Nature 353:529-536 (1991).
54 Legouis, R., Hardelin, J.-P., Levilliers, J., Claverie, J.-M., Compain, S., Wunderle, V., Millasseau,
P., Le Paslier, D., Cohen, D., Caterina, D., Bougueleret, L., Delemarre-Van de Waal, H., Lutfalla,
G., Weissenbach, J., and Petit, C. The candidate gene for the X-linked Kallmann syndrome encodes
a protein related to adhesion molecules. Cell67:423-435 (1991).
55 Ballabio, A., Parenti, G., Carrozzo, R., Sebastio, G., Andria, G., Buckle, V., Fraser, N., Craig, I.,
Rocchi, M., Romeo, G., Jobsis, A.C., and Persico, M. G. Isolation and characterization of a steroid
sulfatase cDNA clone: genomic deletions in patients with X-chromosome-linked ichthyosis.
Proc.Natl.Acad.Sci. USA 84:4519-4523 (1987).
56 Monaco, A.P., Neve, R.L., Coletti-Feener, C., Bertelson, C.J., Kurnit, D.M., and Kunkel, L.M.
Isolation of candidate cDNAs for portions of the Duchenne muscular dystrophy gene. Nature
323:646-650 (1986).
57 Ho, M., Chelly, J., Carter, N., Danek, A., Crocker, P., and Monaco, A.P. Isolation of the gene for
McLeod syndrome that encodes a novel membrane transport protein. Cell77:869-880 (1994).
58 Royer-Pokora, B., Kunkel, L.M., Monaco, A.P., Goff, S.C., Newburger, P.E., Baehner, R.L., Cole,
F.S., Curnutte, J.T., and Orkin, S.H. Cloning the gene for an inherited human disorder -chronic
granulomatous disease- on the basis of its chromosomal location. Nature 322:32-38 (1986).
59 Robinson, D.O., Boyd, Y., Cockburn, D., Collinson, M.N., Craig, I., and Jacobs, P.A. The parental
origin of de novo X-utosome translocations in females with Duchenne muscular dystrophy revealed
by M27 beta methylation analysis. Genet.Res. 56:135-140 (1990).
60 Francis, F., Hennig, S., Korn, B., Reinhardt, R., De Jong, P., Poustka, A., Lehrach, H., Rowe,
P.S.N., Goulding, J.N., Summerfield, T., Mountford, R., Read, A.P., Popowska, E., Pronicka, E.,

49
Chapter 1

Davies, K.E., O'Riordan, J.L.H., Econs, M.J., Nesbitt, T., Drezner, M.K., Oudet, C., Pannetier, S.,
Hanauer, A., Strom, T.M., and Meindl, A. A gene (PEX) with homologies to endopeptidases is
mutated in patients with X-linked hypophosphatemic rickets. Nature Genet. 11:130-136 (1995).
61 Van den Berg, I.E.T., Van Beurden, E.A.C.M., Malingre, H.E.M., Ploos Van Amstel, H.K.,
Poll-The, B.T., Smeitink, J.A.M., Lamers, W.H., and Berger, R. X-linked liver phosphorylase kinase
deficiency is associated with mutations in the human liver phosphorylase kinase a subunit.
Am.J.Hum.Genet. 56:381-387 (1995).
62 Hendrickx, J., Dams, E., Coucke, P., Fernandes, J., and Willems, P.J. X-linked liver glycogenosis
type II (XLG II) is caused by mutations in PHKA2, the gene encoding the liver a subunit of
phosphorylase kinase. Hum.Mol.Genet. 5:649-652 (1996).
63 Trivier, E., De Cesare, D., Jacquot, S., Pannetier, S., Zackai, E., Young, 1., Mandel, J.-L.,
Sassone-Corsi, P., and Hanauer, A. Mutations in the kinase Rsk-2 associated with Coffin-Lowry
syndrome. Nature 384:567-570 (1996).
64 Lindsay, E.A., Grillo, A., Ferrero, G.B., Roth, E.J., Magenis, E., Grompe, M., Hulten, M., Gould,
C., Baldini, A., Zoghbi, H.Y., and Ballabio, A. Microphthalmia with linear skin defects (MLS)
syndrome: clinicak, cytogenetic, and molecular characterization. Am.J.Med.Genet. 49:229-234
(1994).
65 Naritomi, K., Izumikawa, Y., Nagataki, S., Fukushima, Y., Wakui, K., Niikawa, N., and Hirayama,
K. Combined Goltz and Aicardi syndromes in a terminal Xp deletion: are they a contiguous
syndrome? Am.J.Med.Genet. 43:839-843 (1992).
66 Hendrickx, J., Coucke, P., Dams, E., Lee, P., Odievre, M., gorbeel, L., Fernandes, J.F., and
Willems, P.J. Mutations in the phosphorylase kinase gene PHKA2 are responsible for X-linked liver
glycogen storage disease. Hum.Mol.Genet. 4:77-83 (1995).
67 Heuertz, S., Smahi, A., Wilkie, A.O.M., Le Merrer, M., Maroteaux, P., and Hors-Cayla, M.C.
Genetic mapping of Xp22.12-p22.31, with a refined localization for spondyloepiphyseal dysplasia
(SEDL). Hum.Genet. 96:407-410 (1995).
68 Toutain, A., Ranee, N., Dessay, B., Robb, L., Francannet, C., Le Merrer, M., Briard, M.L., Kaplan,
J., and Moraine, C. Nance-Horan syndrome: linkage analysis in 4 families refines localization in
Xp22.31-p22.13 region. Hum.Genet. 99:256-261 (1997).
69 McGuire, R.E., Sullivan, L.S., Blanton, S.H., Church, M.W., Heckenlively, J.R., and Daiger, S.P.
X-linked dominant cone-rod degeneration: linkage mapping of a new locus for Retinitis Pigmentosa
(RP15) to Xp22.13-p22.11. Am.J.Hum.Genet. 57:87-94 (1995).
70 Del Castillo, I., Villamar, M., Sarduy, M., Romero, L., Herraiz, C., Hernandez, F.J., Rodriguez, M.,
Borras, 1., Montero, A., Bellon, J., Cruz Tapia, M., and Moreno, F. A novel locus for non-syndromic
sensorineural deafness (DFN6) maps to chromosome Xp22. H urn. Mol. Genet. 5: 1383-1387 ( 1996).
71 Gedeon, A., Kerr, B., Mulley, J., and Turner, G. Pericentromeric genes for non-specific X-linked
mental retardation (MRX). Am.J.Med.Genet. 51:553-564 (1994).
72 Donnelly, A.J., Andy Choo, K.H., Kozman, H.M., Gedeon, A.K., Danks, D.M., and Mulley, J.C.
Regional localisation of a non-specific X-linked mental retardation gene (MRX19) to Xp22.
Am.J.Med.Genet. 51:581-585 (1994).
73 Hu, L.-J., Blumenfeld-Heyberger, S., Hanauer, A., Weissenbach, J., and Mandel, J.-L. Non-specific

50
References

X-linked mental retardation: Linkage analysis in MRX2 and MRX4 families revisited.
Am.J.Med.Genet. 51:569-574 (1994).
74 Schutz, C.K., Ives, E.J., Chalifoux, M., MacLaren, L., Farrell, S., Robinson, P.D., White, B.N., and
Holden, J.J.A. Regional localization of an X-linked mental retardation gene to Xp21.1-Xp22.13
(MRX38). Am.J.Med.Genet. 64:89-96. (1996).
75 Van de Vosse, E., Bergen, A.A.B., Meershoek, E.J., Oosterwijk, J.C., Gregory, S., Bakker, B.,
Weissenbach, J., Coffey, A.J., Van Ommen, G.J.B., and Den Dunnen, J.T. A Xp22.1-p22.2 YAC
contig encompassing the disease loci for RS, KFSD, CLS, HYP and RP15; refined localization of
RS. Eur.J.Hum.Genet. 4:101-104 (1996).
76 Oosterwijk, J.C., Van der Wielen, M.J.R., Van de Vosse, E., Voorhoeve, E., and Bakker, E.
Refinement of the localisation of X-linked keratosis follicularis spinulosa decal vans (KFSD) gene
in Xp22.1-p22.2. J.Med.Genet. 32:736-739 (1995).
77 Haas, J. Ober das zusammenvorkommen von Verii.nderungen der Retina und Chorioidea.
Arch.Augenheilkd. 37:343-348 (1898).
78 Wilczek, M. Bin fall der netzhautspaltung (Retinoschisis) mit einer offnung. Ztschr.fA. 85: 108-116
(1996).
79 Sorsby, A., Klein, M., Humdall Gann, J., and Siggins, G. Unusual retinal detachment, possibly
sex-linked. Brit.J.Ophthalmol. 35:1-10 (1951).
80 Bergen, A.A.B., Ten Brink, J.B., and Van Schooneveld, M.J. Efficient DNA carrier detection in X
linked juvenile retinoschisis. Br.J.Ophthalmol. 79:683-686 (1995).
81 Lisch, W. Sex-linked Juvenile Retinoschisis. In: Straub, W. (ed) Hereditary Vitroretinal
Degenerations. Karger, Basel,New York: 19-32.(1983).
82 Pawar, H., Bingham, E.L., Lunetta, K.L., Segal, M., Richards, J.E., Boehnke, M., and Sieving, P.A.
Refined genetic mapping of juvenile X-linked retinoschisis. Hum.Hered. 45:206-210 (1995).
83 George, N.D., Yates, J.R., and Moore, A.T. Clinical features in affected males with X-linked
retinoschisis. Arch.Ophthalmol. 114:274-280 (1996).
84 Sieving, P.A., Bingham, E.L., Roth, M.S., Young, M.R., Boehnke, M., Kuo, C.-Y., and Ginsburg,
D. Linkage relationship of X-linked juvenile retinoschisis with Xp22.1-p22.3 probes.
Am.J.Hum.Genet. 47:616-621 (1990).
85 George, N.D.L., Payne, S.J., Barton, D.E., Moore, A.T., and Yates, J.R.W. Genetic mapping of
X-linked Retinoschisis. Cytogenet.Cell Genet. 67:354(1994).
86 Forsius, H., Eriksson, A., and Vainio-Mattila, B. Geshlechtsgebundene, erbliche Retinoschisis in
zwei Familien in Finnland. Klin.Mbl.Augenheilk. 143:806-816 (1963).
87 Condon, G.P., Brownstein, S., Wang, N.-S., Keams, A.F., and Ewing, C.E. Congenital hereditary
(juvenile X-linked) retinoschisis. Arch.Ophthalmol. 104:576-583 (1986).
88 Gellert, G., Petersen, J., Krawczak, M., and Zoll, B. Linkage relationship between retinoschisis and
four marker loci. Hum.Genet. 79:382-384 (1988).
89 Alitalo, T., Forsius, H., Kama, J., Frants, R.R., Eriksson, A.W., Wood, S., Kruse, T.A., and De la
Chapelle, A. Linkage relationships and gene order around the locus for X-linked Retinoschisis.
Am.J.Hum.Genet. 43:476-483 (1988).
90 Dahl, N., Goonewardena, P., Chotal, J., Anvret, M., and Petterson, U. DNA linkage analysis of

51
Chapter 1

X-linked retinoschisis. Hum.Genet. 78:228-232 (1988).


91 Arden, G.B., Gorin, M.B., Polkinghome, P.J., Jay, M., and Bird, A. C. Detection of the carrier state
of X-linked Retinoschisis. Am J Ophthalmoll 05:590-595 (1988).
92 Kaplan, J., Pelet, A., Hentati, H., Jeanpierre, M., Briard, M.L., Journel, H., Munnich, A., and Dufier,
J .L. Contribution to carrier detection and genetic counselling in X linked retinoschisis. J.Med. Genet.
28:383-388 (1991).
93 George, N.D.L., Yates, J.R.W., and Moore, A.T. X linked retinoschisis. Br.J.Ophthalmol.
79:697-702 (1995).
94 Manschot, W.A. Pathology of hereditary juvenile retinoschisis. Arch.Ophthalmol. 88:131-138
(1972).
95 Yanoff, M., Kertesz Rahn, E., and Zimmerman, L.E. Histopathology of juvenile retinoschisis.
Arch.Ophthalmol. 88:131-138 (1972).
96 Peachey, N.S., Fishman, G.A., Derlacki, D.J., and Brigell, M.G. Psychophysical and
electroretinographic findings in X-linked juvenile retinoschisis. Arch.Ophthalmol. 105:513-516
(1987).
97 De Jong, P.T.V.M., Zrenner, E., Van Meel, G.J., Keunen, J.E.E., and Van Norren, D. Mizuo
phenomenon in X-linked retinoschisis. Arch. Ophthalmol. 109: 1104-1108 ( 1991 ).
98 Ewing, C.C. and Cullen, A.P. Fluorescein angiography in X-chromosomal maculopathy with
retinoschisis (juvenile hereditary retinoschisis). Can.J.Ophthalmol. 7:19-28 (1972).
99 Brockhurst, R.J. Photocoagulation in congenital retinoschisis. Arch. Ophthalmol. 84:158-165 (1970).
lOO Regillo, C.D., Tasman, W.S., and Brown, G.C. Surgical management of complications associated
with X -linked retinoschisis. Arch. Ophthalmol. Ill : 1080-1086 ( 1993).
101 Madjarov, B., Hilton, G.F., Brinton, D.A., and Lee, S.S. A new classification of the retinoschises.
Retina. 15:282-285 (1995).
102 Wieacker, P., Wienker, T.F., Dallapiccola, B., Bender, K., Davies, K.E., and Ropers, H.H. Linkage
relationships between retinoschisis, Xg, and a cloned DNA sequence ftom the distal short arm of
the X chromosome. Hum.Genet. 64:143-145 (1983).
103 Alitalo, T., Karna, J., Forsius, H., and De la Chapelle, A. X-linked retinoschisis is closely linked to
DXS41 and DXS16 but not DXS85. Clin.Gene,t. 32:192-195 (1987).
104 Alitalo, T., Kruse, T.A., and De la Chapelle, A. Refined localization of the gene causing X-linked
juvenile retinoschisis. Genomics 9:505-510 (1991).
105 Oudet, C., Weber, C., Kaplan, J., Segues, B., Croquette, M.-F., Ollagnon Roman, E., and Hanauer,
A. Characterisation of a highly polymorphic microsatellite at the DXS207locus: confirmation of
very close linkage to the retinoschisis disease gene. J.Med.Genet. 30:300-303 (1992).
106 Bergen, A.A.B., Van Schooneveld, M.J., Orth, U., Bleeker-Wagemakers, E.M., and Gal, A.
Multi point linkage analysis in X -linked juvenile retinoschisis. Clin. Gen. 4 3: 113-116 ( 1993).
107 Biancalana, V., Trivier, E., Weber, C., Weissenbach, J., Rowe, P.S.N., O'Riordan, J.L.H.,
Partington, M.W., Heyberger, S., Oudet, C., and Hanauer, A. Construction of a high-resolution
linkage map for Xp22.1-p22.2 and refinement of the genetic localization of the Coffin-Lowry
syndrome gene. Genomics 22:617-625 (1994).
108 Shastry, B.S., Hejtmancik, F.J., Margherio, R.T., and Trese, M.T. Linkage mapping of new X-linked

52
References

juvenile retinoschisis kindreds using microsatellite markers. Biochem.Biophys.Res. Commun.


220:824-827 (1996).
I 09 Fickett, J.W. The gene identification problem: an overview for developers.
Computers.And.Chemistry. 20:103-118 (1996).
110 Francis, F., Benham, F., Gee See, C., Fox, M., Ishikawa-Brush, Y., Monaco, A.P., Weiss, B.,
Rappold, G., Hamvas, R.M.J., and Lehrach, H. Identification of YAC and cosmid clones
encompassing the ZFX-POLA region using irradiation hybrid cell lines. Genomics 20:75-83 (1994).
Ill Montini, E., Rugarli, E.I., Van de Vosse, E., Andolfi, G., Puca, A.A., Den Dunnen, J.T., Ballabio,
A., and Franco, B. A human homolog of the Drosophila retinal degeneration C (rdgc) gene encodes
a novel serine-threonine phosphatase selectively expressed in sensory neurons of neural crest origin.
Hum.Mol.Genet. 6:1137-1145 (1997).
112 Franco, B. et al. In preparation (1997).
113 Van de Vosse, E., Walpole, S.M., Nicolaou,A., Van der Bent, P., Cahn, A., Vaudin, M., Ross, M.T.,
Durham, J., Pavitt, R., Wilkinson, J., Grafham, D., Bergen, A.A.B., Van Ommen, G.J.B., Yates.;
J.R.W., Den Dunnen, J.T., and Trump, D. Characterization of a new developmental gene, SCMLJ,
in Xp22. Submitted (1997).
114 Franco, B. et al. in preparation (1997).
115 Lameris, H.J. Ichthyosis follicularia. Ned.Tijdschr.Geneeskd. 2:1524(1905).
116 Siemens, H.W. Keratosis follicularis spinulosa decal vans. Arch.Derm.Syph. 151:384-386 (1926).
117 Siemens, H.W. Uber einen, in der menschlichen Pathologie noch nicht beobachteten
Vererbungsmodus; dominant geschlechtsgebundene Vererbung. Arch.fRass. u. Ges.Biol. 17:47-61
(1925).
118 Guillet, G., Labouche, F., Cambazard, F., Plantin, P., Gall, Y., Le Jollec, A., Zagnoli, A., and Parent,
P. Keratosis follicularis spinulosa decalvans: nosological discussion of Siemens' disease. Apropos
of 3 cases. Pediatrie 42:437-440 (1987).
119 Kuokkanen, K. Keratosis follicularis spinulosa decal vans in a family from northern Finland. Acta
Derm. Venereal. 51:146-150 (1971).
120 Herd, R.M. and Benton, E.C. Keratosis follicularis spinulosa decalvans: Report of a new pedigree.
Br.J.Dermatol. 134:138-142 (1996).
121 Oosterwijk, J.C., Nelen, M., van Zandvoort, P.M., van Osch, L.D.M., Oranje, A.P., Wittebol-Post,
D., and Van Oast, B.A. Confirmation of X-linked inheritance and provisional mapping of the
keratosis follicularis spinulosa decalvans gene on Xp in a large Dutch pedigree.
Ophthalm.Paediatr.Genet. 13:27-30 (1991).
122 Oranje, A.P., Van Osch, L.D., and Oosterwijk, J.C. Keratosis Pilaris Atrophicans. One
heterogeneous disease or a symptom in different clinical entities? Arch.Dermatol. 130:500-502
(1994).
23 Oosterwijk, J.C., Richard, G., Van der Wielen, M.J.R., Van de Vosse, E., Harth, W., Sandkuyl,
L.A., Bakker, E. and Van Ommen, G.J.B. Molecular genetic analysis of two families with keratosis
follicularis spinulosa decal vans (KFSD): refinement of gene localization and evidence for genetic
heterogeneity. Hum.Genet. 100:520-524 (1997).
24 Boehnke, M. Limits of resolution of genetic linkage studies: implications for the positional cloning

53
Chapter 1

of human disease genes. Am.J.Hum.Genet. 55:379-390 (1994).


125 Brownstein, B.H., Silverman, G.A., Little, R.D., Burke, D.T., Korsmeyer, S.J., Schlessinger, D., and
Olson, M.V. Isolation of single-copy human genes from a library of yeast artificial chromosome
clones. Science 244:1348-1351 (1989).
126 Green, E.D. and Olson, M. V. Systematic screening of yeast artificial-chromosome libraries by use
of the polymerase chain reaction. Proc.Natl.Acad.Sci. USA 87:1213-1217 (1990).
127 Cohen, D., Chumakov, 1., and Weissenbach, J. A first generation physical map of the human
genome. Nature 366:698-701 (1993).
128 Nelson, D.L., Ledbetter, S.A., Corbo, L., Victoria, M.F., Ramirez-Solis, R., Webster, T.D.,
Ledbetter, D.H., and Caskey, C.T. Alu polymerase chain reaction: a method for rapid isolation of
human-specific sequences from complex DNA sources. Proc.Natl.Acad.Sci. USA 86:6686-6690
(1989).
129 Poustka, A. and Lehrach, H. Jumping libraries and linking libraries: the next generation molecular
tools in mammalian genetics. Trends Genet. 2:174-179 (1986).
130 Butler, R., Ogilvie, D.J., Elvin, P., Riley, J.H., Finniear, R.S., Slynn, G., Morten, J.E.N., Markham,
A.F., and Anand, R. Walking, cloning, and mapping with yeast artificial chromosomes: a contig
encompassing D21S13 and D21S16. Genomics 12:42-51 (1992).
131 Muscatelli, F., Monaco, A.P., Goodfellow, P.N., Hors-Cayla, M.C., Lehrach, H., and Fontes, M.
Isolation of new probes from Xq12-Q13: an example of the screening of reference libraries with
Alu-PCR products from radiation hybrids. Cytogenet.Cell Genet. 61:109-113 (1992).
132 Albertsen, H.M., Abderrahim, H., Cann, H.M., Dausset, J., Le Paslier, D., and Cohen, D.
Construction and characterization of a yeast artificial chromosome library containing seven haploid
human genome equivalents. Proc.Natl.Acad.Sci. USA 87:4256-4260 ( 1990).
133 Anand, R., Riley, J.H., Butler, R., Smith, J.C., and Markham, A.F. A 3.5 genome equivalent multi
access YAC library: construction, characterisation, screening and storage. Nucleic Acids Res.
18:1951-1956 (1990).
134 Larin, Z., Monaco, A.P., and Lehrach, H. Yeast artificial chromosome libraries containing large
inserts from mouse and human DNA. Proc.Natl.Acad.Sci. USA 88:4123-4127 (1991).
135 Driesen, M.S., Dauwerse, J.G., Wapenaar, M,.c., Meershoek, E.J., Mollevanger, P., Chen, K.L.,
Fischbeck, K.H., and Van Ommen, G.J.B. Generation and fluorescent in situ hybridization mapping
of yeast artificial chromosomes of 1p, 17p, 17q, and 19q from a hybrid cell line by high-density
screening of an amplified library. Genomics 11:1079-1087 (1991).
136 Bellanne-Chantelot, C., Lacroix, B., Ougen, P., Billault, A., Beaufils, S., Bertrand, S., Georges, 1.,
Glibert, F., Gros, 1., Lucotte, G., Susini, L., Codani, J.-J., Gesnouin, P., Pook, S., Vaysseix, G.,
Lu-Kuo, J., Ried, T., Ward, D., Chumakov, 1., Le Paslier, D., Barillot, E., and Cohen, D. Mapping
the whole human genome by fingerprinting yeast artificial chromosomes. Cell10: I 059-1068 (1992).
137 Chumakov, I.M., Rigault, P., Le Gall, 1., Bellanne-Chantelot, C., Billault, A., Guillou, S., Soularue,
P., Guasconi, G., Poullier, E., Gros, 1., Belova, M., Sambucy, J.L., Susini, L., Gervy, P., Glibert, F.,
Beaufils, S., Bui, H., Massart, C., De Tand, M.F., Dukasz, F., Lecoulant, S., Ougen, P., Perrot, V.,
and Saumler, M. A YAC contig map of the human genome. Nature 371 Suppl.l75-183 (1995).
138 Vos, P., Hogers, R., Bleeker, M., Reijans, M., Van de Lee, T., Homes, M., Frijters, A., Pot, J.,

54
References

Peleman, J., Kuiper, M., and Zabeau, M. AFLP: a new technique for DNA fingerprinting. Nucleic
Acids Res. 23:4407-4414 (1995).
139 Coffey, A., Gregory, S., and Cole, C.G. Alu-PCR fingerprinting of YACs. Methods.Mol.Biol.
54:97-114 (1996).
140 Nagaraja, R., Kere, J., MacMillan, S., Masisi, M.W.J., Johnson, D., Molini, B.J., Halley, G.R.,
Wein, K., Trusgnich, M., Eble, B., Rai1ey, B., Brownstein, B.H., and Schlessinger, D.
Characterization of four human Y AC libraries for clone size, chimerism and X chromosome
sequence representation. Nucleic Acids Res. 22:3406-3411 (1994).
141 Pavan, W.J., Hieter, P., and Reeves, R.H. Generation of deletion derivatives by targeted
transformation of human-derived yeast artificial chromosomes. Proc.Natl.Acad.Sci. USA
87:1300-1304 (1990).
142 Pavan, W.J., Hieter, P., Sears, D., Burkhoff, A., and Reeves, R.H. High-efficiency yeast artificial
chromosome fragmentation vectors. Gene 106:125-127 (1991).
143 Heus, J.J., De Winther, M.P.J., Van de Vosse, E., Van Ommen, G.J.B., and Den Dunnen, J.T.
Centromeric and non-centromeric ADE2-se1ectab1e fragmentation vectors for YACs in AB 1380.
Genome Res. 7:657-660 (1997).
144 Van de Vosse, E., Van der Bent, P., Heus, J.J., Van Ommen, G.J.B., and Den Dunnen, J.T. High
resolution mapping by YAC fragmentation of a 2.5 Mb Xp22 region containing the human RS,
KFSD and CLS disease genes. Mamm.Genome 8:497-501 (1997).
145 Potier, M.-C., Dutriaux, A., and Reeves, R. Use of YAC fragmentation to delimit a duplicated
region on human chromosome 21. Mamm.Genome 7:85-88 (1996).
146 Datson, N.A., Semina, E., Van Staalduinen, A.A.A., Dauwerse, H.G., Meershoek, E.J., Heus, J.J.,
Frants, R.R., Den Dtinnen, J.T., Murray, J.C., and Van Ommen, G.-J.B. Closing in on the Rieger
syndrome gene on 4q25: mapping translocation breakpoints within a 50 kb region. Am.J.Hum. Genet.
59:1297-1305 (1996).
147 Hochgeschwender, U., Sutcliffe, J.G., and Brennan, M.B. Construction and screening of a genomic
library specific for mouse chromosome 16. Proc.Natl.Acad.Sci. USA 86:8482-8486 (1989).
148 Lovett, M., Kere, J., and Hinton, L.M. Direct selection: a method for the isolation of cDNAs
encoded by large genomic regions. Proc.Natl.Acad.Sci. USA 88:9628-9632 (1991).
149 Parimoo, S., Patanjali, S.R., Shukla, H., Chaplin, D.D., and Weissman, S.M. cDNA selection:
efficient PCR approach for the selection of cDNAs encoded in large chromosomal DNA fragments.
Proc.Natl.Acad.Sci. USA 88:9623-9627 (1991).
150 Guo, W., Worley, K., Adams, V., Mason, J., Sylvester-Jackson, D., Zhang, Y.-H., Towbin, J.A.,
Fogt, D.D., Madu, S., Wheeler, D.A., and McCabe, E.R.B. Genomic scanning for expressed
sequences in Xp21 identifies the glycerol kinase gene. Nature Genet. 4:367-371 (1993).
151 Rommens, J.M., Lin, B., Hutchinson, G.B., Andrew, S.E., Goldberg, Y.P., Glaves, M.L., Graham,
R., Lai, V., McArthur, J., Nasir, J., Theilmann, J., McDonald, H., Kalchmann, M., Clarke, L.A.,
Schappert, K., and Hayden, M.R. A transcription map of the region containing the Huntington
disease gene. Hum.Mol.Genet. 2:901-907 (1993).
152 Miki, Y., Swensen, J., Shattuck-Eidens, D., Futreal, P.A., Harshman, K., Tavtigian, S., Liu, Q.,
Cochran, C., Bennett, L.M., Ding, W., Bell, R., Rosenthal, J., Hussey, C., Tran, T., McCJure, M.,

55
Chapter 1

Frye, C., Rattier, T., Phelps, R., Haugen-Strano, A., Katcher, H., Yakumo, K., Gholami, Z., Shaffer,
D., Stone, S., Bayer, S., Wray, C., Bogden, R., Dayananth, P., Ward, J., Tonin, P., Narod, S.,
Bristow, P.K., Norris, F.H., Helvering, L., Morrison, P., Rosteck, P., Lai, M., Barrett, J.C., Lewis,
C., Neuhausen, S., Cannon-Albright, L., Goldgar, D., Wiseman, R., Kamb, A., and Skolnick, M.H.
A strong candidate for the breast and ovarian cancer susceptibility gene BRCAl. Science 266:66-71
( 1994).
153 Derry, J.M., Ochs, H.D., and Francke, U. Isolation of a novel gene mutated in Wiskott-Aldrich
syndrome. Cel/78:635-644 (1994).
154 Brennan, M.B. and Hochgeschwender, U. So many needles, so much hay. Hum.Mol.Genet.
4:153-156 (1995).
155 National Research Council. Mapping and sequencing the Human Genome. Natl.Acad.Press,
Washington,DC (1988).
156 Capone, M.C., Gorman, D.M., Ching, E.P., and Zlotnik, A. Identification through bioinformatics
of cDNAs encoding human thymic shared Ag-1/stem cell Ag-2. A new member of the human Ly-6
family. Journal of Immunology 157:969-973 (1996).
157 Dodt, G., Braverman, N., Valle, D., and Gould, S.I .. From expressed sequence tags to peroxisome
biogenesis disorder genes. Ann.N.Y.Acad.Sci. 804:516-523 (1996).
158 Banfi, S., Borsani, G., Rossi, E., Bemard, L., Guffanti, A., Rubboli, F., Marchitiello, A., Giglio, S.,
Coluccia, E., Zollo, M., Zuffardi, 0., and Ballabio, A. Identification and mapping of human cDNAs
homologous to Drosophila mutant genes through EST database searching. Nat. Genet. 13: 167-174
(1996).
159 Sedlacek, Z., Konecki, D.S., Siebenhaar, R., Kioschis, P., and Poustka, A. Direct selection of DNA
sequences conserved between species. Nucleic Acids Res. 21 :3419-3425 ( 1993).
160 Bird, A.P. CpG-rich islands and the function of DNA methylation. Nature 321:209-213 (1986).
161 Antequera, F. and Bird, A. Number of CpG islands and genes in human and mouse.
Proc.Natl.Acad.Sci. USA 90:11995-11999 (1993).
162 John, R.M., Robbins, C.A., and Myers, R.M. Identification of genes within CpG-enriched DNA
from human chromosome 4pl6.3. Hum.Mol.Genet. 3:1611-1616 (1994).
163 Pate!, K., Cox, R., Shipley, J., Kiely, F., Frazer, K., Cox, D.R., Lehrach, H., and Sheer, D. A novel
and rapid method for isolating sequences adjacent to rare cutting sites and their use in physical
mapping. Nucl.Acids Res. 19:4371-4375 (1991).
164 Shiraishi, M., Lerman, L.S., and Sekiya, T. Preferential isolation of DNA fragments associated with
CpG islands. Proc.Natl.Acad.Sci.USA 92:4229-4233 (1995).
165 Cross, S.H., Charlton, J.A., Nan, X., and Bird, A.P. Purification of CpG islands using a methylated
DNA binding column. Nat.Genet. 6:236-244 (1994).
166 Auch, D. and Reth, M. Exon trap cloning: using PCR to rapidly detect and clone exons from
genomic DNA fragments. Nucleic.Acids.Res. 18:6743-6744 (1990).
167 Buckler, A.J., Chang, D.D., Graw, S.L., Brook, J.D., Haber, D.A., Sharp, P.A., and Housman, D.E.
Exon amplification: A strategy to isolate mammalian genes based on RNA splicing.
Proc.Natl.Acad.Sci. USA 88:4005-4009 ( 1991).
168 Church, D.M., Stotler, C.J., Rutter, J.L., Murrell, J.R., Trofatter, J.A., and Buckler, A.J. Isolation

56

;
1
References

of genes from complex sources of mammalian genomic DNA using exon amplification. Nature
Genet. 6:98-105 (1994).
169 Nehls, M., Pfeifer, D., and Boehm, T. Ex on amplification from complete libraries of genomic DNA
using a novel phage vector with automatic plasmid excision facility: application to themouse
neurofibromatosis-1locus. Oncogene 9:2169-2175 (1994).
170 Kreissig, S., Schuddekopf, K., Dear, N., and Boehm, T. Expression of peptides encoded by exons
in cloned mammalian DNA. Nucl.Acids Res. 24:4358-4359 (1996).
171 Batshake, B. and Sundelin, J. The mouse genes for the EP 1 prostanoid receptor and the PKN protein
kinase overlap. Biochem.Biophys.Res. Commun. 227:70-76 (1996).
172 Krizman, D.B. and Berget, S.M. Efficient selection of 3'-terminal exons from vertebrate DNA.
Nucleic Acids Res. 21:5198-5202 (1993).
173 Datson, N.A., Duyk, G.M., Blonden, L.A.J., Van Ommen, G.-J.B., and Den Dunnen, J.T. An exon
trapping system providing size selection of spliced clones and facilitating direct cloning. In:
Hochgeschwender, U., Gardiner, K. (eds) Identification of Transcribed Sequences. Plenum Press,
New York: 169-181.(1994).
174 Datson, N.A., Duyk, G.M., Van Ommen, G.J.B., and Den Dunnen, J.T. Specific isolation of
3'-terminal exons of human genes by ex on trapping. Nucleic Acids Res. 22:4148-4153 (1994).
175 Hawkins, J.D. A survey on intron and exon lengths. Nucl.Acids Res. 16:9893-9908 (1988).
176 Datson, N.A., Van de Vosse, E., Dauwerse, H.G., Bout, M., Van Ommen, G.J.B., and Den Dunnen,
J.T. Scanning for genes in large genomic regions: cosmid-based exon trapping of multiple exons in
a single product. Nucleic Acids Res. 24:1105-1111 (1996).
177 Lau, C. International Workshop on the Identification of Transcribed Sequences 6:(Abstract) (1996).
178 The Huntington's Disease Collaborative Research Group A novel gene containing a trinucleotide
repeat that is expanded and unstable on Huntington's disease chromosomes. Cell72:971-983 (1993).
179 Chen, E.Y., Zollo, M., Mazzarella, R., Ciccodicola, A., Chen, C., Zuo, L., Heiner, C., Burough, F.,
Repetto, M., Schlessinger, D., and D'Urso, M. Long-range sequence analysis in Xq28: thirteen
known and six candidate genes in 219.4kb of high GC DNA between the RCP/GCP and G6PD loci.
Hum.Mol.Genet. 5:659-668 (1996).
180 Fickett, J.W. Finding genes by computer: the state of the art. Trends.Genet. 12:316-320 (1996).
181 Brenner, S.E. BLAST, Blitz, BLOCKS and BEAUTY: sequence comparison on the net. Trends
Genet. 11:330-331 (1995).
182 Burset, M. and Guigo, R. Evaluation of gene structure prediction programs. Genomics. 34:353-367
(1996).
183 Lopez, R.S., Larsen, F., and Prydz, H. Evaluation of the exons predictions of the GRAIL software.
Genomics 24:133-136 (1994).
184 Cotton, R.G.H. Mutation detection. Oxford University Press, Oxford: (1997).
185 Den Dunnen, J.T., Liang, P., Van Ommen, G.J.B., and Van Broeckhoven, C. Mutation detection and
diagnosis using PFGE. In: Davies, K.E. (ed) Human genetic disease analysis: a practical approach.
3rd edn. Oxford University Press, Oxford: 45-67.(1993).
186 Datson, N.A., Schaap, C., Bakker, E., Semina, E., Murray, J.C., Van Ommen, G.-J.B., Frants, R.R.,
and Den Dunnen, J.T. De novo splice mutation in the homeobox gene RIEG involved in Rieger

57
Chapter 1

syndrome. In preparation (1997).


187 Antonarakis, S.E., Rossiter, J.P., Young, M., Horst, J., De Moerloose, P., Somrner, S.S., Ketterling,
R.P., Kazazian, H.H.j., Negrier, C., Vinciguerra, C., Gitschier, J., Goossens, M., Girodon, E.,
Ghanem, N., Plassa, F., Lavergne, J.M., Vidaud, M., Costa, J.M., Laurian, Y., Lin, S.-W., Lin, S.-R.,
Shen, M.-C., Lillicrap, D., Taylor, S.A.M., Windsor, S., Valleix, S.V., Nafa, K., Sultan, Y.,
Delpech, M., Vnencak-Jones, C.L., Phillips, J.A.I., Ljung, R.C.R., Koumbarelis, E., Gileraki, A.,
Mandalaki, T., Jenkins, P.V., Collins, P.W., Pasi, K.J., Goodeve, A., Peake, I., Preston, F.E.,
Schwartz, M., Scheibel, E., Ingerslev, J., Cooper, D.N., Millar, D.S., Kakkar, V.V., Giannelli, F.,
Naylor, J.A., Tizzano, E.F., Baiget, M., Domenech, M., Altisent, C., Tusell, J., Beneyto, M.,
Lorenzo, J.I., Gaucher, C., Mazurier, C., Peerlinck, K., Matthijs, G., Cassiman, J.J., Vermylen, J.,
Mori, P.G., Acquila, M., Caprino, D., and Inaba, H. Factor VIll gene inversions in severe
hemophilia A: results of an international consortium study. Blood 86:2206-2212 (1995).
188 Roest, P.A.M., Roberts, R.G., Sugino, S., Van Omrnen, G.J.B., and Den Dunnen, J.T. Protein
truncation test (PTT) for rapid detection of translation-terminating mutations. Hum.Mol.Genet.
10:1719-1721 (1993).
189 Sarkar, G. and Somrner, S.S. Access to a messenger RNA sequence or its protein product is not
limited by tissue or species specificity. Science 244:331-334 (1989).
190 Hogervorst, F.B.L., Comelis, R.S., Bout, M., Van Vliet, M., Oosterwijk, J.C., Olmer, R., Bakker,
B., Klijn, J.G.M., Vasen, H.F.A., Meijers-Heijboer, H., Menko, F.H., Comelisse, C.J., Den Dunnen,
J.T., Devilee, P., and Van Omrnen, G.-J.B. Rapid detection ofBRCA1 mutations by the protein
truncation test. Nature Genet. 10:208-212 (1995).
191 Van der Luijt, R., Khan, P.M., Vasen, H., Van Leeuwen, C., Tops, C., Roest, P., Den Dunnen, J.,
and Fodde, R. Rapid detection of translation-terminating mutations at the adenomatous polyposis
coli (APC) gene by direct protein truncation test. Genomics 20:1-4 (1994).
192 Petrij, F., Giles, R.H., Dauwerse, H.G., Saris, J.J., Hennekam, R.C., Masuno, M., Tommerup, N.,
Van Ommen, G.J., Goodman, R.H., Peters, DJ., and Breuning, M.H. Rubinstein-Taybi syndrome
caused by mutations in the transcriptional eo-activator CBP. Nature 376:348-351 (1995).
193 Kaplan, J.C., Kahn, A., and Chelly, J. illegitimate transcription: its use in the study of inherited
disease. Hum.Mutat. 1:357-360 (1992).
194 Cariello, N.F. and Skopek, T.R. Mutational analysis using denaturing gradient gel electrophoresis
and PCR. Mutat.Res. 288:103-112 (1993).
195 Mural, R. and Gardiner, K. Toward understanding genome function: workshop in France focuses
on transcribed sequences. Hum. Genome News 7(5):(1996).
196 Olson, M. V. The Human Genome Project. Proc.Natl.Acad.Sci. USA 90:4338-4344 (1993).
197 Cann, H.M. CEPH maps. Curr.Opin.Genet.Dev. 2:393-399 (1992).
198 Weissenbach, J., Gyapay, G., Dib, C., Vignal, A., Morisette, J., Millasseau, P., Vaysseix, G., and
Lathrop, M. A second-generation linkage map of the human genome. Nature 359:794-801 (1992).
199 Gyapay, G., Morisette, J., Vignal, A., Dib, C., Fizames, C., Millasseau, P., Marc, S., Bernardi, G.,
Lathrop, M., and Weissenbach, J. The 1993-94 Genethon human genetic linkage map. Nature Genet.
7:246-249 (1994).
200 Adamson, D., Albertsen, H., Ballard, L., Bradley, P., Carlson, M., Cartwright, P., Council, C.,

58
References

Elsner, T., Fuhrman, D., Gerken, S., Harris, L., Holik, P.R., K.imball, A., Knell, J., Lawrence, E.,
Lu, J., Marks, A., Matsunami, N., Melis, R., Milner, B., Moore, M., Nelson, L., Odelberg, S., and
Peters, G. A collection of ordered tetranucleotide-repeat markers from the human genome.
Am.J.Hum.Genet. 57:619-628 (1995).
201 Hudson, T.J., Stein, L.D., Gerety, S.S., Ma, J., Castle, A.B., Si1va, J., Slonim, D.K., Baptista, R.,
Kruglyak, L., Xu, S.-H., Hu, X., Colbert, A.M.E., Rosenberg, C., Reeve-Daly, M.P., Rozen, S., Hui,
L., Wu, X., Vestergaard, C., Wilson, K.M., Bae, J.S., Maitra, S., Ganiatsas, S., Evans, C.A.,
DeAngelis, M.M., lngalls, K.A., Nahf, R.W., Horton, L.T.j., Oskin Anderson, M., Collymore, A.J.,
Ye, W., Kouyoumjian, V., Zemsteva, I.S., Tarn, J., Devine, R., Courtney, D.F., Turner Renaud, M.,
Nguyen, H., O'Connor, T.J., Fizames, C., Faun~, S., Gyapay, G., Dib, C., Morissette, J., Orlin, J.B.,
Birren, B.W., Goodman, N., Weissenbach, J., Hawkins, T.L., Foote, S., Page, D.C., and Lander, E.S.
An STS-based map of the human genome. Science 270:1945-1954 (1995).
202 Schuler, G.D., Boguski, M.S., Hudson, T.J., Hui, L., Castle, AB., Wu, X., Silva, J., Nusbaum, H. C.,
Birren, B.B., Slonim, D.K., Rozen, S., Stein, L.D., Page, D., Lander, E.S., Stewart, E.A., Aggarwal,
A., Bajorek, E., Brady, S., Chu, S., Fang, N., Hadley, D., Harris, M., Hussain, S., and Hudson, J.R.j.
Genome maps 7. The human transcript map. Wall Chart. Science 274:547-562 (1996).
203 Gyapay, G., Schmitt, K., Fizames, C., Jones, H., Vega-Czamy, N., Spillett, D., Muselet, D.,
Prud'Homme, J.-F., Dib, C., Auffray, C., Morisette, J., Weissenbach, J., and Goodfellow, P. A
radiation hybrid map of the human genome. Hum.Mol.Genet. 5:339-346 (1996).
204 Dib, C., Faure, S., Fizames, C., Samson, D., Drouot, N., Vignal, A., Millasseau, P., Marc, S., Hazan,
J., Seboun, E., Gyapay, G., Morissette, J., and Weissenbach, J. A comprehensive genetic map of the
human genome based on 5,264 microsatellites. Nature 380:152-154 (1996).
205 Anonymous. Second International Strategy Meeting on Human Genome Sequencing. Unpublished
(1997).
206 Ali, R.R., Reichel, M.B., Thrasher, A.J., Levinsky, R.J., K.innon, C., Kanuga, N., Hunt, D.M., and
Bhattacharya, S.S. Gene transfer into the mouse retina mediated by an adeno-associated viral vector.
Hum.Mol.Genet. 5:591-594 (1996).

59

You might also like