

254 |
PHYSICAL |
MAPPING |
|
|
|
Comparison of this sequence with the |
Dpn |
I recognition site shown above indicates that a |
|||
Dpn |
I cleavage site has been generated. An eight-base sequence is required to specify this |
||||
site by the procedure we have used. Since it contains two CpG’s, it will be a very rare site |
|||||
in mammalian genomes. Thirty to 40 variations on this theme exist, some generating sites |
|||||
as large as 16 base pairs. Some of these sites are predicted to occur less than once, on av- |
|||||
erage, in most genomes. However, one difficulty with these schemes, is the lack of avail- |
|||||
ability of many of the necessary methylases in sufficiently nuclease-free form to generate |
|||||
the very large DNA fragments specified by such large recognition sites. At present the ex- |
|||||
ploitation of |
Dpn |
I remains an |
extremely attractive method that is not yet generally prac- |
||
ticed because of some of these unsolved experimental limitations. |
|
||||
LINKING |
CLONES |
|
|
|
|
In this and the next few sections we discuss several of the methods that allow the order of restriction fragments to be determined, de novo, without access to other mapping information. By far the most robust of these, in many respects, is the use of specialized clones
called |
linking clones. |
These clones contain the same rare-cutting |
sites that were used to |
||||
produce macrorestriction fragments from the sample to be mapped. As shown in Figure |
|
||||||
8.16, because these clones have an internal rare-cutting site, they must overlap two adja- |
|||||||
cent large DNA fragments. (The sequence |
properties of two authentic human linking |
||||||
clones were illustrated in Figs. 8.10 and 8.11.) When used as hybridization probes, the |
|||||||
linking clones should identify two fragments that are adjacent in the genome. This will |
|||||||
occur unless the rare-cutting site is so |
close to one edge |
of the linking |
clone |
that not |
|||
enough single-copy material remains beyond |
the site to hybridize effectively. To elimi- |
||||||
nate this possibility, it is useful to work with more than one set of linking clones, con- |
|||||||
structed in such a way as to have different distributions of DNA flanking the rare-cutting |
|||||||
site. A convenient way to do this is to start with separate total digest libraries made with |
|||||||
different restriction nucleases such as |
Eco |
R I and |
Hin d III. In principle, there is sufficient |
||||
information in linking clone hybridizations that if a complete linking library were avail- |
|||||||
able, its use in hybridizations would produce data that would allow the entire set of re- |
|||||||
striction fragments to be ordered. |
|
|
|
|
|
||
Two of the methods that have been used to prepare libraries of linking clones are sum- |
|||||||
marized in Figure 8.17. In each case one starts with a small insert genomic library con- |
|||||||
tained into a circular vector that is lacking any sites for the restriction enzyme of interest. |
|||||||
If such a library does not preexist, |
it is easily constructed by subcloning an existing li- |
||||||
brary into such a vector. |
DNA from the entire library is purified and digested |
with |
the |
||||
rare-cutting restriction nuclease. Only those clones containing a genomic insert that in- |
|||||||
cludes a site for this enzyme will be linearized; the rest remain circular. Two different |
|||||||
methods can be used to select out the clones that have been linearized. One approach is to |
|||||||
use trapping electrophoresis: ordinary or PFG electrophoresis at high fields such that lin- |
|||||||
ear molecules migrate into |
the gel effectively while circles remain trapped in |
the sample |
Figure 8.16 A linking clone spans two adjacent large DNA fragments.

LINKING CLONES |
255 |
Figure 8.17 Two procedures used to make libraries of linking clones. Both require starting with a library of the desired chromosome (or other target) in a circular vector.
well or sample plug. In this way the linking clones are selectively captured. They can be |
|
|
||||
religated at low concentration to avoid the production of chimeras, and then re-introduced |
|
|
||||
into a bacterial host. |
|
|
|
|
|
|
The alternative approach, after the rare-site-containing clones have been linearized, is |
|
|
||||
to ligate a selectable marker into the rare-cutting site. Both suppressor tRNA genes and |
|
|
||||
kanamycin resistance genes have been successfully used for this purpose. The library is |
|
|
|
|||
then used to transform a bacterial host in the presence of the |
selection. The only clones |
|
|
|||
that should survive are the linking clones. Both of the methods described work reasonably |
|
|
|
|||
well. However, the procedures are not perfect, and it is important, after a linking library |
|
|
||||
has been made, to screen out artifactual clones. These may include such annoyances as |
|
|
||||
chimeras of rare cutter fragments, clones that contain a rare cutter site that is not actually |
|
|
||||
cut in the parental genome because it is methylated, and clones containing small individ- |
|
|
||||
ual rare cutter fragments. One way to screen for useful clones is to use them as probes in |
|
|
||||
Southern hybridizations with genomic DNA singly and doubly digested with the restric- |
|
|
|
|||
tion enzyme used for constructing the library such as |
|
Eco |
R I or Hin |
d III, and the rare-cut- |
|
|
ting enzyme. Proper linking clones will show single bands or no bands in the absence of |
|
|
|
|||
rare cutter cleavage, and double bands once the rare site has been cut. |
|
|
|
|
||
Once linking clones are available, they have many other useful applications besides the |
|
|
||||
ordering of macrorestriction fragments. Linking clones serve as effective bridges between |
|
|
|
|||
two different kinds of restriction nuclease digests (or other |
types of DNA samples) be- |
|
|
|||
cause they come from a known position, the ends, of large DNA fragments. As a second |
|
|
|
|||
example, one can cut linking clones at their internal rare cutter site and isolate the two |
|
|
||||
fragments. The resulting samples are called |
half-linking clones. |
They are useful for the |
|
|||
analysis of the internal structure of large DNA fragments, as shown by the schematic ex- |
|
|
||||
periment illustrated in Figure 8.18. Here a |
Not |
I half-linking clone is used as an indirect |
|
|||
end label by hybridizating it to a blot of a PFG-fractionated partial |
|
|
Eco R I digest of a |
Not |
||
I fragment. The sizes of the DNA pieces seen in the partial digest provide the location of |
|
|
||||
the internal |
Eco R I sites. This type of analysis was originally developed for smaller DNA |
|
|
|||
pieces by Smith and Birnstiel. It is a very powerful approach. Half-linking clones are also |
|
|
||||
very useful for the analysis of partial digests with enzymes |
that cut |
infrequently, |
as we |
|
|
|
will demonstrate later. |
|
|
|
|
|

256 PHYSICAL MAPPING
Figure 8.18 Use of a half-linking clone as an indirect end label to reveal the pattern of internal restriction sites in a partial digest. This approach to restriction mapping, called the Smith-Birnstiel method after the researchers who first described it, remains the most powerful way to accumulate
large amounts of restriction map data rapidly.
|
Another potential use of linking |
clones will be to order complete digest libraries of |
||||
cloned large DNA fragments, once we are able to produce such libraries. The difficulty |
||||||
today is that too large a percentage of the fragments generated from mammalian DNA by |
||||||
an enzyme |
like |
Not |
I |
exceed the capacities of current large insert cloning vectors. |
||
However, an example of the potential power of this approach can be seen for an enzyme |
||||||
like |
Eag |
I |
that recognizes |
the |
sequence CCGCCG, which is the internal 6 base pairs of |
|
the |
Not |
I recognition sequence. |
Eag |
I cuts genomic DNA into fragments that average 200 |
||
to 300 kb in size. These fragments can be cloned into conventional YAC vectors once the |
||||||
vectors are equipped with the proper |
cloning sites. The result of probing such a library |
|||||
with |
Eag |
I |
linking clones |
is shown in Figure 8.19. In principle, there is enough informa- |
||
tion in the two sets of clones, linking clones and YACs, to completely order both sets of |
||||||
samples, and the method has two advantages. The materials used are only a tiny bit larger |
||||||
than the minimum possible tiling path |
for any ordered library, and distances along this |
|||||
path are known with great precision. In practice, there is no example of a library that has |
||||||
been fully ordered in this way. |
|
|
|
|||
|
One limitation with currently used linking clones must be realized. These clones tend |
|||||
to come from very G |
C rich regions because this is where the sites of most rare-cutting |
|||||
enzymes are preferentially located. These regions make subsequent PCR analyses fairly |
||||||
difficult because, in current PCR protocols, very G |
C rich primers and templates do not |
work especially well. Secondary structure in the single strands of these materials is presumably the major cause of the problems, but a generally effective solution to these problems is not yet in hand.
Figure 8.19 Linking clones and large DNA fragments generated by the same enzyme could be assembled, in principle, into an ordered library with almost the minimum possible tiling length.

JUMPING LIBRARIES |
257 |
Figure 8.20 |
Basic notion of a jumping clone: Two discontinuous pieces |
of the genome ( |
a ) and ( b ), but related by some map or fragment informa- |
tion, are assembled into the same clone.
JUMPING LIBRARIES
Jumping clones offer a way of dealing with two discontinuous pieces of a chromosome.
The basic notion of a jumping clone is shown in Figure 8.20. It is an ordinary small insert clone except that two distinct and distant pieces of the original DNA target are brought together and fused in the process of cloning. A simple case of a jumping clone would be
one that contained the two ends of a large DNA fragment, but all of the internal DNA of the fragment had been excised out. The way in which such clones can be made is shown
in Figure 8.21.
A genomic digest with a rare-cutting enzyme is diluted to very low concentration and ligated in the presence of a vector containing a selectable marker. The goal is to cyclize the large fragments around vector sequences. At sufficiently low concentrations of target, it becomes very unlikely that any intermolecular ligation of target fragments will occur. One can use excess vector, provided that it is dephosphorylated so that vector–vector ligation is not possible. When successful, ligations at low concentrations can produce very large DNA circles. These
are then digested with a restriction enzyme |
that |
has much more frequent sites in |
the |
target |
DNA but no sites in the vector. This can |
be |
arranged by suitable design of |
the |
vector. |
Figure 8.21 Method for producing a library of jumping clones derived from the ends of large
DNA fragments.

258 PHYSICAL MAPPING
Figure 8.22 Potential utility of jumping clones that span large DNA fragments.
The result is a mixture of relatively small DNA pieces; |
only those near the junction of the |
two ends of the original large DNA fragments contain vector sequences. A second ligation is |
|
now carried out, again at very low concentration. This circularizes all of the DNAs in the |
|
sample. When these are reintroduced into |
E. coli, under conditions where selectable markers |
on the vector are required for growth, a jumping library results. |
|
There are potentially powerful uses for jumping libraries that span adjacent sites of in- |
|
frequently cutting restriction enzymes. These are illustrated in Figure 8.22. Note that each |
|
of the jumping clones will overlap two linking clones. |
If one systematically examines |
which linking clones share DNA sequences with jumping clones, and vice versa, the re- |
|
sult will be to order all the linking clones and all the jumping clones. This could be done |
|
by hybridization, PCR, or by direct DNA sequencing near the rare-cutting sites. In the lat- |
|
ter case DNA sequence comparisons will reveal clone overlaps. Such an approach, called |
|
sequence-tagged rare restriction sites |
(STARs), becomes increasingly attractive as the |
ease and throughput of automated DNA sequencing improves. |
|
A more general form of jumping library is shown schematically in Figure 8.23. Here a |
|
genome is partially digested by a relatively frequent cutting restriction nuclease. The re- |
|
sulting, very complex, mixture is fractionated by PFG, and a very narrow size range of |
|
material is selected. If this is centered about 500 kb, then the sample contains all contigu- |
|
ous blocks of 500-kb DNA in the genome. This material is used to make a jumping li- |
|
brary in the same general manner as described above, by |
circularizing it around a vector |
at low concentration, excising internal material, and recircularizing. The resulting library |
|
is a size-selected jumping library, consisting, in |
principle, of all discontinuous sets of |
short DNA sequences spaced 500 kb apart in the genome. The major disadvantage of this |
|
library is that it is very complex. However, it is |
also very useful, as shown in Figure |
8.24 a. Suppose that one has a marker in a region of interest, and one would like another |
|
marker spaced approximately 500 kb away. The original |
marker is used to screen a 500- |
Figure 8.23 |
DNA preparation ( |
a ) used to generate a more general jumping library ( |
b ) consisting |
of a very dense sampling of genomic fragments of a fixed length.

PARTIAL DIGESTION |
259 |
Figure 8.24 |
Example of the use of a general |
jumping library to move rapidly from one site in a |
|
genome to distantly spaced sites. ( |
a ) |
Half-jumping clones provide new probes, at known distances |
|
from the starting probe. ( |
b ) Information about the direction of the jump can be preserved if the map |
orientation of the original probe is known.
kb jumping library. This will result in the identification of clones that can be cut into halfjumping probes that flank the original marker by 500 kb on either side. It is possible to tell which jump has occurred in which direction if the original marker is oriented with re-
spect to some other markers in the genome and if it contains a convenient internal restric-
tion site (Fig. 8.24 |
b ). Selection of jumping clones by using portions |
of the original |
marker will let information about its orientation be preserved after the jump. The major |
||
limitation in the use of |
this otherwise very powerful approach is that it is relatively |
hard |
to make long jumps because it is difficult to ligate such large DNA circles efficiently. |
|
PARTIAL DIGESTION
In many regions of the human genome, it is difficult to find any restriction enzyme that consistently gives DNA fragments larger than 200 kb. In most regions, genomic restriction fragments are rarely more than a few Mb in size. This is quite inefficient for lowresolution mapping, since PFG is capable of resolving DNA fragments up to 5 or 7 Mb in size. To take advantage of the very large size range of PFG, it is often extremely useful to do partial digests with enzymes that cut the genome infrequently. However, there is a basic problem in trying to analyze the result of such a digest to generate a unique map of the region. This problem is illustrated in Figure 8.25. If a single probe is used to examine the digest by hybridization after PFG fractionation, the probe will detect a mixture of DNA fragments that extends from its original location in both directions. It is not straightforward to analyze this mixture of products and deduce the map that generated it.

260 PHYSICAL MAPPING
Figure 8.25 Ambiguity in interpreting the fragment pattern seen in a partial restriction nuclease digest, when hybridized with only a single probe.
If the enzyme cut all sites with equal kinetics, one could probably use likelihood or
maximum entropy arguments to select the most likely fragment orderings consistent |
with |
||
a given digestion pattern. The difficulty is that many restriction enzymes appear to have |
|||
preferred cutting sites. In the case of |
Sfi |
I these are generated by the peculiarities of the |
|
interrupted recognition sequence. In the case of |
Not |
I and many other enzymes, hot spots |
|
for cutting appear to occur in regions where a |
number of sites |
are clustered very |
closely |
in the genome (Fig. 8.26). Since the possibility of a hot spot in a particular region is hard |
|||
to rule out, a priori, one plausible candidate for a restriction map that fits the partial diges- |
|||
tion data is a null map in which all partially |
cleaved sites lie to one side of the probe and |
||
a fully cut hot spot lies to the other side (Fig. 8.27). While such a map is statistically im- |
|||
plausible in a random genome, it becomes quite reasonable |
once the possibility of hot |
||
spots is allowed. |
|
|
|
There are three special cases of partial digest patterns that can be analyzed without the |
|||
complications we have just raised above. The first of these |
occurs when the region is |
||
flanked by a site that is cut in every molecule under examination. There are two ways to |
|||
generate this site (which is, in effect, a known hot spot). Suppose that the probe used to ana- |
|||
lyze the digest is very close to the end of a |
chromosome (Fig. |
8.28). Then the null map is |
the correct map. Using a telomeric probe is equivalent to the Smith-Birnstiel mapping approach we discussed earlier for smaller DNA fragments. It is worth noting that even if the probe is not at the very end of the mapped region, any prior knowledge that it is close to the end will greatly simplify the analysis of partial digest data. A second case of such a site oc-
curs if |
we can |
integrate |
into a |
chromosome |
the |
recognition |
sequence |
of |
a very |
rare |
cutting |
enzyme, |
such that |
this |
will be the |
only |
site of its |
kind in |
the |
region |
of interest. |
Figure 8.26 Example of a restriction enzyme hot spot that can confound attempts to use partial digest mapping methods.

PARTIAL DIGESTION |
261 |
Figure 8.27 Null ordering of fragments seen in a partial digest.
Figure 8.28 Assembling a restriction map from partial digest data when a totally digested site (or a chromosome end) is known to be present in the region.
Then we cleave at the very rare site completely |
and partially digest with an enzyme that |
||||
has somewhat more frequent cleavage sites. The power of this approach as a rapid map- |
|||||
ping method is considerable. However, it is dependent on the ability to drive the very rare |
|||||
cleavage to completion. As we have indicated before, in many of the existing schemes for |
|||||
very rare cleavage, total digestion cannot be guaranteed. Thierry and Dujon in Paris have |
|||||
recently shown that sites for the nuclease I- |
|
Sce |
I (Table 8.1) can be inserted at convenient |
||
densities in the yeast genome and that the enzyme cuts completely enough to make the |
|||||
strategy we have just described very effective. |
|
|
|
|
|
A second case occurs that allows analysis of partial digestion data if the region of in- |
|||||
terest is next to a very large DNA fragment. Then, as shown in Figure 8.29, all DNA frag- |
|||||
ments seen in the partial digest that have a size less than the very large fragment can be |
|||||
ordered so long as the digest is probed by hybridization with a probe that is located on the |
|||||
first small piece next to the very large fragment. Again, as in the case of telomeric probes, |
|||||
relaxing this constraint a bit still enables |
considerable map |
information |
to be |
inferred |
|
from the digest. |
|
|
|
|
|
The third case where partial digests can be analyzed occurs when one has two probes |
|||||
known, a priori, to lie on adjacent DNA fragments. This is precisely the case in hand, for |
|||||
example, when two half-linking clones are used as probes of a partial digest made by us- |
|||||
ing the same rare site present on the clones. In this case, as shown in Figure 8.30, those |
|||||
fragment sizes seen with one probe and not the other must extend in the direction of that |
|||||
probe. Bands seen with both probes are generally not that |
informative. |
Thus |
linking |
clones play a key role in the efficient analysis of partial digests.
Figure 8.29 Assembling a restriction map from partial digest data adjacent to a very large restriction fragment.

262 PHYSICAL MAPPING
Figure 8.30 Assembling a restriction map when two hybridization probes are available and are known to lie on adjacent DNA fragments. Such probes are available, for example, as the halves of a linking clone.
The power of the approaches that employ partial digests is that a probe in one location |
|
||
can be used to reach out and obtain map data across considerable distances, where no other |
|
||
probes may yet exist. In order to carry out these experiments, however, two requirements |
|
||
must be met. First, reliable length standards and good |
high-resolution PFG fractionations |
|
|
are essential. The data afforded by a partial digest is all inherent in the lengths of the DNA |
|
||
bands seen. For example, if one used a probe known to reside on a 400-kb band, and found |
|
||
in a partial 1000-kb band, this is evidence that a 600-kb band neighbors the 400-kb band. |
|
||
One could then try to isolate specific probes from the 600-kb region to try to prove this as- |
|
||
signment. However, if the 1000-kb band was mis-sized, and it was really only 900 kb, when |
|
||
one went to find probes in the 600-kb region of a |
gel, this would be the wrong region. |
|
|
The second constraint for effective partial digest analysis is very sensitive hybridization pro- |
|
||
tocols. The yields of DNA pieces in partials can easily be only 1 to 10%. Detecting these re- |
|
||
quires hybridizations that are 10 to 100 times as sensitive as those needed for ordinary sin- |
|
||
gle-copy DNA targets. Autoradiographic exposures of one to two weeks are not uncommon |
|
||
in the analysis of partial digests with infrequently cutting enzymes. |
|
||
EXPLOITING DNA POLYMORPHISMS TO ASSIST MAPPING |
|
|
|
Suppose that two different DNA probes detect a |
|
Not I fragment 800 kb long. How can we |
|
tell if they lie on the same fragment, or if it just a coincidence and they derive from two |
|
||
different fragments with sizes too similar to resolve? One approach is to cut the |
Not I di- |
||
gest with a second relatively rare-cutting enzyme. If the band seen by one probe shortens, |
|
||
and the band seen by the other does not, we know two different fragments are involved. If |
|
||
both bands shorten after the second digestion, the result is ambiguous, unless two differ- |
|
||
ent size bands are seen by the two probes and the sum |
of their sizes is greater than the |
|
|
size of the band originally seen in the |
Not |
I digest. |
|
A more reliable approach is to try the two different probes on DNA isolated from a se- |
|
||
ries of different cell lines. In practice, eight cell lines with very different characteristics |
|
||
usually suffices. When this is done with a single DNA probe, usually one or more of the |
|
||
lines shows a significant size difference or cutting difference from the others (Fig. 8.31). |
|
||
There are many potential origins for this polymorphism. Mammalian genomes are rampant |
|
||
with tandem repeats, and these differ in size substantially |
from individual to individual. |
|

EXPLOITING DNA POLYMORPHISMS TO ASSIST MAPPING |
263 |
Figure 8.31 Example of the polymorphism in large restriction fragments seen with a single-copy
DNA probe when a number of different cell lines are compared.
Most rare-cutting sites are methylation sensitive, and especially in |
tissue |
culture |
cell |
||
lines, a quite heterogeneous pattern of methylation frequently occurs. There are also fre- |
|
||||
quent genetic |
polymorphisms at rare-cutting enzyme sites: These RFLPs |
arise because |
|
||
the sites contain CpGs that are potential mutation hotspots. For the purposes of map con- |
|
||||
struction, the source of the polymorphism is almost irrelevant. The basic idea |
is that if |
||||
two different probes share the same pattern of polymorphisms across a series of cell lines, |
|
||||
whatever its cause, they almost certainly must derive from the same DNA fragment. One |
|
||||
caveat to this statement must be noted. Overloading a sample used for PFG analysis will |
|
||||
slow down all bands. If one cell line is overloaded, it will look like the apparent size of a |
|||||
particular |
Not |
I band has increased, but in practice, this is an |
artifact. To avoid this prob- |
||
lem, it is best to work at very low DNA concentrations, especially when PFG is used to |
|
||||
analyze polymorphisms. Further details are given in Chapter 5. |
|
|
|
||
Polymorphism patterns can also help to link up adjacent fragments. Consider the ex- |
|
||||
ample shown in Figure 8.32. Here one has probes for two DNA fragments that happen to |
|
||||
share a common polymorphic site. In a typical case the site is cut in some cell lines, par- |
|
||||
tially cut in |
others, and not cut at all in others. The pattern of bands seen by the |
two |
Figure 8.32 Effect of a polymorphic restriction site on the patterns of DNA fragments seen with two probes that flank this site (from Oliva et al., 1991). Hybridization of DNA from different cell lines (lanes 1–10) with two putatively linked probes (A and B) leads to the detection of different and common fragment sizes but identical polymorphism patterns.