Добавил:
Опубликованный материал нарушает ваши авторские права? Сообщите нам.
Вуз: Предмет: Файл:
Скачиваний:
57
Добавлен:
17.08.2013
Размер:
634.83 Кб
Скачать

254

PHYSICAL

MAPPING

 

 

Comparison of this sequence with the

Dpn

I recognition site shown above indicates that a

Dpn

I cleavage site has been generated. An eight-base sequence is required to specify this

site by the procedure we have used. Since it contains two CpG’s, it will be a very rare site

in mammalian genomes. Thirty to 40 variations on this theme exist, some generating sites

as large as 16 base pairs. Some of these sites are predicted to occur less than once, on av-

erage, in most genomes. However, one difficulty with these schemes, is the lack of avail-

ability of many of the necessary methylases in sufficiently nuclease-free form to generate

the very large DNA fragments specified by such large recognition sites. At present the ex-

ploitation of

Dpn

I remains an

extremely attractive method that is not yet generally prac-

ticed because of some of these unsolved experimental limitations.

 

LINKING

CLONES

 

 

 

 

In this and the next few sections we discuss several of the methods that allow the order of restriction fragments to be determined, de novo, without access to other mapping information. By far the most robust of these, in many respects, is the use of specialized clones

called

linking clones.

These clones contain the same rare-cutting

sites that were used to

produce macrorestriction fragments from the sample to be mapped. As shown in Figure

 

8.16, because these clones have an internal rare-cutting site, they must overlap two adja-

cent large DNA fragments. (The sequence

properties of two authentic human linking

clones were illustrated in Figs. 8.10 and 8.11.) When used as hybridization probes, the

linking clones should identify two fragments that are adjacent in the genome. This will

occur unless the rare-cutting site is so

close to one edge

of the linking

clone

that not

enough single-copy material remains beyond

the site to hybridize effectively. To elimi-

nate this possibility, it is useful to work with more than one set of linking clones, con-

structed in such a way as to have different distributions of DNA flanking the rare-cutting

site. A convenient way to do this is to start with separate total digest libraries made with

different restriction nucleases such as

Eco

R I and

Hin d III. In principle, there is sufficient

information in linking clone hybridizations that if a complete linking library were avail-

able, its use in hybridizations would produce data that would allow the entire set of re-

striction fragments to be ordered.

 

 

 

 

 

Two of the methods that have been used to prepare libraries of linking clones are sum-

marized in Figure 8.17. In each case one starts with a small insert genomic library con-

tained into a circular vector that is lacking any sites for the restriction enzyme of interest.

If such a library does not preexist,

it is easily constructed by subcloning an existing li-

brary into such a vector.

DNA from the entire library is purified and digested

with

the

rare-cutting restriction nuclease. Only those clones containing a genomic insert that in-

cludes a site for this enzyme will be linearized; the rest remain circular. Two different

methods can be used to select out the clones that have been linearized. One approach is to

use trapping electrophoresis: ordinary or PFG electrophoresis at high fields such that lin-

ear molecules migrate into

the gel effectively while circles remain trapped in

the sample

Figure 8.16 A linking clone spans two adjacent large DNA fragments.

LINKING CLONES

255

Figure 8.17 Two procedures used to make libraries of linking clones. Both require starting with a library of the desired chromosome (or other target) in a circular vector.

well or sample plug. In this way the linking clones are selectively captured. They can be

 

 

religated at low concentration to avoid the production of chimeras, and then re-introduced

 

 

into a bacterial host.

 

 

 

 

 

The alternative approach, after the rare-site-containing clones have been linearized, is

 

 

to ligate a selectable marker into the rare-cutting site. Both suppressor tRNA genes and

 

 

kanamycin resistance genes have been successfully used for this purpose. The library is

 

 

 

then used to transform a bacterial host in the presence of the

selection. The only clones

 

 

that should survive are the linking clones. Both of the methods described work reasonably

 

 

 

well. However, the procedures are not perfect, and it is important, after a linking library

 

 

has been made, to screen out artifactual clones. These may include such annoyances as

 

 

chimeras of rare cutter fragments, clones that contain a rare cutter site that is not actually

 

 

cut in the parental genome because it is methylated, and clones containing small individ-

 

 

ual rare cutter fragments. One way to screen for useful clones is to use them as probes in

 

 

Southern hybridizations with genomic DNA singly and doubly digested with the restric-

 

 

 

tion enzyme used for constructing the library such as

 

Eco

R I or Hin

d III, and the rare-cut-

 

ting enzyme. Proper linking clones will show single bands or no bands in the absence of

 

 

 

rare cutter cleavage, and double bands once the rare site has been cut.

 

 

 

 

Once linking clones are available, they have many other useful applications besides the

 

 

ordering of macrorestriction fragments. Linking clones serve as effective bridges between

 

 

 

two different kinds of restriction nuclease digests (or other

types of DNA samples) be-

 

 

cause they come from a known position, the ends, of large DNA fragments. As a second

 

 

 

example, one can cut linking clones at their internal rare cutter site and isolate the two

 

 

fragments. The resulting samples are called

half-linking clones.

They are useful for the

 

analysis of the internal structure of large DNA fragments, as shown by the schematic ex-

 

 

periment illustrated in Figure 8.18. Here a

Not

I half-linking clone is used as an indirect

 

end label by hybridizating it to a blot of a PFG-fractionated partial

 

 

Eco R I digest of a

Not

I fragment. The sizes of the DNA pieces seen in the partial digest provide the location of

 

 

the internal

Eco R I sites. This type of analysis was originally developed for smaller DNA

 

 

pieces by Smith and Birnstiel. It is a very powerful approach. Half-linking clones are also

 

 

very useful for the analysis of partial digests with enzymes

that cut

infrequently,

as we

 

 

will demonstrate later.

 

 

 

 

 

256 PHYSICAL MAPPING

Figure 8.18 Use of a half-linking clone as an indirect end label to reveal the pattern of internal restriction sites in a partial digest. This approach to restriction mapping, called the Smith-Birnstiel method after the researchers who first described it, remains the most powerful way to accumulate

large amounts of restriction map data rapidly.

 

Another potential use of linking

clones will be to order complete digest libraries of

cloned large DNA fragments, once we are able to produce such libraries. The difficulty

today is that too large a percentage of the fragments generated from mammalian DNA by

an enzyme

like

Not

I

exceed the capacities of current large insert cloning vectors.

However, an example of the potential power of this approach can be seen for an enzyme

like

Eag

I

that recognizes

the

sequence CCGCCG, which is the internal 6 base pairs of

the

Not

I recognition sequence.

Eag

I cuts genomic DNA into fragments that average 200

to 300 kb in size. These fragments can be cloned into conventional YAC vectors once the

vectors are equipped with the proper

cloning sites. The result of probing such a library

with

Eag

I

linking clones

is shown in Figure 8.19. In principle, there is enough informa-

tion in the two sets of clones, linking clones and YACs, to completely order both sets of

samples, and the method has two advantages. The materials used are only a tiny bit larger

than the minimum possible tiling path

for any ordered library, and distances along this

path are known with great precision. In practice, there is no example of a library that has

been fully ordered in this way.

 

 

 

 

One limitation with currently used linking clones must be realized. These clones tend

to come from very G

C rich regions because this is where the sites of most rare-cutting

enzymes are preferentially located. These regions make subsequent PCR analyses fairly

difficult because, in current PCR protocols, very G

C rich primers and templates do not

work especially well. Secondary structure in the single strands of these materials is presumably the major cause of the problems, but a generally effective solution to these problems is not yet in hand.

Figure 8.19 Linking clones and large DNA fragments generated by the same enzyme could be assembled, in principle, into an ordered library with almost the minimum possible tiling length.

JUMPING LIBRARIES

257

Figure 8.20

Basic notion of a jumping clone: Two discontinuous pieces

of the genome (

a ) and ( b ), but related by some map or fragment informa-

tion, are assembled into the same clone.

JUMPING LIBRARIES

Jumping clones offer a way of dealing with two discontinuous pieces of a chromosome.

The basic notion of a jumping clone is shown in Figure 8.20. It is an ordinary small insert clone except that two distinct and distant pieces of the original DNA target are brought together and fused in the process of cloning. A simple case of a jumping clone would be

one that contained the two ends of a large DNA fragment, but all of the internal DNA of the fragment had been excised out. The way in which such clones can be made is shown

in Figure 8.21.

A genomic digest with a rare-cutting enzyme is diluted to very low concentration and ligated in the presence of a vector containing a selectable marker. The goal is to cyclize the large fragments around vector sequences. At sufficiently low concentrations of target, it becomes very unlikely that any intermolecular ligation of target fragments will occur. One can use excess vector, provided that it is dephosphorylated so that vector–vector ligation is not possible. When successful, ligations at low concentrations can produce very large DNA circles. These

are then digested with a restriction enzyme

that

has much more frequent sites in

the

target

DNA but no sites in the vector. This can

be

arranged by suitable design of

the

vector.

Figure 8.21 Method for producing a library of jumping clones derived from the ends of large

DNA fragments.

258 PHYSICAL MAPPING

Figure 8.22 Potential utility of jumping clones that span large DNA fragments.

The result is a mixture of relatively small DNA pieces;

only those near the junction of the

two ends of the original large DNA fragments contain vector sequences. A second ligation is

now carried out, again at very low concentration. This circularizes all of the DNAs in the

sample. When these are reintroduced into

E. coli, under conditions where selectable markers

on the vector are required for growth, a jumping library results.

There are potentially powerful uses for jumping libraries that span adjacent sites of in-

frequently cutting restriction enzymes. These are illustrated in Figure 8.22. Note that each

of the jumping clones will overlap two linking clones.

If one systematically examines

which linking clones share DNA sequences with jumping clones, and vice versa, the re-

sult will be to order all the linking clones and all the jumping clones. This could be done

by hybridization, PCR, or by direct DNA sequencing near the rare-cutting sites. In the lat-

ter case DNA sequence comparisons will reveal clone overlaps. Such an approach, called

sequence-tagged rare restriction sites

(STARs), becomes increasingly attractive as the

ease and throughput of automated DNA sequencing improves.

 

A more general form of jumping library is shown schematically in Figure 8.23. Here a

genome is partially digested by a relatively frequent cutting restriction nuclease. The re-

sulting, very complex, mixture is fractionated by PFG, and a very narrow size range of

material is selected. If this is centered about 500 kb, then the sample contains all contigu-

ous blocks of 500-kb DNA in the genome. This material is used to make a jumping li-

brary in the same general manner as described above, by

circularizing it around a vector

at low concentration, excising internal material, and recircularizing. The resulting library

is a size-selected jumping library, consisting, in

principle, of all discontinuous sets of

short DNA sequences spaced 500 kb apart in the genome. The major disadvantage of this

library is that it is very complex. However, it is

also very useful, as shown in Figure

8.24 a. Suppose that one has a marker in a region of interest, and one would like another

marker spaced approximately 500 kb away. The original

marker is used to screen a 500-

Figure 8.23

DNA preparation (

a ) used to generate a more general jumping library (

b ) consisting

of a very dense sampling of genomic fragments of a fixed length.

PARTIAL DIGESTION

259

Figure 8.24

Example of the use of a general

jumping library to move rapidly from one site in a

genome to distantly spaced sites. (

a )

Half-jumping clones provide new probes, at known distances

from the starting probe. (

b ) Information about the direction of the jump can be preserved if the map

orientation of the original probe is known.

kb jumping library. This will result in the identification of clones that can be cut into halfjumping probes that flank the original marker by 500 kb on either side. It is possible to tell which jump has occurred in which direction if the original marker is oriented with re-

spect to some other markers in the genome and if it contains a convenient internal restric-

tion site (Fig. 8.24

b ). Selection of jumping clones by using portions

of the original

marker will let information about its orientation be preserved after the jump. The major

limitation in the use of

this otherwise very powerful approach is that it is relatively

hard

to make long jumps because it is difficult to ligate such large DNA circles efficiently.

 

PARTIAL DIGESTION

In many regions of the human genome, it is difficult to find any restriction enzyme that consistently gives DNA fragments larger than 200 kb. In most regions, genomic restriction fragments are rarely more than a few Mb in size. This is quite inefficient for lowresolution mapping, since PFG is capable of resolving DNA fragments up to 5 or 7 Mb in size. To take advantage of the very large size range of PFG, it is often extremely useful to do partial digests with enzymes that cut the genome infrequently. However, there is a basic problem in trying to analyze the result of such a digest to generate a unique map of the region. This problem is illustrated in Figure 8.25. If a single probe is used to examine the digest by hybridization after PFG fractionation, the probe will detect a mixture of DNA fragments that extends from its original location in both directions. It is not straightforward to analyze this mixture of products and deduce the map that generated it.

260 PHYSICAL MAPPING

Figure 8.25 Ambiguity in interpreting the fragment pattern seen in a partial restriction nuclease digest, when hybridized with only a single probe.

If the enzyme cut all sites with equal kinetics, one could probably use likelihood or

maximum entropy arguments to select the most likely fragment orderings consistent

with

a given digestion pattern. The difficulty is that many restriction enzymes appear to have

preferred cutting sites. In the case of

Sfi

I these are generated by the peculiarities of the

interrupted recognition sequence. In the case of

Not

I and many other enzymes, hot spots

for cutting appear to occur in regions where a

number of sites

are clustered very

closely

in the genome (Fig. 8.26). Since the possibility of a hot spot in a particular region is hard

to rule out, a priori, one plausible candidate for a restriction map that fits the partial diges-

tion data is a null map in which all partially

cleaved sites lie to one side of the probe and

a fully cut hot spot lies to the other side (Fig. 8.27). While such a map is statistically im-

plausible in a random genome, it becomes quite reasonable

once the possibility of hot

spots is allowed.

 

 

 

There are three special cases of partial digest patterns that can be analyzed without the

complications we have just raised above. The first of these

occurs when the region is

flanked by a site that is cut in every molecule under examination. There are two ways to

generate this site (which is, in effect, a known hot spot). Suppose that the probe used to ana-

lyze the digest is very close to the end of a

chromosome (Fig.

8.28). Then the null map is

the correct map. Using a telomeric probe is equivalent to the Smith-Birnstiel mapping approach we discussed earlier for smaller DNA fragments. It is worth noting that even if the probe is not at the very end of the mapped region, any prior knowledge that it is close to the end will greatly simplify the analysis of partial digest data. A second case of such a site oc-

curs if

we can

integrate

into a

chromosome

the

recognition

sequence

of

a very

rare

cutting

enzyme,

such that

this

will be the

only

site of its

kind in

the

region

of interest.

Figure 8.26 Example of a restriction enzyme hot spot that can confound attempts to use partial digest mapping methods.

PARTIAL DIGESTION

261

Figure 8.27 Null ordering of fragments seen in a partial digest.

Figure 8.28 Assembling a restriction map from partial digest data when a totally digested site (or a chromosome end) is known to be present in the region.

Then we cleave at the very rare site completely

and partially digest with an enzyme that

has somewhat more frequent cleavage sites. The power of this approach as a rapid map-

ping method is considerable. However, it is dependent on the ability to drive the very rare

cleavage to completion. As we have indicated before, in many of the existing schemes for

very rare cleavage, total digestion cannot be guaranteed. Thierry and Dujon in Paris have

recently shown that sites for the nuclease I-

 

Sce

I (Table 8.1) can be inserted at convenient

densities in the yeast genome and that the enzyme cuts completely enough to make the

strategy we have just described very effective.

 

 

 

 

 

A second case occurs that allows analysis of partial digestion data if the region of in-

terest is next to a very large DNA fragment. Then, as shown in Figure 8.29, all DNA frag-

ments seen in the partial digest that have a size less than the very large fragment can be

ordered so long as the digest is probed by hybridization with a probe that is located on the

first small piece next to the very large fragment. Again, as in the case of telomeric probes,

relaxing this constraint a bit still enables

considerable map

information

to be

inferred

from the digest.

 

 

 

 

 

The third case where partial digests can be analyzed occurs when one has two probes

known, a priori, to lie on adjacent DNA fragments. This is precisely the case in hand, for

example, when two half-linking clones are used as probes of a partial digest made by us-

ing the same rare site present on the clones. In this case, as shown in Figure 8.30, those

fragment sizes seen with one probe and not the other must extend in the direction of that

probe. Bands seen with both probes are generally not that

informative.

Thus

linking

clones play a key role in the efficient analysis of partial digests.

Figure 8.29 Assembling a restriction map from partial digest data adjacent to a very large restriction fragment.

262 PHYSICAL MAPPING

Figure 8.30 Assembling a restriction map when two hybridization probes are available and are known to lie on adjacent DNA fragments. Such probes are available, for example, as the halves of a linking clone.

The power of the approaches that employ partial digests is that a probe in one location

 

can be used to reach out and obtain map data across considerable distances, where no other

 

probes may yet exist. In order to carry out these experiments, however, two requirements

 

must be met. First, reliable length standards and good

high-resolution PFG fractionations

 

are essential. The data afforded by a partial digest is all inherent in the lengths of the DNA

 

bands seen. For example, if one used a probe known to reside on a 400-kb band, and found

 

in a partial 1000-kb band, this is evidence that a 600-kb band neighbors the 400-kb band.

 

One could then try to isolate specific probes from the 600-kb region to try to prove this as-

 

signment. However, if the 1000-kb band was mis-sized, and it was really only 900 kb, when

 

one went to find probes in the 600-kb region of a

gel, this would be the wrong region.

 

The second constraint for effective partial digest analysis is very sensitive hybridization pro-

 

tocols. The yields of DNA pieces in partials can easily be only 1 to 10%. Detecting these re-

 

quires hybridizations that are 10 to 100 times as sensitive as those needed for ordinary sin-

 

gle-copy DNA targets. Autoradiographic exposures of one to two weeks are not uncommon

 

in the analysis of partial digests with infrequently cutting enzymes.

 

EXPLOITING DNA POLYMORPHISMS TO ASSIST MAPPING

 

 

 

Suppose that two different DNA probes detect a

 

Not I fragment 800 kb long. How can we

 

tell if they lie on the same fragment, or if it just a coincidence and they derive from two

 

different fragments with sizes too similar to resolve? One approach is to cut the

Not I di-

gest with a second relatively rare-cutting enzyme. If the band seen by one probe shortens,

 

and the band seen by the other does not, we know two different fragments are involved. If

 

both bands shorten after the second digestion, the result is ambiguous, unless two differ-

 

ent size bands are seen by the two probes and the sum

of their sizes is greater than the

 

size of the band originally seen in the

Not

I digest.

 

A more reliable approach is to try the two different probes on DNA isolated from a se-

 

ries of different cell lines. In practice, eight cell lines with very different characteristics

 

usually suffices. When this is done with a single DNA probe, usually one or more of the

 

lines shows a significant size difference or cutting difference from the others (Fig. 8.31).

 

There are many potential origins for this polymorphism. Mammalian genomes are rampant

 

with tandem repeats, and these differ in size substantially

from individual to individual.

 

EXPLOITING DNA POLYMORPHISMS TO ASSIST MAPPING

263

Figure 8.31 Example of the polymorphism in large restriction fragments seen with a single-copy

DNA probe when a number of different cell lines are compared.

Most rare-cutting sites are methylation sensitive, and especially in

tissue

culture

cell

lines, a quite heterogeneous pattern of methylation frequently occurs. There are also fre-

 

quent genetic

polymorphisms at rare-cutting enzyme sites: These RFLPs

arise because

 

the sites contain CpGs that are potential mutation hotspots. For the purposes of map con-

 

struction, the source of the polymorphism is almost irrelevant. The basic idea

is that if

two different probes share the same pattern of polymorphisms across a series of cell lines,

 

whatever its cause, they almost certainly must derive from the same DNA fragment. One

 

caveat to this statement must be noted. Overloading a sample used for PFG analysis will

 

slow down all bands. If one cell line is overloaded, it will look like the apparent size of a

particular

Not

I band has increased, but in practice, this is an

artifact. To avoid this prob-

lem, it is best to work at very low DNA concentrations, especially when PFG is used to

 

analyze polymorphisms. Further details are given in Chapter 5.

 

 

 

Polymorphism patterns can also help to link up adjacent fragments. Consider the ex-

 

ample shown in Figure 8.32. Here one has probes for two DNA fragments that happen to

 

share a common polymorphic site. In a typical case the site is cut in some cell lines, par-

 

tially cut in

others, and not cut at all in others. The pattern of bands seen by the

two

Figure 8.32 Effect of a polymorphic restriction site on the patterns of DNA fragments seen with two probes that flank this site (from Oliva et al., 1991). Hybridization of DNA from different cell lines (lanes 1–10) with two putatively linked probes (A and B) leads to the detection of different and common fragment sizes but identical polymorphism patterns.

Соседние файлы в папке genomics1-10