Добавил:

fench Опубликованный материал нарушает ваши авторские права? Сообщите нам.

Вуз:

Сумский государственный университет

Предмет:

Генетика

Файл:

Genomics- The Science and Technology Behind the Human Genome Project. Charles R. Cantor, Cassandra L / genomics1-10 / 8

.pdf

Скачиваний:

Добавлен:

17.08.2013

Размер:

634.83 Кб

Скачать

☆

<<< < Предыдущая 1 23 / 63 4 5 6 > Следующая >>>

254	PHYSICAL		MAPPING
Comparison of this sequence with the				Dpn	I recognition site shown above indicates that a
Dpn	I cleavage site has been generated. An eight-base sequence is required to specify this
site by the procedure we have used. Since it contains two CpG’s, it will be a very rare site
in mammalian genomes. Thirty to 40 variations on this theme exist, some generating sites
as large as 16 base pairs. Some of these sites are predicted to occur less than once, on av-
erage, in most genomes. However, one difﬁculty with these schemes, is the lack of avail-
ability of many of the necessary methylases in sufﬁciently nuclease-free form to generate
the very large DNA fragments speciﬁed by such large recognition sites. At present the ex-
ploitation of		Dpn	I remains an	extremely attractive method that is not yet generally prac-
ticed because of some of these unsolved experimental limitations.
LINKING	CLONES

In this and the next few sections we discuss several of the methods that allow the order of restriction fragments to be determined, de novo, without access to other mapping information. By far the most robust of these, in many respects, is the use of specialized clones


called	linking clones.	These clones contain the same rare-cutting			sites that were used to
produce macrorestriction fragments from the sample to be mapped. As shown in Figure
8.16, because these clones have an internal rare-cutting site, they must overlap two adja-
cent large DNA fragments. (The sequence			properties of two authentic human linking
clones were illustrated in Figs. 8.10 and 8.11.) When used as hybridization probes, the
linking clones should identify two fragments that are adjacent in the genome. This will
occur unless the rare-cutting site is so			close to one edge	of the linking		clone	that not
enough single-copy material remains beyond			the site to hybridize effectively. To elimi-
nate this possibility, it is useful to work with more than one set of linking clones, con-
structed in such a way as to have different distributions of DNA ﬂanking the rare-cutting
site. A convenient way to do this is to start with separate total digest libraries made with
different restriction nucleases such as			Eco	R I and	Hin d III. In principle, there is sufﬁcient
information in linking clone hybridizations that if a complete linking library were avail-
able, its use in hybridizations would produce data that would allow the entire set of re-
striction fragments to be ordered.
Two of the methods that have been used to prepare libraries of linking clones are sum-
marized in Figure 8.17. In each case one starts with a small insert genomic library con-
tained into a circular vector that is lacking any sites for the restriction enzyme of interest.
If such a library does not preexist,			it is easily constructed by subcloning an existing li-
brary into such a vector.		DNA from the entire library is puriﬁed and digested				with	the
rare-cutting restriction nuclease. Only those clones containing a genomic insert that in-
cludes a site for this enzyme will be linearized; the rest remain circular. Two different
methods can be used to select out the clones that have been linearized. One approach is to
use trapping electrophoresis: ordinary or PFG electrophoresis at high ﬁelds such that lin-
ear molecules migrate into		the gel effectively while circles remain trapped in			the sample

Figure 8.16 A linking clone spans two adjacent large DNA fragments.

LINKING CLONES

255

Figure 8.17 Two procedures used to make libraries of linking clones. Both require starting with a library of the desired chromosome (or other target) in a circular vector.


well or sample plug. In this way the linking clones are selectively captured. They can be
religated at low concentration to avoid the production of chimeras, and then re-introduced
into a bacterial host.
The alternative approach, after the rare-site-containing clones have been linearized, is
to ligate a selectable marker into the rare-cutting site. Both suppressor tRNA genes and
kanamycin resistance genes have been successfully used for this purpose. The library is
then used to transform a bacterial host in the presence of the		selection. The only clones
that should survive are the linking clones. Both of the methods described work reasonably
well. However, the procedures are not perfect, and it is important, after a linking library
has been made, to screen out artifactual clones. These may include such annoyances as
chimeras of rare cutter fragments, clones that contain a rare cutter site that is not actually
cut in the parental genome because it is methylated, and clones containing small individ-
ual rare cutter fragments. One way to screen for useful clones is to use them as probes in
Southern hybridizations with genomic DNA singly and doubly digested with the restric-
tion enzyme used for constructing the library such as			Eco	R I or Hin	d III, and the rare-cut-
ting enzyme. Proper linking clones will show single bands or no bands in the absence of
rare cutter cleavage, and double bands once the rare site has been cut.
Once linking clones are available, they have many other useful applications besides the
ordering of macrorestriction fragments. Linking clones serve as effective bridges between
two different kinds of restriction nuclease digests (or other		types of DNA samples) be-
cause they come from a known position, the ends, of large DNA fragments. As a second
example, one can cut linking clones at their internal rare cutter site and isolate the two
fragments. The resulting samples are called		half-linking clones.			They are useful for the
analysis of the internal structure of large DNA fragments, as shown by the schematic ex-
periment illustrated in Figure 8.18. Here a		Not	I half-linking clone is used as an indirect
end label by hybridizating it to a blot of a PFG-fractionated partial					Eco R I digest of a	Not
I fragment. The sizes of the DNA pieces seen in the partial digest provide the location of
the internal	Eco R I sites. This type of analysis was originally developed for smaller DNA
pieces by Smith and Birnstiel. It is a very powerful approach. Half-linking clones are also
very useful for the analysis of partial digests with enzymes		that cut	infrequently,	as we
will demonstrate later.

256 PHYSICAL MAPPING

Figure 8.18 Use of a half-linking clone as an indirect end label to reveal the pattern of internal restriction sites in a partial digest. This approach to restriction mapping, called the Smith-Birnstiel method after the researchers who ﬁrst described it, remains the most powerful way to accumulate

large amounts of restriction map data rapidly.

	Another potential use of linking				clones will be to order complete digest libraries of
cloned large DNA fragments, once we are able to produce such libraries. The difﬁculty
today is that too large a percentage of the fragments generated from mammalian DNA by
an enzyme		like	Not	I	exceed the capacities of current large insert cloning vectors.
However, an example of the potential power of this approach can be seen for an enzyme
like	Eag	I	that recognizes	the	sequence CCGCCG, which is the internal 6 base pairs of
the	Not	I recognition sequence.			Eag	I cuts genomic DNA into fragments that average 200
to 300 kb in size. These fragments can be cloned into conventional YAC vectors once the
vectors are equipped with the proper					cloning sites. The result of probing such a library
with	Eag	I	linking clones	is shown in Figure 8.19. In principle, there is enough informa-
tion in the two sets of clones, linking clones and YACs, to completely order both sets of
samples, and the method has two advantages. The materials used are only a tiny bit larger
than the minimum possible tiling path					for any ordered library, and distances along this
path are known with great precision. In practice, there is no example of a library that has
been fully ordered in this way.
	One limitation with currently used linking clones must be realized. These clones tend
to come from very G				C rich regions because this is where the sites of most rare-cutting
enzymes are preferentially located. These regions make subsequent PCR analyses fairly
difﬁcult because, in current PCR protocols, very G						C rich primers and templates do not

work especially well. Secondary structure in the single strands of these materials is presumably the major cause of the problems, but a generally effective solution to these problems is not yet in hand.

Figure 8.19 Linking clones and large DNA fragments generated by the same enzyme could be assembled, in principle, into an ordered library with almost the minimum possible tiling length.

JUMPING LIBRARIES

257

Figure 8.20	Basic notion of a jumping clone: Two discontinuous pieces
of the genome (	a ) and ( b ), but related by some map or fragment informa-

tion, are assembled into the same clone.

JUMPING LIBRARIES

Jumping clones offer a way of dealing with two discontinuous pieces of a chromosome.

The basic notion of a jumping clone is shown in Figure 8.20. It is an ordinary small insert clone except that two distinct and distant pieces of the original DNA target are brought together and fused in the process of cloning. A simple case of a jumping clone would be

one that contained the two ends of a large DNA fragment, but all of the internal DNA of the fragment had been excised out. The way in which such clones can be made is shown

in Figure 8.21.

A genomic digest with a rare-cutting enzyme is diluted to very low concentration and ligated in the presence of a vector containing a selectable marker. The goal is to cyclize the large fragments around vector sequences. At sufﬁciently low concentrations of target, it becomes very unlikely that any intermolecular ligation of target fragments will occur. One can use excess vector, provided that it is dephosphorylated so that vector–vector ligation is not possible. When successful, ligations at low concentrations can produce very large DNA circles. These

are then digested with a restriction enzyme	that	has much more frequent sites in	the	target
DNA but no sites in the vector. This can	be	arranged by suitable design of	the	vector.

Figure 8.21 Method for producing a library of jumping clones derived from the ends of large

DNA fragments.

258 PHYSICAL MAPPING

Figure 8.22 Potential utility of jumping clones that span large DNA fragments.

The result is a mixture of relatively small DNA pieces;	only those near the junction of the
two ends of the original large DNA fragments contain vector sequences. A second ligation is
now carried out, again at very low concentration. This circularizes all of the DNAs in the
sample. When these are reintroduced into	E. coli, under conditions where selectable markers
on the vector are required for growth, a jumping library results.
There are potentially powerful uses for jumping libraries that span adjacent sites of in-
frequently cutting restriction enzymes. These are illustrated in Figure 8.22. Note that each
of the jumping clones will overlap two linking clones.	If one systematically examines
which linking clones share DNA sequences with jumping clones, and vice versa, the re-
sult will be to order all the linking clones and all the jumping clones. This could be done
by hybridization, PCR, or by direct DNA sequencing near the rare-cutting sites. In the lat-
ter case DNA sequence comparisons will reveal clone overlaps. Such an approach, called
sequence-tagged rare restriction sites	(STARs), becomes increasingly attractive as the
ease and throughput of automated DNA sequencing improves.
A more general form of jumping library is shown schematically in Figure 8.23. Here a
genome is partially digested by a relatively frequent cutting restriction nuclease. The re-
sulting, very complex, mixture is fractionated by PFG, and a very narrow size range of
material is selected. If this is centered about 500 kb, then the sample contains all contigu-
ous blocks of 500-kb DNA in the genome. This material is used to make a jumping li-
brary in the same general manner as described above, by	circularizing it around a vector
at low concentration, excising internal material, and recircularizing. The resulting library
is a size-selected jumping library, consisting, in	principle, of all discontinuous sets of
short DNA sequences spaced 500 kb apart in the genome. The major disadvantage of this
library is that it is very complex. However, it is	also very useful, as shown in Figure
8.24 a. Suppose that one has a marker in a region of interest, and one would like another
marker spaced approximately 500 kb away. The original	marker is used to screen a 500-

Figure 8.23

DNA preparation (

a ) used to generate a more general jumping library (

b ) consisting

of a very dense sampling of genomic fragments of a ﬁxed length.

PARTIAL DIGESTION

259


Figure 8.24	Example of the use of a general		jumping library to move rapidly from one site in a
genome to distantly spaced sites. (		a )	Half-jumping clones provide new probes, at known distances
from the starting probe. (		b ) Information about the direction of the jump can be preserved if the map

orientation of the original probe is known.

kb jumping library. This will result in the identiﬁcation of clones that can be cut into halfjumping probes that ﬂank the original marker by 500 kb on either side. It is possible to tell which jump has occurred in which direction if the original marker is oriented with re-

spect to some other markers in the genome and if it contains a convenient internal restric-

tion site (Fig. 8.24	b ). Selection of jumping clones by using portions	of the original
marker will let information about its orientation be preserved after the jump. The major
limitation in the use of	this otherwise very powerful approach is that it is relatively	hard
to make long jumps because it is difﬁcult to ligate such large DNA circles efﬁciently.

PARTIAL DIGESTION

In many regions of the human genome, it is difﬁcult to ﬁnd any restriction enzyme that consistently gives DNA fragments larger than 200 kb. In most regions, genomic restriction fragments are rarely more than a few Mb in size. This is quite inefﬁcient for lowresolution mapping, since PFG is capable of resolving DNA fragments up to 5 or 7 Mb in size. To take advantage of the very large size range of PFG, it is often extremely useful to do partial digests with enzymes that cut the genome infrequently. However, there is a basic problem in trying to analyze the result of such a digest to generate a unique map of the region. This problem is illustrated in Figure 8.25. If a single probe is used to examine the digest by hybridization after PFG fractionation, the probe will detect a mixture of DNA fragments that extends from its original location in both directions. It is not straightforward to analyze this mixture of products and deduce the map that generated it.

260 PHYSICAL MAPPING

Figure 8.25 Ambiguity in interpreting the fragment pattern seen in a partial restriction nuclease digest, when hybridized with only a single probe.

If the enzyme cut all sites with equal kinetics, one could probably use likelihood or

maximum entropy arguments to select the most likely fragment orderings consistent			with
a given digestion pattern. The difﬁculty is that many restriction enzymes appear to have
preferred cutting sites. In the case of	Sﬁ	I these are generated by the peculiarities of the
interrupted recognition sequence. In the case of		Not	I and many other enzymes, hot spots
for cutting appear to occur in regions where a	number of sites	are clustered very	closely
in the genome (Fig. 8.26). Since the possibility of a hot spot in a particular region is hard
to rule out, a priori, one plausible candidate for a restriction map that ﬁts the partial diges-
tion data is a null map in which all partially	cleaved sites lie to one side of the probe and
a fully cut hot spot lies to the other side (Fig. 8.27). While such a map is statistically im-
plausible in a random genome, it becomes quite reasonable		once the possibility of hot
spots is allowed.
There are three special cases of partial digest patterns that can be analyzed without the
complications we have just raised above. The ﬁrst of these		occurs when the region is
ﬂanked by a site that is cut in every molecule under examination. There are two ways to
generate this site (which is, in effect, a known hot spot). Suppose that the probe used to ana-
lyze the digest is very close to the end of a	chromosome (Fig.	8.28). Then the null map is

the correct map. Using a telomeric probe is equivalent to the Smith-Birnstiel mapping approach we discussed earlier for smaller DNA fragments. It is worth noting that even if the probe is not at the very end of the mapped region, any prior knowledge that it is close to the end will greatly simplify the analysis of partial digest data. A second case of such a site oc-

curs if	we can	integrate	into a	chromosome	the	recognition	sequence	of	a very	rare
cutting	enzyme,	such that	this	will be the	only	site of its	kind in	the	region	of interest.

Figure 8.26 Example of a restriction enzyme hot spot that can confound attempts to use partial digest mapping methods.

PARTIAL DIGESTION

261

Figure 8.27 Null ordering of fragments seen in a partial digest.

Figure 8.28 Assembling a restriction map from partial digest data when a totally digested site (or a chromosome end) is known to be present in the region.

Then we cleave at the very rare site completely	and partially digest with an enzyme that
has somewhat more frequent cleavage sites. The power of this approach as a rapid map-
ping method is considerable. However, it is dependent on the ability to drive the very rare
cleavage to completion. As we have indicated before, in many of the existing schemes for
very rare cleavage, total digestion cannot be guaranteed. Thierry and Dujon in Paris have
recently shown that sites for the nuclease I-		Sce	I (Table 8.1) can be inserted at convenient
densities in the yeast genome and that the enzyme cuts completely enough to make the
strategy we have just described very effective.
A second case occurs that allows analysis of partial digestion data if the region of in-
terest is next to a very large DNA fragment. Then, as shown in Figure 8.29, all DNA frag-
ments seen in the partial digest that have a size less than the very large fragment can be
ordered so long as the digest is probed by hybridization with a probe that is located on the
ﬁrst small piece next to the very large fragment. Again, as in the case of telomeric probes,
relaxing this constraint a bit still enables	considerable map	information		to be	inferred
from the digest.
The third case where partial digests can be analyzed occurs when one has two probes
known, a priori, to lie on adjacent DNA fragments. This is precisely the case in hand, for
example, when two half-linking clones are used as probes of a partial digest made by us-
ing the same rare site present on the clones. In this case, as shown in Figure 8.30, those
fragment sizes seen with one probe and not the other must extend in the direction of that
probe. Bands seen with both probes are generally not that		informative.		Thus	linking

clones play a key role in the efﬁcient analysis of partial digests.

Figure 8.29 Assembling a restriction map from partial digest data adjacent to a very large restriction fragment.

262 PHYSICAL MAPPING

Figure 8.30 Assembling a restriction map when two hybridization probes are available and are known to lie on adjacent DNA fragments. Such probes are available, for example, as the halves of a linking clone.

The power of the approaches that employ partial digests is that a probe in one location
can be used to reach out and obtain map data across considerable distances, where no other
probes may yet exist. In order to carry out these experiments, however, two requirements
must be met. First, reliable length standards and good	high-resolution PFG fractionations
are essential. The data afforded by a partial digest is all inherent in the lengths of the DNA
bands seen. For example, if one used a probe known to reside on a 400-kb band, and found
in a partial 1000-kb band, this is evidence that a 600-kb band neighbors the 400-kb band.
One could then try to isolate speciﬁc probes from the 600-kb region to try to prove this as-
signment. However, if the 1000-kb band was mis-sized, and it was really only 900 kb, when
one went to ﬁnd probes in the 600-kb region of a	gel, this would be the wrong region.
The second constraint for effective partial digest analysis is very sensitive hybridization pro-
tocols. The yields of DNA pieces in partials can easily be only 1 to 10%. Detecting these re-
quires hybridizations that are 10 to 100 times as sensitive as those needed for ordinary sin-
gle-copy DNA targets. Autoradiographic exposures of one to two weeks are not uncommon
in the analysis of partial digests with infrequently cutting enzymes.
EXPLOITING DNA POLYMORPHISMS TO ASSIST MAPPING
Suppose that two different DNA probes detect a		Not I fragment 800 kb long. How can we
tell if they lie on the same fragment, or if it just a coincidence and they derive from two
different fragments with sizes too similar to resolve? One approach is to cut the			Not I di-
gest with a second relatively rare-cutting enzyme. If the band seen by one probe shortens,
and the band seen by the other does not, we know two different fragments are involved. If
both bands shorten after the second digestion, the result is ambiguous, unless two differ-
ent size bands are seen by the two probes and the sum	of their sizes is greater than the
size of the band originally seen in the	Not	I digest.
A more reliable approach is to try the two different probes on DNA isolated from a se-
ries of different cell lines. In practice, eight cell lines with very different characteristics
usually sufﬁces. When this is done with a single DNA probe, usually one or more of the
lines shows a signiﬁcant size difference or cutting difference from the others (Fig. 8.31).
There are many potential origins for this polymorphism. Mammalian genomes are rampant
with tandem repeats, and these differ in size substantially		from individual to individual.

EXPLOITING DNA POLYMORPHISMS TO ASSIST MAPPING

263

Figure 8.31 Example of the polymorphism in large restriction fragments seen with a single-copy

DNA probe when a number of different cell lines are compared.

Most rare-cutting sites are methylation sensitive, and especially in			tissue	culture	cell
lines, a quite heterogeneous pattern of methylation frequently occurs. There are also fre-
quent genetic	polymorphisms at rare-cutting enzyme sites: These RFLPs		arise because
the sites contain CpGs that are potential mutation hotspots. For the purposes of map con-
struction, the source of the polymorphism is almost irrelevant. The basic idea				is that if
two different probes share the same pattern of polymorphisms across a series of cell lines,
whatever its cause, they almost certainly must derive from the same DNA fragment. One
caveat to this statement must be noted. Overloading a sample used for PFG analysis will
slow down all bands. If one cell line is overloaded, it will look like the apparent size of a
particular	Not	I band has increased, but in practice, this is an	artifact. To avoid this prob-
lem, it is best to work at very low DNA concentrations, especially when PFG is used to
analyze polymorphisms. Further details are given in Chapter 5.
Polymorphism patterns can also help to link up adjacent fragments. Consider the ex-
ample shown in Figure 8.32. Here one has probes for two DNA fragments that happen to
share a common polymorphic site. In a typical case the site is cut in some cell lines, par-
tially cut in	others, and not cut at all in others. The pattern of bands seen by the				two

Figure 8.32 Effect of a polymorphic restriction site on the patterns of DNA fragments seen with two probes that ﬂank this site (from Oliva et al., 1991). Hybridization of DNA from different cell lines (lanes 1–10) with two putatively linked probes (A and B) leads to the detection of different and common fragment sizes but identical polymorphism patterns.

<<< < Предыдущая 1 23 / 63 4 5 6 > Следующая >>>

Соседние файлы в папке genomics1-10

#
17.08.2013343.56 Кб583.pdf
#
17.08.2013296.13 Кб584.pdf
#
17.08.2013326.85 Кб585.pdf
#
17.08.2013406.31 Кб586.pdf
#
17.08.2013277.57 Кб587.pdf
#
17.08.2013634.83 Кб608.pdf
#
17.08.2013475.69 Кб599.pdf
#
17.08.2013192.47 Кб60booktext[1].pdf