Добавил:
Опубликованный материал нарушает ваши авторские права? Сообщите нам.
Вуз: Предмет: Файл:
Скачиваний:
46
Добавлен:
17.08.2013
Размер:
475.69 Кб
Скачать

 

HYBRID MAPPING STRATEGIES: CROSS-CONNECTIONS BETWEEN

LIBRARIES

295

BOX

9.1

(Continued)

 

 

 

 

Liquid-handling robots such as the Beckman Biomec, the Hewlett-Packard Orca

 

chemical robot, among others, can use microtitre plates singly or several

at

a time.

 

More complex

sets of plates can be handled by storage and retrieval

from

vertical

 

racks called microtitre plate hotels. A number of custom-made robots have also been

 

built to handle specialized aspects of microtitre plate manipulation efficiently, such as

 

plate duplication, custom sample pooling, and array making from plates. All of these

 

instruments share a number of common design features. Plate wells can be filled or

 

sampled singly

with individual pipetting devices, addressable in the

 

 

x-yplane. Rows

and columns can be sampled or fed by multiple-headed pipetors. Entire plates can be

 

filled in a single step by 96-head pipetors. This is done, for instance, when all the wells

 

must

be filled with the same sample medium for cell growth or the same buffer solu-

 

tion for PCR.

 

Most standard biochemical and microbiological manipulations can be carried out in

 

the wells of the microtitre plates. A sterile atmosphere can be provided in various ways

 

to allow colony inoculation, growth, and monitoring by absorbance. Temperature con-

 

 

trol allows incubation or PCR. Solid state DNA preparations such as agarose plug

preparations of large DNA, or immobilized magnetic microbead-based preparations of

 

 

plasmid DNA or PCR samples are all easily adapted to a microtitre plate format. The

result is that hundreds to thousands of samples can be prepared at once. Standard liq-

 

uid phase preparations of DNA are much more difficult to automate in the microtitre

 

plate format because they usually require centrifugation. While centrifuges have been

 

built that handle microtitre plates, loading and unloading them is tedious.

 

 

 

 

 

 

For

microbiological

samples,

automated

colony

pickers

have

been

built

that

start

with

a

conventional

array

of

clones

or

plaques

 

in

an

ordinary petri

dish

(or

a rectangular dish for more accurate mechanical positioning), optically

detect the

colonies, pick them one at a time by poking with a sharp object, and transfer them to a

 

rectilinear array in microtitre plates. The rate-limiting step in most automated handling

 

of

bacteria

or

yeast

colonies

appears

to

be

sterilization

of

the

sharp

object

used

for

picking,

 

which

must

be

done

after

each

step

 

to

avoid

cross-contamination.

With liquid handling, cross-contamination can also be a problem in many applications.

 

Here one has the choice of extensive rinsing of

the pipet tips between each sample,

 

which is time-consuming, or

the

use

of

disposable

pipet

tips,

which

is

very

costly.

As we gain more experience with this type of automation, more clever designs are sure

 

to emerge that improve the throughput by parallelizing some of the steps. A typical ex-

 

ample would be to have multiple sample tips or pipetors so that some are being steril-

 

ized or rinsed off line while others are being used. At present, most of the robots that

have been developed to aid mapping and sequencing are effective but often painfully

 

slow.

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

Dense sample arrays on filters are usually

made by using offset printing to com-

press microtitre plate arrays. An example is shown in Figure 9.7. Here a 96-pin tool is

used to sample a 96-well microtitre plate of DNA samples, and stamp its image onto a

filter, for subsequent hybridization. Because the pin tips are much smaller than the mi-

 

crotitre plate wells, it is possible to intersperse the impressions of many plates to place

all of these within the same area originally occupied by a single plate. With presently

available robots, usually the maximum compression

attained

is

16-fold

(a

4

 

 

4 array

of 96 well images). This leads to a 3

 

 

 

 

5 cm filter area with about 1600 samples. Thus

a 10

10

cm filter can hold about

10

 

 

 

 

 

4 samples,

more than

enough for most current

(continued)

296 ENHANCED METHODS FOR PHYSICAL MAPPING

BOX 9.1

(Continued)

 

 

 

Figure 9.7

 

 

Making a dense array of samples by offset spotting of a more dilute array.

applications. Most dense arrays that have been made thus far for mapping projects are

random. That is, no information is available about any of the DNA samples at the time

 

the

array

is made. As the map develops, the array becomes much more informative,

 

but the

 

x, yindexes of each sample have only historical significance.

 

 

 

 

 

Once

a map has been completed, it is convenient to reconfigure the array

of

sam-

ples so that they are placed in the actual order dictated by the map. While this is not

absolutely necessary, it does allow for visual inspection of a hybridization to be in-

stantly interpretable in many cases. There is no difficulty in instructing a robot to re-

configure an array. This procedure needs to be done only once, however

slow

the

 

process, and then replicas of the new configuration can be made rapidly. The great mo-

 

 

tivation

for achieving

more

compressed arrays is sample storage. Many

clones

and

most

DNA

samples are stored at low temperature. Large libraries of

samples

can

rapidly saturate all available temperature-controlled laboratory storage

space,

espe-

cially if a large number of replicas is made for subsequent distribution.

 

 

 

 

 

select out human material from hybrid cell lines or to reduce the complexity of a probe to

desirable levels. Some of these techniques were described in Chapter 4;

others

will be

dealt with in Chapter 14. It is worth noting that randomly chosen oligonucleotides often

make very effective probes for fingerprinting. For example, a randomly chosen 10-mer

should detect 1 out of every 25 cosmid clones. A key issue that we will

address

later in

this chapter is whether it is possible to increase the efficiency of handling probes by pool-

ing them instead of using them individually.

 

 

 

 

SCREENING

BY

PCR

VERSUS HYBRIDIZATION

 

 

 

 

A key variable in contemporary mapping efforts is whether the connections between probes

 

 

and samples are made by hybridization or by PCR. The two techniques have compensating

 

 

disadvantages

and

advantages,

and

the choice of which to use will depend on the nature of

 

the samples available and the complexity of the mapping project. We will consider these dif-

 

ferences at the level of the target, the probe, and the organization of the project.

 

 

 

 

The sensitivity of PCR at detecting small amounts of target is unquestionably greater

than hybridization. The advantage of being able to use smaller samples

is

that

with

presently

available methods,

it

is much easier to automate the preparation

of relatively

small DNA samples. It is also cheaper to make such samples. When pools of targets are used, the greater sensitivity of PCR allows more complex pools with larger numbers of samples to be used. What matters for detection is the concentration of the particular target

 

 

 

 

 

 

 

SCREENING

BY

PCR

VERSUS

HYBRIDIZATION

297

DNA that will be detected by one method or the other. With PCR this

concentration

can

 

be almost arbitrarily low, so long as there are not contaminants that will give an unaccept-

 

 

able PCR background.

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

Hybridization has the advantage when a large number of

physically

discrete

samples

 

 

(or pools) must be examined simultaneously. We have already described the use of dense

 

 

 

arrays of samples on filters to process in parallel large numbers of targets in hybridization

 

 

against a single probe. These arrays can easily contain 10

 

 

 

 

 

 

 

 

 

4 samples. In comparison, typi-

 

cal PCR reactions must be done in individually isolated liquid samples. Microtitre plates

 

handle

around 10

2

of these. Devices

have

been built

that

can

do

thousands of

simultane-

 

ous PCRs, but these are large; with objects on such a scale one could easily handle 10

5

 

samples at once by hybridization.

 

 

 

 

 

 

 

 

 

 

 

 

 

 

At the level of the probe, PCR is more demanding than hybridization. Hybridization

 

probes can be made by PCR or by radiolabeling any DNA sample available. Typical PCR

 

 

 

analyses require a knowledge of the DNA sequence of the target. In contrast, hybridiza-

 

tion requires only a sample

of the DNA that corresponds to one element

of

the

target.

 

PCR primers require custom synthesis, which is still expensive and time-consuming al-

 

 

though recent progress in automated synthesis has lowered the unit cost of these materials

 

 

 

considerably. PCR with large numbers of different primers is not very convenient because

 

 

 

in most current protocols the PCR reactions must be done individually or at most in small

 

 

pools. Pooling of probes (or using very complex probes) is a powerful way to speed up

 

 

mapping, as we will illustrate later in this chapter. However, in PCR, primers are difficult

 

to pool. The reason is that with increasing numbers of PCR primers, the possible set of

 

reactions rises as the square of the number of primers: Each primer, in principle, could

 

amplify with any other primer if a suitable target were present. Thus the expected back-

 

 

ground will increase as the square of the number of primers. The desired signal will in-

 

 

crease only linearly with the number of primers. Clearly this rapidly becomes a losing

 

 

proposition. In contrast, it is relatively easier to use many hybridization probes simultane-

 

ously, since here the background will increase only linearly with the number of probes.

 

 

 

 

Both PCR and hybridization schemes lend themselves to

large

scale

organized

 

projects, but the implications and mechanics are very different. With hybridization, filter

 

replicas of an array can be

sent out to a set of distant users. However, the power

of

the

 

array increases, the more about it one knows. Therefore, for an array to have optimal im-

 

 

pact, it is highly desirable that all results of probe hybridizations against it be compiled in

 

one centrally accessible location. In practice, this lends itself

 

to schemes in place in

 

Europe where the array hybridizations are actually done in central locations, and data are

 

 

compiled there. Someone who wishes to interrogate an array with a probe, mails in that

 

 

probe to a central site.

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

With

PCR screening,

the key

information

is the

sequences

of

the

DNA

primers.

 

This information can easily be compiled and stored on a centrally

accessible

database.

 

 

Users simply have to access this database, and either make the primers needed or obtain

 

 

them from others who have already made them. This allows a large number of individual

 

 

 

 

laboratories to use the map, as it develops, and to participate in the mapping without any

 

 

kind of elaborate distribution scheme for samples and without centralized experimental

 

 

facilities. The PCR-based screening approach has thus far been

 

more

popular

in

the

 

United States.

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

In the long run it would

be nice to have a hybrid approach that

blends the

advantages

 

 

of both PCR and hybridization. One way to think about this

 

would be to use small

 

primers

or

repeated

sequence

primers

to reduce

the

complexity

of

complex

probes

by

 

 

 

298

 

ENHANCED METHODS FOR PHYSICAL MAPPING

 

 

 

 

 

 

sampling

either useful portions of pools of probes or

large insert

clones

such as YACs.

 

This would greatly increase the efficiency of PCR for handling complex pools and also

 

 

decrease the cost of making large numbers of custom, specific PCR primers. What is still

 

needed for an optimal method is spatially resolved PCR. The idea

is

to

have a method

 

that would allow one to probe a filter array of samples directly by PCR without having to

 

set up a separate PCR reaction for each sample. However, in order for this to be efficient,

 

the

PCR products from each element of the array have

to be kept from

mixing. While

 

some progress at developing in situ PCR has been reported, it is not yet clear that this

 

methodology is generally applicable for mapping. One nice approach is to do PCR inside

 

 

permeabilized cells, and then the PCR products

are

retained inside

the

cells. However,

 

this procedure, thus far, cannot be carried out to high levels of amplification (Teo and

 

Shaunak, 1995).

 

 

 

 

 

 

 

 

 

 

TIERED

SETS

OF

SAMPLES

 

 

 

 

 

 

 

 

 

In cross-connecting different sets of DNA samples

and

libraries,

it

is

helpful

to

have

 

dense sets of targets that span a range of sizes in an orderly way. The rationale behind this

 

statement is illustrated in Figure 9.8. The figure shows an attempt to connect two sets of

 

total digest fragments or clones by cross-hybridization or some other kind of complemen-

 

 

tary fingerprinting. When large fragment

 

 

J is used as a probe, it detects three smaller frag-

 

ments

a, b,and

c but does not indicate their order. When the smaller fragments

a, b,and

c

are

used

as probes,

b detects only

J,which means

that

 

b

is the central small fragment;

a

and

b

each detect additional flanking larger fragments, so they must be external. A gener-

 

alization of this argument indicates that it is more efficient if one has access to a progres-

 

sion of samples where average sizes diminish roughly by factors of three. Less sharp size

 

decreases will mean an unnecessarily large number

of different sets of samples. Larger

 

size decreases will mean that ordering of each smaller set will be too ambiguous.

 

 

 

 

The same kinds of arguments are applicable in more complex cases where overlapping

 

 

 

sets of fragments or clones exist. Consider the example shown in Figure 9.9. The objec-

 

tive is to subdivide a 500 kb YAC into cosmid clones so that these can be used as more

 

convenient sources of DNA for finding polymorphisms or for sequencing. The traditional

 

 

 

approach to this problem would have been to subclone the purified YAC DNA into cos-

 

 

mids. However, this involves a tremendous amount of

work. The more modern approach

 

 

 

is to start with a YAC contig flanking the clone of interest. This contig will automatically

 

arise in

the context of producing an ordered YAC

library. With a fivefold redundant

array

 

of YACs, the contig will have typically five members in this region. Therefore, if these

 

members

are used separately for hybridization or fingerprinting, they will divide the re-

 

gion into intervals that average about 100 kb in size. Each interval will serve as a bin to

 

assign cosmids

to the

region. Since the intervals

are

less than three times

the cosmid

in-

 

Figure 9.8 Clone ordering when tiered sets of clones are available.

 

 

 

 

TIERED SETS OF SAMPLES

299

sert size, the ordering should be reasonably effective. However, the key point in the over-

 

all strategy is that one need not make the YAC contig in advance. Cross-connecting the

 

various sets of samples in parallel will eventually provide all of the information needed to

 

order all of them.

 

 

 

The example shown in Figure 9.9 is a somewhat difficult case because the size step

 

taken, from 500 kb clones to 40 kb clones, is a big one. It would be better to have one or

 

two tiers of samples in between; say 250 kb YACs and 100 kb P1 clones. This will com-

 

pensate quite a bit for the inevitable irregularities in the distribution and coverage of each

 

of the libraries in particular regions. An analogy that may not be totally farfetched is that

 

the intermediate tiers of samples can help strengthen

the ordering process in the

same

 

manner as intermediate levels in a neural net can enhance its performance (see Chapter

 

15).

 

 

 

 

 

YAC and cosmid libraries involve large numbers of clones, and it would be very ineffi-

 

cient to handle such samples one at a time. We have already shown that cosmids can be

 

handled

very

efficiently as dense arrays of DNAs on filters. YACs are less easily handled

 

in this manner. The amount of specific sample DNA is much less, since typical YACs are

 

single-copy clones in a 13-Mb genome background,

while cosmids are multicopy

in a

 

4.8-Mb background. Thus hybridization screening of arrays of YAC clones has not always

 

been very successful. It would greatly help if there were an effective way to purify the

 

YAC DNA away from the yeast genomic DNA. Automated procedures already have been

 

 

developed to purify total DNA from many YAC-bearing strains at once (Box 9.1). Now

 

what is needed is a simple automatable method for plucking the YACs out of this mixture.

 

An alternative to purification would be YAC amplification. As described earlier in this

 

chapter, this is possible but not yet widely used, and

the amount of amplification is

still

 

only modest. Perhaps the most effective method currently available for increasing our

 

sensitivity of working with YAC DNA is PCR.

In Chapter 14 we will illustrate how

 

human-specific PCR, based on repeating sequence primers like

Alu ’s, can be used

to see

just the human DNA in a complex sample. Almost every YAC is expected to contain mul-

 

tiple

Alu

repeats. It is possible to do hundreds of PCR reactions simultaneously with com-

 

monly available thermal cyclers, and ten to perhaps a hundred times larger number

of

 

samples is manageable with equipment that has been designed and built specially for this

 

purpose.

 

 

 

 

 

Figure 9.9

Division of a YAC into a contig of cosmids by taking advantage of other YACs known

to be contiguous.

 

300

ENHANCED METHODS FOR PHYSICAL MAPPING

 

 

 

SIMPLE

POOLING

STRATEGIES

FOR

FINDING A

CLONE

 

 

 

OF INTEREST

 

 

 

 

 

 

 

 

 

A major problem in genome analysis is to find in a library the clones that correspond to a

 

probe of interest. This is typically the situation one faces in trying to find clones in a re-

 

gion marked by a specific probe that is suspected to be near a gene of interest. With cos-

 

mid clones, one screens a filter array, as illustrated earlier. With YAC clones, filter arrays

 

have not worked well in many hands, and instead, PCR is used. But with today’s methods

 

it is inconvenient and expensive to analyze the YACs in a large library, individually, by

 

PCR. For example, a single coverage 300-kb human genomic YAC library is 10

4 clones.

Fivefold coverage would require 5

 

 

104 clones. This number of PCR reactions is still

daunting. However, if the objective is to find a small number of clones in the library that

 

overlap a single probe, there are more efficient schemes for searching the library. These

 

involve pooling clones and doing PCR reactions

on

the

pools instead of

on individual

 

clones.

 

 

 

 

 

 

 

 

 

 

One of the simplest and most straightforward

YAC

pooling schemes

involves

three

 

tiers of samples (Fig. 9.10). The YAC library is normally distributed in 96-well microtitre

 

plates (Fig. 9.6 a ). Thus 100 plates would be required to hold 10

 

4 clones. Pools are made

from the clones on each plate. This can be done by sampling them individually or by us-

 

ing multiple headed pipetors or other tools as described in Box 9.1. Each plate pool con-

 

tains 96 samples. The plate pools are combined ten at a time to make super pools. First,

 

ten super pools are screened by PCR. This takes 10 PCR reactions (or 50 if a fivefold re-

 

dundant library is used). Each positive superpool is then screened by subsequent, separate

 

PCR reactions of each of the ten plate pools

it contains. In turn, each plate pool that

 

shows a positive PCR is divided into 12 column pools (of 9 YACs each) and, separately, 9

 

row pools (of 12 YACs each), and a separate PCR analysis is done on each of these sam-

 

ples. In

most cases

this should

result

in a unique

row-column positive combination

that

 

serves to identify the single positive YAC that has been responsible for the entire tier of PCR amplifications. Each positive clone found in this manner will require around 41 PCR

Figure 9.10

Three-tier pooling strategy for finding a clone of interest in a YAC library.

 

 

 

SEQUENCE-SPECIFIC

TAGS

301

reactions. This is a vast improvement over the 10

4 reactions

required if YACs are exam-

 

ined one at a time. For a fivefold redundant library, if the super pools are kept the same

 

 

size, then

the total number of PCRs needed will be 81 per

positive clone. This is

still

 

 

quite reasonable.

 

 

 

 

 

SEQUENCE-SPECIFIC TAGS

 

 

 

 

 

The power of the simple pooling approach just described had a strong effect on early

 

 

 

strategies developed for genome mapping. In reality the power is greatest when individual

 

 

 

clones are sought, and it diminishes considerably when the goal

is to order a whole li-

 

 

 

brary. However, the attractiveness of PCR-based screening has led to much consideration

 

 

 

about the kinds of PCR primers that would be suitable for genome analysis. Since PCR

 

 

ordinarily requires known sequences, the issue really becomes what kinds of DNA se-

 

 

 

quences should be used for finding genes or for genome mapping. A number of different

 

 

 

types of approaches are currently being used; these are described below.

 

 

 

 

STS.

This is short for a sequence tagged site. The original notion was that any

arbi-

 

 

trary bit of known DNA sequence could be used as a probe if it successfully gener-

 

 

ated useful PCR primers. One early notion was to take all of the existing polymor-

 

 

phic genetic probes and retro-fit these as STSs by

determining

a partial

DNA

 

 

sequence, and then developing useful PCR primers based on this sequence. This ap-

 

 

 

proach was never executed on a large scale, which is probably good in retrospect

 

 

because recently developed genetic markers are far more useful for mapping than

 

 

 

earlier ones, and the process for finding these automatically includes

the develop-

 

 

ment of unique sequence tags suitable for PCR.

 

 

 

 

 

STAR.

This stands for sequence tagged rare restriction site. The great utility

of such

 

 

probes has already been described in considering strategies for efficient restriction

 

 

mapping or ordering total digest libraries. The appeal of STARs is that they allow

 

 

precise placement of probes on physical maps, even at rather early stages in map

 

 

construction. The disadvantage of STARs, mentioned earlier, is that many cleavage

 

 

sites for rare restriction nucleases turn out to be very

G

 

C rich,

and thus PCR

in

these regions is more difficult to perform. For any kind of probe, clone ordering is most efficient if the probes come from the very ends of DNA fragments or clone in-

serts. As shown in Figure 9.11, this allows PCR to be done by using primers in the vector arms in addition to primers in the insert. Thus the number of unique primers

that must be made for each probe is halved. STARs in total digest libraries naturally come from the ends of DNA fragments, and thus they promote efficient mapping strategies.

Figure 9.11

Use of vector primers in PCR to amplify the ends of clones like YACs.

302

ENHANCED METHODS FOR PHYSICAL MAPPING

 

 

 

 

 

 

STP

or STRP.

 

 

 

These abbreviations refer to sequence tagged polymorphism or poly-

 

 

morphic sequence tag. A very simple notion is involved here. If the PCR tag is a

 

 

polymorphic

sequence, then

the genetic and physical maps can be

directly

aligned

 

 

 

at the position of this tag. This allows the genetic and physical maps to be built in

 

 

parallel. A possible limitation

here is that some of the most

useful

genetic probes

 

 

are tandemly repeating sequences, and a certain subset of these, usually very simple

 

 

 

repeats

like

(AC)

 

n , tend to give extra unwanted amplification products in typical

 

PCR protocols. However, it seems possible to find slightly more complex repeats,

 

 

like

(AAAC)

n , that are

equally useful as genetic probes but

show fewer

PCR

arti-

 

facts.

 

 

 

 

 

 

 

 

 

 

 

 

 

EST.

This

stands for

expressed sequence tag. It could really refer to any piece of cod-

 

 

ing DNA sequence for which PCR primers have been established. However, in

 

 

practice, EST almost always refers to a segment of the DNA sequence of a cDNA.

 

 

These

samples

are

usually obtained by starting with an existing cDNA library,

 

 

choosing clones at random, sequencing as much of them as can be done in a single

 

 

 

pass, and then using this sequence information to place the clone on a physical map

 

 

 

(through somatic cell genetics or FISH). There are many advantages to such se-

 

 

quences as probes. One knows that a gene is involved. Therefore the region of the

 

 

chromosome is of potential interest. The bit of DNA sequence obtained may be in-

 

 

 

teresting:

It

may

match something already in the database or

be

interpretable

in

 

 

some way (see Chapter 15). In general, the kinds of PCR artifacts observed with

 

 

STPs, STRPs, and STARs are much less likely to occur with ESTs.

 

 

 

Despite

their

considerable

appeal

there

are a number of potential problems

in

dealing

 

 

with ESTs as mapping reagents. As shown in Figure 9.12, cDNAs are discontinuous sam-

 

 

ples of genomic DNA. They will typically span many exons. This can be very confusing.

 

 

If the EST crosses a large

intron, the probe will show PCR amplification, but genomic

 

 

DNA or a YAC clone will not. A common strategy for EST production uses largely un-

 

 

translated DNA sequence at the 3

 

-ends of the cDNA clones. It is relatively easy to clone

these regions,

and

they

are more polymorphic than the internal coding region.

 

Furthermore cDNAs from gene families will tend to have rather different

3

 

 

 

 

-untranslated

regions, and thus one will avoid some of the problems otherwise encountered with multi-

 

 

ple positive PCR reactions

from members of a gene family. These 3

 

 

 

 

 

-end sequences will

also tend to contain only a single exon in front of the untranslated region. However, all of

 

 

these advantages carry a price: The 3

-end sequence

is

less

interesting and

interpretable

than the remainder of the cDNA.

 

 

 

 

 

 

 

 

Figure 9.12 A potential problem with ESTs as mapping reagents is that an EST can cross one or more introns.

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

POOLING IN MAPPING STRATEGIES

303

Eugene Sverdlov and his colleagues in Moscow have developed an efficient procedure

 

 

 

for preparing chromosome-specific cDNA libraries. Their three-step procedure is an ex-

 

 

 

tension of simpler procedures that were tried earlier by others. This procedure uses an ini-

 

 

 

tial

Alu

-primed PCR reaction to make a cDNA copy of the hnRNA produced in a hybrid

 

 

 

cell containing just the chromosome of interest. The resulting DNA is equipped with an

 

 

 

oligo-G tail, and then a first round PCR is carried out using an oligo-C containing primer

 

 

 

and an

Alu

primer. Then a second

round of PCR is done

with a nested

 

 

 

 

Alu

primer. The

PCR primers are also designed so that

the first round introduces one restriction site and

 

 

 

the second round another. The resulting products are then directionally cloned into a vec-

 

 

 

tor requiring both sites. In studies to

date, Sverdlov and his coworkers have found that

 

 

 

this scheme produces a diverse

 

set

of

highly enriched human cDNAs. Because these

 

 

 

come from

 

 

Alu s in hnRNA, they will contain introns, and this gives them potential advan-

 

 

 

tages

as

mapping

probes

when

compared

with conventional cDNAs. As with the 3

 

 

-

cDNAs discussed above, cDNA from hnRNAs will be

more

effective

than

ordinary

 

 

 

cDNAs in dealing with gene families and in avoiding cross-hybridization with conserved

 

 

 

exonic sequences in rodent-human cell hybrids.

 

 

 

 

 

 

 

 

 

 

 

 

 

POOLING

IN MAPPING STRATEGIES

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

All of the above methods for screening

individual samples in a complex library are fine

 

 

 

for finding genes. However, they are very inefficient in ordering a whole library. The

 

 

 

schematic result of a successful screen for a gene-containing clone in an arrayed library is

 

 

 

shown

in Figure

9.13

 

 

a.

A single positive clone is detected, presumably containing the

 

 

 

sample

of

interest. However, from the viewpoint of information

retrieval,

this

result is

 

 

 

very weak. A sample array is potentially an extremely informative source of information

 

 

 

about

the

order of the samples it contains. A single positive hybridization extracts the

 

 

 

minimum possible amount of

information

from the array and

requires a time-consuming

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

(a)

 

(b)

Figure 9.13

Hybridization to arrays. (

a ) Typical pattern seen when screening a

library for a par-

ticular gene. (

b ) Ideal pattern of hybridization in interrogating a dense sample array.

 

304

 

ENHANCED METHODS FOR PHYSICAL MAPPING

 

 

 

 

experiment. From the viewpoint of information theory, a much better designed interroga-

 

tion of the array would produce the result shown in Figure 9.13

 

 

 

b. In this ideal black-and-

white case, roughly half the clones would be scored positive and half negative if we are

 

considering only positive and negative hybridization results. When the amount

of

signal

 

can

be

quantitated, much more

information is potentially available from a single

test of

the array through all the gray levels seen for each individual clone.

 

 

 

 

 

The

practical issue is how

to take advantage of the power of arrays, or pools

of

samples,

in a maximally efficient way so that all of the clones are ordered accurately with a minimum

 

number of experiments. The answer will depend critically on the nature

of the

errors in-

 

volved in interrogating arrays or pools. We start by considering fairly ideal cases. A detailed

analysis

of the effects of errors in real applications is

still not available, but

the power of

these approaches is so great that it appears that a reasonable error rate can be tolerated.

 

 

The goal is to design probe or sample pools and arrays that allow roughly half of the tar-

 

gets to be scored positive in each hybridization or PCR. There are a number of different ways

 

to try to accomplish this. One general approach is to increase the amount of different DNA

 

sequences in the probes or the targets. The other general approach is to increase the fraction

 

of potential target DNA that will be complementary to probe sequences. In principle, both

 

approaches can be combined. A very simple strategy is to use complex probes. For example,

 

purified large DNA fragments can be used as hybridization probes. DNA from hybrid cell

 

lines or radiation hybrids can be used as probes. In some or all of these cases, it is helpful to

employ human-specific PCR amplification so that the probe contains sufficient concentra-

 

tions of the sequences that are actually serving to hybridize with specific samples in the tar-

 

get.

 

 

 

 

 

 

 

 

 

 

 

The logic behind the use of large DNA probes is that they automatically contain conti-

nuity information, and sets of target clones detected in such a hybridization should lie in

the same region of the genome. It is far more efficient, for example, to assign an array of

clones

to

chromosome regions

by hybridizing with

DNA

purified

from

those

regions,

 

than it

is to assign the regional location of clones one at a time by hybridizing

to a panel

of

cell

lines. The key technical advances that makes these new strategies

possible

are

PCR amplification of desired sequences and suppression hybridization of undesired re-

 

peated sequences.

 

 

 

 

 

 

 

 

 

An alternative to large DNA probes is simple sequence probes. Oligonucleotides of

lengths 10 to 12 will hybridize with a significant fraction of arrayed cosmids or YACs.

Alternatively, one could use simple sequence PCR primers, as we discussed in Chapter 4.

There is no reason why probe

molecules must be used individually. Instead, one

could

 

make pools of probes and use

this pool directly in hybridization. By constructing differ-

ent sets of pools, it is possible, after the fact, to sort out which members of

which pools

were responsible for positive hybridization. This turns out to be more efficient in princi-

ple. Any sequences can be used in pools. One approach is to select arbitrary, nonoverlap-

ping sets of single-copy sequences. Another approach

is to build

up pools from

mixtures

 

of short oligonucleotides.

 

 

 

 

 

 

 

 

 

Another basic strategy for increasing the efficiency of mapping is to use pools of sam-

ples. This

is a necessary part of PCR screening

methods, but

there

is no

reason why

it

also could not be used for hybridization analyses. We will describe some of the principles

that

are involved in pooling strategies in the next few sections. These principles apply

equally whether pools

of probes or pools of samples are

involved.

Basically it would

seem

that one should be

able to combine simultaneously sample

pooling

and probe pool-

Соседние файлы в папке genomics1-10