Добавил:
Опубликованный материал нарушает ваши авторские права? Сообщите нам.
Вуз: Предмет: Файл:

Solid-Phase Synthesis and Combinatorial Technologies

.pdf
Скачиваний:
20
Добавлен:
15.08.2013
Размер:
7.21 Mб
Скачать

9.1 PHARMACEUTICAL APPLICATIONS 423

increased competition, changes in the patients’ needs in terms of acceptable drug profiles (including costs), and an increase in regulatory requirements to obtain new drug approvals. The macroscopic effects of these and other factors are summarized in two major phenomena. First, the cost to launch a new drug on the market has increased eight times since 1976, going from around $50 million to an impressive average of $400 million, which is sustainable only by major pharmaceutical companies. Second, development time has almost doubled since 1960, reaching the current average figures of 14–15 years and reducing the time window to capitalize on sales before patent expiration of a successful drug and insurgence of generic competition.

The need to counteract this tendency has led to the search for faster and/or less expensive processes for the discovery and development of drug candidates. Biological and chemical techniques to synthesize and screen early or late drug candidates in a high-throughput fashion (HTS, combinatorial technologies) are impacting positively on these factors, as already mentioned (see Section 4.1.2). In the following sections we cover the so-called drug discovery process, which is schematized in Fig 9.1, describing and discussing the impact of combinatorial technologies on each of the steps involved. A recent review has covered exhaustively the economic impact of combinatorial technologies in drug discovery (1).

9.1.2 From Gene to Function

Disease selection is the first step of drug discovery. The choice of one or more disease areas in which to research for new targets is based on the so-called opportunitymapping process, which ranks the disease areas according to the unmet medical need and the risk involved. Ideally efforts are concentrated in areas of high unmet medical need, and the disease portfolio of a pharmaceutical company will include some risky areas, where little, or nothing, is known, as well as more assessed diseases for which significant knowledge is available. Combinatorial technologies are not involved at this level of the drug discovery process.

FROM GENE TO FUNCTION

FROM FUNCTION TO TARGET

FROM TARGET TO HIT

FROM HIT TO LEAD

FROM LEAD TO CLINICAL CANDIDATE

Figure 9.1 Key steps of the drug discovery process.

424 APPLICATIONS OF SYNTHETIC LIBRARIES

Target identification is the next, crucial phase. Currently marketed drugs interact with about 400 known targets (genes or gene products), which are mostly receptors, enzymes, or ion channels. The completion of the Human Genome Project, which is expected in the next five years, will unravel all of the approximately 100,000 human genes and will provide their detailed genetic and physical maps. Of these, the estimated number of important genes to cure diseases ranges between 3000 and 10,000. All these potential targets will be known to everybody, and the competitive advantage will be to identify, faster than the competition, the pharmaceutically relevant targets and to associate the significant ones to a given disease. The conventional sources for target identification (literature, in-house research, competitor information) are already and will be more heavily overwhelmed in the future by susceptibility genetics (2, 3), which establishes an association between a gene and a disease by studying a patient population. The link may be determined by differential gene expression (DGE) analysis (4, 5), which analyzes the alterations of gene transcription comparing healthy versus diseased tissues, by proteomics (6, 7), which profile the proteins from a cellular or tissue source, again comparing normal versus diseased samples, or by bioinformatic data mining (8, 9), which browses the genetic databases and identifies potential new targets by comparison of their sequence with known genes of interest. The identification of these targets may take advantage, at some level, of biological libraries (see Chapter 10) as tools, but synthetic libraries are not significantly involved in this phase.

9.1.3 From Function to Target

The identified target is first validated, discriminating between relevant and nonrelevant targets for a given disease. Even if a target is related to a disease, it may not be essential, and to interfere with it may not lead to novel, useful drugs. Functional genomics, or functional gene analysis (FGA) (10, 11), is fundamental in linking genomics research with the discovery of disease-relevant targets, addressing the question of their function or dysfunction in disease states. Current technologies in FGA rely on gene under- (12) or overexpression (13), on knock-out and interaction studies in cell cultures or in transgenic animals (14, 15), and on tissue distribution of new genes or on in vivo pharmacological studies in animals. These technologies use a wide variety of techniques, including combinatorial technology–related tools such as ribozymes, antisense oligonucleotides, aptamers, and antibodies. Synthetic libraries, though, are not involved in this phase of the drug discovery process; the only exception is a novel approach, termed chemical genetics and described in the next subsection (see also Sections 7.5.1 and 7.5.2), which will provide significant assistance to elucidate biological pathways relevant to diseases and to validate novel targets via their interaction with small molecules.

The consequence of validation is the selection of a target that is promising; that is, affecting it will likely result in curing the disease of interest. An important criterion is target tractability, related to the ease of screening chemical diversity for activity on the target. In fact, there is no therapeutic use for a target where one or preferably more screening assays cannot be designed and realized (see also next section).

9.1 PHARMACEUTICAL APPLICATIONS 425

9.1.4 Chemical Genetics: From Chemical Entities to Valuable Targets

The use of chemical tools to elucidate complex biological pathways is not a recent discovery; many past examples determined the identity and the function of a target through a chemical entity interacting with the macromolecule. This pharmacological approach has been largely obscured by the advent of genetics and genomics (see Sections 9.1.2 and 9.1.3), but has still the power to unravel, or to provide information about complex pathways for which genetic manipulation protocols are not yet assessed.

A higher throughput version of this approach takes advantage of combinatorial technologies and has been recently developed and exploited with the name of chemical genetics (16–18). More specifically, two complementary approaches have been defined and will be exemplified:

forward chemical genetics (FCG), where a large collection of drug-like compounds is tested for its modulation of a complex biological pathway. If one, or more, modulators are found, their molecular target (either novel, or not known to interact with the pathway) is identified and its role in the pathway is elucidated. This provides both useful chemical entities to be progressed and unvaluable proprietary information regarding novel targets for relevant diseases;

reverse chemical genetics (RCG), where a small molecule which targets a specific pathway or which causes a specific phenomenon is incubated with a large genetic collection, to spot if the expression of any of the gene products is affected. Knowing that the whole human genome will be soon available for such experiments in microarrays, and that most of the sequenced genes will not be immediately linked to a function, this approach will be important for the ambitious goal “to identify a small molecule partner for every gene product” (16).

Mayer et al. (19) reported the use of FCG to identify compounds that affect the cellular mitotic process through a screening cascade reported in Fig. 9.2. A commercially available, diverse collection of 16,320 compounds was screened on a whole cell primary cytoblot assay (20) measuring the phosphorylation level of a nucleolar protein called nucleolin (step a). This protein is phosphorylated when cells enter mitosis, and inhibitors of the mitotic process are expected to increase the level of phosphonucleolin. 139 positive compounds were able both to penetrate the cell and to increase the level of phosphorylated nucleolin.

As tubulin is a major target for antimitotics, the positives were screened for their in vitro effects on tubulin polymerization (step b). 52 compounds inhibited tubulin polymerization, and one stimulated it; these structures (see a few representatives, 9.1–9.5, in Fig. 9.3), which could represent a starting point for an optimization program, were archived. The remaining 86 positives not interacting with tubulin were progressed and tested in mammalian epithelial kidney cells (BS-C-1) stained with various fluorescent reagents to visualize microtubules, actin and chromatin, i.e. the essential structural/mechanochemical mitiotic spindle (step c). 27 compounds, although increasing the number of normal mitotic cells and thus confirming the result of primary screening, did not have any immediate effect and were deemed to act on

426 APPLICATIONS OF SYNTHETIC LIBRARIES

screening set: 16,320 compounds

a

primary screening:

nucleolin phosphorylation

 

 

 

139 positives

secondary screening: b tubulin polymerization

later

 

 

52 inhibitors of

 

1 stimulator of

 

later

progression(?)

 

 

tubulin polymerization

 

tubulin polymerization

 

 

 

progression(?)

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

86 compounds with no

 

 

 

 

 

 

effects on tubulin polymerization

 

 

 

 

 

 

 

 

tertiary screening:

 

 

 

 

 

 

c

 

fluorescent staining of

 

 

 

 

 

 

12 aspecific

 

mitotic spindle components

 

 

discarded

 

 

 

 

 

 

 

 

 

 

compounds

 

42 compounds acting

 

later

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

both on mitotic and on

 

 

later

 

27 compounds with no

 

 

 

progression(?)

 

 

interphase cells

 

progression(?)

 

 

effects on mitotic spindle

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

5 compounds specifically acting on mitotic spindle

Figure 9.2 Forward Chemical Genetics (FCG): screening aimed to compounds interacting with the mitotic spindle components during mitosis.

other components of the mitotic process. These compounds were archived. 42 compounds affected the mitotic spindle (chromatin and microtubules, no effect on actin cytoskeleton) but affected microtubules also in interphase cells not yet entered into the mitotic phase. These compounds were also archived. 12 compounds showed multiple, aspecific effects and were thus discarded (Fig. 9.2).

The five remaining compounds were specifically acting on the mitotic spindle, and did not alter its components in interphase cells. One of them in particular prevented the formation of the spindle in most mitotic cells, replacing it with a monoastral microtubule formation surrounded by chromosomes; the compound (9.6, Fig. 9.3) was thus named monastrol. The authors compared monastrol effects with several published effects (21–23) related to inhibition of Eg5, a member of the BimC kinesin family, and showed monastrol to be the α selective Eg5 inhibitor (Eg5-driven microtubule motility inhibition = 14 µM). This compound is both the first permeable and selective inhibitor of a specific kinesin, and may have many possible applications as a tool or as the starting point for a chemical optimization program.

The same compound collection was screened by Stockwell et al. (24) on a primary FCG assay directed towards activators of a regulatory reporter gene (p3TPLux)

 

 

9.1 PHARMACEUTICAL APPLICATIONS 427

Cl

 

 

 

O

 

 

 

O

 

HN

S

 

 

 

 

 

O

 

O

N

O

 

 

 

 

 

 

9.1

 

 

9.2

 

 

 

MeO

O

 

 

N

 

 

O2N

N

 

 

 

 

 

O

9.4

 

 

 

9.3

OMe

 

 

 

 

 

 

 

 

OH

S

O

O

O

 

 

 

N

N

HN

O

9.5

H

 

 

 

S N

9.6

 

 

H monastrol

Figure 9.3 Structures of inhibitors of tubulin polymerization (9.1–9.5) and of monastrol 9.6, an inhibitor of mitotic spindle assembly, found through FCG.

sensitive to TGF-β (25) (step a, Fig. 9.4). The determination of the mechanism of action for the activators of p3TPLux should have provided insights into the TGF-β signalling pathway. Four positive compounds (9.7–9.10, Fig. 9.4) were identified and submitted to a secondary cytoblot screen for 5-bromodeoxyuridine incorporation (20) to determine their inhibition of DNA synthesis (step b, Fig. 9.4). All of them inhibited DNA synthesis, as happens for TGF-β signalling.

Compounds 9.8 and 9.10 were submitted to an RCG screen using the vast majority of Saccaromyces cerevisiae genome (around 5800 genes) on a microarray format (step a, Fig. 9.5). The high similarity between the human and fungal genome was expected to ensure useful indications for the molecular targets of the two molecules. The gene expression of the whole microarray, checked with the use of fluorescent dyes, did not change in the presence of 9.10. The expression of five genes (Fig. 9.5) was significantly increased by 9.8. Two out of the five genes are involved in metal transport, respectively

428 APPLICATIONS OF SYNTHETIC LIBRARIES

screening set: 16,320 compounds

a

primary screening:

p3TPLux activation

 

4 positives

b

secondary screening:

5-bromodeoxyuridine incorporation

4 positives 9.7-9.10: p3TPLux acivators DNA synthesis inhibitors

 

 

 

O

 

 

 

N

N

N

N

N

 

 

 

H

OH

R

OH

OH

 

 

 

HO

9.10

9.7R = H

9.8R = Me

9.9R = n-Bu

Figure 9.4 Forward Chemical Genetics (FCG): screening aimed to compounds activating the reporter gene p3TPLux and interfering with TGF-β signalling, and structures of activators

9.7–9.10.

for Zn (ZRT1) (26) and Cu-depended Fe transport (FET3) (27); HSP26 is a heat shock protein activated by osmotic stress (28), and the two uncharacterized genes are likely to be involved in similar, metal-related processes.

Compounds 9.8 and 9.10 were further characterized for their metal binding capacity; 9.8 confirmed a high and quite selective affinity for Zn2+, Fe3+ and Cu2+, and its activation of TGF-β signaling was reversed (Fe3+) or largely reduced (Zn2+, Cu2+) by addition of metal chloride salts. RCG thus provided insight on TGF-β signalling (several potential genes involved) and on the relevance of metal ions for this process.

9.1.5 From Target to Hit

This is the drug discovery phase where the accumulated biological knowledge is converted into relevant, novel chemical entities that will be progressed along the line to hopefully become new drugs on the market. The interdependent progression of implied biological and chemical activities is thoroughly described below, with the assistance of several examples.

This phase requires the initial development of an assay able to measure significant biological interactions of a molecule with the target. This is the foundation of pharmaceutical research, and traditionally there was no real need for close interaction

9.1 PHARMACEUTICAL APPLICATIONS 429

screening set: around 5800 S. cerevisiae genes

 

primary screening:

a

effect of 9.8 and 9.10

 

on gene expression

9.8

9.10

increase of

no changes in

gene expression

gene expression

for 5 genes:

 

ZRT1, FET3, HSP26,

 

YDR534C and YOL155C

 

N N N

OH

Me

OH

9.8 metal binder

with high affinity for Zn2+, Fe3+ and Cu2+

Figure 9.5 Reverse Chemical Genetics (RCG): screening aimed to elucidate the mode of action and the main features of compounds 9.8 and 9.10.

at the assay development stage between chemists and biologists. Medicinal chemists, in fact, prepared large amounts of molecules, which were then tested by the biologists on a variety of assays and were fully characterized. Eventually, the acquired structure– activity information was used to orient further chemical efforts. While this classical approach was accurate and successful on many occasions, the low throughput of both synthesis and screening was not appropriate for a quick and economical drug discovery process. This is the engine that gave rise first to high-throughput screening (HTS) and then to its chemical counterpart, high-throughput chemical synthesis or, as it is better known, combinatorial chemistry (see Section 4.1.2 for the historical background).

The significant change is represented by a quick and inexpensive biological filter to rapidly discard nonactive molecules and to focus further efforts on a few confirmed and attractive positives. The assay, thus, is automated to test even hundreds of thousands of compounds in a relatively short time, and this introduces some limitations. The following are among the most relevant:

Only simple assays can be automated, while complex protocols will cause problems and will not produce reliable results.

Automation must not affect the robustness of the assay, because automated protocols cannot afford last-minute adjustments or modifications.

430APPLICATIONS OF SYNTHETIC LIBRARIES

Automated assays must be miniaturized, partly because they will typically receive small quantities of compound from collections or chemical libraries, but also to reduce the costs of biological reagents.

Automated assays must be developed using several formats, on an assay-specific base, and it is essential to possess the necessary expertise to select and automate the best choice for a given target.

Automated instrumentation and data management software are necessary to analyze and archive the large number of data produced by a screening campaign.

In principle, any type of biological assay can be automated, and several excellent reviews have covered this topic (29–38). Assay development and automation are not covered further here, but their influence on chemistry at the early stages of drug discovery will be examined.

At the very start of the so-called hit identification phase, the project is driven by the available information. If the biological target is already well-characterized, either a pharmacophoric/structural model or even structures of known ligands/inhibitors are available. This structural information will determine the class(es) of compounds to be tested and the size of the screening set. This knowledge-based, or focused, approach calls for similarity with a model/structure, rather than for diversity, and the screening sets are typically smaller (see Sections 5.4.2 and 5.4.3). More focused approaches are dealt with in Section 9.1.7. A more frequent situation, due to the input of novel genomic-derived targets, deals with unknown or, at best, poorly characterized targets, which are less tractable but also highly rewarding if novel, active structures are identified. The goal here is to test significant chemical diversity on a single, automated primary assay to fish out positives. These are then profiled through a screening cascade of secondary assays, and one or more structures are eventually selected as starting points for an exploratory chemical project. Ideally, large sets (hundreds of thousands) of compounds from collections and from synthetic libraries are screened, providing that a reliable and robust primary screening assay has been set up. Pooling strategies to reduce the number of wells per screening can also be adopted (39).

The screening of a large, diversity-based set of compounds/libraries implies the selection of candidates. Selected libraries should be built around a relevant (possibly proprietary) common core scaffold, maximize their chemical diversity, and provide novel, druglike individuals according to well-known computational filters (see Section 5.3). Each pharmaceutically biased-targeted primary library is tested on many biological targets; thus a large number of library equivalents are prepared and stored to be used when needed. High-quality SP pool libraries are usually the preferred format, but also medium–large discrete libraries are becoming very popular as hit-seeking libraries. Each screening set contains library individuals or compounds from collections in very small amounts to fully exploit the synthetic efforts and maximize the compound availability for multiple screening.

The positive compounds from the primary screening are further profiled to check their usefulness. This characterization includes preliminary physicochemical property determination, toxicity data, and specificity/selectivity data when possible. The secondary screening cascade will restrict the screening outcome to a small number of

9.1 PHARMACEUTICAL APPLICATIONS 431

confirmed hits, to be used as starting points for chemical modification projects (see Section 9.1.8).

9.1.6 Management of Combinatorial/Screening Data and Samples

Combinatorial technologies and HTS in drug discovery generate a huge amount of biological and chemical information. This section follows the flow of generated data from the design and synthesis of a library to its analytical characterization and its screening, ending with the extraction of activity data, the assessment of SAR, and the storage of the information.

The design of a primary or biased-targeted, hit-seeking library is aided by computational tools in selecting the most diverse set of library individuals according to the desired library size (see Section 5.4). The designed library is then prepared, and the individual structures are registered. The structures of hundreds of thousands of compounds, though, would require enormous efforts for their input into a database and a huge computing space for them to be stored and searched. The structures are thus stored via the reaction scheme and the building blocks (BBs) used to prepare the library. The example shown in Fig. 9.6 (40) shows how a 64,000,000-member library of hexapeptides, where 20 α-amino acids are randomized in each position, can be

H

 

AA2AA

AA4AA

AA6CONH

N

AA

 

1

3

5

2

O

64,000,000 N-acetyl hexapeptide amides

H

N A A A

A A A CONH2 O structure nº 1

+

...........

+

...........

+

...........

+

H

N Y Y Y

Y Y Y CONH2

O

structure nº 64,000,000

 

PRODUCT

REPRESENTATION

AA1 = A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W, Y

structures nº 1-20

+

...........

+

...........

+

AA6 = A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W, Y

structures nº 101-120

H

 

AA2AA

AA4AA

AA6CONH

N

AA

 

1

3

5

2

O

PROCESS/BBs

REPRESENTATION

Figure 9.6 Product representation versus process/building block representation of a hypothetical 64,000,000-member peptide library.

432 APPLICATIONS OF SYNTHETIC LIBRARIES

condensed from 64,000,000 represented structures (product representation) to 120 monomers and a reaction scheme (process/BB representation). The combination of building blocks can be used at any time to extract and visualize each library component, providing that an unequivocal match can be made from the pathway and the building block structures. The example in Fig. 9.7 (41) shows how several alternatives in terms of regioand stereochemistry may prevent the assignment of an unequivocal structure from a process/BB representation. The structures of amino acids (M1), aldehydes (M2), olefins (M3), and acyl chlorides (M4) do not allow the generation of the correct reaction products related to the olefin cycloaddition step (regiochemistry and stereochemical preference; see Fig. 9.7) via software programs.

The structural database must be linked to a relational database (42), which contains all the generated information for a discrete or a pool library sample. The relational database includes structural information (MW, molecular formula), physicochemical properties (log P, solubility, pKa), analytical results (peak values or spectra IDs from which the spectra can be found), and activity data from primary and secondary

H

COOH

 

 

 

 

N

CHO

R3

 

COCl

Fmoc

 

 

 

R

 

 

R4

 

 

2

 

 

 

R1

 

M2

 

M3

M4

M1

 

 

 

 

 

 

O

 

 

 

 

N

 

 

+

 

 

R2

O

P

R3

 

 

R1

 

 

 

 

regiochemistry of cycloaddition? stereochemistry of proline substituents?

 

R1

O

 

 

 

O

 

 

 

 

 

 

R1

 

R3

 

O

P

R3

 

O

P

 

 

 

 

 

N

O

 

+

R2

N O

 

R2

R4

 

 

 

R4

 

 

 

 

 

 

 

library made by two diastereomeric couples

in different ratios according to the monomer structures

Figure 9.7 Process/BB representation of a hypothetical pyrrolidine library: regioand stereochemical issues.