Добавил:
Опубликованный материал нарушает ваши авторские права? Сообщите нам.
Вуз: Предмет: Файл:
Тексты / 3 / Math.doc
Скачиваний:
171
Добавлен:
02.05.2014
Размер:
287.74 Кб
Скачать

9.3 Discussion

It is worth noting that the aspects of subsumption deletion we've just discussed are closely related to the phenomena discussed in the first part of the paper. Discovering a new area of the environment in which the evolved classifiers are overly general is similar to the case in which an animat learns locally because it cannot explore all the areas of the environment frequently. Therefore, we might observe that specify and teletransportation are complementary. However, the two problems are quite distinct in that we may have an environment where every area can be visited frequently in which, suddenly, a door opens and the animat faces an unexplored area. Most important, teletransportation acts through evolution and therefore it is slower than specify, which is a heuristic and thus acts much faster than genetic evolution.

10 Summary

We presented a study of the generalization mechanism of XCS to explain some of the previously reported results. We analyzed Wilson's generalization hypothesis which explains how generalization in XCS works. Then, we stated a hypothesis which suggests that XCS may not converge to the optimum when, due to the structure of the environment and to the exploration strategy, the system is not able to visit all the areas of the environment frequently. We verified our hypothesis by introducing a meta-exploration strategy, teletransportation, which was used as a theoretical tool during the validation phase. Subsequently, we suggested how the ideas underlying teletransportation might be implemented in a real application integrating XCS with a model of the environment in a Dyna architecture. A possible implementation of such an architecture was then discussed.

Finally, we analyzed the conditions under which XCS may fail to evolve a compact representation of the learned task. We showed this is likely to happen in environments where there is no direct relation between the number of don't care symbols a classifier condition has and the number of environmental configurations the condition matches. In such cases, there is no pressure for further generalization beyond what may be permitted by available states. The system rapidly reaches the maximum generalization that the environmental states permit. Then it can start evolving a redundant representation of the learned task. Accordingly, we showed how subsumption deletion can be effective in evolving a more compact solution.

Acknowledgments

I wish to thank all the people who helped me during the course of this work. Stewart W. Wilson, who was always available for discussing the generalization issue and for reviewing the early versions of this paper. Marco Dorigo, for the many discussions on learning classifier systems when I was in Brussels. Marco Colombetti, who supports my work and is always available for discussions. Trevor Collins, for his invaluable effort in trying to improve my English. Finally, Gabriella, for her never ending patience.

This research was partially funded by a grant to Marco Colombetti for the year 1996 from the Fondo di Ricerca d'Ateneo (University Research Fund) of the Politecnico di Milano.

(n1) General parameters for the four algorithms are set as follows: Beta=0.2, Gamma=0.71, Theta=25, Epsilon[sub 0]=0.01, Alpha=0.1, Chi=0.8, Mu=0.01, Delta=0.1, Phi=0.5, P[sub #]=0.3, P[sub I]=10.0, Epsilon[sub I]=0.0, F[sub I]=10.0.

Specific parameters are set as follows: for algorithm (ii), P[sub #]=0.0 and mutation does not insert any # symbol; in (iii) specify parameters are set as N[sub Sp]=20 and P[sub Sp]=0.5; finally, biased exploration chooses an action randomly with a probability P[sub s]=0.3. We refer the interested reader to Wilson (1995) for a detailed description of each of the parameters.

(n2) Due to the significant difference of the results it is not possible to use the same scale on both plots.

(n3) XCS's parameters are set as in the previous experiment, except for the specify probability parameter, P[sub Sp], which is set to 0.8.

(n4) This happens because the classifier parameters are always updated according to the payoff level of the first niche, but never according to the payoff level of the second niche.

(n5) This phenomenon was already noticed by Wilson (1995) where, discussing the generalization produced by XCS in Woods2, Wilson observed that XCS produced classifiers which matched the same niches but contained different numbers of # symbols.

SOME APPLICATIONS OF GRAPH THEORY TO OTHER PARTS OF MATHEMATICS

Source: Mathematical Intelligencer, Summer99, Vol. 21 Issue 3, p6, 6p, 7 diagrams, 1 graph, 2bw

Author(s): Bertram, Edward; Horak, Peter

Many mathematicians are now generally aware of the significance of graph theory as it is applied to other areas of science and even to societal problems. These areas include organic chemistry, solid state physics and statistical mechanics, electrical engineering (communications networks and coding theory), computer science (algorithms and computation), optimization theory, and operations research. The wide scope of these and other applications has been well documented (e.g., [4, 11]).

However, not everyone realizes that the powerful combinatorial methods found in graph theory have also been used to prove significant and well-known results in a variety of areas of pure mathematics. Perhaps the best known of these methods are related to a part of graph theory called matching theory. For example, results from this area can be used to prove Dilworth's chain decomposition theorem for finite partially ordered sets. A well-known application of matching in group theory shows that there is a common set of left and right coset representatives of a subgroup in a finite group. Also, the existence of matchings in certain infinite bipartite graphs played an important role in Laczkovich's affirmative answer to Tarski's 1925 problem of whether a circle is piecewise congruent to a square. Other applications of graph theory to pure mathematics may be found scattered throughout the literature.

Recently, a collection of examples [10] showing the application of a variety of combinatorial ideas to other areas has appeared. There, for example, matching theory is applied to give a very simple constructive proof of the existence of Haar measure on compact topological groups, but the other combinatorial applications do not focus on graph theory. The graph-theoretic applications presented here do not overlap with those in [10], and no attempt has been made at a survey. Rather, we present five examples, from set theory, number theory, algebra, and analysis, whose statements are well known or are easily understood by mathematicians who are not experts in the area.

Additional criteria for choosing these five examples were that the statement can be formulated using few definitions and that the proof can be explained in a relatively short space, without too much technical detail. The proof should exhibit the strength and elegance of graph-theoretic methods, although, in some cases, one must consult the literature in order to complete the proof.

Preliminaries

For the convenience of our readers, we recall the necessary definitions from graph theory.

An (undirected) graph G = (V, E) is a pair in which V is a set, the vertices of G, and E is a set of 2-element subsets of V, the edges of G. An edge e is an element of E is denoted by e = xy, x and y being the end vertices of e. Here, e is incident with x (and e is incident with y). The degree of a vertex v, deg(v), is the number of edges incident with v. In a directed graph, or simply digraph, G = (V, E), the (directed) edges are ordered pairs of vertices of V and are denoted by e = (x, y).

A trail of length n in a graph G (digraph G) is a sequence of vertices x[sub 0], x[sub 1], x[sub 2], ..., x[sub n] (x[sub i] is an element of V), such that for i = 0, 1, ..., n - 1, x[sub i]x[sub i + 1] is an edge of G ((x[sub i]x[sub i + 1]) is an oriented edge of G). If x[sub 0] = x[sub n], then the trail is said to be closed. When all the vertices in the sequence are distinct, the trail is called a path. A closed trail, all of whose vertices are distinct except for x[sub 0] and x[sub n], is called a cycle.

A graph G is connected if any two vertices of G are joined by a path in G. Otherwise, G is said to be disconnected. The components of G are the maximal connected subgraphs of G. A tree is a connected graph without cycles. A graph G = (V, E) is said to be bipartite if V can be partitioned into two nonempty subsets A and B such that each edge of G has one end vertex in A and one end vertex in B. Then, G is also denoted by G = (A, B; E).

If (H, Center Dot) is a group and S a set of generators of H, not necessarily minimal, the Cayley graph G(H, S), of (H, Center Dot) with respect to S, has vertices x, y, ... is an element of H, and xy is an edge if and only if either x = y Center Dot a or y = x Center Dot a for some a is an element of S.

If G is any graph and e = xy an edge of G, then by a contraction along e, we mean the graph G' which arises from G by identifying the vertices x and y (see Fig. 1).

We say that a graph G[sub 1] is contractible onto a graph G[sub 2] if there is a sequence of contractions along edges which transforms G[sub 1] to G[sub 2].

The automorphism group of a graph G is the group of all permutations p of the vertices of G with the property that p(x)p(y) is an edge of G iff xy is an edge of G. A group H of permutations acting on a set V is called semiregular if for each x is an element of V, the stabilizer H[sub x] : = {h is an element of H | x[sup h] = x} consists of the identity only, where x[sup h] denotes the image of x under h. If H is transitive and semiregular, then it is regular.

Cantor-Schroder-Bernstein Theorem

Our first example is a graph-theoretical proof of the classical result of Schroder and Bernstein. Actually, the theorem was stated by Cantor, who did not give a proof. The theorem was proved independently by Schroder (1896) and Bernstein (1905). The idea behind the proof presented here can be found in [8].

Theorem (Cantor-Schroder-Bernstein): Let A and B be sets. If there is an injective mapping f: A right arrow B and an injective mapping g: B right arrow A, then there is a bijection from A onto B, that is, A and B have the same cardinality.

Proof. Without loss of generality, we may assume that A and B are disjoint. Define a bipartite graph G = (A, B; E), where xy is an element of E if and only if either f(x) = y or g(y) = x, x is an element of A, y is an element of B. By our hypothesis, 1 </= deg v </= 2 for each vertex v of G. Therefore, each component of G is either a one-way infinite path (i.e., a path of the form x[sub 0], X[sub 1], ..., x[sub n], ...), or a two-way infinite path (of the form ... X[sub -n], x[sub -n + 1], ..., x[sub -1], x[sub 0], X[sub 1], ..., X[sub n], ...), or a cycle of even length with more than two vertices, or an edge. Note that a finite path of length >/= 2 cannot be a component of G. Hence, there is in each component a set of edges such that each vertex in the component is incident with precisely one of these edges. Hence, in each component, the subset of vertices from A is of the same cardinality as the subset of vertices from B.

Fermat's (Little) Theorem

There are many proofs of Fermat's Little Theorem, even short algebraic or number-theoretic proofs. The first known proof of the theorem was given by Euler, in his letter of 6 March 1742 to Goldbach. The idea of the graphtheoretic one presented below can be found in [5] where this method, together with some number-theoretic results, was used to prove Euler's generalization to nonprime modulus.

Theorem (Fermat): Let p be a prime such that a is not divisible by p. Then, a[sup p] - a is divisible by p.

Proof Consider the graph G = (V, E), where V is the set of all sequences (a[sub 1], a[sub 2], ..., a[sub p]) of natural numbers between 1 and a (inclusive), with a[sub i] Is not equal to a[sub j] for some i Is not equal to j. Clearly, V has a[sup p] - a elements. For any u is an element of V, u = (u[sub 1], ..., u[sub p-1], u[sub p]), let us say that uv is an element of E just in case v = (u[sub p], u[sub 1], ..., u[sub p-1]). Clearly, each vertex of G is of degree 2, so each component of G is a cycle, of length p. But then, the number of components must be (a[sup p] - a)/p, so p|a[sup p] - a.

Nielson-Schreier Theorem

Let H be a group and S be a set of generators of H. Then, a product of generators and their inverses which equals (the identity) 1 is called a trivial relation among the generators in S if 1 can be obtained from that product by repeatedly replacing xx[sup -1] or x[sup -1]x by 1, otherwise such a product is called a nontrivial relation. A group H is free if H has a set of generators such that all relations among the generators are trivial. In [1] Babai proved the Nielson-Schreier Theorem on subgroups of free groups, as well as other results in diverse areas, from his "Contraction Lemma." The particular case of this lemma when G is a tree, and its use in proving the Nielson-Schreier Theorem, was also observed by Serre [12, Chap. 1, Sec. 3]. The proof of the Contraction Lemma below is somewhat technical, although it uses only the ideas from group theory and graph theory we have already recalled, and is omitted here.

Contraction Lemma. Let H be a semiregular subgroup of the automorphism group ora connected graph G. Then, G is contractible onto some Cayley graph of H.

If H is a group and h is an element of H, consider the permutation h[sub R] of H obtained by multiplying all the elements of H on the right by h. The collection H[sub R] = {h[sub R]: h is an element of H} is a regular group of permutations (under composition) and is called the (right) regular permutation representation of H.

It is known [1] that G is a Cayley graph of the group H if and only if G is connected and H[sub R] is a subgroup of the automorphism group of G.

Corollary. If J is a subgroup of a group H, then any G(H, S) is contractible onto G(J, T) for some set T of generators of J.

Proof H[sub R], the regular representation of H, acts naturally as a regular permutation group on G(H, S), which is connected. Thus, the subgroup of H[sub R] corresponding to the elements of J is a semiregular subgroup of the automorphism group of G(H, S). Now apply the Contraction Lemma.

Theorem (Nielson-Schreier): Any subgroup of a free qroup is free.

Proof. We first show that in any group H and for any set S of generators of H, the Cayley graph G(H, S) contains a cycle of length > 2 if and only if there is a nontrivial relation among the generators in S. To show this, suppose x[sub 0], x[sub 1], ..., x[sub n] = x[sub 0] is a cycle of G(H, S). Then, there are a[sub i] is an element of S, 1 </= i </= n, such that x[subi -1]a[sup epsilon][sub i] = x[sub i], where epsilon[sub i] is an element of {1, -1}. Hence, x[sub n] = x[sub n-1]a[sup epsilon][sub n] = x[sub n-2]a[sup epsilon][sub n-1] a[sup epsilon][sub n] = ... = x[sub 0]a[sup epsilon][sub 1]a[sup epsilon][sub 2] ... a [sup epsilon][sub n], i.e., the identity 1 = a[sup epsilon][sub 1] a[sup epsilon][sub 2] ... a[sup epsilon][sub n]. If this were a trivial relation, then there would exist an integer i, 1 </= i </= n, such that a[sub i] = a[sub i-1] and epsilon[sub i] = -epsilon[sub i + 1]. However, this implies that x[sub i-1] = x[sub i + 1], a contradiction. Similarly, if a[sup epsilon][sub 1] ... a[sup epsilon][sub n] = 1 is a nontrivial relation, then x[sub 0], x[sub 1], ..., x[sub n-1], x[sub n], where x[sub i] = x[sub i-1]a[sup epsilon][sub i] 1 </= i </= n, and x[sub 0] = x[sub n], is a closed trail in G(H, S), which must contain a cycle.

Suppose now that H is a free group, S a minimal set of generators of H, and J a subgroup of H. Since there is no nontrivial relation on the elements of S, G(H, S) does not contain a cycle. Also, from the corollary above, G(H, S) is contractible onto G(J, T) for some set T of generators of J. Because any contraction of a cycle-free graph is again cycle-free, G(J, T) must be cycle-free, and, thus, there is no nontrivial relation on the elements of T. Hence, J must be a free group, freely generated by T.

In [7] the interested reader may further pursue the substantial use of elementary graph theory in giving simplified proofs of important theorems in combinatorial group theory.

Existence of a Nonmeasurable Set

The following proof of the existence of a subset of the real numbers R which is non-measurable in the Lebesgue sense is due to Thomas [15]. He wrote his paper while an undergraduate student. We realize that many readers may still prefer Vitali's proof. However, it is quite unexpected that this theorem can be reduced to the theorem below, an easily proved result in measure theory, by using only discrete mathematics.

A simple, well-known result from graph theory says that a graph (V, E) is bipartite if and only if all its cycles are of even length. For a proof, it suffices to prove it for connected graphs only. Choose any x is an element of V and define V[sub 1] = {y is an element of V: any path connecting x and y is of odd length} and V[sub 2] = {y is an element of V: any path connecting x and y is of even length}. Since there are no odd cycles in (V, E), any two paths connecting x and y are of the same parity, so V[sub 1] and V[sub 2] yield a partition of V. From the definitions of V[sub 1] and V[sub 2], it follows that no edge has both vertices in the same V[sub i], so the graph is bipartite. The second implication is obvious.

Consider now the graph T = (R, E), where xy is an element of E if and only if |x - y| = 3[sup k], with k an integer. In order to show that T is bipartite, suppose that X[sub 0], X[sub 1], x[sub 2], ..., x[sub n-1], x[sub n] = X[sub 0] is a cycle of T of length n. Then, by definition of T, x[sub n] = x[sub n-1] +/- 3[sup k][sub n] = X[sub n-2> +/- 3[sup k][sub n] +/- 3[sup k][sub n] = ... = x[sub 0] +/- 3[sup k][sub 1] +/-3[sup k][sub 2] +/- ... +/- 3[sup k][sub 3] and, thus,

Multiple line equation(s) cannot be represented in ASCII text

where {k[sub i]}[sup n][sub 1] is a set of integers. Multiplying both sides by 3[sp N], where N is an integer such that N + k[sub i] > 0, 1 </=+ i </= n, yields

Multiple line equation(s) cannot be represented in ASCII text

which implies that n is even, since otherwise the left side of the above equation is odd, a contradiction. Thus, T is bipartite.

Hence, there are sets A and B with A intersection cap B = *[This character cannot be converted to ASCII text], A union cup B = R, such that each edge of T is incident with one vertex in A and the other vertex in B. If both A and B were measurable, then at least one of them, say A, would have positive measure. Furthermore, for each integer k, A + 3[sup k] subset or is equal to B, which yields A intersection cup (A + 3[sup k]) = *[This character cannot be converted to ASCII text]. Since 3[sup k] right arrow 0 as k right arrow -infinity, this contradicts the following theorem, which is a standard result in measure theory. For the convenience of the reader, we include the proof from [15].

Theorem. Let M be a set of real numbers with positive Lebesgue measure. Then, there exists a delta > 0 such that for every x is an element of R, |x| < delta, M union cup (M + x) Is not equal to *[This character cannot be converted to ASCII text].

Proof. Find a closed set F and an open set G with F subset or is equal to M and F subset G such that 3lambda(G) < 4lambda(F) (where lambda is Lebesgue measure). Since G is a countable union of disjoint open intervals, there is one among them, say I, such that 3lambda(I) < 4lambda(F intersection cup I). Let delta = 1/2lambda(I) and suppose that |x| < lambda. Then, I union cup (x + I) is an interval of length less than 3/2lambda(I) which contains both F intersection cup I and x + (F intersection cup I). The last two sets cannot be disjoint, since otherwise

Multiple line equation(s) cannot be represented in ASCII text

which is a contradiction. Hence, *[This character cannot be converted to ASCII text] Is not equal to (F intersection cup I) union cup (x + (F intersection cup I)) subset or is equal to M intersection cup (x + M), completing the proof.

Remark. It is well known that a nonmeasurable set cannot be constructed without using the axiom of choice. Our graph T is not connected, and, in fact, each component of T has only a countable number of vertices. Thus, to define A and B, we need to make use of this axiom.

Sharkovsky's Theorem

Let f: R right arrow R be a continuous function. A point x is an element of R is called a k-periodic point off if f[sup k](x) = x and f[sup i](x) Is not equal to x for i = 1, 2, ..., k - 1. Here, f[sup n] is the nth iterate off, i.e., f[sup n] = f*[This character cannot be converted to ASCII text]f[sup n-1].

If f has a k-periodic point, is it necessary that f have an m-periodic point for some m Is not equal to k?

In 1964, Sharkovsky [13] gave a complete and amazing answer to this question with the following

Theorem. Let f: R right arrow R be a continuous function with a k-periodic point. Then, f has an m-periodic point if k precedes m in the following ordering (S) of all the natural numbers:

Multiple line equation(s) cannot be represented in ASCII text

This is best possible, since whenever k and m are natural numbers and m precedes k, there exists a continuous function f: R right arrow R with a k-periodic point, but no m-periodic point.

The original proof by Sharkovsky is very complicated, and, later, several mathematicians presented much simpler proofs. In some of them, graph theory was used, with the most important step being made by Straffin [14]. He defined a digraph associated with a periodic point of a function and proved the crucial result.

For this purpose, let x be a k-periodic point of a function f. Then, the distinct values [x, f(x), f[sup 2](x), ..., f[sup k-1] (x)} determine k - 1 finite intervals I[sub 1], I[sub 2], ..., I[sub k-1], labeled from left to right, after locating these numbers in their natural order on the x (and y) axis (see, for example, Fig. 2). Define a digraph G = (V, E) by V = {I[sub 1], ..., I[sub k-1]} with (I[sub i], I[sub j]) is an element of E whenever f(I[sub i]) contains or is equal to I[sub j]. For example the digraph corresponding to the 4-periodic point x = 0 of f, seen in Fig. 2, is the graph given in Fig. 3.

A closed trail in a digraph is said to be nonrepetitive if it does not consist entirely of a cycle of smaller length traced several times. For example, the digraph in Fig. 3 has nonrepetitive trails of lengths 1 and 2 only. Now, we are able to state Straffin's theorem, which turns the problem of the existence of a periodic point into a problem about the corresponding digraph.

Theorem [14]. If the digraph associated with a k-periodic point of a function f has a nonrepetitive closed trail of length m, then f has an m-periodic point.

Figure 4 shows the digraph associated with any 3-periodic point of a function. Clearly, this digraph contains a nonrepetitive closed trail of arbitrary length, showing that the existence of a 3-periodic point off implies that f has periodic points of all orders. This special case, and other results on systems with 3-periodic points, were proved in 1975 by Li and Yorke [9], when Sharkovsky's theorem was still little noticed.

The reader is referred to Straffin's one-page proof of his theorem above, which is modeled after Li and Yorke's. Straffin's proof makes essential use of two lemmas which are standard in analysis courses:

Lemma 1. Suppose I and J are closed intervals, f continuous, and J Subset f(I). Then there is a closed interval Q Subset I such that f(Q) = J.

Lemma 2. Suppose I is a closed interval, f continuous, and I Subset f(I). Then, f has a fixed point in I.

Using his theorem above, Straffin proved some parts of Sharkovsky's Theorem, and his approach subsequently allowed several authors to complete the proof (see [3, 6]). In the proof of Sharkovsky's Theorem presented in [2] graphs were used without applying Straffin's result. To give some of the flavor of the proofs in [3, 6] we sketch the proof of a partial result, showing that in the ordering S, all even integers lie after all the odd integers (see [6]).

Theorem. If a continuous function f: R right arrow R has a point of odd period 2n + 1 (n >/ = 1), then it has periodic points of all even periods.

Proof (sketch). For n = 1, the proof was given above. Now, suppose n > 1 and assume by way of induction that the theorem is true whenever f has a point of odd period 2m + 1, where 3 </ = 2m + 1 < 2n + 1. Straffin proved generally that the digraph corresponding to a periodic point of period k contains a closed trail of length k in which some vertex is repeated exactly twice. In our case, k = 2n + 1, and this closed trail can, therefore, be decomposed into two closed nonrepetitive trails, one of which has odd length, say 2m + 1 < 2n + 1. If this closed trail is of length greater than one, the assertion follows by our induction assumption and the previous theorem. If not, then Straffin proved that our digraph must contain the directed subgraph given in Fig. 5. This subgraph has a cycle of length 2, and one of length 4. For any even number t > 4, we may begin a nonrepetitive closed trail of length t at the bottom right-hand vertex, traverse the 4-cycle once, and follow this by traversing the 2-cycle exactly (t - 4)/2 times. By the previous theorem, the existence of all even periods follows.

WHAT IS ANCIENT MATHEMATICS?

Source: Mathematical Intelligencer, Summer99, Vol. 21 Issue 3, p38, 10p, 10 diagrams, 1bw Author(s): Vardi, Ilan

In my opinion, it is not only the serious accomplishments of great and good men which are worthy of being recorded, but also their amusements. XENOPHON, SYMPOSlUM

The title of this paper is a result of comments on earlier drafts by mathematicians: "This is not mathematics, this is history!" and by historians of mathematics: "This is not history, this is mathematics!" After some reflection, I came to the conclusion that the historians were right and the mathematicians were wrong--for example, I have found little difference between reading papers of Atle Selberg (1917-, Fields Medal 1950) and Archimedes (287-212 BC) (who both lived in Syracuse!). I believe that the mathematicians I spoke to were expressing a generally held belief that reading mathematical papers that are over a hundred years old is history of mathematics, not mathematics. Thus, the reconstruction of Heegner's solution to the class-number-one problem (1952) appeared in a mathematics journal [52], while a reconstruction of the missing portions of Archimedes's The Method (250 BC) appeared in a history journal [29].

To me, reading and proving results about a mathematical paper, whether it was written in 1950 or 250 BC, is always mathematics, though the latter case might be called "ancient mathematics." At least as to Greece, this is accepted by some eminent mathematicians [30, p. 21]:

Oriental mathematics may be an interesting curiosity, but Greek mathematics is the real thing ... The Greeks, as Littlewood said to me once, are not clever schoolboys o r "scholarship candidates, " but "Fellows of another college." So Greek mathematics is "permanent, " more perhaps even than Greek literature. Archimedes will be remembered when Aeschylus is forgotten, because languages die and mathematical ideas do not.

I am saying that ancient Greek mathematicians were in every essential way similar to modern mathematicians. In fact, some mathematicians might find more in common with Archimedes and Euclid than with many colleagues of their departments, and even reading the original Greek--a subject traditionally taught in High School [9]--seems easier than understanding, say, the proof that every semistable elliptic curve is modular [59].

Nineteenth-century mathematicians dedicated much of their research to elementary Euclidean geometry. It is possible that some mathematicians of that era felt that the influence of the past was too great, as Felix Klein wrote [38, Vol. 2, p. 189]:

Although the Greeks worked fruitfully, not only in geometry but also in the most varied fields of mathematics, nevertheless we today have gone beyond them everywhere and certainly also in geometry.

For whatever reason, geometers recently tend to distance themselves from Euclidean geometry. For example, the book Unsolved Problems in Geometry [16], part, of a series on "unsolved problems in intuitive mathematics, " does not have a section devoted to classical Euclidean geometry, and with few exceptions, such as [10], articles on this subject are relegated to "lowbrow" publications. Yet earlier in this century, Bieberbach, Hadamard, and Lebesgue all wrote books on elementary Euclidean geometry [13] [27] [44], and excellent books and articles on ancient mathematics are still being written [31] [55]. See [17] for further analysis of these issues.

In this paper, I will give an example of ancient mathematics by using techniques that Archimedes developed in his paper The Method to derive results that he proved in his paper On Spirals. I will try to present these in a way that Archimedes might understand [57], in particular, the diagrams are intended to conform to ancient Greek standards [46]. I will also indicate how ideas in these papers can lead to some surprising results (e.g., Exercise 4 below). The paper will include such exercises as may challenge the reader to understand concepts of Archimedes as he expressed them.

(I have concentrated on the works of Archimedes because these are most similar to modern mathematical research papers, sharply focused on problems and their solution. By comparison, the works of Euclid read like a generic textbook; and so little is known about Euclid that it cannot be ruled out that he was actually a "consortium." Moreover, it seems likely that the works of Euclid are based on the efforts of earlier mathematicians [24] [39].[1])

The balance of the paper shows how a precise knowledge of ancient mathematics allows one to navigate in the sea of inaccuracies and misconceptions written about the history of mathematics. This also gives one perspective on cultural aspects of mathematics, as it forces one to understand ideas of first-rate mathematicians whose cultural background is very different from the present one. For example, it can help you read The New York Times [37]:

"Alien intelligences may be so far advanced that their math would simply be too hard for us to grasp, " [Paul] Davies said. "The calculus would have baffled Pythagoras, but with suitable tuition he would have accepted it."

Reading this paper should make it clear that Archimedes could have been Pythagoras's calculus tutor, thus refuting any implication that calculus was an unknown concept to ancient Greeks.

It is my hope that I can convince mathematicians that there are many interesting and relevant ideas to be uncovered in ancient Greek mathematics, and that it might be worthwhile to take a first-hand look, being wary of popular accounts and secondary sources, this one included!

Extending Archimedes's Method

In 1906 the Danish philologist J.L. Heiberg went to Constantinople to examine a manuscript containing mathematical writing which had been discovered seven years earlier in the monastery of the Holy Sepulchre at Jerusalem. What he found was a 10th-century palimpsest--a parchment containing works of Archimedes that, sometime between the 12th and 14th centuries, had been partially erased and overwritten by religious text. Heiberg managed to decipher the manuscript [33] and found that it included a text of The Method, a work of Archimedes previously thought lost. (The story of the transmission of Archimedean manuscripts given in [18] reads like a chapter from The Maltese Falcon. Late bulletin: Heiberg's palimpsest was sold by Christie's for $2, 000, 000--see Jeremy Grey's article in this issue.)

This discovery had a significant impact on the understanding of ancient Greek mathematics, for two reasons. The first is the aim of the paper, summarized by Archimedes[2] [54, Vol. 2, p. 221]:

Moreover, seeing in you, as I say, a zealous student and a man of considerable eminence in philosophy, who gives due honour to mathematical inquiries when they arise, I have thought fit to write out for you and explain in detail in the same book the peculiarity of a certain method, with which furnished you will be able to make a beginning in the investigation by mechanics of some of the problems in mathematics. I am persuaded that this method is no less useful even for the proofs the theorems themselves. For some things first became clear to me by mechanics, though they had later to be proved geometrically owing to the fact that investigation by this method does not amount to actual proof; but it is, of course, easier to provide the proof when some knowledge of the things sought has been acquired by this method rather than to seek it with no prior knowledge.

This is a radical divergence from all other extant Greek works, as T.L. Heath explains [6, Supplement, p. 6]:

Nothing is more characteristic of the classical works of the great geometers of Greece, or more tantalising, than the absolve of any indication of the steps by which they worked their ways to the discovery of their great theorems. As they have come down to us, these theorems are finished masterpieces which leave no traces of any rough-hewn stage, no hint of the method by which they were evolved ... A partial exception is now furnished by The Method; for here we have a sort of lifting of the veil, a glimpse of the interior of Archimedes' workshop as it were.

The other surprising aspect of The Method is the revelation that Archimedes worked with infinitesimals, for example, "The triangle GammaZA is composed of the straight lines drawn in GammaZA" [54, Vol. 2, p. 227], "The cylinder, the sphere and the cone being filled by circles thus taken." [8, vol. 3, p. 91], see [1] [42]. As every mathematician knows, infinitesimals were reinvented by mathematicians such as Cavalieri (1598-1647) and Leibniz (1646-1716), see [2] [21]. Archimedes used them to compute the area and volumes of various geometrical figures including what he considered his greatest achievement:[3]

Any cylinder having for its base the greatest of the circles in the sphere, and having its height equal to the diameter of the sphere, is one-and-a-half times the sphere, a result he subsequently proved, On the Sphere and Cylinder, I, Corollary to Proposition 34 [54, Vol. 2, p. 125]. Archimedes understood that his method does not produce valid proofs due to its use of infinitesimals, [4] though it is unclear if the same is true of his successors. In any case, it is easy to make the arguments rigorous, given present knowledge. The basic ideas of The Method are still presented in contemporary calculus courses [35] [51, p. 709], and a physical model of Archimedes's argument has been built [25].

On the other hand, Archimedes's On Spirals is a masterpiece of rigorous mathematics. In this paper, Archimedes computes the area and tangent of a spiral, and, in doing so, derives much of the Calculus I curriculum, including related rates, limits, tangents, and the evaluation of Riemann sums. This is reflected by the fact that a number of contemporary Calculus texts outline the basic idea of Archimedes's computation of the area of a spiral [3, p. 3] [12, p. 75], though both these works avoid technical difficulties by substituting a parabola, but then incorrectly imply that Archimedes used such an approach for the parabola [3, p. 8] [12, p. 75]. The considerable length of the paper is a consequence of proving these results from basic principles. Unfortunately, it does not yet have a faithful English translation [56]; Heath's intent in [6] was to capture the modern flavor of Archimedes's works in order to make them more accessible. A generally faithful French translation, including the Greek text, is available [8].

The mechanical method does not seem to produce directly the area of a spiral, or even the area of a circle (also computed by Archimedes), so one might wonder how he first derived them. W.R. Knorr [40] has suggested that the writings of Pappus of Alexandria (fourth century AD) indicate that Archimedes wrote an earlier version of On Spirals which used a different argument to compute the area of the spiral, but then rejected it as inelegant (this approach is developed in the solution to Exercise 1). The object of the next section is to show how a natural extension of the mechanical method easily produces these results.

Weighing a Spiral

The Method relies on a mechanical analogy by using a balance to compare objects. This requires a few simple assumptions and facts about the properties of a lever, which are developed (sometimes implicitly) in Archimedes's On the Equilibrium of Planes I, [6] [18, Chapter IX]. These can be summarized by

Assumption 1. Two objects will balance each other if the distances of their centers of gravity to the fulcrum are inversely proportional to their weights. The center of gravity of an object lies on an axis of symmetry.

When only the weight of an object is relevant to an argument, I will place it on a pan suspended from the balance. The object and any of its sections will then be assumed to have their centers of gravity at the point where the pan is suspended. I will also make extra assumptions not seen in Archimedes's works (however, see the solution to Exercise 4).

Assumption 2. A plane figure is composed of circular arcs with common center, and each circular arc weighs the same as a line segment of equal length.

Exercise 1. What happens if you instead decompose plane figures into radii with common center?

I will first show how the mechanical method can be used to derive Archimedes's formula for the area of a circle given in Measurement of the Circle, Proposition 1 [54, Vol. 1, p. 317].[5]

Proposition 1. Any circle is equal to a right-angled triangle in which one of the sides about the right angle is equal to the radius, and the base is equal to the circumference.

Exercise 2. Explain why Proposition 1 is equivalent to the familiar formula: Area of a circle = piR[sup 2].

Suspend two pans on opposite sides of a balance and at equal distances to the fulcrum. On one pan, place a circle with center at A and radius AB, on the other place a line segment CD of length AB. By Assumption 2, the circle is composed of circumferences with center A and radius AE for any E lying on AB. For each such circumference, place a line segment FG perpendicular to CD, of length the circumference through E, such that its endpoint F lies on CD and CF is equal to AE. By Assumption 2, the line segment FG is in equilibrium with the circumference through E. The resulting figure is a right triangle of height AB, base the circumference through B, and it balances a circle of radius AB, which is the statement of Proposition 1.

Exercise 3. Why is the resulting figure in this construction a triangle?

Exercise 4. Generalize the following heuristic from The Method [6, Supplement]: "... judging from the fact that any circle is equal to a triangle with base equal to the circumference and height equal to the radius of the circle, I apprehended that, in like manner, any sphere is equal to a cone with base equal to the surface of the sphere and height equal to the radius."

Archimedes's definition of a spiral and its relevant components is given by [54, Vol. 2, p. 183]:

1. If a straight line drawn in a plane revolve uniformly

any number of times about a fixed extremity until it

return to its original position, and if, at the same

time as the line revolves, a point move uniformly along

the straight line, beginning at the fixed extremity, the

point will describe a spiral in the plane.

2. Let the extremity of the straight line which remains

fixed while the straight line revolves be called the

origin of the spiral.

3. Let the position of the line, from which the straight

line began to revolve, be called the initial line of the

revolution.

Proposition 2. The area inside a spiral anywhere within its first revolution is one third the sector of a circle with center at the origin of the spiral, radius equal to the distance of the point describing the spiral to the origin, and angle equal to the angle between the line and the initial line. (Archimedes gave areas for complete revolutions only, but his proof also applies to this case.)

Consider a spiral with origin A, initial line AB, and C the position of the point describing the spiral. Consider also a balance arm DE of length twice AC and let the midpoint F of DE be the fulcrum. On this balance suspend a pan from D and place the spiral region in the pan.

By Assumption 2, the spiral region is composed of arcs GH for each G lying on AC, where H is the intersection of the circle with center A and radius AG and the spiral. Extend AH to intersect the circle of center A and radius AC at I. Consider a line segment JK of length equal to the arc CI and crossing DE at L such that JK and DE are perpendicular, L is the midpoint of JK, and FL is equal to AG. I claim that JK and the arc GH are in equilibrium. To see this note that, by Exercise 3, the length of an arc is proportional to its radius, so that

arc GH : arc CI :: AG : AC,

and the result follows from the assumption that the arc CI has its center of gravity at D and from Assumption 1. Now extend the arc CI to intersect AB at M; then the arc CI is equal to the arc CIM minus the arc IM, and by the definition of spiral, IM is proportional to AH. Since the arc CIM remains constant in this argument, the second part of Exercise 3 shows that the arc CIM minus JK is proportional to FL, which means that the resulting figure is an isosceles triangle which balances the inside of the spiral.

The exact same argument shows that the area between the spiral and the initial line that lies within the same sector balances the same isosceles triangle, but reversed so that, its vertex lies on the fulcrum. The crucial step is to recall the following

Fact: The center of gravity of a triangle lies at the intersection of the medians, and the medians of a triangle intersect each other in a ratio of 2:1.

The first part is suggested by the observation that a median divides a triangle into two triangles of equal weight. and its proof is one of the main results of On the Equilibrium of Planes L The second part is an easy exercise [15, Section/Sections1.4] and follows from On the Equilibrium of Planes I, Proposition 15, generalized to trapezoids.

This shows that reversing the first triangle places the center of gravity twice as far from the fulcrum, so the second triangle will balance twice the first. One concludes that the inside of the spiral weight one half the outside of the spiral and thus one third of the sector of the circle, which is the statement of Proposition 2.

Exercise 5. Evaluate the area of the spiral using the same procedure as for the circle, i.e., by only comparing weights placed on pans.

Exercise 6. Use the mechanical method to compute the Center of gravity of a spiral region.

A Modern Translation

The basic observation is that Assumption 2 extends Archimedes's method to polar coordinates. Consider a curve r = f(theta) in polar coordinates, where, for simplicity, f(r) is an increasing function, so there is an inverse function theta = g(r) (this notation is more convenient given the difficulties of Exercise 1). To compute the area of a region A lying inside the curve and having 0 </= theta </= Theta, one partitions A into thin circular shells of width h > 0, as in Figure 6. Using the formula thetar[sup 2]/2 for the area of a sector of angle theta and radius r, each shell has area (Theta - theta)rh + R(r, h), where the error R(r, h) is less than the area of the small shell element of area (r + h)h(g(r + h) - g(r)), see Figure 6, and this is less than Crh[sup 2], for some constant C, assuming that g(r) is well behaved. It follows that, ignoring terms of order h[sup 2], the area of each shell is (Theta - theta)rh, which is the length of the bottom arc of the shell multiplied by h. This shows why the first part of Assumption 2 holds. All these shells have area a linear function of h up to an error term of lower order, and form a disjoint union of A, which shows why the second part of Assumption 2 holds. Letting h arrow right 0, it follows that the area of A is

Multiple line equation(s) cannot be represented in ASCII text

The standard derivation of this formula uses the formula rdrd theta for the area element in polar coordinates

Multiple line equation(s) cannot be represented in ASCII text

A circle is simply g(r) = 0, which yields 2pi "[Greek text cannot be converted in ASCI text]"[sup R, sub 0] rdr = piR[sup 2].

A spiral, in polar coordinates, is given by the equation r = atheta, for some constant a, which can be written as theta = kr, where k = 1/a. By the above, the area of the spiral is

Multiple line equation(s) cannot be represented in ASCII text

where the term on the right is seen to be 1/3 the area of the sector of the circle of radius R and angle Theta, yielding Proposition 2.

Any proof of this formula is equivalent to evaluating such integrals. Archimedes evaluated "[Greek text cannot be converted in ASCI text]"[sup R, sub 0]r[sup 2]dr by decomposing it into Riemann sums and obtaining a closed form for the sum 1[sup 2] + ... + n[sup 2]. In Proposition 2 this integral is computed by realizing it as the moment of a triangle and evaluating this as its weight multiplied by the distance of its center of gravity from the fulcrum.

The Way of Archimedes

The Calculus Reform movement has emphasized experimentation over rigor in calculus education and has been criticized as a result [53]. To defend its position that physical problems should be used to discover mathematical results, Harvard Calculus appeals to Archimedes and The Method [35, p. vii]:

The Way of Archimedes: Formal definitions and procedures evolve from the investigation of practical problems.

This principle accurately represents the works of Archimedes, but a disparity arises in that Harvard Calculus postpones mathematical rigor indefinitely; Archimedes's name should not be associated with such an endeavor. For example, the method of exhaustion used by Archimedes is essentially the epsilon-delta argument abandoned by Harvard Calculus, as B.L. van der Waerden writes [58, p. 220]:

... the estimations, which occur in the summing of infinite series and in limit operations, the `epsilontics', as the calculation with an arbitrary small epsilon is sometimes called, were for Archimedes an open book. In this respect, his thinking is entirely modern.

Moreover, Archimedes held in contempt those who did not furnish proofs of their results. In the introduction to On Spirals, Archimedes reveals that he intentionally announced false theorems in order to expose some of his contemporaries [6]:

... I wish now to put them in review one by one, particularly as it happens that there are two among them which [are wrong and which may serve as a warning to] those who claim to discover everything but produce no proofs of the same may be confuted as having actually pretended to discover the impossible.

Harvard Calculus fails miserably when measured against this Way of Archimedes. Apart from the passage quoted above, the word "theorem" appears in [35] only in the name "Fundamental Theorem of Calculus." Compare this with a standard calculus text [22], which lists 130 theorems in its index. Even more revealing, the only instance of the word "proof" I located in [35] was in Archimedes's introduction to the method quoted above and used in [35] to justify "The Way of Archimedes." In fact, this quote emphasizes that discovery of the answer to a problem leads to a theorem whose proof is facilitated by knowledge of the answer. My interpretation is not Calculus Reform but

Problem-Solving: When faced with a problem, use any method that allows you to conjecture the answer, then find a rigorous proof.

A recent development: The second edition of [35] has taken a more moderate approach to Calculus Reform and now includes some complete proofs [35, 2nd Edition, p. 78] and the epsilon-delta definition of a limit [35, 2nd Edition, p. 128]. However, this new edition no longer includes "The Way of Archimedes."

Popular Misconceptions

It must be noted that the penultimate remark of the previous section paraphrases E.T. Bell [11, p. 31]: "In short he used mechanics to advance his mathematics. This is one of his titles to a modern mind: he used anything and everything that suggested itself as a weapon to attack his problems." However, strong opinions such as those expressed in [11] are fraught with danger, and it is instructive to include the continuation of this passage:

To a modern all is fair in war, love, and mathematics; to many of the ancients, mathematics was a stultified game to be played according to the prim rules imposed by the philosophically-minded Plato. According to Plato only a straightedge and a pair of compasses were to be permitted as the implements of construction in geometry. No wonder the classical geometers hammered their heads for centuries against `the three problems of antiquity': to trisect an angle; to construct a cube having double the volume of a given cube; to construct a square equal to a circle.

This has since been discredited, see [24] [41] (better yet, look at original sources, e.g., as collected in [54, Vol. 1, Chapter 9]); and van der Waerden writes [58, p. 263],

The idea, sometimes expressed, that the Greeks only permitted constructions by means of compasses and straight edge, is inadmissible. It is contradicted by the numerous constructions, which have been handed down, for the duplication of the cube and the trisection of the angle.

In particular, Archimedes trisected the angle with ruler and compass in Proposition 8 of The Book of Lemmas [6, p. 309], see [20] [31, Section 31]. The history of this misconception might prove an interesting subject for further study.

Unfortunately, it is only one of a number of popular misconceptions about the limitations of Greek science [56]. For example, Isaac Asimov (1920-1992) has written [5],

To the Greeks, experimentation seemed irrelevant. It interfered with and detracted from th e beauty of pure deduction ... To test a perfect theory with imperfect instruments did not impress the Greek philosophers as a valid way to gain knowledge ... The Greek rationalization for the "cult of uselessness" may similarly have been based on a feeling that to allow mundane knowledge (such as the distance from Athens to Corinth) to intrude on abstract thought was to allow imperfection to enter the Eden of true philosophy. Whatever the rationalization, the Greek thinkers were severely limited by their attitude. Greece was not barren of practical contributions to civilization, but even its great engineer, Archimedes of Syracuse, refused to write about his inventions and discoveries ... to maintain his amateur status, he broadcast only his achievements in pure mathematics.

This passage is contradicted by numerous examples of Greek scientific experiments, for example, Eratosthenes's measurement of the earth [4]. Asimov may be excused for paraphrasing Plutarch's account of Archimedes in his Life of Marcellus, written circa 75 AD [49] [54, Vol. 2, p. 31]:

Yet Archimedes possessed so lofty a spirit, so profound a soul, and such a wealth of scientific inquiry, that although he had acquired through his inventions a name and reputation for divine rather than human intelligence, he would not deign to leave behind a single writing on such subjects. Regarding the business of mechanics and every utilitarian art as ignoble or vulgar, he gave his zealous devotion only to those subjects whose elegance and subtlety are untrammeled by the necessities of life ...

Despite Plutarch's ancient credentials, he had no better insight into Archimedes's scientific contribution, which contradict his story. The reader is already aware that The Method shows that physical considerations played an important role in Greek mathematics. But Asimov and Plutarch are completely refuted by Archimedes in The Sand Reckoner [6] [18]:

While examining this question I have, for my part tried in the following manner, to show with the aid of instruments, the angle subtended by the sun, having its vertex at the eye. Clearly, the exact evaluation of this angle is not easy since neither vision, hands, nor the instruments required to measure this angle are reliable enough to measure it precisely. But this does not seem to me to be the place to discuss this question at length, especially because observations of this type ha ye often been reported. For the purposes of my proposition, it suffices to find an angle that is not greater than the angle subtended at the sun with vertex at the eye and to then find another angle which is not less than the angle subtended by the sun with vertex at the eye.

A long ruler having been placed on a vertical stand placed in the direction the rising sun is seen, a little cylinder was put vertically on the ruler immediately after sunrise. The sun, being at the horizon, can be looked at directly, and the ruler is oriented towards the sun and the eye placed at the end of the ruler. The cylinder being placed between the sun and the eye, occludes the sun. The cylinder is then moved further away from the eye and as soon as a small piece of the sun begins to show itself from each side of the cylinder, it is fixed.

If the eye were really to see from one point, tangents to the cylinder produced from the end of the ruler where the eye was placed would make an angle less than the angle subtended by the sun with vertex at the eye. But since the eyes do not see from a unique point, but from a certain size, one takes a certain size, of round shape, not smaller than the eye and one places it at the extremity of the ruler where the eye was placed ... the width of cylinders producing this effect is not smaller than the dimensions of the eye.

... It is therefore clear that the angle subtended by the sun with vertex at the eye is also smaller than the one hundred and sixty fourth part of a right angle, and greater than the two hundredth part of a right angle.

The correct value of the angular diameter of the sun is now known to average about 34' [26, p. 95], i.e., the 159th part of a right angle. It is important to note that this shows not only that ancient Greeks frequently performed experiments, but that Archimedes dealt with experimental error and also compensated for the fact that the human eye is part of the observational instrument, thus anticipating scientists such as Hermann von Helmoltz (1821-1894) [34]. A translation and analysis of The Sand Reckoner is given in [56].

Answers to Exercises

Exercise 1. A naive approach leads to incorrect results, evidence of the dangers of using infinitesimals, and indicating why Archimedes did not consider his method to be rigorous. For example, taking the radii of a circle of radius R, with respect to the circumference, and reordering them to form a rectangle, yields area 2piR[sup 2]. For a general figure, it's not even clear how to pick the radii. To make sense of what is going on, one regards radii as limits of sectors, i.e., infinitesimal triangles. In the case of the circle, this means that the weight of a radius, with respect to the circumference, is equal to one half its length. This can be loosely interpreted as the argument Archimedes used to compute the area of the circle [1]. In the general case, the following is justified:

Assumption 3. The weight of a radius is proportional to the square of its length.

In modern notation, this is simply

Multiple line equation(s) cannot be represented in ASCII text

where the radii have been chosen with respect to the unit circle. Given Assumption 3, one can compute the area of the spiral by using Pappus's argument [48, Book 4, Proposition 21], see also [32, p. 377] [41, p. 162].

To compute the weight of a spiral region, take each radius of the spiral, starting from the final radius, and place a disk with diameter equal to this radius at height the current angle so the resulting figure is a cone. Similarly, for each radius of the sector place a disk with diameter equal to this radius at height the current angle, resulting in a cylinder with the same base and height as the cone.

Since Euclid's Proposition 2 of Book 12 proves that "circles are to one another as the squares on the diameter, " Assumption 3 shows that the ratio of the weight of the spiral region to the weight of the sector is the same as the ratio of the volume of the cone to the volume of the cylinder. But Euclid's Proposition 10 of Book 12 proved that the volume of a cone is one third the cylinder with the same base and height, so the spiral weighs one third of the sector, which is the statement of Proposition 2. (Note that equilateral triangles could have been used instead of circles resulting in a pyramid whose volume is easier to compute.)

Knorr [40] comments that this appeal to three-dimensional figures might have been considered inelegant by Archimedes as it uses volumes to compute areas. On the other hand, reversing this argument and using the evaluation above shows that the volume of a cone can be computed by the mechanical method, a result which does not appear in The Method.

Exercise 2. In modern notation, Archimedes's formulation of Proposition 1 is Area of circle of radius R = Integral[sup R, sub 0] 2pirdr, for the integral represents the area of a right triangle with base R and height 2piR.

Exercise 3. This is equivalent to the fact that the length of an arc of fixed angle is proportional to its radius. In particular, pi exists, see [45] [56]. The proof is similar to [23, Book 12, Proposition 2] cited in Exercise 1, and is implicit in Archimedes's Measurement of the Circle. Similarly, the length of an arc of fixed radius is proportional to its angle.

Exercise 4. By analogy with Assumption 2, consider a sphere as being composed of spherical shells centered at the center of the sphere, where each shell weighs the same as a circle of equal area. The justification follows exactly as in Proposition 2: Consider two pans suspended at equal distances from the fulcrum of a balance. On one pan, place a sphere of center A and radius AB and on the other a line CD of length equal to AB. For each E on AB there is a spherical shell passing through E, and consider a circle of area equal to this spherical shell with center at F lying on CD, where CF equals AE, and such that the circle is perpendicular to CD. The resulting figure is a cone with base the area of the sphere and height the radius of the sphere; since it balances the sphere, the claim is justified.

The similarity of this argument to the one of Proposition 1 suggests that Archimedes may have been implicitly aware of the ideas of this paper. Moreover, the reader may verify that the heuristic of this exercise and its justification directly generalize to higher dimensions (a different generalization is given in [19]):

Proposition 3. The volume of an n-dimensional ball is equal to the volume of a cone whose base has n - 1-dimensional volume equal to the (n - 1)-dimensional volume of the boundary of the ball and height equal to the radius of the ball.

Exercise 5. The procedure, when applied to the spiral, yields a section of a parabola. The general formula for such areas was computed by Archimedes in The Quadrature of the Parabola, and in this case it states that the resulting area is four-thirds the triangle with same base and height as the section of the parabola. Since the height and base are equal to the final radius and half the final radius, respectively, Proposition 2 follows.

Exercise 6. Further extensions of Archimedes's method could be a subject for investigation. As Archimedes wrote in The Method [6, Supplement, p.13],

I deem it necessary to expound the method partly because I have already spoken of it but equally because I am persuaded that it will be of no little service to mathematics; for I apprehend that some, either of my contemporaries or of my successors, will, by means of the method when once established, be able to discover other theorems in addition, which have not yet occurred to me.

Acknowledgment

I would like to thank Alain Herreman, Reviel Netz, and David Wilkins for helpful comments.

1 Hence I question the curriculum of St. John's College, which purports to educate its students by following an historical sequence of original sources. Its reading list also includes the ancient textbook [47].

2 Archimedes is addressing Eratosthenes of Cyrene (circa 284-194 B.C), director of the library of Alexandria, famous for his accurate measurement of the circumference of the earth [14] and his sieve to compute prime numbers [47].

3 Archimedes requested that a diagram of a sphere inscribed in a cylinder along with their proportion be placed on his grave, which Cicero reported finding in 75 B.C. when he was treasurer of Sicily [54, Vol. 2, p. 33].

4 In The Quadrature of the Parabola Archimedes gave what he considered to be a rigorous proof using the mechanical method of a result conjectured in a similar way in The Method, but using infinitesimals.

5 A similar method was used by Rabbi Abraham bar Hiyya (1070-1136), see V.J. Katz, review of "Force and Geometry in Newton's Principia" by F. de Gandt, American Math. Monthly 105 (1998), 386-392 and F. Sanchez-Faba, Abraham Bar Hiyya and his "Libro de Geometria, " (Spannish) Gac. Math., I 32 (1998), 101-115.

Соседние файлы в папке 3