The Impact of Mathematical Developments in the
Derivation of p
Daniel Y.
Chong
Contents
Abstract
Mathematicians
throughout history have been fascinated with p. For millennia, they have focused on finding as many
digits of p as possible. We have
come such a long way from the crude estimations given by ancient mathematicians
that it seems only natural to wonder – how have mathematical advances through
history affected mathematicians’ approach to calculating p? The origin of this constant, defined as the ratio of
a circle’s circumference to its diameter, lies in geometry. Mathematicians such
as Archimedes used geometric principles to work out rudimentary approximations
for p. As new branches of
mathematics have developed, formulae for p have become increasingly sophisticated and
accordingly, have become faster. Trigonometry extended geometric formulae by
allowing mathematicians to work with expressions involving angles, and algebra
allowed for easier manipulation of these expressions. But it was not until the
invention of calculus that the approximation of p truly began to see significant improvement. Newton,
Liebniz, and others came derived many infinite series and products for p. Next came Euler, whose identities and series
provided much better approximations than ever before. Mathematicians have since
continued to find more and more digits; as of 1995, the known digits of p exceeds six billion.
The Impact of Mathematical Developments in the Derivation of Pi
It is defined as the ratio of
the circumference of a circle to its diameter. Denoted by the Greek letter p and approximated by the ever-familiar 3.14159, its
knowledge dates back to pre-history; as early as 2000 BC the ancient civilizations
of Babylon and Egypt knew of its existence (CECM, 1996). However, as
mathematics progressed, the repeated occurrences of p in higher mathematics has both startled and fascinated
mathematicians for centuries. Man’s fascination for p has led him to convert it from an abstract concept to
a concrete string of numbers. Although early attempts at calculating it have
been understandably imperfect, hampered by a dearth of any mathematical
knowledge beyond rudimentary geometry and simple arithmetic, in more recent
years man has had greater success at calculating its digits, which leads one to
wonder: how have developments in the mathematics impacted man’s quest to
uncover the digits of p? With the understanding that as man’s inevitable quest for knowledge
and progress has led to the development of higher mathematics, which in turn
has led to more sophisticated and accurate derivations for p, we will examine, in roughly chronological order, the
impact of the developments of geometry, algebra, trigonometry, and calculus on
the derivation of p.
It can be said that geometry,
next to arithmetic, was the first branch of mathematics to be explored by early
man. Indeed, to the ancient Greeks, mathematics was entirely geometry. This is
apparent when one considers, among other things, the Pythagoreans’ practice of
representing numbers by arranging stones on the ground, the importance placed
in precise construction using a straight-edge and a compass, and Euclid, author
of the Elements, a work which so
profoundly impacted the development of geometry that its effects are evident
even today, nearly two-and-a-half millennia after it was written.
Ancient mathematicians, when
calculating p, relied almost
completely on an underdeveloped system of geometry to aid them, so early approximations
of p were coarse at best. For
example, the Ahmes Papyrus, which, originating from Egypt, is among the oldest
known writing on mathematics, makes the incorrect assumption that a circle with
a radius of 4.5 is equal in area to a square with side 8. (Beckmann, 1971).
Given this, the value Ahmes used for p is clear:

The method of deriving a value
for p used by Archimedes was
much more sophisticated than that used by Ahmes, who seems to have simply made
an invalid assumption. Archimedes reasoned that the perimeter of an n-sided polygon inscribed in a circle
approximates the circumference of the circle. This was nothing new; many
mathematicians before him utilized the same principle in calculating p themselves. But Archimedes took it one step further.
He realized that the approximation given by an inscribed polygon will always
fall short of the actual circumference. To compensate for this, he
circumscribed the same polygon about the circle as well. Thus, instead of
calculating a simple approximation for p, he obtained a range in which p lay (O’Connor and Robertson, 1996). Beginning with a
hexagon (n = 6), he continuously
doubled n until he obtained n = 96, with which he calculated
![]()
a
remarkable approximation, especially in light of the fact that Archimedes
accomplished this without the use of trigonometry, whose half-angle formulae
would have greatly simplified and aided his work.
The next branch of mathematics
to be developed was algebra. Although the ancient Greeks used geometric
construction to solve many problems which can be solved algebraically, the
first mathematician to make the transition from a geometric process to an
algebraic one is Diophantus (circa
275 AD). Following Diophantus was the Arab mathematician Mohammed ibn Musa
al-Khwarizmi, who authored a booked titled Hisab
al-jabr w’al-muqbala, or Al-jabr
for short, from which the word algebra
originates (Hollingdale, 1991). In his book, al-Khwarizmi formulated many of
the foundations of algebra, namely the equivalency postulates of arithmetic
(additive/multiplicative properties of equality, etc.), presenting their proofs
in geometric form. Through the next few centuries, improvements in algebra
abound with such discoveries as the quadratic equation, solutions of
higher-degree polynomials, and complex numbers. It was not until about the 17th
century that a universal set of symbols describing algebraic operations and processes
was fully developed and adopted.
The effects of algebra on
derivations of p are not as readily apparent as they were for other branches of
mathematics, such as trigonometry and calculus. This is due in part to the fact
that, around the time of its development, many of the equations and formulae
involving p were not yet complicated
enough to necessitate the use of symbolic representation; geometric or even
verbal representation was often sufficient. However, on a less direct level,
algebra has strongly impacted the derivation of p, at least in the sense that it has served as the
foundation of higher mathematics, namely analytic geometry. The latter is especially
important because it allowed mathematicians to view a figure not as a
collection of lines on a piece of paper, but rather as a relationship of points
on a coordinate plane. The mathematician and philosopher René Descartes is
often credited with satisfactorily blending algebra and geometry to form
analytic geometry (Beckmann, 1971). Indeed, it is from his name that the modern
term Cartesian geometry arises. The
development of analytic geometry has also played an integral role in the
development of calculus; the definitions of the derivative as the slope of a
curve and the integral as the area under a curve are meaningless unless the relation
between algebra and geometry is well understood.
It can be said that the modern
concept of trigonometry originated in the late 16th and early 17th
centuries, through the works of such mathematicians as Viète, Fincke, and
Pitiscus, who began to develop methods of calculations of right triangles using
the six trigonometric functions familiar to us today. Up until that time,
trigonometry was still largely treated as the study of the lengths of arcs and
chords of a given circle. A comprehensive table of chords, corresponding to a
modern table of sines, had been compiled as early the second century AD by
Ptolemy (Smith, 1958), but this approach was not as well suited for working
with angles in the manner required by Archimedes’ inscribed and circumscribed
polygons.
With the advent of trigonometry,
we can now carry Archimedes’ calculations of p even further. We start with a circle with a diameter
of 2 units, in which we insert n
congruent isosceles triangles with a common vertex at the center of the circle.
The following diagram illustrates this for n
= 6.

If we call the non-congruent leg of the
triangle x, the perimeter of the
polygon formed by the triangles is given by nx.
Taking a closer look at the triangles, we see that the two congruent legs, both
being radii of the circle, are each 1 unit long. Furthermore, the angle between
these two legs is equal to
.

To
avoid dealing with oblique triangles (and thus having to use the Law of
Cosines), we can split the triangle into two congruent right triangles.

From
this diagram, it follows that
. Solving for x
yields
. Therefore, the perimeter of the polygon (and hence our
approximation for the circumference) equals
. Substituting this expression in the equation
gives our approximation
of p to be
. Evaluating this expression for n = 6 gives us the simple approximation of
. From here, continuously doubling the value of n and applying the half-angle identities
(
and
) produces an interesting result. For n = 12,

with
a decimal approximation of 3.105829. For n
= 24,

a
monster expression which reduces to the relatively tame
, approximately 3.132629. It can be shown through proof by
mathematical induction that these repeated radicals for p will always appear in the form
, where the number of nested radicals (that is, the number of
radicals within the first radical) is equal to m. This number corresponds to the number n of sides of the polygon inscribed in the circle by the formula
. So for m = 6
(384-sided polygon), the approximation is
, or about 3.141558, which is accurate to 5 significant
figures.
Although the preceding
discussion presents a method which is strictly of the author’s invention,
several mathematicians since the development of trigonometry have used similar
approaches to derive an expression for p. François Viète (1540-1603), who among other things
was a pioneer in the development of trigonometry, dealt with the relationship
between an n-sided polygon and a 2n-sided polygon both inscribed in the
same circle. With the aid of trigonometric half-angle identities Viète was able
to derive the infinite product
(Beckmann, 1971)
Historically,
this derivation is significant in that it was the first infinite expression for
calculating p. During Viète’s time,
many mathematicians still believed that perhaps p was rational, that it could be expressed as the
quotient of two integers. As each new digit of p was discovered, these mathematicians’ flickering hope
diminished further. It seems reasonable to suspect that Viète’s expression, an
infinite product of nested radicals, all of which are themselves irrational,
may have seemed very near an actual proof of the irrationality of p, which was finally provided by Johann Heinrich
Lambert (1728-1777) some 150 years later (Beckmann, 1971).
After Viète came countless other
mathematicians who refined the known value of p by using polygons with an ever-increasing number of
sides (one Dutch mathematician’s polygon had
sides). These
mathematicians had little, other than more and more digits, to add to the
theory of p-derivation. The next
great breakthrough in this field would not come until the invention of
calculus, which would change forever man’s efforts at calculating p.
The invention of calculus is
most often ascribed to Isaac Newton and Gottfried Leibniz. To be sure, these
two men did much for the development of this branch of mathematics, and were two
of the first to realize that the integral and the derivative were inverse
operations, but to give them full credit is grossly unfair to the many groundbreaking
mathematicians that preceded them.
One such mathematician was John
Wallis (1616-1703), who, using what essentially amounted to Riemann sums,
derived, in modern notation, the following infinite summation for p to calculate the area of a quarter-circle:
. While Wallis lacked the insight to convert this expression
to an integral, he did manage to expand it to the infinite product
(Beckmann, 1971).
Today, by applying the limit definition of the definite integral, we can easily
see that the previous equation is equivalent to
. Furthermore, we can calculate this using trigonometric
substitution, but doing so yields the meaningless result
. Alternatively, we can apply Isaac Newton’s modified
binomial theorem to expand the integrand then integrate the resulting infinite
series term-by-term, yielding a decimal approximation.
One other mathematician to
precede Newton and Leibniz noteworthy in respect to the calculations of p is the Scottish mathematician James Gregory. He is
perhaps best remembered for discovering a power series for the arctangent
function, which he knew to be equal to the area under the curve
. By use of repeated long division coupled with the power
rule for integration discovered some time earlier by Blaise Pascal, Gregory derived
his namesake power series,
. Leibniz later substituted x = 1 to yield
(Beckmann, 1971). The
first infinite series derived for p, it is also one of the least useful since its rate of
convergence is significantly slower than many other series which were
discovered soon afterwards. Practicality aside, the Leibniz series for p is important in that it was the first infinite series
discovered for p, and its derivation marked the start of a totally different approach
to calculating p.
The publication of the Leibniz
series drew much attention from the math community of the time. Anyone
attempting to utilize the series immediately recognized its unforgivably slow
rate of convergence (matching Archimedes’ value would require over 300 terms).
With this in mind, a mathematician named Abraham Sharp (1651-1742) explored the
possibility of substituting a value into the Gregory series other than x = 1, the value for which the Gregory
series experiences the slowest rate of convergence. The value he chose was
, which yields
(Beckmann, 1971)
It
is easy to see that Sharp did indeed succeed in deriving a series with a
greater rate of convergence than the elegant but slow Leibniz series. In the
Leibniz series, the denominators increase in arithmetic procession (3, 5, 7, 9,
…), whereas successive denominators of Sharp’s series increase exponentially
(9, 45, 189, 729). The difference is striking; after just 1000 terms, Sharp’s
series evaluates to 3.141592653591, a value correct to 12 significant figures,
while evaluating the first 100,000 terms of the Leibniz series yields
3.141582653589, which is correct to a meager 5 significant figures.
Drawing from the vast array of
discoveries made by Wallis, Gregory, and others, Newton and Leibniz
independently set to work systematically organizing the laws of differential
and integral calculus during the mid to late 17th centuries. While
each made many significant accomplishments during their lifetimes, both to
calculus and to mathematics in general, the work of Newton, who placed greater
importance upon power series, applies far more to the calculation of p. Newton’s own approach to this task was similar to
Wallis in that he took an equation of a circle and integrated to find the area
underneath it. The specific curve he used was given by the equation
, which defines the upper half of the circle with center at
(0, ½) and radius ½. Newton integrated to find the area from x = 0 to x = ¼.
After factoring out a
, he was left with the following integral:
, which represents the area under the arc AD in the diagram
below.

This
area is also equivalent to the area of the sector ACD minus the area of
triangle BCD. Since
,
, and
. Using the formulae for the area of a sector,
, and the area of a triangle, Newton calculated the area
under arc AD to be
. Newton had by this time discovered, by interpolating coefficients
for fractional powers on Pascal’s Triangle, how to expand a binomial to any
rational power to a power series. Using his modified binomial theorem, he
expanded the integrand of his previous expression to
. Distributing the
and integrating,
Newton calculated the area to be
. After simplifying this expression and setting it equal to
the value obtained earlier, Newton derived the following identity:
(Beckmann, 1971). Although it is neither as simple nor
aesthetically pleasing as the derivations of Wallis and Leibniz, it is more
practical and useful since it converges much faster (note that successive
denominators of the series increase exponentially).
The next significant
mathematician to work extensively with p was Leonard Euler (1707-1783). Euler began his work
by evaluating the infinite summation of inverse squares (
). He managed this by solving the innocent-looking equation
. He replaced the sine function with its power series to
yield
. Dividing through by x
and making the substitution
, he reached the equation
. Since he knew the expression on the right side was
equivalent to
, y ¹ 0 because he had divided by x in the original power series, he knew that y must equal p2, 4p2, 9p2, ….
According to a higher-level theorem, the sum of the reciprocals of the zeroes
of an equation is equal to the opposite of the ratio of the linear coefficient
to the constant term. Applying this principle to this equation, Euler derived
the identity
. Multiplying through by p2 yielded
the result
. Encouraged by his success, Euler repeated the same process
for the cosine function and derived another series for p2: ![]()
Euler derived many, many
expressions for p, and to list them all would require an entire volume devoted to that
purpose. A few more examples will suffice before moving on. Euler discovered
infinite series for the reciprocals of all even powers up to 26, for which
. He also derived an infinite product for p2:
, as well as the following summation of arctangents:
. (Beckmann, 1971)
Another important contribution
of Euler was the derivation and evaluation (using an infinite series of his own
discovery) of a new arctangent series for p. Inspired by the success of English mathematician
John Machin, Euler adopted a method of deriving an arctangent identity similar
to the one Machin had used some three or four decades ago. To begin with, Euler
supposed that there be two angles, call them a and b, for which
. Next he arbitrarily picked
to be the value of
the tangent of a, since he realized that for the identity to converge
quickly when expanded into series the angles would have to be fairly small, and
the arctangent of
is very small indeed.
Using his arctangent series, he calculated a to be
approximately 0.1419, which divides into
five times. Thus,
Euler obtained the equation
and proceeded to
solve for b:

Applying
the sum/difference rule for tangent yields
![]()
To
calculate tan 5a Euler once
again used the sum/difference rule:

Back-substituting
into the previous equation,

At
this point, Euler had his two angles. He could have stopped here and left his
identity as
. But Euler did not relish the thought expanding
into series. He began
to wonder if perhaps b could be expressed as a multiple of a simpler,
smaller angle f. Setting b equal to 2f, Euler solved for f:

Algebraic
manipulation reduces this equation to a quadratic, which can simply be plugged
into the Quadratic Equation to solve for tan f (happily, the
discriminant turns out to be a perfect square):
![]()

Discarding
the negative root in favor of the positive one, Euler thus found the angle he
was looking for. With this newly acquired angle he obtained the equation
. With this identity and his power series for arctangent,
, where
, Euler provided us with one of the fastest converging series
for p ever discovered up to
that time (Beckmann, 1971).
As we have seen, many
mathematicians throughout history have tried their hand at calculating the
value of p. Through their work, we
now have a mass of formulae which converge at infinity. This poses an obvious
question: which formula is ‘best’, and why? Historically there have been two
main factors in determining the effectiveness of a series or a formula – its
rate of convergence and its convenience (does it involve messy radicals and
fractions, etc.?). With the advent of the computer and calculator, the latter
is less of a concern, but the former is as important as ever because a slower
rate of convergence generally means more computing time to calculate to a given
number of decimals.
Through repeated testing of
several of the formulae it can be shown that the fastest by far is Euler’s
arctangent identity paired with his power series for arctangent, which, after
just 1000 terms evaluates to 3.141592653586962, a value accurate to 12
significant figures. By contrast, some of the slowest methods, such as the
Leibniz series, requires over 100,000 terms to obtain just six-digit accuracy.
For a complete list comparing the rates of convergence of several different
methods, see appendix A.
It is interesting to see how the
rates of convergence for the various methods have increased throughout history.
This comes as no surprise; the more contemporary mathematicians had the
benefits of more fully developed mathematics. Euler, for example, drew upon the
work of countless mathematicians before him in deriving his multitude of series
for p – the calculus of Newton
and Leibniz, the trigonometry of Viète and Fincke, the algebra of al-Khwarizmi,
the geometry of the ancient Greeks. However, this increase in rates of
convergence have often come at the expense of elegance and simplicity. Consider
again Euler’s arctangent series. Its evaluation involves both repeated multiplication
and division. By contrast, the Leibniz series requires only a simple
reciprocation. Needless to say, it takes far less time to evaluate a single
term of the latter series. Euler’s series, however, contains many redundant
steps, and from a programming standpoint, could be made much more efficient.
Thus, the true rate of convergence of a series (accuracy as a function of time)
is as much a question of implementation as the complexity of the series.
What now? Formulae for p have been improving for centuries. How much better
could they possibly become? Not too recently, we broke the one billion digit
mark. Does it even matter anymore? Probably not. But that will not stop
mathematicians from searching for more and more digits. Historically, approximating
p has been one of the
quintessential examples of mathematics for its own sake. We will never need so
many digits, but in the process of deriving and applying formulae, we gather
new insights that we can apply to other fields. And that is what mathematics is
all about.
Appendix A – Convergence of p identities
|
Identity |
Page # |
n = 1 |
n = 5 |
n = 10 |
n = 50 |
n = 100 |
n = 1000 |
n = 10000 |
|
|
3-5 |
0 |
2.93892626 |
3.09016994 |
3.13952598 |
3.14107591 |
3.14158749 |
3.14159260 |
|
|
6 |
4 |
3.41333333 |
3.00217596 |
3.11094517 |
3.12607890 |
3.14002382 |
3.14143559 |
|
|
6-7 |
4 |
3.33968254 |
3.04183962 |
3.12159465 |
3.13159290 |
3.14059265 |
3.14149265 |
|
|
7 |
3.07920143 |
3.14130878 |
3.14159330 |
3.14159265 |
3.14159265 |
3.14159265 |
3.14159265 |
|
|
8-9 |
3.14903811 |
3.14159508 |
3.14159265 |
3.14159265 |
3.14159265 |
3.14159265 |
3.14159265 |
|
|
9 |
2.44948974 |
2.9633877 |
3.04936164 |
3.12262652 |
3.13207653 |
3.14063806 |
3.14149716 |
|
|
9 |
2.82842712 |
3.07748593 |
3.10962546 |