Imagine you live in a time with no computers. If you want to approximate the square root of a non-square number N, there is a logical way to do it: Look for the square number a2 nearest to N. If N is greater than a2, then N=a2+r (where we use r for remainder); if N is smaller than a2, then (using r again) N=a2−r. Now think of the formula (a±b)2=a2±2ab+b2, and assume that b is so small that you may neglect its square b2. This means r≈2ab and, from this, b≈r2a.
Thus the first approximation of the square root of N is √N=√a2±r≈a±r2a. Take, for example, N=3=22−1. That is a=2,r=1, and √3≈2−14=134=1.75, a very good approximation because (134)2=(74)2=4916=3116, with an excess of only 116 or 0.0625.
This method (but not, of course, the symbolic formula) was used in Greek antiquity, especially by the first century A.D. mathematician Heron of Alexandria. It was more accurate the nearer N was to a2; that is, the smaller r was. The square of a±r2a is a2±r+(r2a)2. This means that, for both positive and negative r, there is always a surplus or excess of (r2a)2.
Of course, some ancient Greek mathematicians were able to calculate better approximations. Archimedes (287-212 B.C.), for example, used √3≈1571780=1351780, or, in decimal notation, 1.7320512…. The real value of √3 is 1.7320508…. (Because decimal fractions began spreading only at the beginning of the 17th century, square roots of non-square numbers were represented only in the form of common fractions prior to that time.)
But the Greeks had no general method to improve the accuracy of the above approximations. We find the earliest known description of such a procedure in the five pages of the “Chapter on the extraction of irrational roots by approximation” of al-Hassar’s book (beginning on the reverse of leaf 125 of the Gotha copy). It is typical that al-Hassar does not give general instructions, rules, or formulas, but only numerical examples that show how to proceed. And he does not derive his method or give a general proof, but shows only that his method gives the right answers in his examples.
Al-Hassar knew the Greek procedure for both the positive and the negative case, and he calculated the example √5=√22+1≈214. His “Chapter on the extraction of irrational roots by approximation” begins with the words (English translation of Suter's German translation):
When it is said: which is the square root of 5, so take the next square number to 5, this is equal to 4, subtract it from 5, the remainder is 1, divide this by 4, that gives 14, and add this to the root of 4, which is equal to 2. That gives 214. And this is the approximate root of 5. Namely, when you multiply 214[=94] by itself you get [8116=]5116. The deviation is an excess of 116.
(Later al-Hassar obtained also √10=√32+1≈316 with the square 10136 – an excess of 136.) And now comes the important instruction:
But if you want a closer approximation, then double 214, that gives 412. Divide 116 by 412. This gives 172 (an eighth of a ninth). Subtract this from 214[=21872]. This leaves a remainder of 21772[=16172]. When you multiply this by itself you get [259215184=]515184, and this is nearer [to 5] than 5116.
This procedure, expressed in modern notation, is √a2+r≈(a+r2a)−(r2a)22(a+r2a).
Al-Hassar continues:
If you want a closer approximation yet, double 21772, divide (172)2[=15184] by the result, and subtract what you get from 21772. So the result will be an even closer approximation than the first and second roots. You may continue like this as far as you want.
In other words, apply this method several times. Of course, in the process the numerator and the denominator of the fractions get longer and longer, and the multiplications become more and more unwieldy. Today's calculator or computer spares us the multidigit calculations that had to be done by hand in the past.
A warning about calculating the first approximation: In both the positive case N=a2+r and the negative case N=a2−r, the r must not exceed a. Therefore, √14 must not be calculated by √32+5, but only by √42−2. With respect to the square root of 20, al-Hassar remarked that you may take either 16 (positive r) or 25 (negative r) as the nearest square, that is, √42+4 or √52−5. In both cases you get the same first approximation, namely 4+12=5−12=412. This is true for all numbers of the form a2+a=(a+1)2−(a+1)=a(a+1); that is, for 2,6,12,20,30,42,56, and so on. The first approximation is always a+12.