The linear algebraic view of Theon's method gives us a quick way to develop methods for approximating other square roots. Since the matrix A=[1121] works to find approximations of √2, let's naively guess that B=[1131] will allow us to approximate √3. Is that true? Well, the eigenvalues are λ1=1+√3 and λ2=1−√3 with corresponding eigenvectors →w1=[1√3] and →w2=[1−√3]. Since |λ1|>1 and |λ2|<1, any starting vector will tend to a multiple of [1√3], and thus yx will tend to √3. This convergence is demonstrated in general later in this section.
Note that we need to be careful with the phrase "any starting vector." If the starting vector is exactly a multiple of →w2, then the iterates will not tend to a multiple of →w1. But note that
- eigenvector →w2 has a negative component, so any starting vector with both components positive will work, and
- eigenvector →w2 has one rational and one irrational component, so any starting vector with both components integers (not both zero) will work as well (even if one or both components are negative).
The tables below show examples starting with x=1,y=3 and with x=−2,y=5.
nxnynyn/xn01331461.5210181.8328481.71434761321.736852083601.7308nxnynyn/xn0−25−2.513−1−0.33332284310141.4424441.83335681161.7059
Table 3. Approximating √3
So far so good, but there is an issue that arises. If we use B=[1151] to get an approximation of √5, the eigenvalues and eigenvectors are just what we expect. But with eigenvalues of λ1=1+√5≈3.236 and λ2=1−√5≈−1.236, both eigenvalues are larger than 1 in absolute value. So does our argument break down? It turns out that the answer is no. The eigenvalue that has the largest absolute value "dominates" in the long run, and vectors still tend to a multiple of the associated eigenvector, in this case →w1=[1√5]. See Table 4 for an example of approximating √5:
nxnynyn/xn01221372.3333210222.2332722.2541042322.2308
Table 4. Approximating √5
Here's an argument as to why the vector with the largest eigenvalue dominates. If we use the matrix B=[11k1] for k∈Z+, we get eigenvalues λ1=1+√k and λ2=1−√k. Note that |λ2|<|λ1|. The associated eigenvectors are →w1=[1√k] and →w2=[1−√k]. We showed above that if →v0=c1→w1+c2→w2, then
→vn=c1λ1n→w1+c2λ2n→w2. If we factor out λ1n on the right side, we get →vn=λ1n(c1→w1+c2λ2nλ1n→w2)=λ1n(c1→w1+c2(λ2λ1)n→w2). But since |λ2|<|λ1|, we have |λ2λ1|<1. So the second term tends to →0, and →vn tends to a multiple of →w1 as desired.
Thus, in general, using the matrix B=[11k1] for k∈Z+, the ratio of y over x for almost any starting vector tends to √k.