Loading [MathJax]/jax/output/HTML-CSS/jax.js

You are here

The Root of the Matter: Approximating Roots with the Greeks - Other Roots

Author(s): 
Matthew Haines and Jody Sorensen (Augsburg University)

The linear algebraic view of Theon's method gives us a quick way to develop methods for approximating other square roots. Since the matrix A=[1121] works to find approximations of 2, let's naively guess that B=[1131] will allow us to approximate 3. Is that true? Well, the eigenvalues are λ1=1+3 and λ2=13 with corresponding eigenvectors w1=[13] and w2=[13]. Since |λ1|>1 and |λ2|<1, any starting vector will tend to a multiple of [13], and thus yx will tend to 3. This convergence is demonstrated in general later in this section.

Note that we need to be careful with the phrase "any starting vector." If the starting vector is exactly a multiple of w2, then the iterates will not tend to a multiple of w1. But note that

  • eigenvector w2 has a negative component, so any starting vector with both components positive will work, and
  • eigenvector w2 has one rational and one irrational component, so any starting vector with both components integers (not both zero) will work as well (even if one or both components are negative).

The tables below show examples starting with x=1,y=3 and with x=2,y=5.

nxnynyn/xn01331461.5210181.8328481.71434761321.736852083601.7308nxnynyn/xn0252.51310.33332284310141.4424441.83335681161.7059

Table 3. Approximating 3

So far so good, but there is an issue that arises. If we use B=[1151] to get an approximation of 5, the eigenvalues and eigenvectors are just what we expect. But with eigenvalues of λ1=1+53.236 and λ2=151.236, both eigenvalues are larger than 1 in absolute value. So does our argument break down? It turns out that the answer is no. The eigenvalue that has the largest absolute value "dominates" in the long run, and vectors still tend to a multiple of the associated eigenvector, in this case w1=[15]. See Table 4 for an example of approximating 5:

nxnynyn/xn01221372.3333210222.2332722.2541042322.2308

Table 4. Approximating 5

Here's an argument as to why the vector with the largest eigenvalue dominates. If we use the matrix B=[11k1] for kZ+, we get eigenvalues λ1=1+k and λ2=1k. Note that |λ2|<|λ1|. The associated eigenvectors are w1=[1k] and w2=[1k]. We showed above that if v0=c1w1+c2w2, then

vn=c1λ1nw1+c2λ2nw2. If we factor out λ1n on the right side, we get vn=λ1n(c1w1+c2λ2nλ1nw2)=λ1n(c1w1+c2(λ2λ1)nw2). But since |λ2|<|λ1|, we have |λ2λ1|<1. So the second term tends to 0, and vn tends to a multiple of w1 as desired.

Thus, in general, using the matrix B=[11k1] for kZ+, the ratio of y over x for almost any starting vector tends to k.

Matthew Haines and Jody Sorensen (Augsburg University), "The Root of the Matter: Approximating Roots with the Greeks - Other Roots," Convergence (June 2018)