任意精度の整数平方根アルゴリズム？

nビット整数の平方根のフロアを計算するための既知のサブ二次アルゴリズムはありますか？

素朴なアルゴリズムは次のようなものです

def sqrt(x):
    r = 0
    i = x.bit_length() // 2
    while i >= 0:
        inc = (r << (i+1)) + (1 << (i*2))
        if inc <= x:
            x -= inc
            r += 1 << i
        i -= 1
    return r

これにはO(n)反復が必要であり、各反復にはO(n)時間である加算が含まれるため、O(n^2)全体として時間です。もっと速いものはありますか？乗算の場合、2次時間よりも優れた特別なアルゴリズムがあることは知っていますが、平方根については何も見つかりません。

algorithms numerical-algorithms

— アンチモン
ソース

関連するものに対する私の答えはcs.stackexchange.com/a/37338/12052を助けるかもしれません。唯一の問題は、その精度を微調整するために経験的に見つけなければならない必要な方程式の一部です。

— Francesco Gramano

@FrancescoGramano：すみません、それは役に立たないと思います。

— Aryabhata

ところで、この準二次要件はより大きな問題の一部ですか？単純な二次と複雑な二次の違いは、実際にはそれほど大きくない可能性があるためです。それとも理論的に興味があるだけですか？

— Aryabhata 2015年

@Aryabhata申し訳ありませんが、以前にあなたのコメントを見ていません。いいえ、それはより大きな問題の一部ではなく、単に好奇心です。

— アンチモン2015年

回答:

多項式根への近似を見つけるために、ニュートン法または他の多くの方法のいずれかを使用できます。 $p(x) = x^2 -c$

ニュートン法の収束率は2次式になります。つまり、正しいビット数が各反復で2倍になります。これは、ニュートン法の回の反復で十分であることを意味します。 $O(\lg n)$

ニュートン法の各反復は

{バツ}_{j + 1} = {バツ}_{j} - （ {バツ}_{j}^{2} - c ） / （ 2 {バツ}_{j} ） = 0.5 {バツ}_{j} + \frac{c}{2 {バツ}_{j}} 。

$x_{j+1} = x_j - (x_j^2 -c)/(2x_j) = 0.5 x_j + \frac{c}{2x_j}.$

乗算のビット複雑度は、2つのビット整数を乗算します（係数を無視）。（ビットの精度への）除算のビット複雑度は同じです。したがって、各反復は演算で計算できます。繰り返しを掛けると、平方根をビットの精度で計算する全体の実行時間は $\stackrel{~}{O}(b \lg b)$ $b$ $\lg \lg b$ $b$ $\stackrel{~}{O}(n \lg n)$ $O(\lg n)$ $n$ 。これは準二次式です。 $\stackrel{~}{O}(n (\lg n)^2)$

私はこれがに向上させることができる、より慎重な分析が示すと考える（我々は唯一の各知っておく必要があることを考慮することで、時間を実行しているがおよそ内に精度のビットではなく、精度のビット）。ただし、より基本的な分析でも、実行時間は明らかに2次以下です。 $\stackrel{~}{O}(n \lg n)$ $x_j$ $j$ $n$

— DW
ソース

バイナリ1も同一用いて大きな初期推定有し

。ログを計算する代わりに、

の桁数として

を概算でき

。例えば、

。

x^{1 / 2} = 2^{1 / 2 \log_{2} x}

$x^{1/2} = 2^{1/2 \log_2 x}$

\log_{2} x

$\log_2 x$

x

$x$

\log_{2} 101011 \approx 6

$\log_2 101011 \approx 6$

— Nick Alger、2015年

@DW：しかし、整数平方根を探していませんか？整数演算のみを使用してニュートン法の反復を行う場合は、

クレームを正当化する必要がありますね。それ以外の場合は、すでに十分に大きな精度を想定しています...明らかなものがない場合は申し訳ありません。

O (\log n)

$O(\log n)$

— Aryabhata 2015年

@DW：

$\;\;\;$ 「ニュートン法の収束率」は、

場合、2次式にはなりません。

、そして私は非負の実数ではない

値に対して何が起こるかわかりません。

c = 0

$c\hspace{-0.04 in}=\hspace{-0.04 in}0$

c

$c$

$\:$ 乗算のビットの複雑さの見積もりは、次の備考が示唆するよりも厳しいです。

$\:$ また、「各

を約内に知る必要がある」

x_{j}

$x_j$

「精度のビット」。

2^{j}

$2^{\hspace{.02 in}j}$

$\;\;\;\;\;\;\;$

@Aryabhata：

$\;\;\;$ 「整数平方根を探している」わけではありません。「平方根の床」を探しています。

$\:$ 同じビットの複雑さは浮動小数点演算にも当てはまりますが、整数演算の問題についてはあなたは正しいです。

$\;\;\;\;\;\;\;$

@RickyDemer、はい、

は特殊なケースです。これは、

根が多重度2を持っているためです。ただし、

とき、根は多重度1を持っているため、ニュートン法は 2次収束します。ニュートンの方法を使用して

平方根を計算する人はいないと思い

（ゼロの平方根は明らかにゼロであるため）。それで、あなたは何を言おうとしているのですか？あなたのコメントは、私の答えに「特別な場合はゼロの平方根」を追加することで対処される簡単なコメントですか、それとも私が欠けているもっと深い何かがありますか？

c = 0

$c=0$

p (x)

$p(x)$

c > 0

$c>0$

c = 0

$c=0$

— DW

ニュートン法の問題の1つは、反復ごとに除算演算が必要になることです。これは、最も遅い基本的な整数演算です。

しかし、逆平方根に対するニュートンの方法はそうではありません。が検索したい番号の場合 $x$ 、繰り返し： $\frac{1}{\sqrt x}$

r_{i + 1} = \frac{1}{2} r_{i} (3 - x r_{i}^{2})

$r_{i+1} = \frac{1}{2} r_i (3 - x r_i^2)$

これはしばしば次のように表現されます：

w_{i} = r_{i}^{2}

$w_i = r_i^2$

d_{i} = 1 - w_{i} x

$d_i = 1 - w_i x$

r_{i + 1} = r_{i} + \frac{r_{i} d_{i}}{2}

$r_{i+1} = r_i + \frac{r_i d_i}{2}$

これが3つの乗算演算です。2による除算は、右シフトとして実装できます。

ここで問題は、が整数ではないことです。ただし、浮動小数点を手動で実装し、必要に応じて一連のシフト操作を実行して補正することで、そのように操作できます。 $r$

まず、再スケーリングします。 $x$

x^{'} = 2^{- 2 e} x

$x' = 2^{-2e} x$

ここで、をより大きく、ただし近くしたいとします。私たちは、上記のアルゴリズムを実行した場合の代わりに、、我々は見つける $x'$ $1$ $x'$ $x$ 。次に、 $r = \frac{1}{\sqrt x'}$ 。 $\sqrt{x} = 2^e r x'$

次に、を仮数と指数に分割します。 $r$

r_{i} = 2^{- e_{i}} r_{i}^{'}

$r_i = 2^{-e_i} r'_i$

ここで、は整数です。直感的に、は回答の精度を表します。 $r'_i$ $e_i$

ニュートンの方法では、正確な有効桁数が約2倍になることがわかっています。だから私たちは選ぶことができます：

e_{i + 1} = 2 e_{i}

$e_{i+1} = 2e_i$

少し操作すると、次のことがわかります。

e_{i + 1} = 2 e_{i}

$e_{i+1} = 2e_i$

w_{i} = {r_{i}^{'}}^{2}

$w_i = {r'_i}^2$

x_{i}^{'} = \frac{x}{2^{2 e - e_{i + 1}}}

$x'_i = \frac{x}{2^{2e - e_{i+1}}}$

d_{i} = 2^{e_{i + 1}} - \frac{w_{i}^{'} x_{i}^{'}}{2^{e_{i + 1}}}

$d_i = 2^{e_{i+1}} - \frac{w_i' x'_i}{2^{e_{i+1}}}$

r_{i + 1}^{'} = 2^{e_{i}} r_{i}^{'} - \frac{r_{i}^{'} d_{i}}{2^{e_{i} + 1}}

$r'_{i+1} = 2^{e_i} r'_i - \frac{r'_i d_i}{2^{e_i + 1}}$

すべての反復で：

\sqrt{x} \approx \frac{r_{i}^{'} x}{2^{e + e_{i}}}

$\sqrt{x} \approx \frac{r'_i x}{2^{e + e_i}}$

$x = 2^{63}$ $2^{31}\sqrt{2}$ $\frac{1}{\sqrt{2}} 2^{-31}$ $e = 31$ $r'_0 = 3$ $e_0 = 2$ $\frac{3}{4}$ $\frac{1}{\sqrt{2}}$

次に：

e_{1} = 4, r_{1}^{'} = 11

$e_1 = 4, r'_1 = 11$

e_{2} = 8, r_{2}^{'} = 180

$e_2 = 8, r'_2 = 180$

e_{3} = 16, r_{3}^{'} = 46338

$e_3 = 16, r'_3 = 46338$

e_{4} = 32, r_{4}^{'} = 3037000481

$e_4 = 32, r'_4 = 3037000481$

$e_i$ $e$ $e_i > 2e$

\sqrt{2^{63}} \approx \frac{3037000481 \times 2^{63}}{2^{31 + 32}} = 3037000481

$\sqrt{2^{63}} \approx \frac{3037000481 \times 2^{63}}{2^{31+32}} = 3037000481$

$3037000499$ $e_i$

$b$ $O(b \log b)$ $r'_i < 2^{e_i}$ $w_i$ $e_i$ $e_{i+1}$ $e_{i+1}$ $2e_{i+1}$ -ビット数。

$O(e_i \log e_i)$ $O(\log e)$ $O(2e \log 2e)$ operations. So the overall complexity is $O(e \log^2 e)$ operations, which is sub-quadratic in the number of bits in $x$ . That ticks all the boxes.

However, this analysis hides an important principle which everyone working with large integers should keep in mind: because multiplication is superlinear in the number of bits, any multiplication operations should only be performed on integers which have the roughly the magnitude of the current precision (and, I might add, you should try to multiply numbers together which have a similar order of magnitude). Using integers larger than that is a waste of effort. Constant factors matter, and for large integers, they matter a lot.

As a final observation, two of the multiplications are of the form $\frac{ab}{2^c}$ . Clearly it's wasteful to compute the all the bits of $ab$ only to throw $c$ of them away with a right-shift. Implementing a smart multiplication method which takes this into account is also left as an exercise.

— Pseudonym
ソース

This is great stuff. One comment, though: Isn't the bit-complexity of division asymptotically approximately the same as the bit-complexity of multiplication? So you're talking about something that gives a constant factor improvement, not an asymptotic improvement, right? That wasn't entirely clear from your answer.

— D.W.

You say that multiplying two

b

$b$ -bit integers takes

O (b \lg b)

$O(b \lg b)$ bit operations. I think the correct answer is something like

O (b \lg b (\lg l g b)^{O (1)})

$O(b \lg b (\lg lg b)^{O(1)})$ (right?). You might want to indicate that you are ignoring poly-log-log factors (e.g., by putting a tilde over your big O, or something).

— D.W.

@D.W. :

$\;\;\;$ No, he says that "multiplying two

b

$b$ -bit integers takes

O (b \log b)

$O(b\log b)$ operations."

$\:$ The word "bit" only appears once in that; otherwise I would've already pointed that out.

$\;\;\;\;\;\;\;$

It is a matter of constant factors, yes. The best large integer division algorithms use a technique very similar to the whole algorithm, such as Newton-Raphson iteration and doubling the effective precision on each iteration. A Newton-Raphson loop within a Newton-Raphson loop piles on the constant factors! Ricky Demer is correct; I was thinking in the word RAM model. I probably should have mentioned this.

— Pseudonym