Berlekamp–Rabin algorithm

Method in number theory
Elwyn R. Berlekamp at conference on Combinatorial Game Theory at Banff International Research Station Elwyn Berlekamp

In number theory, Berlekamp's root finding algorithm, also called the Berlekamp–Rabin algorithm, is the probabilistic method of finding roots of polynomials over the field F p {\displaystyle \mathbb {F} _{p}} with p {\displaystyle p} elements. The method was discovered by Elwyn Berlekamp in 1970[1] as an auxiliary to the algorithm for polynomial factorization over finite fields. The algorithm was later modified by Rabin for arbitrary finite fields in 1979.[2] The method was also independently discovered before Berlekamp by other researchers.[3]

History

The method was proposed by Elwyn Berlekamp in his 1970 work[1] on polynomial factorization over finite fields. His original work lacked a formal correctness proof[2] and was later refined and modified for arbitrary finite fields by Michael Rabin.[2] In 1986 René Peralta proposed a similar algorithm[4] for finding square roots in F p {\displaystyle \mathbb {F} _{p}} .[5] In 2000 Peralta's method was generalized for cubic equations.[6]

Statement of problem

Let p {\displaystyle p} be an odd prime number. Consider the polynomial f ( x ) = a 0 + a 1 x + + a n x n {\textstyle f(x)=a_{0}+a_{1}x+\cdots +a_{n}x^{n}} over the field F p Z / p Z {\displaystyle \mathbb {F} _{p}\simeq \mathbb {Z} /p\mathbb {Z} } of remainders modulo p {\displaystyle p} . The algorithm should find all λ {\displaystyle \lambda } in F p {\displaystyle \mathbb {F} _{p}} such that f ( λ ) = 0 {\textstyle f(\lambda )=0} in F p {\displaystyle \mathbb {F} _{p}} .[2][7]

Algorithm

Randomization

Let f ( x ) = ( x λ 1 ) ( x λ 2 ) ( x λ n ) {\textstyle f(x)=(x-\lambda _{1})(x-\lambda _{2})\cdots (x-\lambda _{n})} . Finding all roots of this polynomial is equivalent to finding its factorization into linear factors. To find such factorization it is sufficient to split the polynomial into any two non-trivial divisors and factorize them recursively. To do this, consider the polynomial f z ( x ) = f ( x z ) = ( x λ 1 z ) ( x λ 2 z ) ( x λ n z ) {\textstyle f_{z}(x)=f(x-z)=(x-\lambda _{1}-z)(x-\lambda _{2}-z)\cdots (x-\lambda _{n}-z)} where z {\displaystyle z}  is some any element of F p {\displaystyle \mathbb {F} _{p}} . If one can represent this polynomial as the product f z ( x ) = p 0 ( x ) p 1 ( x ) {\displaystyle f_{z}(x)=p_{0}(x)p_{1}(x)} then in terms of the initial polynomial it means that f ( x ) = p 0 ( x + z ) p 1 ( x + z ) {\displaystyle f(x)=p_{0}(x+z)p_{1}(x+z)} , which provides needed factorization of f ( x ) {\displaystyle f(x)} .[1][7]

Classification of F p {\displaystyle \mathbb {F} _{p}} elements

Due to Euler's criterion, for every monomial ( x λ ) {\displaystyle (x-\lambda )} exactly one of following properties holds:[1]

  1. The monomial is equal to x {\displaystyle x} if λ = 0 {\displaystyle \lambda =0} ,
  2. The monomial divides g 0 ( x ) = ( x ( p 1 ) / 2 1 ) {\textstyle g_{0}(x)=(x^{(p-1)/2}-1)} if λ {\displaystyle \lambda }  is quadratic residue modulo p {\displaystyle p} ,
  3. The monomial divides g 1 ( x ) = ( x ( p 1 ) / 2 + 1 ) {\textstyle g_{1}(x)=(x^{(p-1)/2}+1)} if λ {\displaystyle \lambda }  is quadratic non-residual modulo p {\displaystyle p} .

Thus if f z ( x ) {\displaystyle f_{z}(x)} is not divisible by x {\displaystyle x} , which may be checked separately, then f z ( x ) {\displaystyle f_{z}(x)} is equal to the product of greatest common divisors gcd ( f z ( x ) ; g 0 ( x ) ) {\displaystyle \gcd(f_{z}(x);g_{0}(x))} and gcd ( f z ( x ) ; g 1 ( x ) ) {\displaystyle \gcd(f_{z}(x);g_{1}(x))} .[7]

Berlekamp's method

The property above leads to the following algorithm:[1]

  1. Explicitly calculate coefficients of f z ( x ) = f ( x z ) {\displaystyle f_{z}(x)=f(x-z)} ,
  2. Calculate remainders of x , x 2 , x 2 2 , x 2 3 , x 2 4 , , x 2 log 2 p {\textstyle x,x^{2},x^{2^{2}},x^{2^{3}},x^{2^{4}},\ldots ,x^{2^{\lfloor \log _{2}p\rfloor }}} modulo f z ( x ) {\displaystyle f_{z}(x)} by squaring the current polynomial and taking remainder modulo f z ( x ) {\displaystyle f_{z}(x)} ,
  3. Using exponentiation by squaring and polynomials calculated on the previous steps calculate the remainder of x ( p 1 ) / 2 {\textstyle x^{(p-1)/2}} modulo f z ( x ) {\textstyle f_{z}(x)} ,
  4. If x ( p 1 ) / 2 ± 1 ( mod f z ( x ) ) {\textstyle x^{(p-1)/2}\not \equiv \pm 1{\pmod {f_{z}(x)}}} then gcd {\displaystyle \gcd } mentioned below provide a non-trivial factorization of f z ( x ) {\displaystyle f_{z}(x)} ,
  5. Otherwise all roots of f z ( x ) {\displaystyle f_{z}(x)} are either residues or non-residues simultaneously and one has to choose another z {\displaystyle z} .

If f ( x ) {\displaystyle f(x)} is divisible by some non-linear primitive polynomial g ( x ) {\displaystyle g(x)} over F p {\displaystyle \mathbb {F} _{p}} then when calculating gcd {\displaystyle \gcd } with g 0 ( x ) {\displaystyle g_{0}(x)} and g 1 ( x ) {\displaystyle g_{1}(x)} one will obtain a non-trivial factorization of f z ( x ) / g z ( x ) {\displaystyle f_{z}(x)/g_{z}(x)} , thus algorithm allows to find all roots of arbitrary polynomials over F p {\displaystyle \mathbb {F} _{p}} .

Modular square root

Consider equation x 2 a ( mod p ) {\textstyle x^{2}\equiv a{\pmod {p}}} having elements β {\displaystyle \beta } and β {\displaystyle -\beta } as its roots. Solution of this equation is equivalent to factorization of polynomial f ( x ) = x 2 a = ( x β ) ( x + β ) {\textstyle f(x)=x^{2}-a=(x-\beta )(x+\beta )} over F p {\displaystyle \mathbb {F} _{p}} . In this particular case problem it is sufficient to calculate only gcd ( f z ( x ) ; g 0 ( x ) ) {\displaystyle \gcd(f_{z}(x);g_{0}(x))} . For this polynomial exactly one of the following properties will hold:

  1. GCD is equal to 1 {\displaystyle 1} which means that z + β {\displaystyle z+\beta } and z β {\displaystyle z-\beta } are both quadratic non-residues,
  2. GCD is equal to f z ( x ) {\displaystyle f_{z}(x)} which means that both numbers are quadratic residues,
  3. GCD is equal to ( x t ) {\displaystyle (x-t)} which means that exactly one of these numbers is quadratic residue.

In the third case GCD is equal to either ( x z β ) {\displaystyle (x-z-\beta )} or ( x z + β ) {\displaystyle (x-z+\beta )} . It allows to write the solution as β = ( t z ) ( mod p ) {\textstyle \beta =(t-z){\pmod {p}}} .[1]

Example

Assume we need to solve the equation x 2 5 ( mod 11 ) {\textstyle x^{2}\equiv 5{\pmod {11}}} . For this we need to factorize f ( x ) = x 2 5 = ( x β ) ( x + β ) {\displaystyle f(x)=x^{2}-5=(x-\beta )(x+\beta )} . Consider some possible values of z {\displaystyle z} :

  1. Let z = 3 {\displaystyle z=3} . Then f z ( x ) = ( x 3 ) 2 5 = x 2 6 x + 4 {\displaystyle f_{z}(x)=(x-3)^{2}-5=x^{2}-6x+4} , thus gcd ( x 2 6 x + 4 ; x 5 1 ) = 1 {\displaystyle \gcd(x^{2}-6x+4;x^{5}-1)=1} . Both numbers 3 ± β {\displaystyle 3\pm \beta } are quadratic non-residues, so we need to take some other z {\displaystyle z} .
  1. Let z = 2 {\displaystyle z=2} . Then f z ( x ) = ( x 2 ) 2 5 = x 2 4 x 1 {\displaystyle f_{z}(x)=(x-2)^{2}-5=x^{2}-4x-1} , thus gcd ( x 2 4 x 1 ; x 5 1 ) x 9 ( mod 11 ) {\textstyle \gcd(x^{2}-4x-1;x^{5}-1)\equiv x-9{\pmod {11}}} . From this follows x 9 = x 2 β {\textstyle x-9=x-2-\beta } , so β 7 ( mod 11 ) {\displaystyle \beta \equiv 7{\pmod {11}}} and β 7 4 ( mod 11 ) {\textstyle -\beta \equiv -7\equiv 4{\pmod {11}}} .

A manual check shows that, indeed, 7 2 49 5 ( mod 11 ) {\textstyle 7^{2}\equiv 49\equiv 5{\pmod {11}}} and 4 2 16 5 ( mod 11 ) {\textstyle 4^{2}\equiv 16\equiv 5{\pmod {11}}} .

Correctness proof

The algorithm finds factorization of f z ( x ) {\displaystyle f_{z}(x)} in all cases except for ones when all numbers z + λ 1 , z + λ 2 , , z + λ n {\displaystyle z+\lambda _{1},z+\lambda _{2},\ldots ,z+\lambda _{n}} are quadratic residues or non-residues simultaneously. According to theory of cyclotomy,[8] the probability of such an event for the case when λ 1 , , λ n {\displaystyle \lambda _{1},\ldots ,\lambda _{n}} are all residues or non-residues simultaneously (that is, when z = 0 {\displaystyle z=0} would fail) may be estimated as 2 k {\displaystyle 2^{-k}} where k {\displaystyle k}  is the number of distinct values in λ 1 , , λ n {\displaystyle \lambda _{1},\ldots ,\lambda _{n}} .[1] In this way even for the worst case of k = 1 {\displaystyle k=1} and f ( x ) = ( x λ ) n {\displaystyle f(x)=(x-\lambda )^{n}} , the probability of error may be estimated as 1 / 2 {\displaystyle 1/2} and for modular square root case error probability is at most 1 / 4 {\displaystyle 1/4} .

Complexity

Let a polynomial have degree n {\displaystyle n} . We derive the algorithm's complexity as follows:

  1. Due to the binomial theorem ( x z ) k = i = 0 k ( k i ) ( z ) k i x i {\textstyle (x-z)^{k}=\sum \limits _{i=0}^{k}{\binom {k}{i}}(-z)^{k-i}x^{i}} , we may transition from f ( x ) {\displaystyle f(x)} to f ( x z ) {\displaystyle f(x-z)} in O ( n 2 ) {\displaystyle O(n^{2})} time.
  2. Polynomial multiplication and taking remainder of one polynomial modulo another one may be done in O ( n 2 ) {\textstyle O(n^{2})} , thus calculation of x 2 k mod f z ( x ) {\textstyle x^{2^{k}}{\bmod {f}}_{z}(x)} is done in O ( n 2 log p ) {\textstyle O(n^{2}\log p)} .
  3. Binary exponentiation works in O ( n 2 log p ) {\displaystyle O(n^{2}\log p)} .
  4. Taking the gcd {\displaystyle \gcd } of two polynomials via Euclidean algorithm works in O ( n 2 ) {\displaystyle O(n^{2})} .

Thus the whole procedure may be done in O ( n 2 log p ) {\displaystyle O(n^{2}\log p)} . Using the fast Fourier transform and Half-GCD algorithm,[9] the algorithm's complexity may be improved to O ( n log n log p n ) {\displaystyle O(n\log n\log pn)} . For the modular square root case, the degree is n = 2 {\displaystyle n=2} , thus the whole complexity of algorithm in such case is bounded by O ( log p ) {\displaystyle O(\log p)} per iteration.[7]

References

  1. ^ a b c d e f g Berlekamp, E. R. (1970). "Factoring polynomials over large finite fields". Mathematics of Computation. 24 (111): 713–735. doi:10.1090/S0025-5718-1970-0276200-X. ISSN 0025-5718.
  2. ^ a b c d M. Rabin (1980). "Probabilistic Algorithms in Finite Fields". SIAM Journal on Computing. 9 (2): 273–280. CiteSeerX 10.1.1.17.5653. doi:10.1137/0209024. ISSN 0097-5397.
  3. ^ Donald E Knuth (1998). The art of computer programming. Vol. 2 Vol. 2. ISBN 978-0201896848. OCLC 900627019.
  4. ^ Tsz-Wo Sze (2011). "On taking square roots without quadratic nonresidues over finite fields". Mathematics of Computation. 80 (275): 1797–1811. arXiv:0812.2591. doi:10.1090/s0025-5718-2011-02419-1. ISSN 0025-5718. S2CID 10249895.
  5. ^ R. Peralta (November 1986). "A simple and fast probabilistic algorithm for computing square roots modulo a prime number (Corresp.)". IEEE Transactions on Information Theory. 32 (6): 846–847. doi:10.1109/TIT.1986.1057236. ISSN 0018-9448.
  6. ^ C Padró, G Sáez (August 2002). "Taking cube roots in Zm". Applied Mathematics Letters. 15 (6): 703–708. doi:10.1016/s0893-9659(02)00031-9. ISSN 0893-9659.
  7. ^ a b c d Alfred J. Menezes, Ian F. Blake, XuHong Gao, Ronald C. Mullin, Scott A. Vanstone (1993). Applications of Finite Fields. The Springer International Series in Engineering and Computer Science. Springer US. ISBN 9780792392828.{{cite book}}: CS1 maint: multiple names: authors list (link)
  8. ^ Marshall Hall (1998). Combinatorial Theory. John Wiley & Sons. ISBN 9780471315186.
  9. ^ Aho, Alfred V. (1974). The design and analysis of computer algorithms. Addison-Wesley Pub. Co. ISBN 0201000296.
  • v
  • t
  • e
Number-theoretic algorithms
Primality testsPrime-generatingInteger factorizationMultiplicationEuclidean divisionDiscrete logarithmGreatest common divisorModular square rootOther algorithms
  • Italics indicate that algorithm is for numbers of special forms