Finite Automata and Arithmetic: J.-P. Allouche

Finite automata and arithmetic
∗
J.-P. Allouche
1 Introduction
The notion of sequence generated by a finite automaton, (or more precisely
a finite automaton with output function, i. e. a “uniform tag system”) has
been introduced and studied by Cobham in 1972 (see [19]; see also [24]).
In 1980, Christol, Kamae, Mendès France and Rauzy, ([18]), proved that a
sequence with values in a finite field is automatic if and only if the related
formal power series is algebraic over the rational functions with coefficients
in this field: this was the starting point of numerous results linking automata
theory, combinatorics and number theory. Our aim is to survey some results
in this area, especially transcendence results, and to provide the reader with
examples of automatic sequences. We will also give a bibliography where
more detailed studies can be found. See in particular the survey of Dekking,
Mendès France and van der Poorten, [22], or the author’s, [2], where many
relations between finite automata and number theory (and between finite
automata and other mathematical fields) are described. For applications of
finite automata to physics see [6].
In the first part of this paper we will recall the basic definitions and give
the theorem of Christol, Kamae, Mendès France and Rauzy. We will also
give five typical examples of sequences generated by finite automata.
In the second part we will discuss transcendence results related to au-
tomata theory, giving in particular some results concerning the Carlitz zeta
function.
We will indicate in the third part of this paper the possible generaliza-
tions of these automatic sequences.
Finally in an appendix we will give an elementary “automatic” proof of
the transcendence of the Carlitz formal power series Π.
∗
C. N. R. S., Mathématiques, 351 cours de la Libération, F-33405 Talence Cedex,
(France).
1
2 Generalities, examples, the main theorem
2.1 Sequences generated by finite automata
Definition 1 Let q be an integer (q ≥ 2). A q-automaton consists of
- a finite set S = {a1 = i, a2 , · · · , ad }, which is called the set of states.
One of the states is denoted by i and called the initial state,
- q maps from S to itself, labelled 0, 1, · · · , q − 1. The image of the state
s by the map j is denoted by j.s,
- a map (the output function), say ϕ, from S to a set Y .
This “machine” generates a sequence with values in Y , say (un )n≥0 , as fol-
lows: to compute the term un , one expands n in base q, say n = `j=0 nj q j ,
P
with 0 ≤ nj ≤ q − 1. Then each nj is interpreted as one of the maps from S

to itself, and these maps are applied to i to obtain:
un = ϕ [n` (n`−1 (· · · (n1 (n0 .i)) · · ·))] .
Such a sequence is called a q-automatic sequence.
2.2 Examples
1) The Prouhet-Thue-Morse sequence
This sequence has been studied by Thue at the beginning of the century,
to give an example of a binary sequence without cubes (i. e. without three
consecutive identical blocks, see [40] and [41]), by Morse in the 20’s, (see
[31]), but also by Prouhet in 1851 ([32]). It can be defined by the following
2-automaton :
- the set of states is S = {i, a},
- the maps 0 and 1 from S to S are defined by,
0.i = i, 0.a = a,
1.i = a, 1.a = i,
- the output function is defined by,
ϕ(i) = 0, ϕ(a) = 1.
Hence this sequence begins by:
011010011001 ···
2
1
' - $

0 i a 0

& %
1
ϕ(i) = 0, ϕ(a) = 1
Figure 1: An automaton which generates the Prouhet-Thue-Morse sequence
2) The Rudin-Shapiro sequence

Let a = (an )n≥0 be any sequence of ±1. What can be said of the
asymptotic size of the supremum of its Fourier transform
N
X −1
FN (a) = sup | an e2iπnx |?
x∈[0,1] n=0
The following bounds are trivial:
√ N
X −1 N
X −1
N =k an e2iπnx kL2 ≤k an e2iπnx kL∞ = FN (a) ≤ N.
n=0 n=0
On the other hand, for almost all (in the sense of the Haar measure on
{−1, +1}IN ) sequences of ±1, one has
p
FN (a) ≤ N log N .
In
√ other words, for a “random” sequence a, FN (a) behaves roughly like
N . Shapiro in 1951 ([37])
√ and Rudin in 1959 ([33]) constructed a sequence
a for which FN (a) ≤ C N and which is deterministic for any reasonable
definition of this notion. Moreover this sequence is 2-automatic, and can be
generated by the following 2-automaton:
- set of states S = {i, a, b, c},
- maps from S to S,
0.i = i, 0.a = i, 0.b = c, 0.c = c,
1.i = a, 1.a = b, 1.b = a, 1.c = b,
3
- output function,
ϕ(i) = ϕ(a) = +1,
ϕ(b) = ϕ(c) = −1.
+1 + 1 + 1 − 1 + 1 + 1 − 1 + 1 + 1 · · ·
-1 -1
-0

i
0
a b c 0

0 1 1
ϕ(i) = ϕ(a) = +1, ϕ(b) = ϕ(c) = −1
Figure 2: An automaton which generates the Rudin-Shapiro sequence
3) The paperfolding sequence

Folding repeatedly a sheet of paper yields a sequence of “peaks” Λ and
“valleys” V which has been studied by many authors since the paper of Davis
and Knuth ([21]). This sequence can indeed be generated by the following
2-automaton:
- set of states S = {i, a, b, c},
- maps from S to S,
0.i = a, 0.a = b, 0.b = b, 0.c = c,
1.i = i, 1.a = c, 1.b = b, 1.c = c,

- output function,
ϕ(i) = ϕ(a) = ϕ(b) = V, ϕ(c) = Λ.
VVΛVVΛΛV ···
4

b
01

0
*

0

1 i - a
HH
H
HH1
jH
H
HH
H
H
c
01
ϕ(i) = ϕ(a) = ϕ(b) = V, ϕ(c) = Λ
Figure 3: An automaton which generates the paperfolding sequence
4) The Baum-Sweet sequence

It is well known that, if a real number is quadratic over the rationals,
then its continued fraction expansion is periodic or ultimately periodic. But
nothing is known for algebraic numbers of degree ≥ 3: no example is known
with bounded partial quotients, nor with unbounded quotients.
If one replaces the real numbers by the field of Laurent series IFq ((X −1 ))
over the finite field IFq , the field of rational numbers by IFq (X), and the ring
of integers ZZ by the ring of polynomials IFq [X], more is known. There is
indeed a theory of continued fractions, and the property of bounded partial
quotients has to be replaced by the property of quotients of bounded degree
(or equivalently quotients taking a finite number of values).
A first result has been given by Baum and Sweet in 1976 ([12]): there
exists a Laurent series in IF2 ((X −1 )), of degree 3 over IF2 (X), such that its
continued fraction has only finitely many partial quotients. By the Chris-
tol, Kamae, Mendès France and Rauzy theorem, (where the variable X is
replaced by X −1 ), the sequence of coefficients of the Baum-Sweet series is
2-automatic. Here is a 2-automaton which generates this sequence:
- set of states S = {i, a, b},
- maps from S to S,
0.i = a, 0.a = i, 0.b = b,
1.i = i, 1.a = b, 1.b = b,
- output function,
ϕ(i) = 1, ϕ(a) = ϕ(b) = 0.

110110010 ···
5
' -0 $

1

1 i a - b
0 1

& %
0
ϕ(i) = 1, ϕ(a) = ϕ(b) = 0
Figure 4: An automaton which generates the Baum-Sweet sequence
It can be shown that the term un of the Baum-Sweet sequence u is

equal to 1 if and only if there is no string of 0’s of odd length in the binary
expansion of n. Other examples have been given by Mills and Robbins for all
characteristics ([29]). A natural question due to Mendès France arises: given
an algebraic Laurent series whose partial quotients take only a finite number
of values, is the sequence of these partial quotients automatic? (remember
that the sequence of coefficients of the series is itself automatic from the
Christol, Kamae, Mendès France and Rauzy theorem). The answer is yes
for the example of Mills and Robbins in characteristic 3, (see [9]), and for
their examples in characteristic p ≥ 3, (see [3]). The sequence of partial
quotients of the Baum-Sweet series has recently been shown non-automatic
by Mkaouar, [30], but this sequence can be generated by a non-uniform
morphism, see below.
5) The Hanoi sequence

The well known towers of Hanoi game is the following: N disks of diam-
eter, say 1, 2, · · ·, N , are stacked on the first of three vertical pegs. At each
step one is allowed to pick the topmost disk on a peg and to put it on another
one, provided it is not stacked on a smaller disk. The game is over when all
the disks are on a (new) peg. A classical recursive algorithm gives a (finite)
sequence of moves of length 2N − 1, which is optimal, to transfer N disks
from the first peg to another one. If one chooses to transfer the disks to the
second peg if N is odd, and to the third one if N is even, all the sequences
of moves of length 2N − 1 given by the algorithm are prefixes of a unique
infinite sequence of moves on the six-letter alphabet of all possible moves.
It has been proved in [10] that this infinite sequence is indeed 2-automatic,
and that it can be generated by the 2-automaton given below. Note that in
6
the cyclic towers of Hanoi, (where the pegs are on a circle and where only
clockwise moves are allowed), the infinite sequence of moves resulting from
the classical cyclic algorithm is NOT automatic, but can be generated by a
non-uniform morphism, (see [8]).
-1
cm am
1
0 TTR 0
0 T
0T
IT
cm
T Tm
a

1 TTR 1 1

1
T
m 1T
IT 1
T Tm -0

bm

0

b
1

)
0 0
m

P
PP
-0
bm bm
PP
PP
PPqP
1 PP 0
1 TTR 1
PP 1 T
PP 1T
IT
P m T Tm
P
a c
TTR 0 0

T 0
0T
IT
T Tm -1
cm

a
1
(The - significant - states have been replaced by their images by ϕ)
Figure 5: An automaton which generates the Hanoi sequence
2.3 Sequences generated by uniform morphisms

Definition 2 A sequence u = (un )n≥0 with values in a finite set Y is said
to be the image of a fixed point of a uniform morphism of length q, (q being
an integer ≥ 2), if there exists:
7
- a set A,
- a uniform morphism σ of length q on A, i. e. a map which associates to
each letter in A a q-letter word on A. This map is extended by concatenation
to a morphism of the free monoid A∗ generated by A, and by continuity to
the infinite sequences with values in A,
- a sequence v = (vn )n≥0 with values in A, which is a fixed point of σ,
- a map ψ from A to Y such that ∀n ∈ IN, ψ(vn ) = un .
2.4 Examples
The patient reader can check (or prove) that the five examples given previ-
ously are images of fixed points of uniform morphisms of length 2, indeed:
1) The Prouhet-Thue-Morse sequence

This sequence is the fixed point of the 2-morphism on {0, 1} given by:
σ(0) = 01,
σ(1) = 10.
2) The Rudin-Shapiro sequence

Let A = {a, b, c, d}, define σ on A by:
σ(a) = ab,
σ(b) = ac,
σ(c) = db,
σ(d) = dc,
and let ψ be the map:
ψ(a) = ψ(b) = +1, ψ(c) = ψ(d) = −1.
Then the sequence v = (vn )n≥0 defined by v = limk→∞ σ k (a) is a fixed point
of σ, and the Rudin-Shapiro sequence is the pointwise image of the sequence
v by the map ψ.
8
3) The paperfolding sequence
σ(a) = ab,
σ(b) = cb,
σ(c) = ad,
σ(d) = cd,
ψ(a) = ψ(b) = V, ψ(c) = ψ(d) = Λ.
of σ, and the paperfolding sequence is the pointwise image of the sequence
v by the map ψ.
4) The Baum-Sweet sequence

σ(a) = ab,
σ(b) = cb,
σ(c) = bd,
σ(d) = dd,
ψ(a) = ψ(b) = 1, ψ(c) = ψ(d) = 0.
of σ, and the Baum-Sweet sequence is the pointwise image of the sequence
v by the map ψ.
5) The Hanoi sequence

This sequence is the fixed point of the 2-morphism σ defined on the
alphabet A = {a, b, c, a, b, c} by:
σ(a) = ac,
σ(b) = cb,
σ(c) = ba,
σ(a) = ac,
σ(b) = cb,
σ(c) = ba.
9
2.5 The main theorem
It is not by chance that our five examples are simultaneously 2-automatic
and images of fixed points of uniform morphisms of length 2. Indeed a
theorem due to Cobham, [19], asserts that this is general:
Theorem 1 ([19]). A sequence is q-automatic if and only if it is the image

of a fixed point of a q-substitution.
The proof of this theorem uses a combinatorial property of these se-

quences: both properties above are equivalent to saying that the set of
subsequences Nq (u) defined by:
n o
Nq (u) = n → uqk n+a , k ≥ 0, 0 ≤ a ≤ q k − 1J ,
(also called the q-kernel of the sequence u, see [35]), is finite.
Christol, Kamae, Mendès France and Rauzy, gave in 1980, [18], an arith-
metical condition which is equivalent to the theoretical-computer-science
condition and to the combinatorial condition given above:
Theorem 2 ([18]). Let u be a sequence with values in the finite field IFq ,
(q is a power of a prime number p). Then the sequence u is q-automatic if
and only if the formal power series +∞ n
P
n=0 un X is algebraic over the field
IFq (X) of rational functions with coefficients in IFq .
Remarks
- this theorem has, of course, nothing to do with the Chomsky-Schützen-
berger theorem, ([17]);
- to give the flavour of this theorem, let us consider again the Prouhet-
Thue-Morse sequence quoted above. Remember that this sequence is the
fixed point of the 2-morphism σ defined by:
σ(0) = 01,
σ(1) = 10.
We consider from now on 0 and 1 as the two elements of IF2 and we make all
computations modulo 2. The definition of our sequence u by the morphism
σ shows that:
∀n ∈ IN, u2n = un , u2n+1 = 1 + un .
10
Hence:
+∞
X +∞
X +∞
X
F (X) := un X n = u2n X 2n + u2n+1 X 2n+1
n=0 n=0 n=0
+∞
X +∞
X
= un X 2n + (1 + un )X 2n+1
n=0 n=0
+∞ +∞
X
n 2
X X
= ( un X ) + X( un X n )2 +
n=0 n=0
(1 + X)2
2 2 X
= F (X) + XF (X) + .
(1 + X)2
One sees that F satisfies the equation:
(1 + X)3 F 2 + (1 + X)2 F + X = 0,
which shows that F is algebraic (quadratic) on IF2 (X).
Another condition can be given for the automaticity of a sequence with

values in a finite field. This is a theorem of Furstenberg’s, which he proved
in 1967, [25]:
Theorem 3 [25]. Let u = (un )n≥0 be a sequence with values in the finite
un X n is algebraic over the field IFq (X) if and
P
field IFq . Then the series
only if there exists a double formal power series m,n≥0 am,n X m Y n such
P
that
- this series is a rational function, i. e. belongs to the field IFq (X, Y ),
- the sequence u is the diagonal of the sequence a, i. e. : ∀n ∈ IN,
un = an,n .
Putting all these conditions together one obtains the following funda-
mental theorem:
Fundamental Theorem Let u = (un )n≥0 be a sequence with values in the

finite field IFq . Then the following conditions are equivalent:
i) the q-kernel of the sequence u, i. e. the set of subsequences
n o
Nq (u) = n → uqk n+a , k ≥ 0, 0 ≤ a ≤ q k − 1J ,
is finite,
11
ii) the sequence u is q-automatic,
iii) the sequence u is the image of a fixed point of a uniform morphism
of length q,
iv) the formal power series +∞ n
P
n=0 un X is algebraic over the field IFq (X),
v) there exists a double sequence a = (am,n )m,n with values in IFq such
that the formal power series m,n≥0 am,n X m Y n is a rational function, (i.
P
e. an element of IFq (X, Y )), and such that u is the diagonal of a, (i. e.
∀n ∈ IN, un = an,n ).
3 Transcendence results and finite automata

In this chapter we will see two kinds of transcendence results:
- transcendence over IFq (X) of formal power series with coefficients in
IFq , using the Christol, Kamae, Mendès France and Rauzy theorem. In
particular we will devote a paragraph to the Carlitz zeta function.
- transcendence of real numbers over the rational numbers.
3.1 Miscellaneous transcendental formal power series

- An old question of Mahler’s asks whether a binary sequence (an )n such that
both numbers +∞ −n and P+∞ a 3−n are algebraic over the rational
P
n=0 an 2 n=0 n
numbers is necessary an ultimately periodic sequence, (i. e. whether both
numbers are “trivial”, indeed whether they are both rational). Actually,
although this question is still open, the result is true if one replaces the
usual operations by operations without carries:
Theorem 4 Let (an )n be a binary sequence such that the formal power se-
ries +∞ n
P
n=0 an X is algebraic over IF2 (X) when considered as an element
of IF2 [[X]], and algebraic over IF3 (X) when considered as an element of
IF3 [[X]]. Then this sequence is ultimately periodic, i. e. both formal power
series are indeed rational functions.
The proof of this result is an easy consequence of the theorem of Christol,

Kamae, Mendès France and Rauzy and of a (non-easy!) result of Cobham
which asserts that a sequence which is both q-automatic and q 0 -automatic,
with q and q 0 multiplicatively independent, is necessary ultimately periodic,
([20]).
12
- Let sq (n) be the residue modulo q of the sum of the digits of the integer n
in its q-ary expansion. It is not hard to see that the sequence (sq (an + b))n
is q-automatic; in particular for q = 2, a = 1, b = 0, one gets the Prouhet-
Thue-Morse sequence. But what can be said of (sq (n2 ))? A result of the
author ([1]), states that this sequence is NOT q-automatic.
Theorem 5 [1]. Let P be a polynomial of degree ≥ 2, such that P (IN) ⊂

IN. Then the sequence (sq (P (n))n is not q-automatic. Hence, if q is a
prime number, the formal power series sq (P (n))X n is transcendental over
P
IFq (X).
- As seen previously, the paperfolding sequence is 2-automatic, hence the

paperfolding series is algebraic over IF2 (X). Now suppose that at each step
you choose to fold either up or down arbitrarily; you thus obtain an un-
countable number of paperfolding sequences. Of course they cannot be all
automatic, as the set of automatic sequences is countable. It can be shown
that such a sequence is 2-automatic if and only if the sequence of its “folding
instructions” (i. e. the sequence of choices to fold one way or the other way)
is ultimately periodic: in other words any non-ultimately periodic sequence
of folding instructions yields a formal power series which is transcendental
over IF2 (X).
3.2 The Carlitz zeta function

In 1935 Carlitz introduced a function now known as the Carlitz zeta function
which ressembles the Riemann zeta function (see for instance [16]). This
function from IN∗ to IFq [[X −1 ]] is defined by:
1
∀n ∈ IN∗ , ζ(n) =
X
.
P monic ∈ IFq [X]
Pn
Moreover there exists a formal Laurent series denoted by Π such that:
∀n ≡ 0 mod (q − 1), n 6= 0, ∃rn ∈ IFq (X), ζ(n) = Πn rn .
The expression for Π is:

+∞ j
!
Y Xq − X
Π= 1 − qj+1 .
j=1
X −X
13
Note that this property can be compared to the classical result on the values
of the Riemann zeta function at the even integers.
One can ask whether this formal Laurent series Π, the values of this zeta
function, and the values ζ(n)
Πn are transcendental over IFq (X). Remember
that the real number π is transcendental over the field of rational numbers,
hence the values of the Riemann zeta function at the even integers are also
transcendental, as the numbers ζ(2n)/π 2n are rational. For the other values
of the Riemann zeta function, (divided by a suitable power of π or not), the
only thing which is known is the irrationality of ζ(3) proved by Apéry in
1978.
Four methods are available for the Carlitz zeta function and the related
series:
- the original method due to Wade in the 40’s, (he proved many tran-
scendence results, in particular the transcendence of the formal power series
Π), ressembles transcendence methods for the case of real numbers. This
method has been extended recently by Dammame and Hellegouarch, who
proved the transcendence of all the values ζ(n), ∀n ∈ IN∗ ;
- the method of diophantine approximation is worked out by de Mathan
and Chérif and gives irrationality measures for the values of the Carlitz zeta
function;
- the method of Yu uses Drinfeld modules and gives the most complete
results, indeed ζ(n) is transcendental ∀n ∈ IN∗ and ζ(n) Πn is transcendental
for every n 6≡ 0 mod (q − 1);
- the “automatic method”. This method has been proposed by the au-
thor to give an “elementary” proof of the transcendence of the formal series
Π, (see [5]). The reader will find in the appendix a different (but even sim-
pler) proof of the transcendence of this series Π. This “automatic” method
has been recently extended by Berthé: she gave an elementary automatic
proof of the transcendence of ζ(n), ∀n ≤ q − 2, (see [13]), as well as linear
independence results for these series, ([14]), and transcendence results for
the Carlitz logarithm, (see [15]).
3.3 Transcendence of real numbers and finite automata

The consequence of Cobham’s theorem quoted above (Theorem 4) can be
described, roughly speaking, by saying: “changing bases kills algebraicity”.
Hence a natural question posed in [18] asks whether every real number
an 2−n such that the sequence of coefficients in its base-2 expansion is
P
2-automatic and not ultimately periodic is indeed a transcendental number.
14
The answer is yes, it is due to Loxton and van der Poorten, (see also the
work of Nishioka):
Theorem 6 [27]. If the coefficients of the base-q expansion of a real number
form an automatic sequence, then this number is either rational or transcen-
dental.
√
In other words a number like 2 cannot have an automatic expansion
in any base. Note that this theorem gives the transcendence of a countable
set of “ad hoc” real numbers, and that one should not hope to get that way
the transcendence of classical numbers like the Euler constant (!), even for
numbers which are known to be transcendental: a reasonable but out of
reach conjecture is that the real numbers π and e are not automatic. Note
also that Mendès France and van der Poorten proved that a real number
whose base-2 expansion is any paperfolding sequence is transcendental, (see
[28]), this gives an uncountable (but “thin”) set of numbers, (which of course
are not all automatic numbers), for which the transcendence can be proved
using this kind of methods.
4 Generalizations
In this chapter we will survey quickly some possible generalizations of the
automatic sequences. The interested reader can find a survey with more
details in [7], in particular what is kept and what is lost in each of these
generalizations.
4.1 The multidimensional case

Instead of considering one-dimensional morphisms which consist of replacing
a letter by a word, one can imagine of a multidimensional morphism. Thus a
two-dimensional morphism associates to each letter a “square”, for instance:
0 1 1 0
0→ , 1→ .
1 0 0 1
This can be extended as previously, iterating this map gives:
0 1 1 0
0 1 1 0 0 1
0→ → → ···
1 0 1 0 0 1
0 1 1 0
15
The reader can find in [34] and [35] more details, in particular a theorem
analogous to the Christol, Kamae, Mendès France and Rauzy theorem holds
true.
4.2 Non-uniform morphisms

A non-uniform morphism maps each letter of a finite alphabet on a word
with letters in this alphabet, but all these words do not have necessarily
the same length. A classical example is the Fibonacci morphism defined on
{0, 1} by:
0 → 01, 1 → 0.
Iterating this morphism gives:
0 → 01 → 010 → 01001 → 01001010 → · · ·
The arithmetic properties of the infinite sequences which are (images of)
fixed points of these morphisms are not very simple as the numeration base
associated to them is not the base q for some integer q ≥ 2. For instance,
in the above example, the numeration base is the Fibonacci base: F0 = 1,
F1 = 2, F2 = 3, F3 = 5, · · ·
On this subject one can read the paper of Shallit ([36]), and also the
work of Fabre.
4.3 From finite fields to fields of positive characteristic

Remember that the theorem of Christol, Kamae, Mendès France and Rauzy
can be stated in the following way, (see theorem 2):
Let q be an integer ≥ 2. For a sequence u = (un )n define its q-kernel as
the set of subsequences
n o
Nq (u) = n → uqk n+a , k ≥ 0, 0 ≤ a ≤ q k − 1 .
un X n
P
Suppose that u takes its values in the finite field IFq . Then the series
is algebraic over IFq (X) if and only if its q-kernel Nq (u) is finite.
The main result obtained by Sharif and Woodcock in [38] and Harase in [26]
(see also the survey of the author [4]), can be stated as follows:
Theorem 7 [38], [26]. Let u be a sequence with values in a field K of

positive characteristic p. Let s be any integer ≥ 1, q = ps , and let K be a
perfect field containing K, (for instance its algebraic closure).
16
Then the series un X n is algebraic over K(X) if and only if the vector
P
space spanned over K by the “modified” q-kernel of u

1/q k
Nq0 (u) = n→ uqk n+a , k
k ≥ 0, 0 ≤ a ≤ q − 1J .
has finite dimension.
Note that this theorem contains the Christol, Kamae, Mendès France and
Rauzy theorem, and that it can be easily extended to the multidimensional
case. Note also that two interesting corollaries can been proved, using the
work of Salon for a finite field, or more generally the above theorem for a
field of positive characteristic, (these results have been first given by Deligne
by a non-elementary method in [23]):
- the Hadamard product of two algebraic formal power series with coef-
ficients in a field of positive characteristic, un X n and vn X n , i. e. the
P P
“naive” product un vn X n , is itself an algebraic formal power series.

P
um,n X m Y n be a double formal power series in K[[X, Y ]], alge-

P
- let
braic over K(X, Y ), (where K is a field of positive characteristic). Then its
diagonal un,n X n is algebraic over the field K(X).
P
4.4 q-regular sequences

Let s(n) be the sum of the digits of n in the binary expansion, then the
sequence (s(n))n mod 2 is the Prouhet-Thue-Morse sequence, hence is a 2-
automatic sequence. What can be said of the sequence (s(n))n not reduced
modulo 2?
The notion of q-regular sequence has been introduced by Shallit and the
author in [11] to answer this question inter alia.
Let q be an integer ≥ 2. Let u = (un )n be a sequence with values in a
Nœtherian ring R. This sequence is said to be q-regular if its kernel Nq (u)
generates a module of finite type.
(Remember
n that the q-kernel of theosequence u is defined as: Nq (u) =
n → uqk n+a , k ≥ 0, 0 ≤ a ≤ q k − 1J ).
The reader is referred to [11] for the properties of these sequences and for
numerous examples of such sequences, together with “their Sloane numbers”
for the sequences which are quoted in Sloane’s book [39].
17
5 Appendix: an easy “automatic” proof of the
transcendence of the formal power series Π
As said in chapter 3.2 the Carlitz formal power series Π, given by
+∞ j
!
Y Xq − X
Π= 1 − qj+1 .
j=1
X −X
has been proved transcendental by Wade in the 40’s. We gave an “auto-
matic” proof of this result in [5]. We want to present now another - still
simpler - “automatic” proof.
The first step consists of a remark due to Laurent Denis concerning an
0
expression for ΠΠ . Indeed taking the derivative of the expression of Π ∈
IFq ((X −1 )), one has:
j
0
X q −X
X 1−
Π0 +∞ Xq
j+1
−X
+∞
X 1
= j =
q j+1 − X
.
Π j=1 1 −
X q −X
j+1 j=1
X
Xq −X
j
A traditional notation is [j] = X q − X, hence the above equality can be
written
Π0 +∞X 1
= .
Π j=1
[j + 1]
If Π were algebraic, that would be the case also for Π0 , (the proof is left
to the reader who might use - for instance - the Christol, Kamae, Mendès
France an Rauzy theorem, where the variable X is replaced by X −1 ), hence
0
for ΠΠ .
Finally to prove the transcendence of Π it suffices to prove the transcen-
dence of the series +∞ 1
P
j=1 [j] . This series is known as the “bracket” series and
has been proved transcendental by Wade, but we gave in [5] an “automatic”
proof for the transcendence of slightly more general series. We rewrite here
this - easy - proof in the case of the bracket series. One has:
+∞ +∞ +∞ qj −1 !−1
1 1 X 1 1
X X
= j = 1−
j=1
q
[j] j=1 X − X j=1
X qj X
 
X 1 qj +m(qj −1) j
1 X 1 m(q −1) 1 X X  1
= = = 1 .
 
 Xn

X X X X n≥1 
j≥1 j≥1 j,m
m≥0 m≥1 m(q j −1)=n
18
This last expression can also be written
 
1 X  X
 1
1 .

 Xn

X n≥1 
j
(q j −1)|n
Using the theorem of Christol, Kamae, Mendès France and Rauzy one sees
that the series above is algebraic over IFq (X) if and only if the sequence
X
n→ 1
j
(q j −1)|n
is q-automatic. So we have to prove that it is not.

But if a sequence v is q-automatic, then the subsequence n → vqn −1 is
ultimately periodic, (hint: the base-q expansion of q n − 1 consists of n digits
all equal to q − 1). It thus suffices to show that the sequence:
X
n→ 1
j
(q j −1)|(q n −1)
is not ultimately periodic. But it is well known that (q j − 1) | (q n − 1) if and

only if j | n. Hence, using the classical notation τ (n) to denote the number
of divisors of the integer n, it suffices to show that the sequence
n → τ (n)
is not ultimately periodic. OF COURSE THIS SEQUENCE HAS TO BE

TAKEN modulo p, where p is the characteristic of IFq .
Now, suppose that (τ (n))n mod p is ultimately periodic. Then there
exist two integers T ≥ 1 and n0 ≥ 1 such that:
∀n ≥ n0 , ∀k ∈ IN, τ (n + kT ) ≡ τ (n) mod p.
This implies
∀n ≥ n0 , ∀k ∈ IN, τ (n(1 + kT )) = τ (n + knT ) ≡ τ (n) mod p.
Now choose k large enough such that (1 + kT ) ≥ n0 and (1 + kT ) is a prime

number, say ω: this is possible from the arithmetic progression theorem for
19
prime numbers, (note that this case, i. e. the existence of arbitrarily large
prime numbers in the progression 1+kT , can be proved in a very elementary
way, using cyclotomic polynomials). Taking n = (1 + kT ) = ω, one gets:
τ (ω 2 ) ≡ τ (ω) mod p,
i. e.
3 ≡ 2 mod p,
which yields the desired contradiction.
References
[1] J.-P. Allouche, Somme des chiffres et transcendance, Bull. Soc. math.
France, 110 (1982), 279–285.
[2] J.-P. Allouche, Automates finis en théorie des nombres, Expo. Math.,
5 (1987), 239–266.
[3] J.-P. Allouche, Sur le développement en fraction continue de certaines

séries formelles, C. R. Acad. Sci. Paris, Série I, 307 (1988), 631–633.
[4] J.-P. Allouche, Note sur un article de Sharif et Woodcock, Séminaire de

Théorie des Nombres de Bordeaux, Série II, 1 (1989), 163–187.
[5] J.-P. Allouche, Sur la transcendance de la série formelle Π, Séminaire

de Théorie des Nombres de Bordeaux, Série II, 2 (1990), 103–117.
[6] J.-P. Allouche, Finite automata in 1-dimensional and 2-dimensional

physics, Number theory and physics, J.-M. Luck, P. Moussa, M. Wald-
schmidt (Eds.), Proceedings in Physics, Springer, 47 (1990), 177–184.
[7] J.-P. Allouche, q-regular sequences and other generalizations of q-

automatic sequences, Lecture Notes in Computer Science, 583 (1992),
15–23.
[8] J.-P. Allouche, Note on the cyclic towers of Hanoi, Theoret. Comput.
Sci., (to appear).
[9] J.-P. Allouche, J. Bétréma and J. Shallit, Sur des points fixes de mor-
phismes d’un monoı̈de libre, Inform. Théor. Appl., 23 (1989), 235–249.
20
[10] J.-P. Allouche and F. Dress, Tours de Hanoı̈ et automates, Inform.
Théor. Appl., 24 (1990), 1–15.
[11] J.-P. Allouche and J. Shallit, The ring of k-regular sequences, Theoret.
Comput. Sci., 98 (1992), 163–197.
[12] L. E. Baum and M. M. Sweet, Continued fractions of algebraic power
series in characteristic 2, Ann. of Math., 103 (1976), 539–610.
[13] V. Berthé, De nouvelles preuves “automatiques” de transcendance
pour la fonction zéta de Carlitz, Journées arithmétiques de Genève,
Astérisque, 209 (1992), 159–168.
ζ(s)
[14] V. Berthé, Combinaisons linéaires de Πs sur IFq (x), pour 1 ≤ s ≤
(q − 2), submitted.
[15] V. Berthé, Automates et valeurs de transcendance du logarithme de
Carlitz, Acta Arithmetica, (to appear).
[16] L. Carlitz, On certain functions connected with polynomials in a Galois
field, Duke Math. J., 1 (1935), 137–168.
[17] N. Chomsky and M. P. Schützenberger, The algebraic theory of context-
free languages, in Computer programming and formal languages, 118–
161, North Holland, Amsterdam, 1963.
[18] G. Christol, T. Kamae, M. Mendès France, G. Rauzy, Suites algébri-
ques, automates et substitutions, Bull. Soc. math. France, 108 (1980),
401–419.
[19] A. Cobham, Uniform tag sequences, Math. Systems Theory, 6 (1972),
164–192.
[20] A. Cobham, On the base-dependence of sets of numbers recognizable by
finite automata, Math. Systems Theory, 3 (1969), 186–192.
[21] C. Davis and D. E. Knuth, Number representations and dragon curves,
I, II, J. Recr. Math. 3 (1970), 161–181 and 133–149.
[22] M. Dekking, M. Mendès France and A. van der Poorten, FOLDS!,
Math. Intell. 4 (1982), 130–138, 173–181 and 190–195.
[23] P. Deligne, Intégration sur un cycle évanescent, Invent. Math., 76
(1984), 129–143.
21
[24] S. Eilenberg, Automata, Languages and Machines, vol. A, Acad. Press,
New York, 1974.
[25] H. Furstenberg, Algebraic functions over finite fields, J. Algebra, 7
(1967), 271–277.
[26] T. Harase, Algebraic elements in formal power series rings, Israel J.
Math., 63 (1988), 281–288.
[27] J. H. Loxton and A. J. van der Poorten, Arithmetic properties of au-
tomata: regular sequences, J. Reine Angew. Math., 392 (1988), 57–69.
[28] M. Mendès France and A. van der Poorten, Arithmetic and analytic
properties of paperfolding sequences, dedicated to K. Mahler, Bull. Aus-
tral. Math. Soc., 24 (1981), 123–131.
[29] W. H. Mills and D. P. Robbins, Continued fractions for certain algebraic
power series, J. Number Theory, 23 (1986), 388–404.
[30] M. Mkaouar, Thèse, Lyon, 1993.
[31] M. Morse, Recurrent geodesics on a surface of negative curvature, Trans.
Amer. Math. Soc., 22 (1921), 84–100.
[32] E. Prouhet, Comptes-Rendus de l’Académie des Sciences, Paris, 33
(1851), 225.
[33] W. Rudin, Some theorems on Fourier coefficients, Proc. Amer. Math.
Soc., 10 (1959), 855–859.
[34] O. Salon, Suites automatiques à multi-indices et algébricité, C. R. Acad.
Sci. Paris, Série I, 305 (1987), 501–504.
[35] O. Salon, Suites automatiques à multi-indices, Séminaire de Théorie des
Nombres de Bordeaux, Exposé 4, 1986–1987.
[36] J. Shallit, A generalization of automatic sequences, Theoret. Comput.
Sci., 61 (1988), 1–16.
[37] H. S. Shapiro, Extremal problems for polynomial and power series, The-
sis, M.I.T., 1951.
[38] H. Sharif and C. F. Woodcock, Algebraic functions over a field of pos-
itive characteristic and Hadamard products, J. Lond. Math. Soc., 37
(1988), 395–403.
22
[39] N. J. A. Sloane, A Handbook of Integer Sequences, Academic Press,
New York, 1973.
[40] A. Thue, Über unendliche Zeichenreihen, Norske vid. Selsk. Skr. I. Mat.
Kl. Christiana, 7 (1906), 1–22.
[41] A. Thue, Lage gleicher Teile gewisse Zeichenreihen, Norske vid. Selsk.
Skr. I. Mat. Kl. Christiana, 1 (1912), 1–67.
23

Finite Automata and Arithmetic: J.-P. Allouche

Uploaded by

Document Information

Original Description:

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Finite Automata and Arithmetic: J.-P. Allouche

Uploaded by

Copyright:

Available Formats

Finite automata and arithmetic

with 0 ≤ nj ≤ q − 1. Then each nj is interpreted as one of the maps from S

Figure 1: An automaton which generates the Prouhet-Thue-Morse sequence

2) The Rudin-Shapiro sequence

The following bounds are trivial:

0.i = i, 0.a = i, 0.b = c, 0.c = c,

1.i = a, 1.a = b, 1.b = a, 1.c = b,

Hence this sequence begins by:

ϕ(i) = ϕ(a) = +1, ϕ(b) = ϕ(c) = −1

Figure 2: An automaton which generates the Rudin-Shapiro sequence

3) The paperfolding sequence

0.i = a, 0.a = b, 0.b = b, 0.c = c,

1.i = i, 1.a = c, 1.b = b, 1.c = c,

ϕ(i) = ϕ(a) = ϕ(b) = V, ϕ(c) = Λ.

Hence this sequence begins by:

Figure 3: An automaton which generates the paperfolding sequence

4) The Baum-Sweet sequence

Hence this sequence begins by:

ϕ(i) = 1, ϕ(a) = ϕ(b) = 0

Figure 4: An automaton which generates the Baum-Sweet sequence

It can be shown that the term un of the Baum-Sweet sequence u is

5) The Hanoi sequence

(The - significant - states have been replaced by their images by ϕ)

Figure 5: An automaton which generates the Hanoi sequence

2.3 Sequences generated by uniform morphisms

1) The Prouhet-Thue-Morse sequence

2) The Rudin-Shapiro sequence

and let ψ be the map:

ψ(a) = ψ(b) = +1, ψ(c) = ψ(d) = −1.

4) The Baum-Sweet sequence

5) The Hanoi sequence

Theorem 1 ([19]). A sequence is q-automatic if and only if it is the image

The proof of this theorem uses a combinatorial property of these se-

(also called the q-kernel of the sequence u, see [35]), is finite.

One sees that F satisfies the equation:

which shows that F is algebraic (quadratic) on IF2 (X).

Another condition can be given for the automaticity of a sequence with

Fundamental Theorem Let u = (un )n≥0 be a sequence with values in the

3 Transcendence results and finite automata

3.1 Miscellaneous transcendental formal power series

The proof of this result is an easy consequence of the theorem of Christol,

Theorem 5 [1]. Let P be a polynomial of degree ≥ 2, such that P (IN) ⊂

- As seen previously, the paperfolding sequence is 2-automatic, hence the

3.2 The Carlitz zeta function

Moreover there exists a formal Laurent series denoted by Π such that:

∀n ≡ 0 mod (q − 1), n 6= 0, ∃rn ∈ IFq (X), ζ(n) = Πn rn .

The expression for Π is:

3.3 Transcendence of real numbers and finite automata

2-automatic and not ultimately periodic is indeed a transcendental number.

4.1 The multidimensional case

This can be extended as previously, iterating this map gives:

4.2 Non-uniform morphisms

0 → 01 → 010 → 01001 → 01001010 → · · ·

4.3 From finite fields to fields of positive characteristic

Theorem 7 [38], [26]. Let u be a sequence with values in a field K of

space spanned over K by the “modified” q-kernel of u

has finite dimension.

“naive” product un vn X n , is itself an algebraic formal power series.

um,n X m Y n be a double formal power series in K[[X, Y ]], alge-

4.4 q-regular sequences

is q-automatic. So we have to prove that it is not.

is not ultimately periodic. But it is well known that (q j − 1) | (q n − 1) if and