The Polynomial method and cardinality of sum-sets in Zp
The Polynomial method and cardinality of sum-sets in Zp
The Polynomial method and cardinality of sum-sets in Zp
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
<strong>The</strong> <strong>Polynomial</strong> <strong>method</strong> <strong>and</strong> <strong>card<strong>in</strong>ality</strong> <strong>of</strong><br />
<strong>sum</strong>-<strong>sets</strong> <strong>in</strong> Z p<br />
Dusan Milijancevic<br />
Mathematical High School, Belgrade<br />
dmilijancevic@gmail.com<br />
Mihajlo Cekic<br />
Mathematical High School, Belgrade<br />
mcekic@sezampro.yu<br />
December 6, 2009<br />
Abstract<br />
In this paper we prove two well known theorems Cauchy-Davenport<br />
<strong>in</strong>equality <strong>and</strong> Heilbrunn-Erdos theorem <strong>and</strong> their extension. <strong>The</strong> pro<strong>of</strong><br />
is based on a recently developed polynomial <strong>method</strong>, which <strong>in</strong> a relatively<br />
simple way proves many facts that are considered to be very difficult to<br />
solve.<br />
Keywords: Caucy-Davenport, Erdos-Heilbrunn, polynomial, coefficient<br />
1 Introduction<br />
<strong>The</strong> first appearance <strong>of</strong> the Additive number theory is <strong>in</strong> the works <strong>of</strong> Cauchy<br />
<strong>and</strong> Warr<strong>in</strong>g who sowed a couple <strong>of</strong> fundamental results <strong>in</strong> this area. <strong>The</strong>ir work<br />
is cont<strong>in</strong>ued by Erdos, <strong>and</strong> later on by Alon, Ruzsa <strong>and</strong> others. Even today this<br />
area conta<strong>in</strong>s many open problems that are partially related to the <strong>card<strong>in</strong>ality</strong><br />
<strong>of</strong> a <strong>sum</strong> <strong>of</strong> specific <strong>sets</strong> <strong>in</strong> abelian groups. We will focus on the group <strong>of</strong><br />
residues <strong>of</strong> a prime module equipped with the st<strong>and</strong>ard operation <strong>of</strong> addition +.<br />
Let us def<strong>in</strong>e the <strong>sum</strong> <strong>of</strong> two <strong>sets</strong> <strong>in</strong> Z p as a set <strong>of</strong> all possible Sums <strong>of</strong> their<br />
elements.<br />
Def<strong>in</strong>ition 1 Denote by + the operation on sub<strong>sets</strong> <strong>of</strong> Z p , where p is a prime<br />
number, that is is def<strong>in</strong>ed as A + B = {a + b|a ∈ A, b ∈ B} for all sub<strong>sets</strong> A <strong>and</strong><br />
B <strong>of</strong> Z p .<br />
1
In a similar ve<strong>in</strong> we def<strong>in</strong>e other operations on sub<strong>sets</strong>. We will use only the<br />
difference <strong>of</strong> <strong>sets</strong> whose def<strong>in</strong>ition is bellow.<br />
Def<strong>in</strong>ition 2 Denote by − the operation on sub<strong>sets</strong> <strong>of</strong> Z p , where p is a prime<br />
number, that is def<strong>in</strong>ed as A − B = {a − b|a ∈ A, b ∈ B}, for all sub<strong>sets</strong> A <strong>and</strong><br />
B <strong>of</strong> Z p .<br />
<strong>The</strong> theorem below concerns with the simple estimates <strong>of</strong> <strong>sum</strong> <strong>of</strong> card<strong>in</strong>alities<br />
<strong>of</strong> two <strong>sets</strong>:<br />
<strong>The</strong>orem 1 If A <strong>and</strong> B are f<strong>in</strong>ite non empty sub<strong>sets</strong> <strong>of</strong> Z p the follow<strong>in</strong>g <strong>in</strong>equalities<br />
hold:<br />
a) |A + B| ≥ max{|A|, |B|}<br />
b) |A + B| ≤ |A||B|.<br />
Pro<strong>of</strong>: a) Translate a set A (add one element on all its elements) so that it<br />
conta<strong>in</strong>s 0 <strong>and</strong> keep its notation. Card<strong>in</strong>alities <strong>of</strong> <strong>sets</strong> A, B <strong>and</strong> A + B are same<br />
as before translation. Similarly we can translate B so that it conta<strong>in</strong>s 0. As<br />
0 ∈ A, B both A <strong>and</strong> B are sub<strong>sets</strong> <strong>of</strong> A + B, with |A|, |B| ≤ |A + B|, that is<br />
max{|A|, |B|} ≤ |A + B|.<br />
b) From the def<strong>in</strong>ition <strong>of</strong> A + B number |A + B| can not exceed |A||B|.<br />
Many results <strong>and</strong> conjectures <strong>in</strong> this area are due to Erdos. All <strong>of</strong> them<br />
were regarded as comb<strong>in</strong>atorial until the new polynomial was created which<br />
gave us a new po<strong>in</strong>t <strong>of</strong> view on these problems. In the follow<strong>in</strong>g section we are<br />
go<strong>in</strong>g the sow how this <strong>method</strong> applies to this k<strong>in</strong>d <strong>of</strong> problem <strong>and</strong> a natural<br />
generalization that follows from it.<br />
2 Cauchy-Davenport <strong>in</strong>equality<br />
Cauchy-Davenport <strong>in</strong>equality is probably the first result <strong>in</strong> this ve<strong>in</strong>. We will<br />
show two pro<strong>of</strong>s <strong>of</strong> it. <strong>The</strong> first one is due to Terence Tao (reference [1]) that<br />
uses <strong>method</strong>s <strong>of</strong> group theory while the other is quite elementary <strong>and</strong> easily<br />
generalized (it appeared <strong>in</strong> [2] due to Noga Alon, Melvyn B. Nathanson <strong>and</strong><br />
Imre Ruzsa). Us<strong>in</strong>g the idea <strong>in</strong> the second pro<strong>of</strong> we will derive two other results<br />
<strong>in</strong> later sections.<br />
<strong>The</strong>orem 2 (Cauchy-Davenport <strong>in</strong>equality) If A <strong>and</strong> B are two non empty sub<strong>sets</strong><br />
<strong>of</strong> Z p for some prime p then the <strong>in</strong>equality |A + B| ≥ m<strong>in</strong>|A| + |B| − 1, p<br />
is true.<br />
2
<strong>The</strong> First Pro<strong>of</strong>: If |A| + |B| − 1 ≥ p <strong>sets</strong> A <strong>and</strong> {x} − B have a non empty<br />
<strong>in</strong>tersection for every x ∈ Z p with A + B = Z p . <strong>The</strong>refore |A + B| = p <strong>and</strong> our<br />
<strong>in</strong>equality is true (it reduces to equality). Hence we may take that |A|+|B|1 < p.<br />
Suppose that the <strong>in</strong>equality is not true <strong>and</strong> from all <strong>of</strong> its counterexamples<br />
(A, B) select the one for which |A| is the least. Let it be (A, B). Now we can<br />
as<strong>sum</strong>e that |A| ≥ 2 as <strong>in</strong> the case <strong>of</strong> |A| = 1 the <strong>in</strong>equality is obviously valid.<br />
Note that by replac<strong>in</strong>g the set B with the set B + {x} for arbitrary x ∈ Z p<br />
(translation) both the left <strong>and</strong> right sides <strong>of</strong> <strong>in</strong>equality have the same value as<br />
before transformation. So translate B for an element such that |A ∩ B| ≥ 1.<br />
Let A ′ = A ∩ B <strong>and</strong> B ′ = A cupB. <strong>The</strong>refore |A| + |B| = |A ′ | + |B ′ |, <strong>and</strong> so<br />
is A ′ + B ′ ⊂ A + B. It implies |A ′ | + |B ′ | ≤ |A| + |B|. Hence (A ′ , B ′ ) is an additional<br />
counterexample to the given <strong>in</strong>equality. However |A| is m<strong>in</strong>imal with this<br />
property so we must have |A ′ | ≥ |A|, <strong>and</strong> consequently A ′ ⊂ A, that is A ′ = A.<br />
It follows that for every m<strong>in</strong>imal counterexample {A, B} such that |A ∩ B| ≥ 1<br />
it holds A ⊂ B. However, if (A, B) is a counterexample then (A + {x}, B) is<br />
counterexample too for each x ∈ Z p so x ∈ B −A. <strong>The</strong>refore for each x ∈ B −A<br />
is A + {x} ⊂ B, i.e. A + (B − A) ⊂ B <strong>and</strong> B + (A − A) ⊂ B, because <strong>of</strong><br />
associative <strong>and</strong> commutativity <strong>of</strong> addition. As set A − A conta<strong>in</strong>s 0 it must be<br />
B subsetB + AA, <strong>and</strong> B = B + (A − A), ie. |B + C| = |B|, for C = A − A. We<br />
will use the follow<strong>in</strong>g lemma that is essential for the characterization <strong>of</strong> B <strong>and</strong><br />
C :<br />
Lemma 1 Let A <strong>and</strong> B be two nonempty sub<strong>sets</strong> <strong>of</strong> G, where G = (G, +) is a<br />
f<strong>in</strong>ite abelian group. <strong>The</strong> equality |A + B| = |a| holds if <strong>and</strong> only if A is union<br />
<strong>of</strong> a f<strong>in</strong>ite number <strong>of</strong> translates <strong>of</strong> G ′ <strong>and</strong> B is a subset <strong>of</strong> one <strong>of</strong> translates <strong>of</strong><br />
G ′ where G ′ = (G, +) is a subgroup <strong>of</strong> the group G.<br />
Pro<strong>of</strong>: Let us prove the first direction <strong>of</strong> lemma. As<strong>sum</strong>e that A = ⊎ k i=1 G′ + {a i }<br />
<strong>and</strong> B ⊂ G ′ +b for some a i 1 ≤ i ≤ k <strong>and</strong> b from G. We want to prove |A+B| =<br />
|A|. Note that the <strong>in</strong>tersection <strong>of</strong> two translates <strong>of</strong> subgroup G ′ is the empty set<br />
or those the two translates co<strong>in</strong>cide. Hence A = {a 1 , a 2 , ..., a k } + G ′ = A ′ + G,<br />
<strong>and</strong><br />
A + B ⊂ A ′ + G ′ + b + G ′ = A ′ + b + G ′ = {a 1 + b, a 2 + b + b, ..., a k } + G ′ .<br />
If some <strong>of</strong> two translates <strong>of</strong> form {a i +b}+G ′ <strong>and</strong> {a j }+{b}+G ′ have some<br />
common element for i ≠ j, then two translates <strong>of</strong> the form {a i } + G <strong>and</strong> a j + G<br />
will have a common element too, which is impossible for i ≠ j. <strong>The</strong>refore any<br />
two <strong>of</strong> the translates are disjo<strong>in</strong>t <strong>and</strong> |A + B| ≤ k|G ′ |. However as |A| = k|G ′ |,<br />
<strong>and</strong> |A + B| ≤ |a| it is |A + B| = |A| what we wanted to prove.<br />
3
Evidence <strong>of</strong> a different direction <strong>of</strong> lemma. Let A, B ⊂ G <strong>and</strong> let |A + B| =<br />
|A|. Consider the set G ′ = {g ∈ G|{±g} + A = A}. Obviously, 0 ∈ G ′ . For G ′<br />
to be a group it is enough to prove that for all g 1 , g 2 ∈ G ′ is g 1 − g 2 ∈ G ′ . As<br />
{g 1 − g 2 } + G ′ = {g 1 } + {−g 2 } + G ′ = {g 1 } + G ′ = G ′ so it is g 1 − g 2 ∈ G ′ .<br />
It implies that G ′ is a group with respect to operation +. Hence A is a union<br />
<strong>of</strong> translates <strong>of</strong> G ′ as A = G ′ + A. Consider an element b from B. <strong>The</strong>n<br />
|A + b| = |A|, <strong>and</strong> so it is A + b ⊂ A + B. <strong>The</strong>refore it is A + B = A + b. As<br />
b is an arbitrary element <strong>of</strong> B the equality A + b = A + b ′ is true for any two<br />
elements b <strong>and</strong> b ′ from B, i.e. b ′ − b + A = A, which means that b ′ − b ∈ G ′ , i.e.<br />
B is a subset <strong>of</strong> the translates <strong>of</strong> B. Thus the pro<strong>of</strong> <strong>of</strong> lemma is completed.<br />
Us<strong>in</strong>g this lemma on the equality |B +C| = |B|, we derive that B is an union<br />
<strong>of</strong> translates <strong>of</strong> a subgroup <strong>of</strong> Z p <strong>and</strong> C is a subset <strong>of</strong> the translate <strong>of</strong> the same<br />
subgroup. As the only subgroups <strong>of</strong> Z p are the whole group <strong>and</strong> {0}, it is either<br />
B = Z p or |C| = 1 which means that |A| = 1. However, this is a contradiction<br />
because <strong>in</strong> both cases we have a valid <strong>in</strong>equality. Hence the pro<strong>of</strong> <strong>of</strong> the whole<br />
theorem is completed.<br />
<strong>The</strong> second pro<strong>of</strong>: As <strong>in</strong> the first pro<strong>of</strong> we can as<strong>sum</strong>e that |A| + |B| −<br />
1 < p. Let |A| = k, |B| = l <strong>and</strong> |A + B| = n. Suppose the opposite, i.e.<br />
n < m<strong>in</strong>p, k + l − 1 <strong>and</strong> n < p <strong>and</strong> n < k + l− 1. Consider the polynomial<br />
fx, y =<br />
∏<br />
∑<br />
(x + y − c) =<br />
f i,j x i y j ,<br />
c∈A+B<br />
i+j≤n,i,j∈N<br />
where f i,j are some <strong>in</strong>tegers. This polynomial can be written <strong>in</strong> the follow<strong>in</strong>g<br />
format:<br />
fx, y =<br />
∑<br />
f i,j x i y j +<br />
∑<br />
f i,j x i y j +<br />
∑<br />
f i,j x i y j +<br />
∑<br />
f i,j x i y j ,<br />
i
∏<br />
<strong>The</strong> determ<strong>in</strong>ant <strong>of</strong> this system is the V<strong>and</strong>ermonde i.e. it is equal to<br />
a i − a j . Hence it is different from 0, s<strong>in</strong>ce a i ≠ a j for i ≠ j. <strong>The</strong>refore<br />
1≤i
holds <strong>in</strong> this case. We can take that |A| + |B| − 2 < p. Let |A| = k, |B| = l <strong>and</strong><br />
|A + B| = n. As<strong>sum</strong>e the opposite i.e. that n < p <strong>and</strong> n < k + l − 2. Consider<br />
the polynomial<br />
f(x, y) = (x − y) ∏ c∈A+ 1B x + y − c = ∑ i+j≤n,i,j∈N 0<br />
f i,j x i y j ,<br />
for some <strong>in</strong>tegers f i,j . Hence f(x, y) = 0 for all (x, y) ∈ A × B. Apply<strong>in</strong>g<br />
the lemma shown <strong>in</strong> the second pro<strong>of</strong> <strong>of</strong> Cauchy-Davenport <strong>in</strong>equality we can<br />
transform f(x, y) <strong>in</strong>to polynomial p(x, y) whose degree <strong>in</strong> x is less than k <strong>and</strong><br />
degree <strong>in</strong> y is less than l, which has the same value as f(x, y) (<strong>of</strong> course <strong>in</strong> Z p ).<br />
Thus p(x, y) ≡ 0 . Consider the coefficient <strong>of</strong> x u y v where u + v = deg(f) <strong>and</strong><br />
u < k, v < l. For these numbers is deg(f) < k + l. <strong>The</strong>refore the coefficient <strong>of</strong><br />
x u y v <strong>in</strong> f(x, y) equals<br />
( ) ( )<br />
u + v − 1 u + v − 1<br />
−<br />
= u + v − 1! 1<br />
u − 1 v − 1 u − 1!v − 1! v − 1 u − vu + v − 1!<br />
=<br />
u uvu − 1!v − 1! ,<br />
which can not be divisible by p s<strong>in</strong>ce u + v − 1 < p. This is a contradiction<br />
because the coefficient <strong>of</strong> x u y v <strong>in</strong> the polynomial p equals 0.<br />
Us<strong>in</strong>g this claim we are able to pr<strong>of</strong>e Erdos-Heilbrunn theorem.<br />
<strong>The</strong>orem 4 (Erdos-Heilbrunn theorem) Let A be a non empty subset <strong>of</strong> Z p . It<br />
holds |A + 1 A| ≥ m<strong>in</strong>{p, 2|A| − 3}.<br />
Pro<strong>of</strong>: Apply<strong>in</strong>g the previous theorem to the <strong>sets</strong> A <strong>and</strong> B where B = A {x}<br />
<strong>and</strong> x is an arbitrary element <strong>of</strong> A we will derive this theorem. As A + 1 B =<br />
A + 1 A because all numbers <strong>of</strong> a form a + x (a ≠ x <strong>and</strong> a ∈ A) belong to A + 1 B<br />
for x ∈ A, a ∈ B <strong>and</strong> a ≠ x. <strong>The</strong>n |A+ 1 A| = |A+ 1 B| ≥ m<strong>in</strong>{p, |A|+|B|−2} =<br />
m<strong>in</strong>{p, 2|A| − 3} what we wanted to prove.<br />
4 Generalization Cauchy-Davenport <strong>in</strong>equality<br />
<strong>and</strong> Erdos-Heilbrunn theorem<br />
In this section we will show one way for generalization <strong>of</strong> the previous theorems<br />
cont<strong>in</strong>u<strong>in</strong>g with the use <strong>of</strong> the polynomial <strong>method</strong> <strong>and</strong> the lemma <strong>in</strong> the<br />
second prove <strong>of</strong> the Cauchy-Davenport <strong>in</strong>equality. This result is due to Noga<br />
Alon, Melvyn B. Nathanson <strong>and</strong> Imre Ruzsa <strong>and</strong> was published <strong>in</strong> [3]. We are<br />
start<strong>in</strong>g with a def<strong>in</strong>ition that will generalize <strong>sum</strong> <strong>of</strong> <strong>sets</strong>.<br />
Def<strong>in</strong>ition 3 For a polynomial h = h(x 0 , x 1 , ..., x k ) on Z p where p is a prime<br />
number <strong>and</strong> <strong>sets</strong> A 0 , A 1 , ..., A k def<strong>in</strong>e its addition by<br />
⊕ h<br />
i=0<br />
k∑<br />
A i = {a 0 + a 1 + ... + a k |a i ∈ A i , h(a 0 , a 1 , ..., a k ) ≠ 0}.<br />
6
As we will show later the former two theorems are special cassis <strong>of</strong> this<br />
result. We will perform a little change <strong>in</strong> the pro<strong>of</strong> concern<strong>in</strong>g the selection <strong>of</strong><br />
the charachteristic polynomial <strong>in</strong> order to be able to choose specific coefficients.<br />
<strong>The</strong>orem 5 Let p be a prime number <strong>and</strong> h = h(x 0 , x 1 , ..., x k ) a polynomial<br />
over Z p . Let A 0 , A 1 , .., A k be a non empty sub<strong>sets</strong> <strong>of</strong> Z p where |A i | = c i + 1.<br />
Moreover let m = ∑ k<br />
i=0 (c i)−deg(h) <strong>and</strong> m ≥ 0. If the coefficient <strong>of</strong> x c 0<br />
0 xc 1<br />
1 ...xc k<br />
k<br />
<strong>in</strong> polynomial<br />
is non zero then<br />
(x 0 + x 1 + ... + x k ) m h(x 0 , x 1 , ..., x k )<br />
k∑<br />
| ⊕ h A i | ≥ m + 1.<br />
i=0<br />
Pro<strong>of</strong>:<br />
Suppose that the statement is false, <strong>and</strong> let E be a set <strong>of</strong> m elements <strong>of</strong> Z p<br />
conta<strong>in</strong><strong>in</strong>g the set ⊕ h<br />
∑ k<br />
i=0 A i. Let Q = Q(x 0 , ..., x k ) be a polynomial def<strong>in</strong>ed<br />
as follows:<br />
Note that<br />
Q(x 0 , ..., x k ) = h(x 0 , ..., x k ) · ∏<br />
(x 0 + ... + x k − e).<br />
e∈E<br />
Q(x 0 , ..., x k ) = 0 for all (x 0 , ..., x k ) ∈ A 0 × A 1 × ... × A k .<br />
∑ k<br />
This is because for any (x 0 , ..., x k ) is h(x 0 , ..., x k ) = 0 or x 0 , ..., x k ∈ ⊕ h i=0 A i ⊂<br />
E, <strong>and</strong> x 0 + x 1 + ... + x n = e for the correspond<strong>in</strong>g e ∈ E. For the degree <strong>of</strong><br />
the polynomial Q we have deg(Q) = m + deg(h) = ∑ k<br />
i=0 c i <strong>and</strong> therefore the<br />
coefficient <strong>of</strong> monomial x c0 · x k ...x c k<br />
k<br />
<strong>in</strong> the polynomial Q is the same as <strong>in</strong> the<br />
polynomial (x 0 + ... + x k ) m h(x 0 , ..., x k ), which is a nonzero, accord<strong>in</strong>g to the<br />
as<strong>sum</strong>ption.<br />
By the lemma shown <strong>in</strong> the second pro<strong>of</strong> <strong>of</strong> Cauchy-Davenport theorem we can<br />
reduce the degree <strong>of</strong> every x i <strong>in</strong> Q to up to c i , so that it does not change its<br />
value on A i . <strong>The</strong> transformed polynomial for has degree <strong>in</strong> x i less than c i <strong>and</strong><br />
is 0 on A 0 × ...A 1 × A k then it is a zero polynomial <strong>in</strong> Z p .<br />
<strong>The</strong> coefficient <strong>of</strong> monomial ∏ k<br />
i=0 xc i<br />
i <strong>in</strong> the <strong>in</strong>itial polynomial Q is equal to<br />
the coefficients <strong>in</strong> the transformed polynomial <strong>and</strong> is equal to 0, because there<br />
were no changes <strong>in</strong> this coefficient, because otherwise the degree <strong>of</strong> polynomial Q<br />
would be larger than ∑ k<br />
i=0 c i, which is impossible. Hence the theorem is proved.<br />
Note that for m < 0 the same <strong>in</strong>equality is valid<br />
k∑<br />
| ⊕ h A i | ≥ 0 ≥ m + 1,<br />
i=0<br />
7
which means that a given <strong>in</strong>equality is valid for negative m.<br />
Under the given conditions <strong>of</strong> the theorem it is also true that<br />
so that m < p.<br />
k∑<br />
p ≥ | ⊕ h A i | ≥ m + 1,<br />
i=0<br />
Alternative pro<strong>of</strong> <strong>of</strong> Cauchy-Davenport <strong>in</strong>equality can be made if the <strong>in</strong>equality<br />
|A| + |B|1 < p is obta<strong>in</strong>ed <strong>and</strong> this theorem applied to the case k =<br />
1 <strong>and</strong> h ≡ 1. Similarly, an alternative pro<strong>of</strong> <strong>of</strong> <strong>The</strong>orem 3. can be made if<br />
after obta<strong>in</strong><strong>in</strong>g the <strong>in</strong>equality |A| + |B| − 2 < p <strong>and</strong> elim<strong>in</strong>at<strong>in</strong>g the trivial case<br />
|A| = |B| = 1 we apply this theorem to the case k = 1 <strong>and</strong> hx 0 , x 1 ≡ x 0 − x 1 .<br />
By choos<strong>in</strong>g different polynomials for h we can make additional assertions. <strong>The</strong><br />
problem with many variables occurs when the coefficient that should be zero<br />
needs to calculated, that <strong>in</strong> some cases may represent some <strong>of</strong> the open comb<strong>in</strong>atorial<br />
problems. One <strong>of</strong> the <strong>in</strong>terest<strong>in</strong>g results <strong>in</strong> this context was proved by<br />
Dias de Silva <strong>and</strong> Hamidoune:<br />
<strong>The</strong>orem 6 Let p be a prime number , A a subset <strong>of</strong> Z p <strong>and</strong> k ∈ N. If the<br />
polynomial h is def<strong>in</strong>ed with h =<br />
∏ (x i − x j ) i, j ∈ N 0 then<br />
0≤i
[5] Noga Alon, Comb<strong>in</strong>atorial Nullstellensatz, Comb<strong>in</strong>atorics, Probability <strong>and</strong><br />
Comput<strong>in</strong>g 8 1-2: 7-29<br />
9