Solving systems of polynomial equations

We have found out from the previous pages that the system of polynomial equations f₁(x₁,x₂,...,x_n) = f₂(x₁,x₂,...,x_n) = ... = f_s(x₁,x₂,...,x_n) = 0 is equivalent to g₁(x₁,x₂,...,x_n) = g₂(x₁,x₂,...,x_n) = ... = g_t(x₁,x₂,...,x_n) = 0 where {g₁,g₂,...,g_t}is a Groebner basis for <f₁,f₂,...,f_s>. This becomes useful when the Groebner basis is computed w.r.t. the lex order which ensures the elimination of the variables (see The Elimination Theorem in [3]). Moreover, the order of elimination corresponds to the ordering of the variables: if x₁>x₂>...>x_n, then x₁ is eliminated first, then x₂ is eliminated second, and so on. Anyway, if we want to eliminate only some variables from our equations, a much more efficient way is to use a monomial order of k-elimination type since lex order may lead to some very unpleasant Groebner bases.

Example[3].

Let us consider the system

f₁ = x²+ y + z - 1 = 0
f₂ = x + y² + z - 1 = 0
f₃ = x + y + z²- 1 = 0.

Then, a Groebner basis for I = <f₁,f₂,f₃> w.r.t. lex order with x>y>z is given by

g₁ = x + y + z² - 1
g₂ = y²- y - z² + z
g₃ =2yz² + z⁴- z²g₄ = z⁶- 4z⁴+ 4z³ - z².

Hence the systems f₁= f₂= f₃= 0 and g₁ = g₂ = g₃ = g₄ = 0 have the same solutions. But, in the second one, g₄ = z⁶- 4z⁴+ 4z³ - z²= z²(z - 1)²(z²+ 2z -1) = 0involves only z and it is easy to solve: we get 0, 1, -1-sqrt(2), -1+sqrt(2)as the possible z's. Substituting these values into g₂ = y²- y - z² + z = 0 and g₃ =2yz² + z⁴- z² = 0, we get the possible y's and, finally, g₁ = x + y + z² - 1 = 0gives the corresponding x's. So the system f₁ = f₂ = f₃ = 0 has exactly five solutions:

(1,0,0), (0,1,0), (0,0,1), (-1-sqrt(2),-1-sqrt(2),-1-sqrt(2)), (-1+sqrt(2),-1+sqrt(2),-1+sqrt(2)).

Note that changing the order of the variables will also change the order of their elimination. If we had used lex order with z>y>x we would have eliminated z first, instead of x.

In this example we have seen that each possible value of z could be extended to a complete solution of the system. This is not always possible, as shown by the following example.

Example[3].

Consider the system

f₁ = xy - 1 = 0
f₂ = xz - 1 = 0.

A Groebner basis for I = <f₁,f₂> w.r.t. lex order is given by

g₁ = xy - 1
g₂ = y - z.

Hence, from g₂ = y - z = 0 we get the partial solutions (a, a) which all extends to complete solutions (1/a, a, a) except for the partial solution (0, 0).

The Extension Theorem tells us when a partial solution can be extended to a complete solution in an algebraically closed field. Before stating it, we must premit the following definition.

Definition 1. Given I = <f₁,f₂,...,f_s> in K[x₁,x₂,...,x_n], the kth elimination ideal I_k is the ideal of K[x_k+1,...,x_n] defined by

I_k = I IK[x_k+1,...,x_n].

Thus, I_k consists of all consequences of f₁= ... = f_s= 0 which eliminates the variables x₁,...,x_k.
In the first example above we have I₁ = <g₂, g₃, g₄> and I₂ = <g₄>, whereas in the latest example I₁ = <g₂>.

Theorem 2 (The Extension Theorem)[3]. Let I = <f₁,f₂,...,f_s> be an ideal in C[x₁,x₂,...,x_n] and let I₁ be the first elimination ideal of I. Rearrange the terms of the generators f_i with respect to the decreasing powers of x₁. If the leading coefficients h_i(x₂,...,x_n) (with respect to this new order) do not vanish simultaneously at the partial solution (a₂,...,a_n), then there exists a₁ in C such that the partial solution (a₂,...,a_n) extends to the complete solution (a₁,a₂,...,a_n).

Example[3].

In C[x,y,z], consider the system

f₁ = x²- y = 0
f₂ = x²- z = 0.

A Groebner basis for I = <f₁,f₂> w.r.t. lex order is given by

g₁ = x²- y
g₂ = x²- z
g₃ = y - z.

All partial solutions y = z = a do extend in C to complete solutions since 1 never vanishes at y = z = a.
Note that over R, only the partial solutions with a >= 0 extend.

Although the Extension Theorem is stated only for the case of eliminating the first variable x₁, it can be used multiple times to eliminate any number of variables; this follows from the fact that I_k+1 C K[x_k+2,...,x_n] is the first elimination ideal of I_k C K[x_k+1,...,x_n].

Example[3].

In C[x,y,z], consider the system

f₁ = x²+ y² + z²- 1 = 0
f₂ = xyz - 1 = 0.

A Groebner basis for I = <f₁,f₂> w.r.t. lex order is given by

g₁ = y⁴z²+ y²z⁴- y²z²+ 1
g₂ = x + y³z + yz³- yz.

We have

I₁ = I I C[y,z] = <g₁>
I₂ = I I C[z] = I₁I C[z] = {0}.

Every z = c in C is a partial solution, but which partial solutions c extend to (a,b,c) in V(I)? We use the Extension Theorem one coordinate at a time. We first go from I₂ to I₁: since the coefficient of y⁴ in g₁ is z², c extends to (b,c) whenever c<>0. Now (b,c) can be extended to (a,b,c) because in f₁ the coefficient of x² is 1 which never vanishes.
Thus the Extension Theorem tells us that all partial solutions c<>0 extend to V(I) in C.

We will find an interesting application of the Extension Theorem when we talk about the implicitization problem.
Now we will conclude this section with two results about how to detect unsolvable systems and systems with finitely many solutions. Buchberger [2] proved the following two facts.

Theorem 3 (Unsolvable systems). Let I = <f₁,f₂,...,f_s> be an ideal in C[x₁,x₂,...,x_n]. Then the system of polynomial equations f₁(x₁,x₂,...,x_n) = f₂(x₁,x₂,...,x_n) = ... = f_s(x₁,x₂,...,x_n) = 0is unsolvable if and only if the reduced Groebner basis of I is {1}.

We must be careful when we work on R[x₁,x₂,...,x_n]: a system might be unsolvable even though the groebner basis of the ideal is other than {1} (consider the equation x²+ 1 = 0!). However, if {1}is the reduced Groebner basis of our ideal, we are sure that the system cannot be solved.

Example[1].

Consider the system

f₁ = x²y - z³ = 0
f₂ = 2xy - 4z - 1 = 0
f₃ = y²- z = 0
f₄ = x³- 4yz = 0.

The reduced Groebner basis for I = <f₁,f₂,f₃,f₄> w.r.t. lex order is {1}, hence the system is unsolvable.

Theorem 4 (Systems with finitely many solutions). Let I = <f₁,f₂,...,f_s> be an ideal in C[x₁,x₂,...,x_n]. Then the system of polynomial equations f₁(x₁,x₂,...,x_n) = f₂(x₁,x₂,...,x_n) = ... = f_s(x₁,x₂,...,x_n) = 0 has finitely many solutions if and only if for all i (1<=i<=s) there exists a power of x_i which belongs to <LT(I)>.

For an example of a polynomial system with finitely many solutions see the first example in this page. You will see that x, y² and z⁶ are in <LT(I)>.

Last updated

Last updated