Root Finding by Interval Halving (Bisection)

1.1. Root Finding by Interval Halving (Bisection)#

Last revised on August 27, 2025.

References:

[Sullivan, 2021] Section 2.2, The Bisection Method.
[Sauer, 2022] Section 1.1, The Bisection Method.
[Burden et al., 2016] Section 2.1, The Bisection Method.
[Dionne, 2023] Sections 2.2, Bisection Method, and 2.3, Interruption criteria.
[Chenney and Kincaid, 2013] Section 3.1, Bisection Method.
[Dahlquist and Björck, 2008] Section 6.1.2, The Bisection Method.

1.1.1. Introduction#

One of the most basic tasks in numerical computing is finding the roots (or “zeros”) of a function — solving the equation \(f(x) = 0\) where \(f:\mathbb{R} \to \mathbb{R}\) is a continuous function from and to the real numbers. As with many topics in this course, there are multiple methods that work, and we will often start with the simplest and then seek improvement in several directions:

reliability or robustness — how good it is at avoiding problems in hard cases, such as division by zero.
accuracy and guarantees about accuracy like estimates of how large the error can be — since in most cases, the result cannot be computed exactly.
speed or cost — often measured by minimizing the amount of arithmetic involved, or the number of times that a function must be evaluated.

We use the package PyPlot; see the notes on plotting graphs and on using package and modules in Notes on the Julia Language.

using PyPlot

Example 1.1 (Solve \(x = \cos x\))

This is a simple equation for which there is no exact formula for a solution, but we can easily ensure that there is a solution, and moreover, a unique one. It is convenient to put the equation into “zero-finding” form \(f(x) = 0\), by defining

\[f(x) := x - \cos x.\]

Also, note that \(|\cos x| \leq 1\), so a solution to the original equation must have \(|x| \leq 1\). So we will start graphing the function on the interval \([a, b] = [-1, 1]\).

f(x) = x - cos(x);

Remark 1.1 (On Julia)

For notes on this compact version of Julia function syntax, see Functions, part 1 in Notes on the Julia Language.

a = -1.0
b = 1.0;

Remark 1.2 (On Julia)

See the notes on the function range used below in Notes on the Julia Language.

x = range(a, b, 100)
figure(figsize=[10,6])
plot(x, f.(x));
plot([a, b], [0, 0], "g");  # Mark the x-axis in green
grid(true)  # Add a graph paper background

../_images/d2081e8eef525fcc5e74cbbc11d741953c1b41ad3d59872ee9ef9e287923247d.png

This shows that the zero lies between 0.5 and 0.75, so zoom in:

a = 0.5
b = 0.75
x = range(a, b, 100)
figure(figsize=[10,6])
plot(x, f.(x))
plot([a, b], [0, 0], "g")
grid(true)

../_images/5bd6ec55a941b0768edeee6ddbab2d31428165532589372b7241bec66513cee6.png

And we could repeat, geting an approximation of any desired accuracy.

However this has two weaknesses: it is very inefficient (the function is evaluated about fifty times at each step in order to draw the graph), and it requires lots of human intervention.

To get a procedure that can be efficiently implemented in Julia (or another programming language of your choice), we extract one key idea here: finding an interval in which the function changes sign, and then repeatedly find a smaller such interval within it. The simplest way to do this is to repeatedly divide an interval known to contain the root in half and check which half has the sign change in it.

Graphically, let us start again with interval \([a, b] = [-1, 1]\), but this time focus on three points of interest: the two ends and the midpoint, where the interval will be bisected:

a = -1.0
b = 1.0
c = (a+b)/2
println("a=$a, b=$b, c=$c")

a=-1.0, b=1.0, c=0.0

Remark 1.3 (On Julia)

If you are unfamiliar with println see the notes on Displaying values in Notes on the Julia Language.

acb = [a c b]
figure(figsize=[10,6])
plot(acb, f.(acb), "b*")
# And just as a visual aid:
x = range(a, b, 100)
plot(x, f.(x), "b-.")
plot([a, b], [0, 0], "g")
grid(true)

../_images/99b27000b71a3aa5a975ff3ead326310b98f710c5120cc6d13309e741b3b31a8.png

\(f(a)\) and \(f(c)\) have the same sign, while \(f(c)\) and \(f(b)\) have opposite signs, so the root is in \([c, b]\); update the a, b, c values and plot again:

a = c  # new left end is old center
b = b  # redundant, as the right end is unchanged
c = (a+b)/2
println("a=$a, b=$b, c=$c")

a=0.0, b=1.0, c=0.5

acb = [a c b]
figure(figsize=[10,6])
plot(acb, f.(acb), "b*")
x = range(a, b, 100)
plot(x, f.(x), "b-.")
plot([a, b], [0, 0], "g")
grid(true)

../_images/922904413b8489ad35cb7f1277cf77a64193dfa0c148028f82e4ef6bc0273b07.png

Again \(f(c)\) and \(f(b)\) have opposite signs, so the root is in \([c, b]\), and …

a = c  # new left end is old center again
# skipping the redundant "b = b" this time
c = (a+b)/2
println("a=$a, b=$b, c=$c")

a=0.5, b=1.0, c=0.75

acb = [a c b]
figure(figsize=[10,6])
plot(acb, f.(acb), "b*")
x = range(a, b, 100)
plot(x, f.(x), "b-.")
plot([a, b], [0, 0], "g")
grid(true)

../_images/17f62ce09594c5960e6b4cde5e3c4195bc84bb66b1ba172ec2eaf07e0e0af15d.png

This time \(f(a)\) and \(f(c)\) have opposite sign, so the root is at left, in \([a, c]\):

# this time, the value of a does not need to be updated ...
b = c  # ... and the new right end is the former center
c = (a+b)/2
println("a=$a, b=$b, c=$c")

a=0.5, b=0.75, c=0.625

acb = [a c b]
figure(figsize=[10,6])
plot(acb, f.(acb), "b*")
x = range(a, b, 100)
plot(x, f.(x), "b-.")
plot([a, b], [0, 0], "g")
grid(true)

../_images/44992a9d6835e2268f220a68c9ead587f59e38274d28b3f947b4403212d8ba85.png

1.1.2. A first algorithm for the bisection method#

Now it is time to dispense with the graphs, and describe the procedure in mathematical terms:

if \(f(a)\) and \(f(c)\) have opposite signs, the root is in interval \([a, c]\), which becomes the new version of interval \([a, b]\).
otherwise, \(f(c)\) and \(f(b)\) have opposite signs, so the root is in interval \([c, b]\)

1.1.2.1. Pseudo-code for describing algorithms#

As a useful bridge from the mathematical desciption of an algorithm with words and formulas to actual executable code, these notes will often describe algorithms in pseudo-code — a mix of words and mathematical formulas with notation that somewhat resembles code in a language like Julia.

This is also preferable to going straight to code in a particular programming language (such as Julia) because it makes it easier if, later, you wish to implement algorithms in a different language.

Note well one feature of the pseudo-code used here: assignment is denoted with a left arrow:

\(x \leftarrow a\)

is the instruction to cause the value of variable x to become the current value of a.

Remark 1.4

A related mathematical notation for this is “:=”, also often used to introduce a definition, as with \(\displaystyle \tan(x) := \frac{\sin(x)}{\cos(x)}\).

This is to distinguish from

\(x = a\)

which is a comparison: the true-or-false assertion that the two quantities already have the same value.

Unfortunately however, Julia (like most programming languages) does not use this notation: instead assignment is done with

x = a

so that asserting equality needs a different notation: this is done with

x == a

Note well that double equal sign!

With that notational issue out of the way, the key step in the bisection strategy is the update of the interval:

Algorithm 1.1 (one step of bisection)

\(\displaystyle c \leftarrow \frac{a + b}{2}\)
if \(f(a) f(c) < 0\) then
\(\quad\) \(b \leftarrow c\)
else
\(\quad\) \(a \leftarrow c\)
end

This needs to be repeated a finite number of times, and the simplest way is to specify the number of iterations. (We will consider more refined methods soon.)

Algorithm 1.2 (bisection, first version)

Get an initial interval \([a, b]\) with a sign-change: \(f(a) f(b) < 0\).
Choose \(N\), the number of iterations.
for i from 1 to N
\(\quad\) \(\displaystyle c \leftarrow \frac{a + b}{2}\)
\(\quad\) if \(f(a) f(c) < 0\) then
\(\quad\)\(\quad\) \(b \leftarrow c\)
\(\quad\) else:
\(\quad\)\(\quad\) \(a \leftarrow c\)
\(\quad\) end
end
The approximate root is the final value of \(c\).

A Julia version of the iteration is not a lot different:

for i in 1:N
    c = (a+b)/2
    if f(a) * f(c) < 0
        b = c
    else
        a = c
    end
end

Remark 1.5 (On Julia)

See the notes on Iteration and Conditionals on the syntax seen here for first time.

See Exercise 1.1.

1.1.3. Error bounds, and a more refined algorithm#

The above method of iteration for a fixed number of times is simple, but usually not what is wanted in practice. Instead, a better goal is to get an approximation with a guaranteed maximum possible error: a result consisting of an approximation \(\tilde{r}\) to the exact root \(r\) and also a bound \(E_{max}\) on the maximum possible error; a guarantee that \(|r - \tilde{r}| \leq E_{max}\). To put it another way, a guarantee that the root \(r\) lies in the interval \([\tilde{r} - E_{max}, \tilde{r} + E_{max}]\).

In the above example, each iteration gives a new interval \([a, b]\) guaranteed to contain the root, and its midpoint \(c = (a+b)/2\) is with a distance \((b-a)/2\) of any point in that interval, so at each iteration, we can have:

\(\tilde{r}\) is the current value of \(c = (a+b)/2\)
\(E_{max} = (b-a)/2\)

1.1.4. Error tolerances and stopping conditions#

The above algorthm can passively state an error bound, but it is better to be able to solve to a desired degree of accuracy; for example, if we want a result “accurate to three decimal places”, we can specify \(E_{max} \leq 0.5 \times 10^{-3}\).

So our next goal is to actively set an accuracy target or error tolerance \(E_{tol}\) and keep iterating until it is met. This can be achieved with a while loop; here is a suitable algorithm:

Algorithm 1.3 (bisection with error tolerance)

Input function \(f\), interval endpoints \(a\) and \(b\), and an error tolerance \(E_{tol}\)
Evaluate \(E_{max} = (b-a)/2\)
while \(E_{max} > E_{tol}\):
\(\quad c \leftarrow (a+b)/2\)
\(\quad\) if \(f(a) f(c) < 0\) then
\(\quad\quad b \leftarrow c\)
\(\quad\) else
\(\quad\quad a \leftarrow c\)
\(\quad\) end
\(\quad E_{max} \leftarrow (b-a)/2\)
end
Output \(\tilde{r} = c\) as the approximate root and \(E_{max}\) as a bound on its absolute error.

1.1.5. Cost/speed#

The bisection method is unusual in that we can calculate in advance how many iterations are needed, using the fact that each iteration halves the maximum possible error.

Theorem 1.1

Starting with an interval \([a,b]\) in which a continuous function \(f\) is known to change signs and thus have a root, the error bound will be less than a target accuracy of \(E_{tol}\) in

(1.1)#\[ N = \left\lceil \log_2\left( \frac{b-a}{E_{tol}} \right) \right\rceil \]

steps (where \(\lceil \cdot \rceil\) is the “ceiling” function, rounding up to an integer.)

Proof. The error in the midpoint \(c = (a+b)/2\) as an approximation of the root is at most \((b-a)/2\), and each iteration shrinks the interval — and thus the error bound on the new interval’s midpoint — by a factor of two. Thus after \(N\) steps, the error is at most \((b-a)/2^{N+1}\).

The error is small enough once this is less than the tolerance \(E_{tol}\), so we are done once

\[ \frac{b-a}{2^{N+1}} \leq E_{tol} \]

Solving this inequality for \(N\) gives the sufficient condition

\[ N \geq \log_2\left(\frac{b-a}{E_{tol}}\right) \]

However, \(N\) must be an integer, so we must round up to get the smallest acceptable value, giving Equation (1.1).

1.1.6. Exercises#

Exercise 1.1

Create a Julia function bisection1 which implements the first algorithm for bisection above, which performs a fixed number \(N\) of iterations; the usage should be: root = bisection1(f, a, b, N)

Test it with the above example: \(f(x) = x - \cos x = 0\), \([a, b] = [-1, 1]\)

Remark 1.6

Julia newcomers: see the notes introducing Julia Functions.

Exercise 1.2

Create a Julia function implementing the better Algorithm 1.3, with usage

Test it with the above example: \(f(x) = x - \cos x\), \([a, b] = [-1, 1]\), this time accurate to within \(10^{-4}\).

Use the fact that there is a solution in the interval \((-1, 1)\).

Exercise 1.3

Consider the equation \(x^5 = x^2 + 10\).

a) Find an interval \([a,b]\) of length one in which there is guaranteed to be a root.

b) Compute, by hand, the next two improved approximations given by the bisection method.

c) Determine how many iterations of the bisection method would then be needed to approximate the root with an absolute error of at most \(10^{-10}\). Do this without actually computing those extra iterations or computing that approximation of the root!