Subsection Section Preview
Investigate!
You have a large collection of \(1\times 1\) squares and \(1\times 2\) dominoes. You want to arrange these to make a \(1 \times 15\) strip. How many ways can you do this?
What if the squares come in three different colors and the dominos come in four different colors? And why is this second question easier than the first?
Hint.
Start by creating a recurrence relation. How are the different \(1\times 3\) strips and \(1 \times 4\) strips related to the \(1 \times 5\) strips?
In
Section 4.3 we saw that if a sequence has some sequence of differences that is constant, then the sequence has a polynomial closed formula. Are there sequences that never have constant differences? And what would their closed formulas look like?
Consider the Fibonacci sequence:
\begin{equation*}
0, 1, 1, 2, 3, 5, 8, 13, 21, 34, 55, \ldots\text{.}
\end{equation*}
If we look at the first differences, we get this sequence:
\begin{equation*}
1, 1, 2, 3, 5, 8, 13, 21, 34, \ldots\text{,}
\end{equation*}
which is the Fibonacci sequence itself. This is not surprising, since the Fibonacci sequence is defined by the recurrence relation \(F_n = F_{n-1} + F_{n-2}\text{.}\) That is saying precisely that to get the next turn of the sequence, we take the current term and add... a term in the sequence!
Of course, if we take another difference, we will get the same sequence back, and again and again, so no \(n\)th differences will be constant.
Another sequence that has this behavior is the powers of 2:
\begin{equation*}
1, 2, 4, 8, 16, 32, 64, 128, 256, 512, \ldots
\end{equation*}
which has differences
\begin{equation*}
1, 2, 4, 8, 16, 32, 64, 128, 256, \ldots\text{.}
\end{equation*}
We can also see this from the recurrence relation, since
\begin{equation*}
a_n = 2a_{n-1} = a_{n-1} + a_{n-1}\text{.}
\end{equation*}
The rate of growth for this, and in fact any geometric sequence, is the sequence itself.
If you have studied calculus, you may recall that the functions that have themselves (or close) as their rate of change (derivative, in the calculus context) are exactly the exponential functions. Here too, geometric sequences, which have exponential closed formulas, have themselves as their rate of change.
In this section, we will explore sequences that are changing at a rate proportional to the sequence itself, and see how these all have an exponential closed formula, or some variation of that.
Worksheet Preview Activity
1.
Subsection Summing Geometric Sequences: Multiply, Shift, and Subtract
Suppose a candy machine dispenses candy in a geometric sequence by first giving 1 candy, then 2 candies, then 4, then 8, and so on. How many candies will you have received in total after 10 turns of the machine?
We can create the sequence of partials sum as \(1, 1+2, 1+2+4, 1+2+4+8, \ldots\) that gives
\begin{equation*}
1, 3, 7, 15, 31, 63, \ldots
\end{equation*}
This is not a geometric sequence, but is almost. In fact, if we add 1 to each term, we get what sure looks like the geometric sequence \(2, 4, 8, 16, 32, 64,\ldots\text{,}\) so we might guess that the closed formula for the sequence of sums is \(2^{n+1} - 1\text{.}\) If this is correct, then the answer to the candy question would be \(2^{11} - 1 = 2047\text{.}\)
More intriguing though is the observation that the sequence of partial sums of a geometric sequence is again geometric-ish. Let’s consider how to find the sum of a geometric sequence in general.
We cannot just reverse and add as we did for the sum of an arithmetic sequence. Do you see why? The reason we got the same term added to itself many times is because there was a constant difference. So as we added that difference in one direction, we subtracted the difference going the other way, leaving a constant total. For geometric sums; we have a different technique.
Example 4.4.1.
What is \(3 + 6 + 12 + 24 + \cdots + 12288\text{?}\)
Solution.
Multiply each term by 2, the common ratio. We get \(2S = 6 + 12 + 24 + \cdots + 24576\text{.}\) Now subtract: \(2S - S = -3 + 24576 = 24573\text{.}\) Since \(2S - S = S\text{,}\) we have our answer.
To better see what happened in the above example, we can write it this way:
\(S=\) |
\(3 \, +\) |
\(6 + 12 + 24 + \cdots + 12288\) |
|
\(- \qquad 2S=\) |
|
\(6 + 12 + 24 + \cdots + 12288\) |
\(+ 24576\) |
\(-S = \) |
\(3 \, +\) |
\(0 + 0 + 0 + \cdots + 0 \) |
\(-24576\) |
Then divide both sides by \(-1\) and we have the same result for \(S\text{.}\) The idea is, by multiplying the sum by the common ratio, each term becomes the next term. We shift over the sum to get the subtraction to mostly cancel out, leaving just the first term and the new last term.
Example 4.4.2.
Find a closed formula for \(S(n) = 2 + 10 + 50 + \cdots + 2\cdot 5^n\text{.}\)
Solution.
The common ratio is 5. So we have
\(S\) |
\(= 2 + 10 + 50 + \cdots + 2\cdot 5^n\) |
\(- \qquad 5S\) |
\(= ~~~~~~10 + 50 + \cdots + 2\cdot 5^n + 2\cdot5^{n+1}\) |
\(-4S\) |
\(= 2 - 2\cdot5^{n+1}\) |
Thus \(S = \dfrac{2-2\cdot 5^{n+1}}{-4}\)
Even though this might seem like a new technique, you have probably used it before.
Example 4.4.3.
Express \(0.464646\ldots\) as a fraction.
Solution.
Let \(N = 0.46464646\ldots\text{.}\) Consider \(0.01N\text{.}\) We get:
\(N =\) |
\(0.4646464\ldots\) |
\(- \qquad 0.01N =\) |
\(0.00464646\ldots\) |
\(0.99N =\) |
\(0.46\) |
So \(N = \frac{46}{99}\text{.}\) What have we done? We viewed the repeating decimal \(0.464646\ldots\) as a sum of the geometric sequence \(0.46, 0.0046, 0.000046, \ldots\) The common ratio is \(0.01\text{.}\) The only real difference is that we are now computing an infinite geometric sum, we do not have the extra “last” term to consider. Really, this is the result of taking a limit as we would in calculus when we compute infinite geometric sums.
To summarize, we now can find a closed formula for a sequence \(a_n\) that has a rate of growth that is an exponential function: \(a_n - a_{n-1} = b_n\text{,}\) where \(b_n\) is a geometric sequence (i.e., an exponential function). What sort of closed formula do we get here? It’s another exponential function!
Subsection The Characteristic Root Technique
Suppose we want to solve a recurrence relation expressed as a combination of the two previous terms, such as \(a_n = a_{n-1} + 6a_{n-2}\text{.}\) In other words, we want to find a function of \(n\) which satisfies \(a_n - a_{n-1} - 6a_{n-2} = 0\text{.}\) Think about how we build up this sequence iteratively.
\begin{align*}
a_2 \amp = a_1 + 6a_0 \\
a_3 \amp = a_2 + 6a_1 = a_1 + 6a_0 + 6a_1\\
a_4 \amp = a_3 + 6a_2 = a_1 + 6a_0 + 6a_1 + 6^2a_0 +6^2a_1
\end{align*}
Let’s stop there and agree this is getting very complicated. However, we do notice that in each step, we would, among other things, multiply a previous iteration by 6. So our closed formula would include \(6\) multiplied some number of times. Thus it is reasonable to guess the solution will contain parts that look geometric. Perhaps the solution will take the form \(r^n\) for some constant \(r\text{.}\)
The nice thing is, we know how to check whether a formula is actually a solution to a recurrence relation: plug it in. What happens if we plug in \(r^n\) into the recursion above? We get
\begin{equation*}
r^n - r^{n-1} - 6r^{n-2} = 0\text{.}
\end{equation*}
Now solve for \(r\text{:}\)
\begin{equation*}
r^{n-2}(r^2 - r - 6) = 0\text{,}
\end{equation*}
so by factoring, \(r = -2\) or \(r = 3\) (or \(r = 0\text{,}\) although this does not help us). This tells us that \(a_n = (-2)^n\) is a solution to the recurrence relation, as is \(a_n = 3^n\text{.}\) Which one is correct? They both are, unless we specify initial conditions. Notice we could also have \(a_n = (-2)^n + 3^n\text{.}\) Or \(a_n = 7(-2)^n + 4\cdot 3^n\text{.}\) In fact, for any \(a\) and \(b\text{,}\) \(a_n = a(-2)^n + b 3^n\) is a solution (try plugging this into the recurrence relation). To find the values of \(a\) and \(b\text{,}\) use the initial conditions.
This points us in the direction of a more general technique for solving recurrence relations. Notice we will always be able to factor out the \(r^{n-2}\) as we did above. So we really only care about the other part. We call this other part the characteristic equation for the recurrence relation. We are interested in finding the roots of the characteristic equation, which are called (surprise) the characteristic roots.
Characteristic Roots.
Given a recurrence relation \(a_n + \alpha a_{n-1} + \beta a_{n-2} = 0\text{,}\) the characteristic polynomial is
\begin{equation*}
x^2 + \alpha x + \beta
\end{equation*}
giving the characteristic equation:
\begin{equation*}
x^2 + \alpha x + \beta = 0\text{.}
\end{equation*}
If \(r_1\) and \(r_2\) are two distinct roots of the characteristic polynomial (i.e., solutions to the characteristic equation), then the solution to the recurrence relation is
\begin{equation*}
a_n = ar_1^n + br_2^n\text{,}
\end{equation*}
where \(a\) and \(b\) are constants determined by the initial conditions.
Example 4.4.4.
Solve the recurrence relation \(a_n = 7a_{n-1} - 10 a_{n-2}\) with \(a_0 = 2\) and \(a_1 = 3\text{.}\)
Solution.
Rewrite the recurrence relation \(a_n - 7a_{n-1} + 10a_{n-2} = 0\text{.}\) Now form the characteristic equation:
\begin{equation*}
x^2 - 7x + 10 = 0
\end{equation*}
and solve for \(x\text{:}\)
\begin{equation*}
(x - 2) (x - 5) = 0\text{,}
\end{equation*}
so \(x = 2\) and \(x = 5\) are the characteristic roots. We therefore know that the solution to the recurrence relation will have the form
\begin{equation*}
a_n = a 2^n + b 5^n\text{.}
\end{equation*}
To find \(a\) and \(b\text{,}\) plug in \(n =0\) and \(n = 1\) to get a system of two equations with two unknowns:
\begin{align*}
2 \amp = a 2^0 + b 5^0 = a + b\\
3 \amp = a 2^1 + b 5^1 = 2a + 5b\text{.}
\end{align*}
Solving this system gives \(a = \frac{7}{3}\) and \(b = -\frac{1}{3}\text{,}\) so the solution to the recurrence relation is
\begin{equation*}
a_n = \frac{7}{3}2^n - \frac{1}{3} 5^n\text{.}
\end{equation*}
Perhaps the most famous recurrence relation is \(F_n = F_{n-1} + F_{n-2}\text{,}\) which together with the initial conditions \(F_0 = 0\) and \(F_1= 1\) defines the Fibonacci sequence. But notice that this is precisely the type of recurrence relation on which we can use the characteristic root technique. When we do, the only thing that changes is that the characteristic equation does not factor, so we must use the quadratic formula to find the characteristic roots. In fact, doing so gives the third most famous irrational number, \(\varphi\text{,}\) the golden ratio.
Before leaving the characteristic root technique, we should think about what might happen when solving the characteristic equation. We have an example above in which the characteristic polynomial has two distinct roots. These roots can be integers, or perhaps irrational numbers (requiring the quadratic formula to find them). In these cases, we know what the solution to the recurrence relation looks like.
However, it is possible for the characteristic polynomial to have only one root. This can happen if the characteristic polynomial factors as \((x - r)^2\text{.}\) It is still the case that \(r^n\) would be a solution to the recurrence relation, but we won’t be able to find solutions for all initial conditions using the general form \(a_n = ar_1^n + br_2^n\text{,}\) since we can’t distinguish between \(r_1^n\) and \(r_2^n\text{.}\) We are in luck though:
Characteristic Root Technique for Repeated Roots.
Suppose the recurrence relation \(a_n = \alpha a_{n-1} + \beta a_{n-2}\) has a characteristic polynomial with only one root \(r\text{.}\) Then the solution to the recurrence relation is
\begin{equation*}
a_n = ar^n + bnr^n
\end{equation*}
where \(a\) and \(b\) are constants determined by the initial conditions.
Notice the extra \(n\) in \(bnr^n\text{.}\) This allows us to solve for the constants \(a\) and \(b\) from the initial conditions.
Example 4.4.5.
Solve the recurrence relation \(a_n = 6a_{n-1} - 9a_{n-2}\) with initial conditions \(a_0 = 1\) and \(a_1 = 4\text{.}\)
Solution.
The characteristic polynomial is \(x^2 - 6x + 9\text{.}\) We solve the characteristic equation
\begin{equation*}
x^2 - 6x + 9 = 0
\end{equation*}
by factoring:
\begin{equation*}
(x - 3)^2 = 0,
\end{equation*}
so \(x =3\) is the only characteristic root. Therefore we know that the solution to the recurrence relation has the form
\begin{equation*}
a_n = a 3^n + bn3^n
\end{equation*}
for some constants \(a\) and \(b\text{.}\) Now use the initial conditions:
\begin{align*}
a_0 = 1 \amp = a 3^0 + b\cdot 0 \cdot 3^0 = a\\
a_1 = 4 \amp = a\cdot 3 + b\cdot 1 \cdot3 = 3a + 3b\text{.}
\end{align*}
Since \(a = 1\text{,}\) we find that \(b = \frac{1}{3}\text{.}\) Therefore the solution to the recurrence relation is
\begin{equation*}
a_n = 3^n + \frac{1}{3}n3^n\text{.}
\end{equation*}
Although we will not consider examples more complicated than these, this characteristic root technique can be applied to much more complicated recurrence relations. For example, \(a_n = 2a_{n-1} + a_{n-2} - 3a_{n-3}\) has characteristic polynomial \(x^3 - 2 x^2 - x + 3\text{.}\) Assuming we see how to factor such a degree 3 (or more) polynomial, we can easily find the characteristic roots and as such solve the recurrence relation (the solution would look like \(a_n = ar_1^n + br_2^n + cr_3^n\) if there were 3 distinct roots). It is also possible that the characteristic roots are complex numbers.
However, the characteristic root technique is only useful for solving recurrence relations in a particular form: \(a_n\) is given as a linear combination of some number of previous terms. These recurrence relations are called linear homogeneous recurrence relations with constant coefficients. The “homogeneous” refers to the fact that there is no additional term in the recurrence relation other than a multiple of \(a_j\) terms. For example, \(a_n = 2a_{n-1} + 1\) is non-homogeneous because of the additional constant 1. There are general methods of solving such things, but we will not consider them here, other than through the use of telescoping or iteration described above.