The exponential function y = e^{x} (solid red curve) and the corresponding Taylor polynomial of degree four (dashed green curve) around the origin.
In calculus, Taylor's theorem gives an approximation of a k times differentiable function around a given point by a kth order Taylor polynomial. For analytic functions the Taylor polynomials at a given point are finite order truncations of its Taylor series, which completely determines the function in some neighborhood of the point. The exact content of "Taylor's theorem" is not universally agreed upon. Indeed, there are several versions of it applicable in different situations, and some of them contain explicit estimates on the approximation error of the function by its Taylor polynomial.
Taylor's theorem is named after the mathematician Brook Taylor, who stated a version of it in 1712. Yet an explicit expression of the error was not provided until much later on by JosephLouis Lagrange. An earlier version of the result was already mentioned in 1671 by James Gregory.^{[1]}
Taylor's theorem is taught on introductory level calculus courses and it is one of the central elementary tools in mathematical analysis. Within pure mathematics it is the starting point of more advanced asymptotic analysis, and it is commonly used in more applied fields of numerics as well as in mathematical physics. Taylor's theorem also generalizes to multivariate and vector valued functions f : R^{n} → R^{m} on any dimensions n and m. This generalization of Taylor's theorem is the basis for the definition of socalled jets which appear in differential geometry and partial differential equations.
Motivation
If a realvalued function f is differentiable at the point a then it has a linear approximation at the point a. This means that there exists a function h_{1} such that

f(x) = f(a) + f'(a)(xa) + h_1(x)(xa), \qquad \lim_{x\to a}h_1(x)=0.
Here

P_1(x) = f(a) + f'(a)(xa) \
is the linear approximation of f at the point a. The graph of y = P_{1}(x) is the tangent line to the graph of f at x = a. The error in the approximation is

R_1(x) = f(x)P_1(x) = h_1(x)(xa). \
Note that this goes to zero a little bit faster than x − a as x tends to a.
Graph of f(x)=e^{x} (blue) with its quadratic approximation P_{2}(x) = 1 + x + x^{2}/2 (red) at a = 0. Note the improvement in the approximation.
If we wanted a better approximation to f, we might instead try a quadratic polynomial instead of a linear function. Instead of just matching one derivative of f at a, we can match two derivatives, thus producing a polynomial that has the same slope and concavity as f at a. The quadratic polynomial in question is

P_2(x) = f(a) + f'(a)(xa) + \frac{f''(a)}{2}(xa)^2. \,
Taylor's theorem ensures that the quadratic approximation is, in a sufficiently small neighborhood of the point a, a better approximation than the linear approximation. Specifically,

f(x) = P_2(x) + h_2(x)(xa)^2, \qquad \lim_{x\to a}h_2(x)=0.
Here the error in the approximation is

R_2(x) = f(x)P_2(x) = h_2(x)(xa)^2 \
which, given the limiting behavior of h_{2}, goes to zero faster than (x − a)^{2} as x tends to a.
Approximation of f(x) = 1/(1 + x^{2}) by its Taylor polynomials P_{k} of order k = 1, ..., 16 centered at x = 0 (red) and x = 1 (green). The approximations do not improve at all outside (1,1) and (1√2,1+√2), respectively.
Similarly, we get still better approximations to
f if we use
polynomials of higher degree, since then we can match even more derivatives with
f at the selected base point. In general, the error in approximating a function by a polynomial of degree
k will go to zero a little bit faster than
(x − a)^{k} as
x tends to
a.
This result is of asymptotic nature: it only tells us that the error R_{k} in an approximation by a kth order Taylor polynomial P_{k} tends to zero faster than any nonzero kth degree polynomial as x → a. It does not tell us how large the error is in any concrete neighborhood of the center of expansion, but for this purpose there are explicit formulae for the remainder term (given below) which are valid under some additional regularity assumptions on f. These enhanced versions of Taylor's theorem typically lead to uniform estimates for the approximation error in a small neighborhood of the center of expansion, but the estimates do not necessarily hold for neighborhoods which are too large, even if the function f is analytic. In that situation one may have to select several Taylor polynomials with different centers of expansion to have reliable Taylorapproximations of the original function (see animation on the right.)
It is also possible that increasing the degree of the approximating polynomial does not increase the quality of approximation at all even if the function f to be approximated is infinitely many times differentiable. An example of this behavior is given below, and it is related to the fact that unlike analytic functions, more general functions are not (locally) determined by the values of their derivatives at a single point.
Taylor's theorem in one real variable
Statement of the theorem
The precise statement of the most basic version of Taylor's theorem is as follows:
The polynomial appearing in Taylor's theorem is the kth order Taylor polynomial

P_k(x) = f(a) + f'(a)(xa) + \frac{f''(a)}{2!}(xa)^2 + \cdots + \frac{f^{(k)}(a)}{k!}(xa)^k
of the function f at the point a. The Taylor polynomial is the unique "asymptotic best fit" polynomial in the sense that if there exists a function h_{k} : R → R and a kth order polynomial p such that

f(x) = p(x) + h_k(x)(xa)^k, \quad \lim_{x\to a}h_k(x)=0,
then p = P_{k}. Taylor's theorem describes the asymptotic behavior of the remainder term

\ R_k(x) = f(x)  P_k(x),
which is the approximation error when approximating f with its Taylor polynomial. Using the littleo notation the statement in Taylor's theorem reads as

R_k(x) = o(xa^k), \quad x\to a.
Explicit formulae for the remainder
Under stronger regularity assumptions on f there are several precise formulae for the remainder term R_{k} of the Taylor polynomial, the most common ones being the following.
These refinements of Taylor's theorem are usually proved using the mean value theorem, whence the name. Also other similar expressions can be found. For example, if G(t) is continuous on the closed interval and differentiable with a nonvanishing derivative on the open interval between a and x, then

R_k(x) = \frac{f^{(k+1)}(\xi)}{k!}(x\xi)^k \frac{G(x)G(a)}{G'(\xi)}
for some number ξ between a and x. This version covers the Lagrange and Cauchy forms of the remainder as special cases, and is proved below using Cauchy's mean value theorem.
The statement for the integral form of the remainder is more advanced than the previous ones, and requires understanding of Lebesgue integration theory for the full generality. However, it holds also in the sense of Riemann integral provided the (k+1)st derivative of f is continuous on the closed interval [a,x].
Due to absolute continuity of f^{(k)} on the closed interval between a and x its derivative f^{(k+1)} exists as an L^{1}function, and the result can be proven by a formal calculation using fundamental theorem of calculus and integration by parts.
Estimates for the remainder
It is often useful in practice to be able to estimate the remainder term appearing in the Taylor approximation, rather than having an exact formula for it. Suppose that f is (k+1)times continuously differentiable in an interval I containing a. Suppose that there are real constants q and Q such that

q\le f^{(k+1)}(x)\le Q
throughout I. Then the remainder term satisfies the inequality^{[8]}

q\frac{(xa)^{k+1}}{(k+1)!}\le R_k(x)\le Q\frac{(xa)^{k+1}}{(k+1)!},
if x > a, and a similar estimate if x < a. This is a simple consequence of the Lagrange form of the remainder. In particular, if

f^{(k+1)}(x)\leq M
on an interval I = (a−r,a+r) with some r>0, then

R_k(x) \le M\frac{xa^{k+1}}{(k+1)!}\le M\frac{r^{k+1}}{(k+1)!}
for all x∈(a−r,a+r). The second inequality is called a uniform estimate, because it holds uniformly for all x on the interval (a−r,a+r).
Example
Approximation of e^{x} (blue) by its Taylor polynomials P_{k} of order k=1,...,7 centered at x=0 (red).
Suppose that we wish to
approximate the function
f(x) = e^{x} on the interval
[−1,1] while ensuring that the error in the approximation is no more than 10
^{−5}. In this example we pretend that we only know the following properties of the exponential function:

(*) \qquad e^0=1, \qquad \frac{d}{dx} e^x = e^x, \qquad e^x>0, \qquad x\in\mathbb{R}.
From these properties it follows that f^{(k)}(x) = e^{x} for all k, and in particular, f^{(k)}(0) = 1. Hence the kth order Taylor polynomial of f at 0 and its remainder term in the Lagrange form are given by

P_k(x) = 1+x+\frac{x^2}{2!}+\cdots+\frac{x^k}{k!}, \qquad R_k(x)=\frac{e^\xi}{(k+1)!}x^{k+1},
where ξ is some number between 0 and x. Since e^{x} is increasing by (*), we can simply use e^{x} ≤ 1 for x ∈ [−1, 0] to estimate the remainder on the subinterval [−1, 0]. To obtain an upper bound for the remainder on [0,1], we use the property e^{ξ}<e^{x} for 0<ξ to estimate

e^x = 1 + x + \frac{e^\xi}{2}x^2 < 1 + x + \frac{e^x}{2}x^2, \qquad 0 < x\leq 1
using the second order Taylor expansion. Then we solve for e^{x} to deduce that

e^x \leq \frac{1+x}{1\frac{x^2}{2}} = 2\frac{1+x}{2x^2} \leq 4, \qquad 0 \leq x\leq 1
simply by maximizing the numerator and minimizing the denominator. Combining these estimates for e^{x} we see that

R_k(x) \leq \frac{4x^{k+1}}{(k+1)!} \leq \frac{4}{(k+1)!}, \qquad 1\leq x \leq 1,
so the required precision is certainly reached, when

\frac{4}{(k+1)!} < 10^{5} \quad \Leftrightarrow \quad 4\cdot 10^5 < (k+1)! \quad \Leftrightarrow \quad k \geq 9.
(See factorial or compute by hand the values 9!=362 880 and 10!=3 628 800.) As a conclusion, Taylor's theorem leads to the approximation

e^x = 1+x+\frac{x^2}{2!} + \ldots + \frac{x^9}{9!} + R_9(x), \qquad R_9(x) < 10^{5}, \qquad 1\leq x \leq 1.
For instance, this approximation provides a decimal expression e≈2.71828, correct up to five decimal places.
Relationship to analyticity
Taylor expansions of real analytic functions
Let I⊂R be an open interval. By definition, a function f:I→R is real analytic if it is locally defined by a convergent power series. This means that for every a ∈ I there exists some r > 0 and a sequence of coefficients c_{k} ∈ R such that (a − r, a + r) ⊂ I and

f(x) = \sum_{k=0}^\infty c_k(xa)^k = c_0 + c_1(xa) + c_2(xa)^2 + \cdots, \qquad xa
In general, the radius of convergence of a power series can be computed from the Cauchy–Hadamard formula

\frac{1}{R} = \limsup_{k\to\infty}c_k^\frac{1}{k}.
This result is based on comparison with a geometric series, and the same method shows that if the power series based on a converges for some b∈R, it must converge uniformly on the closed interval [a − r_{b}, a + r_{b}], where r_{b} = b − a. Here only the convergence of the power series is considered, and it might well be that (a − R,a + R) extends beyond the domain I of f.
The Taylor polynomials of the real analytic function f at a are simply the finite truncations

P_k(x) = \sum_{j=0}^k c_j(xa)^j, \qquad c_j = \frac{f^{(j)}(a)}{j!}
of its locally defining power series, and the corresponding remainder terms are locally given by the analytic functions

R_k(x) = \sum_{j=k+1}^\infty c_j(xa)^j = (xa)^k h_k(x), \qquad xa
Here the functions

h_k:(ar,a+r)\to \R; \qquad h_k(x) = (xa)\sum_{j=0}^\infty c_{k+1+j}(xa)^j
are also analytic, since their defining power series have the same radius of convergence as the original series. Assuming that [a − r, a + r] ⊂ I and r < R, all these series converge uniformly on (a − r, a + r). Naturally, in the case of analytic functions one can estimate the remainder term R_{k}(x) by the tail of the sequence of the derivatives f′(a) at the center of the expansion, but using complex analysis also another possibility arises, which is described below.
Taylor's theorem and convergence of Taylor series
There is a source of confusion on the relationship between Taylor polynomials of smooth functions and the Taylor series of analytic functions. One can (rightfully) see the Taylor series

f(x) \approx \sum_{k=0}^\infty c_k(xa)^k = c_0 + c_1(xa) + c_2(xa)^2 + \ldots
of an infinitely many times differentiable function f:R→R as its "infinite order Taylor polynomial" at a. Now the estimates for the remainder of a Taylor polynomial implies that for any order k and for any r>0 there exists a constant M_{k,r}>0 such that

(*) \quad R_k(x)\leq M_{k,r}\frac{xa^{k+1}}{(k+1)!}
for every x∈(ar,a+r). Sometimes these constants can be chosen in such way that M_{k,r} → 0 when k → ∞ and r stays fixed. Then the Taylor series of f converges uniformly to some analytic function

T_f:(ar,a+r)\to\mathbb R; \qquad T_f(x) = \sum_{k=0}^\infty \frac{f^{(k)}(a)}{k!}(xa)^k.
Here comes the subtle point. It may well be that an infinitely many times differentiable function f has a Taylor series at a which converges on some open neighborhood of a, but the limit function T_{f} is different from f. An important example of this phenomenon is provided by

f:\mathbb R \to \mathbb R; \qquad f(x) = \begin{cases} e^{\frac{1}{x^2}} & x>0, \\ 0 & x\leq 0.\end{cases}
Using the chain rule one can show inductively that for any order k,

f^{(k)}(x) = \begin{cases} \frac{p_k(x)}{x^{3k}}e^{\frac{1}{x^2}} & x>0 \\ 0 & x\leq 0\end{cases}
for some polynomial p_{k} of degree 2(k1). The function e^{\frac{1}{x^2}} tends to zero faster than any polynomial as x → 0, so f is infinitely many times differentiable and f^{(k)}(0) = 0 for every positive integer k. Now the estimates for the remainder for the Taylor polynomials show that the Taylor series of f converges uniformly to the zero function on the whole real axis. Nothing is wrong in here:

The Taylor series of f converges uniformly to the zero function T_{f}(x)=0.

The zero function is analytic and every coefficient in its Taylor series is zero.

The function f is infinitely many times differentiable, but not analytic.

For any k∈N and r>0 there exists M_{k,r}>0 such that the remainder term for the kth order Taylor polynomial of f satisfies (*).
Taylor's theorem in complex analysis
Taylor's theorem generalizes to functions f:\mathbb C\to\mathbb C which are complex differentiable in an open subset U ⊂ C of the complex plane. However, its usefulness is dwarfed by other general theorems in complex analysis. Namely, stronger versions of related results can be deduced for complex differentiable functions f : U → C using Cauchy's integral formula as follows.
Let r > 0 such that the closed disk B(z, r) ∪ S(z, r) is contained in U. Then Cauchy's integral formula with a positive parametrization γ(t)=re^{it} of the circle S(z,r) with t ∈ [0,2π] gives

\begin{align}& f(z) = \frac{1}{2\pi i}\int_\gamma \frac{f(w)}{wz}dw, \quad f'(z) = \frac{1}{2\pi i}\int_\gamma \frac{f(w)}{(wz)^2}dw, \\& \ldots, \quad f^{(k)}(z) = \frac{k!}{2\pi i}\int_\gamma \frac{f(w)}{(wz)^{k+1}}dw. \end{align}
Here all the integrands are continuous on the circle S(z, r), which justifies differentiation under the integral sign. In particular, if f is once complex differentiable on the open set U, then it is actually infinitely many times complex differentiable on U. One also obtains the Cauchy's estimates^{[9]}

f^{(k)}(z) \leq \frac{k!}{2\pi}\int_\gamma \frac{M_r}{wz^{k+1}}dw = \frac{k!M_r}{r^k}, \quad M_r = \max_{wc=r}f(w)
for any z ∈ U and r > 0 such that B(z, r) ∪ S(c, r) ⊂ U. These estimates imply that the complex Taylor series

T_f(z) = \sum_{k=0}^\infty \frac{f^{(k)}(c)}{k!}(zc)^k
of f converges uniformly on any open disk B(c, r) ⊂ U with S(c, r) ⊂ U into some function T_{f}. Furthermore, using the contour integral formulae for the derivatives f^{(k)}(c),

\begin{align} T_f(z) = \ & \sum_{k=0}^\infty \frac{(zc)^k}{2\pi i}\int_\gamma \frac{f(w)}{(wc)^{k+1}}dw = \frac{1}{2\pi i} \int_\gamma \frac{f(w)}{wc} \sum_{k=0}^\infty \left(\frac{zc}{wc}\right)^k dw \\ = \ & \frac{1}{2\pi i} \int_\gamma \frac{f(w)}{wc}\left( \frac{1}{1\frac{zc}{wc}} \right) dw = \frac{1}{2\pi i} \int_\gamma \frac{f(w)}{wz} dw = f(z), \end{align}
so any complex differentiable function f in an open set U ⊂ C is in fact complex analytic. All that is said for real analytic functions here holds also for complex analytic functions with the open interval I replaced by an open subset U ∈ C and acentered intervals (a − r, a + r) replaced by ccentered disks B(c, r). In particular, the Taylor expansion holds in the form

f(z) = P_k(z) + R_k(z), \quad P_k(z) = \sum_{j=0}^k \frac{f^{(j)}(c)}{j!}(zc)^j,
where the remainder term R_{k} is complex analytic. Methods of complex analysis provide some powerful results regarding Taylor expansions. For example, using Cauchy's integral formula for any positively oriented Jordan curve γ which parametrizes the boundary ∂W ⊂ U of a region W ⊂ U, one obtains expressions for the derivatives f^{(j)}(c) as above, and modifying slightly the computation for T_{f}(z) = f(z), one arrives at the exact formula

R_k(z) = \sum_{j=k+1}^\infty \frac{(zc)^j}{2\pi i} \int_\gamma \frac{f(w)}{(wc)^{j+1}}dw = \frac{(zc)^{k+1}}{2\pi i} \int_\gamma \frac{f(w)dw}{(wc)^{k+1}(wz)} , \qquad z\in W.
The important feature here is that the quality of the approximation by a Taylor polynomial on the region W ⊂ U is dominated by the values of the function f itself on the boundary ∂W ⊂ U. Similarly, applying Cauchy's estimates to the series expression for the remainder, one obtains the uniform estimates

R_k(z) \leq \sum_{j=k+1}^\infty \frac{M_r zc^j}{r^j} = \frac{M_r}{r^{k+1}} \frac{zc^{k+1}}{1\frac{zc}{r}} \leq \frac{M_r \beta^{k+1}}{1\beta} , \qquad \frac{zc}{r}\leq \beta < 1.
Example
Complex plot of f(z) = 1/(1 + z^{2}). Modulus is shown by elevation and argument by coloring: cyan=0, blue=π/3, violet=2π/3, red=π, yellow=4π/3, green=5π/3.
The function f:R→R defined by

f(x) = \frac{1}{1+x^2}
is real analytic, that is, locally determined by its Taylor series. This function was plotted above to illustrate the fact that some elementary functions cannot be approximated by Taylor polynomials in neighborhoods of the center of expansion which are too large. This kind of behavior is easily understood in the framework of complex analysis. Namely, the function f extends into a meromorphic function

f:\mathbb C\cup\{\infty\} \to \mathbb C\cup\{\infty\}; \quad f(z) = \frac{1}{1+z^2}
on the compactified complex plane. It has simple poles at z=i and z=−i, and it is analytic elsewhere. Now its Taylor series centered at z_{0} converges on any disc B(z_{0},r) with r<zz_{0}, where the same Taylor series converges at z∈C. Therefore Taylor series of f centered at 0 converges on B(0,1) and it does not converge for any z∈C with z>1 due to the poles at i and −i. For the same reason the Taylor series of f centered at 1 converges on B(1,√2) and does not converge for any z∈C with z1>√2.
Generalizations of Taylor's theorem
Higherorder differentiability
A function f: R^{n} → R is differentiable at a ∈ R^{n} if and only if there exists a linear functional L : R^{n} → R and a function h : R^{n} → R such that

f(\boldsymbol{x}) = f(\boldsymbol{a}) + L(\boldsymbol{x}\boldsymbol{a}) + h(\boldsymbol{x})\mathbf{x}\mathbf{a}, \qquad \lim_{\boldsymbol{x}\to\boldsymbol{a}}h(\boldsymbol{x})=0.
If this is the case, then L = df(a) is the (uniquely defined) differential of f at the point a. Furthermore, then the partial derivatives of f exist at a and the differential of f at a is given by

df( \boldsymbol{a} )( \boldsymbol{v} ) = \frac{\partial f}{\partial x_1}(\boldsymbol{a})v_1 + \cdots + \frac{\partial f}{\partial x_n}(\boldsymbol{a})v_n.
Introduce the multiindex notation

\alpha = \alpha_1+\cdots+\alpha_n, \quad \alpha!=\alpha_1!\cdots\alpha_n!, \quad \boldsymbol{x}^\alpha=x_1^{\alpha_1}\cdots x_n^{\alpha_n}
for α ∈ N^{n} and x ∈ R^{n}. If all the kth order partial derivatives of f : R^{n} → R are continuous at a ∈ R^{n}, then by Clairaut's theorem, one can change the order of mixed derivatives at a, so the notation

D^\alpha f = \frac{\partial^{\alpha}f}{\partial x_1^{\alpha_1}\cdots \partial x_n^{\alpha_n}}, \qquad \alpha\leq k
for the higher order partial derivatives is justified in this situation. The same is true if all the (k − 1)th order partial derivatives of f exist in some neighborhood of a and are differentiable at a.^{[10]} Then we say that f is k times differentiable at the point a .
Taylor's theorem for multivariate functions
If the function f : R^{n} → R is k+1 times continuously differentiable in the closed ball B, then one can derive an exact formula for the remainder in terms of (k+1)th order partial derivatives of f in this neighborhood. Namely,

\begin{align}& f( \boldsymbol{x} ) = \sum_{\alpha\leq k} \frac{D^\alpha f(\boldsymbol{a})}{\alpha!} (\boldsymbol{x}\boldsymbol{a})^\alpha + \sum_{\beta=k+1} R_\beta(\boldsymbol{x})(\boldsymbol{x}\boldsymbol{a})^\beta, \\& R_\beta( \boldsymbol{x} ) = \frac{\beta}{\beta!} \int_0^1 (1t)^{\beta1}D^\beta f \big(\boldsymbol{a}+t( \boldsymbol{x}\boldsymbol{a} )\big) \, dt. \end{align}
In this case, due to the continuity of (k+1)th order partial derivatives in the compact set B, one immediately obtains the uniform estimates

\leftR_\beta(\boldsymbol{x})\right \leq \frac{1}{\beta!} \max_{\alpha=\beta} \max_{\boldsymbol{y}\in B} D^\alpha f(\boldsymbol{y}), \qquad \boldsymbol{x}\in B.
Example in two dimensions
For example, the third order Taylor polynomial of a function f: R^{2} → R is, denoting x − a = v,

\begin{align} P_3(\boldsymbol{x}) = f ( \boldsymbol{a} ) + &\frac{\partial f}{\partial x_1}( \boldsymbol{a} ) v_1 + \frac{\partial f}{\partial x_2}( \boldsymbol{a} ) v_2 + \frac{\partial^2 f}{\partial^2 x_1}( \boldsymbol{a} ) \frac {v_1^2}{2!} + \frac{\partial^2 f}{\partial x_1 \partial x_2}( \boldsymbol{a} ) v_1 v_2 + \frac{\partial^2 f}{\partial^2 x_2}( \boldsymbol{a} ) \frac{v_2^2}{2!} \\ & + \frac{\partial^3 f}{\partial x_1^3}( \boldsymbol{a} ) \frac{v_1^3}{3!} + \frac{\partial^3 f}{\partial^2 x_1 \partial x_2}( \boldsymbol{a} ) \frac{v_1^2 v_2}{2!} + \frac{\partial^3 f}{\partial x_1 \partial^2 x_2}( \boldsymbol{a} ) \frac{v_1 v_2^2}{2!} + \frac{\partial^3 f}{\partial^3 x_2}( \boldsymbol{a} ) \frac{v_2^3}{3!} \end{align}
Proofs
Proof for Taylor's theorem in one real variable
Let^{[12]}

h_k(x) = \begin{cases} \frac{f(x)  P(x)}{(xa)^k} & x\not=a\\ 0&x=a \end{cases}
where, as in the statement of Taylor's theorem,

P(x) = f(a) + f'(a)(xa) + \frac{f''(a)}{2!}(xa)^2 + \cdots + \frac{f^{(k)}(a)}{k!}(xa)^k.
It is sufficient to show that

\lim_{x\to a} h_k(x) =0. \,
The proof here is based on repeated application of L'Hôpital's rule. Note that, for each j = 0,1,...,k−1, f^{(j)}(a)=P^{(j)}(a). Hence each of the first k−1 derivatives of the numerator in h_k(x) vanishes at x=a, and the same is true of the denominator. Also, since the condition that the function f be k times differentiable at a point requires differentiability up to order k−1 in a neighborhood of said point (this is true, because differentiability requires a function to be defined in a whole neighborhood of a point), the nominator and its k2 derivatives are differentiable in a neighborhood of a. Clearly, the denominator also satisfies said condition, and additionally, doesn't vanish unless x=a, therefore all conditions necessary for L'Hopital's rule are fulfilled, and its use is justified. So

\begin{align} \lim_{x\to a} \frac{f(x)  P(x)}{(xa)^k} &= \lim_{x\to a} \frac{\frac{d}{dx}(f(x)  P(x))}{\frac{d}{dx}(xa)^k} = \cdots = \lim_{x\to a} \frac{\frac{d^{k1}}{dx^{k1}}(f(x)  P(x))}{\frac{d^{k1}}{dx^{k1}}(xa)^k}\\ &=\frac{1}{k!}\lim_{x\to a} \frac{f^{(k1)}(x)  P^{(k1)}(x)}{xa}\\ &=\frac{1}{k!}(f^{(k)}(a)  f^{(k)}(a)) = 0 \end{align}
where the second to last equality follows by the definition of the derivative at x = a.
Derivation for the mean value forms of the remainder
Let G be any realvalued function, continuous on the closed interval between a and x and differentiable with a nonvanishing derivative on the open interval between a and x, and define

F(t) = f(t) + f'(t)(xt) + \frac{f''(t)}{2!}(xt)^2 + \cdots + \frac{f^{(k)}(t)}{k!}(xt)^k.
Then, by Cauchy's mean value theorem,

(*) \quad \frac{F'(\xi)}{G'(\xi)} = \frac{F(x)  F(a)}{G(x)  G(a)}
for some ξ on the open interval between a and x. Note that here the numerator F(x) − F(a) = R_{k}(x) is exactly the remainder of the Taylor polynomial for f(x). Compute

\begin{align} F'(t) = & f'(t) + \big(f''(t)(xt)  f'(t)\big) + \left(\frac{f^{(3)}(t)}{2!}(xt)^2  \frac{f^{(2)}(t)}{1!}(xt)\right) + \cdots \\ & \cdots + \left( \frac{f^{(k+1)}(t)}{k!}(xt)^k  \frac{f^{(k)}(t)}{(k1)!}(xt)^{k1}\right) = \frac{f^{(k+1)}(t)}{k!}(xt)^k, \end{align}
plug it into (*) and rearrange terms to find that

R_k(x) = \frac{f^{(k+1)}(\xi)}{k!}(x\xi)^k \frac{G(x)G(a)}{G'(\xi)}.
This is the form of the remainder term mentioned after the actual statement of Taylor's theorem with remainder in the mean value form. The Lagrange form of the remainder is found by choosing \ G(t)=(xt)^{k+1} \ and the Cauchy form by choosing \ G(t) = ta.
Remark. Using this method one can also recover the integral form of the remainder by choosing

G(t) = \int_a^t \frac{f^{(k+1)}(s)}{k!} (xs)^k \, ds,
but the requirements for f needed for the use of mean value theorem are too strong, if one aims to prove the claim in the case that f^{(k)} is only absolutely continuous. However, if one uses Riemann integral instead of Lebesgue integral, the assumptions cannot be weakened.
Derivation for the integral form of the remainder
Due to absolute continuity of f^{(k)} on the closed interval between a and x its derivative f^{(k+1)} exists as an L^{1}function, and we can use fundamental theorem of calculus and integration by parts. This same proof applies for the Riemann integral assuming that f^{(k)} is continuous on the closed interval and differentiable on the open interval between a and x, and this leads to the same result than using the mean value theorem.
The fundamental theorem of calculus states that

f(x)=f(a)+ \int_a^x \, f'(t) \, dt.
Now we can integrate by parts and use the fundamental theorem of calculus again to see that

\begin{align} f(x) &= f(a)+\Big(xf'(x)af'(a)\Big)\int_a^x tf''(t) \, dt \\ &= f(a) + x\left(f'(a) + \int_a^x f''(t) \,dt \right) af'(a)\int_a^x tf''(t) \, dt \\ &= f(a)+(xa)f'(a)+\int_a^x \, (xt)f''(t) \, dt, \end{align}
which is exactly Taylor's theorem with remainder in the integral form in the case k=1. The general statement is proved using induction. Suppose that

(*) \quad f(x) = f(a) + \frac{f'(a)}{1!}(x  a) + \cdots + \frac{f^{(k)}(a)}{k!}(x  a)^k + \int_a^x \frac{f^{(k+1)} (t)}{k!} (x  t)^k \, dt.
Integrating the remainder term by parts we arrive at

\begin{align} \int_a^x \frac{f^{(k+1)} (t)}{k!} (x  t)^k \, dt = &  \left[ \frac{f^{(k+1)} (t)}{(k+1)k!} (x  t)^{k+1} \right]_a^x + \int_a^x \frac{f^{(k+2)} (t)}{(k+1)k!} (x  t)^{k+1} \, dt \\ = & \ \frac{f^{(k+1)} (a)}{(k+1)!} (x  a)^{k+1} + \int_a^x \frac{f^{(k+2)} (t)}{(k+1)!} (x  t)^{k+1} \, dt. \\ \end{align}
Substituting this into the formula in (*) shows that if it holds for the value k, it must also hold for the value k + 1. Therefore, since it holds for k = 1, it must hold for every positive integer k.
Derivation for the remainder of multivariate Taylor polynomials
We prove the special case, where f : R^{n} → R has continuous partial derivatives up to the order k+1 in some closed ball B with center a. The strategy of the proof is to apply the onevariable case of Taylor's theorem to the restriction of f to the line segment adjoining x and a.^{[13]} Parametrize the line segment between a and x by u(t) = a + t(x − a). We apply the onevariable version of Taylor's theorem to the function g(t) = f(u(t)):

f(x)=g(1)=g(0)+\sum_{j=1}^k\frac{1}{j!}g^{(j)}(0)\ +\ \int_0^1 \frac{(1t)^k }{k!} g^{(k+1)}(t)\, dt.
Applying the chain rule for several variables gives

\begin{align} g^{(j)}(t)&=\frac{d^j}{dt^j}f(u(t)) = \frac{d^j}{dt^j} f(\mathbf{a}+t(\mathbf{x}\mathbf{a})) \\ &= \sum_{\alpha=j} \left(\begin{matrix} j \\ \alpha\end{matrix} \right) (D^\alpha f) (\mathbf{a}+t(\mathbf{x}\mathbf{a})) (\mathbf{x}\mathbf{a})^\alpha \end{align}
where \left(\begin{matrix}j \\ \alpha\end{matrix}\right) is the multinomial coefficient. Since \frac{1}{j!}\left(\begin{matrix}j\\ \alpha\end{matrix}\right)=\frac{1}{\alpha!}, we get

f(\mathbf x)= f(\mathbf a)+\sum_{\alpha\leq k}\frac{1}{\alpha!} (D^\alpha f) (\mathbf a)(\mathbf x\mathbf a)^\alpha+\sum_{\alpha=k+1}\frac{k+1}{\alpha!} (\mathbf x\mathbf a)^\alpha \int_0^1 (1t)^k (D^\alpha f)(\mathbf a+t(\mathbf x\mathbf a))\,dt.
See also

^ Kline 1972, p. 442,464

^ Genocchi, Angelo; Peano, Giuseppe (1884), Calcolo differenziale e principii di calcolo integrale, (N. 67, p.XVIIXIX): Fratelli Bocca ed.

^

^ Hazewinkel, Michiel, ed. (2001), "Taylor formula",

^ Klein 1998, §20.3; Apostol 1967, §7.7.

^ Apostol 1967, §7.7.

^ Apostol 1967, §7.5.

^ Apostol 1967, §7.6

^ Rudin, 1987, §10.26.

^ This follows from iterated application of the theorem that if the partial derivatives of a function f exist in a neighborhood of a and are continuous at a, then the function is differentiable at a. See, for instance, Apostol 1974, Theorem 12.11.

^ Königsberger Analysis 2, p. 64 ff.

^ Stromberg 1981

^ Hörmander 1976, pp. 12–13
References

.

Apostol, Tom (1974), Mathematical analysis, Addison—Wesley .

Bartle; Sherbert (2000), Introduction to Real Analysis (3rd ed.), John Wiley & Sons, Inc., .

.

.

Kline, Morris (1998), Calculus: An Intuitive and Physical Approach, Dover, .

Pedrick, George (1994), A First Course in Analysis, SpringerVerlag, .

Stromberg, Karl (1981), Introduction to classical real analysis, Wadsworth, Inc., .

Rudin, Walter (1987), Real and complex analysis, 3rd ed., McGrawHill Book Company, .
External links
This article was sourced from Creative Commons AttributionShareAlike License; additional terms may apply. World Heritage Encyclopedia content is assembled from numerous content providers, Open Access Publishing, and in compliance with The Fair Access to Science and Technology Research Act (FASTR), Wikimedia Foundation, Inc., Public Library of Science, The Encyclopedia of Life, Open Book Publishers (OBP), PubMed, U.S. National Library of Medicine, National Center for Biotechnology Information, U.S. National Library of Medicine, National Institutes of Health (NIH), U.S. Department of Health & Human Services, and USA.gov, which sources content from all federal, state, local, tribal, and territorial government publication portals (.gov, .mil, .edu). Funding for USA.gov and content contributors is made possible from the U.S. Congress, EGovernment Act of 2002.
Crowd sourced content that is contributed to World Heritage Encyclopedia is peer reviewed and edited by our editorial staff to ensure quality scholarly research articles.
By using this site, you agree to the Terms of Use and Privacy Policy. World Heritage Encyclopedia™ is a registered trademark of the World Public Library Association, a nonprofit organization.