My watch list

Heat equation

The heat equation is an important partial differential equation which describes the variation of temperature in a given region over time.

Additional recommended knowledge

How to ensure accurate weighing results every day?

Essential Laboratory Skills Guide

Daily Visual Balance Check

1 General-audience description
2 The physical problem and the equation
3 Solving the heat equation using Fourier series
- 3.1 Generalizing the solution technique
4 Heat conduction in non-homogeneous anisotropic media
5 Particle diffusion
6 Schrödinger equation for a free particle
7 Applications
8 See also
9 References

General-audience description

Suppose one has a function u which describes the temperature at a given location (x, y, z). This function will change over time as heat spreads throughout space. The heat equation is used to determine the change in the function u over time. The image below is animated and has a description of the way heat changes in time along a metal bar. One of the interesting properties of the heat equation is the maximum principle which says that the maximum value of u is either earlier in time than the region of concern or on the edge of the region of concern. This is essentially saying that temperature comes either from some source or from earlier in time because heat permeates but is not created from nothingness. This is a property of parabolic partial differential equations and is not difficult to prove mathematically (see below).

Another interesting property is that even if u has a discontinuity at an initial time t = t₀, then the temperature becomes instantly smooth as soon as t > t₀. For example, if a bar of metal has temperature 0 and another has temperature 100 and they are stuck together end to end, then instantaneously the temperature at the point of connection is 50 and the graph of the temperature is smoothly running from 0 to 100. This is not physically possible, since there would then be information propagation at infinite speed, which would violate causality. Therefore this is a property of the mathematical equation rather than of heat conduction itself. However, for most practical purposes, the difference is negligible.

The heat equation is used in probability and describes random walks. It is also applied in financial mathematics for this reason.

It is also important in Riemannian geometry and thus topology: it was adapted by Richard Hamilton when he defined the Ricci flow that was later used to solve the topological Poincaré conjecture.

The physical problem and the equation

In the special case of heat propagation in an isotropic and homogeneous medium in the 3-dimensional space, this equation is

${\partial u\over \partial t} = k \left({\partial^2 u\over \partial x^2 } + {\partial^2 u\over \partial y^2 } + {\partial^2 u\over \partial z^2 }\right) = k ( u_{xx} + u_{yy} + u_{zz} ) \quad$

where:

$u=u(t,x,y,z) \,\!$ is temperature as a function of time and space;

$\frac{\partial u}{\partial t}$ is the rate of change of temperature at a point over time;

$u_{xx}\,\!$ , $u_{yy}\,\!$ , and $u_{zz}\,\!$ are the second spatial derivatives (thermal conductions) of temperature in the x, y, and z directions, respectively

k is a material-specific quantity depending on the thermal conductivity , the density and the heat capacity.

The heat equation is a consequence of Fourier's law of cooling (see heat conduction).

If the medium is not the whole space, in order to solve the heat equation uniquely we also need to specify boundary conditions for u. To determine uniqueness of solutions in the whole space it is necessary to assume an exponential bound on the growth of solutions, this assumption is consistent with observed experiments.

Solutions of the heat equation are characterized by a gradual smoothing of the initial temperature distribution by the flow of heat from warmer to colder areas of an object. Generally, many different states and starting conditions will tend toward the same stable equilibrium. As a consequence, to reverse the solution and conclude something about earlier times or initial conditions from the present heat distribution is very inaccurate except over the shortest of time periods.

The heat equation is the prototypical example of a parabolic partial differential equation.

Using the Laplace operator, the heat equation can be simplified, and generalized to similar equations over spaces of arbitrary number of dimensions, as

$u_t = k \nabla^2 u = k \Delta u, \quad \,\!$

where the Laplace operator, Δ or $\nabla^2$ , the divergence of the gradient, is taken in the spatial variables.

The heat equation governs heat diffusion, as well as other diffusive processes, such as particle diffusion or the propagation of action potential in nerve cells. Although they are not diffusive in nature, some quantum mechanics problems are also governed by a mathematical analog of the heat equation (see below). It also can be used to model some phenomena arising in finance, like the Black-Scholes or the Ornstein-Uhlenbeck processes. The equation, and various non-linear analogues, has also been used in image analysis.

The heat equation is, technically, in violation of special relativity, because its solutions involve instantaneous propagation of a disturbance. The part of the disturbance outside the forward light cone can usually be safely neglected, but if it is necessary to develop a reasonable speed for the transmission of heat, a hyperbolic problem should be considered instead -- like a partial differential equation involving a second-order time derivative.

Solving the heat equation using Fourier series

The following solution technique for the heat equation was proposed by Joseph Fourier in his treatise Théorie analytique de la chaleur, published in 1822. Let us consider the heat equation for one space variable. This could be used to model heat conduction in a rod. The equation is

$(1) \ u_t = k u_{xx} \quad$

where u = u(t, x) is a function of two variables t and x. Here

x is the space variable, so x ∈ [0,L], where L is the length of the rod.
t is the time variable, so t ≥ 0.

We assume the initial condition

$(2) \ u(0,x) = f(x) \quad \forall x \in [0,L] \quad$

where the function f is given and the boundary conditions

$(3) \ u(t,0) = 0 = u(t,L) \quad \forall t > 0 \quad$ .

Let us attempt to find a solution of (1) which is not identically zero satisfying the boundary conditions (3) but with the following property: u is a product in which the dependence of u on x, t is separated, that is:

$(4) \ u(t,x) = X(x) T(t). \quad$

This solution technique is called separation of variables. Substituting u back into equation (1),

$\frac{T'(t)}{kT(t)} = \frac{X''(x)}{X(x)}. \quad$

Since the right hand side depends only on x and the left hand side only on t, both sides are equal to some constant value − λ. Thus:

$(5) \ T'(t) = - \lambda kT(t) \quad$

and

$(6) \ X''(x) = - \lambda X(x). \quad$

We will now show that solutions for (6) for values of λ ≤ 0 cannot occur:

Suppose that λ < 0. Then there exist real numbers B, C such that
$X(x) = B e^{\sqrt{-\lambda} \, x} + C e^{-\sqrt{-\lambda} \, x}.$
From (3) we get
$X(0) = 0 = X(L). \quad$
and therefore B = 0 = C which implies u is identically 0.
Suppose that λ = 0. Then there exist real numbers B, C such that
$X(x) = Bx + C. \quad$
From equation (3) we conclude in the same manner as in 1 that u is identically 0.
Therefore, it must be the case that λ > 0. Then there exist real numbers A, B, C such that
$T(t) = A e^{-\lambda k t} \quad$
and
$X(x) = B \sin(\sqrt{\lambda} \, x) + C \cos(\sqrt{\lambda} \, x).$
From (3) we get C = 0 and that for some positive integer n,
$\sqrt{\lambda} = n \frac{\pi}{L}.$

This solves the heat equation in the special case that the dependence of u has the special form (4).

In general, the sum of solutions to (1) which satisfy the boundary conditions (3) also satisfies (1) and (3). We can show that the solution to (1), (2) and (3) is given by

$u(t,x) = \sum_{n = 1}^{+\infty} D_n \left(\sin \frac{n\pi x}{L}\right) e^{-\frac{n^2 \pi^2 kt}{L^2}}$

where

$D_n = \frac{2}{L} \int_0^L f(x) \sin \frac{n\pi x}{L} \, dx.$

Generalizing the solution technique

The solution technique used above can be greatly extended to many other types of equations. The idea is that the operator u_xx with the zero boundary conditions can be represented in terms of its eigenvectors. This leads naturally to one of the basic ideas of the spectral theory of linear self-adjoint operators.

Consider the linear operator Δ u = u_{x x}. The infinite sequence of functions

$e_n(x) = \sqrt{\frac{2}{L}}\sin \frac{n\pi x}{L}$

for n ≥ 1 are eigenvectors of Δ. Indeed

$\Delta e_n = -\frac{n^2 \pi^2}{L^2} e_n.$

Moreover, any eigenvector f of Δ with the boundary conditions f(0)=f(L)=0 is of the form e_n for some n ≥ 1. The functions e_n for n ≥ 1 form an orthonormal sequence with respect to a certain inner product on the space of real-valued functions on [0, L]. This means

$\langle e_n, e_m \rangle = \int_0^L e_n(x) e_m(x) dx = \left\{ \begin{matrix} 0 & n \neq m \\ 1 & m = n\end{matrix}\right..$

Finally, the sequence {e_n}_{n ∈ N} spans a dense linear subspace of L²(0, L). This shows that in effect we have diagonalized the operator Δ.

Heat conduction in non-homogeneous anisotropic media

In general, the study of heat conduction is based on several principles. Heat flow is a form of energy flow, and as such it is meaningful to speak of the time rate of flow of heat into a region of space.

The time rate of heat flow into a region V is given by a time-dependent quantity q_t(V). We assume q has a density, so that

$q_t(V) = \int_V Q(t,x)\,d x \quad$

Heat flow is a time-dependent vector function H(x) characterized as follows: the time rate of heat flowing through an infinitesimal surface element with area d S and with unit normal vector n is

$\mathbf{H}(x) \cdot \mathbf{n}(x) \, dS$

Thus the rate of heat flow into V is also given by the surface integral

$q_t(V)= - \int_{\partial V} \mathbf{H}(x) \cdot \mathbf{n}(x) \, dS$

where n(x) is the outward pointing normal vector at x.

The Fourier law states that heat energy flow has the following linear dependence on the temperature gradient

$\mathbf{H}(x) = -\mathbf{A}(x) \cdot \nabla u (x)$

where A(x) is a 3 × 3 real matrix that is symmetric and positive definite.

By Green's theorem, the previous surface integral for heat flow into V can be transformed into the volume integral

$q_t(V) = - \int_{\partial V} \mathbf{H}(x) \cdot \mathbf{n}(x) \, dS$

$= \int_{\partial V} \mathbf{A}(x) \cdot \nabla u (x) \cdot \mathbf{n}(x) \, dS$

$= \int_V \sum_{i, j} \partial_{x_i} a_{i j}(x) \partial_{x_j} u (t,x)\,dx$

The time rate of temperature change at x is proportional to the heat flowing into an infinitesimal volume element, where the constant of proportionality is dependent on a constant κ

$\partial_t u(t,x) = \kappa(x) Q(t,x)\,$

Putting these equations together gives the general equation of heat flow:

$\partial_t u(t,x) = \kappa(x) \sum_{i, j} \partial_{x_i} a_{i j}(x) \partial_{x_j} u (t,x)$

Remarks.

The coefficient κ(x) is the inverse of specific heat of the substance at x × density of the substance at x.

In the case of an isotropic medium, the matrix A is a scalar matrix equal to thermal conductivity.

In the anisotropic case where the coefficient matrix A is not scalar (i.e., if it depends on x), then an explicit formula for the solution of the heat equation can seldom be written down. Though, it is usually possible to consider the associated abstract cauchy problem and show that it is a well-posed problem and/or to show some qualitative properties (like preservation of positive initial data, infinite speed of propagation, convergence toward an equilibrium, smoothing properties). This is usually done by one-parameter semigroups theory: for instance, if A is a symmetric matrix, then the elliptic operator defined by

$Au(x):=\sum_{i, j} \partial_{x_i} a_{i j}(x) \partial_{x_j} u (x)$

is self-adjoint and dissipative, thus by the spectral theorem it generates a one-parameter semigroup.

Particle diffusion

Particle diffusion equation

One can model particle diffusion by an equation involving either:

the volumetric concentration of particles, denoted c, in the case of collective diffusion of a large number of particles, or
the probability density function associated with the position of a single particle, denoted P.

In either case, one uses the heat equation

$c_t = D \Delta c, \quad$

$P_t = D \Delta P. \quad$

Both c and P are functions of position and time. D is the diffusion coefficient that controls the speed of the diffusive process, and is typically expressed in meters squared over second.

If the diffusion coefficient D is not constant, but depends on the concentration c (or P in the second case), then one gets the nonlinear diffusion equation.

The random trajectory of a single particle subject to the particle diffusion equation is a brownian motion.

If a particle is placed in $\vec R = \vec 0$ at time $t = 0$ , then the probability density function associated to the vector $\vec R$ will be the following:

$P(\vec R,t) = G(\vec R,t) = \frac{1}{(4 \pi D t)^{3/2}} e^{-\frac {\vec R^2}{4 D t}}$

It is related to the probability density functions associated to each of its components $R x$ , $R y$ and $R z$ in the following way:

$P(\vec R,t) = \frac{1}{(4 \pi D t)^{3/2}} e^{-\frac {R_x^2+R_y^2+R_z^2}{4 D t}} = P(R_x,t)P(R_y,t)P(R_z,t)$

The random variables $R x$ , $R y$ and $R z$ are distributed according to a normal distribution of mean 0 and of variance $2\,D\,t$ . In 3D, the random vector $\vec R$ is distributed according to a normal distribution of mean $\vec 0$ and of variance $6\, D\,t$ .

At t=0, the expression of $P(\vec R,t)$ above is singular. The probability density function corresponding to the initial condition of a particle located in a known position $\vec R = \vec 0$ is the Dirac delta function, noted $\delta (\vec R)$ (the generalisation to 3D of the Dirac delta function is simply $\delta (\vec R) = \delta (R_x) \delta (R_y) \delta (R_z)$ ). The solution of the diffusion equation associated to this initial condition is also called a Green function.

Historical origin of the diffusion equation

The particle diffusion equation was originally derived by Adolf Fick in 1855.

Solving the diffusion equation through Green functions

Green functions are the solutions of the diffusion equation corresponding to the initial condition of a particle of known position. For another initial condition, the solution to the diffusion equation can be expressed as a decomposition on a set of Green Functions.

Say, for example, that at t=0 we have not only a particle located in a known position $\vec R = \vec 0$ , but instead a large number of particles, distributed according to a spatial concentration profile $c(\vec R, t=0)$ . Solving the diffusion equation will tell us how this profile will evolve with time.

As any function, the initial concentration profile can be decomposed as an integral sum on Dirac delta functions:

$c(\vec R, t=0) = \int c(\vec R^0,t=0) \delta(\vec R - \vec R^0) dR_x^0\,dR_y^0\,dR_z^0$

At subsequent instants, given the linearity of the diffusion equation, the concentration profile becomes:

$c(\vec R, t) = \int c(\vec R^0,t=0) G(\vec R - \vec R^0,t) dR_x^0\,dR_y^0\,dR_z^0$ , where $G(\vec R - \vec R^0,t)$ is the Green function defined above.

Although it is more easily understood in the case of particle diffusion , where an initial condition corresponding to a Dirac delta function can be intuitively described as a particle being located in a known position, such a decomposition of a solution into Green functions can be generalized to the case of any diffusive process, like heat transfer, or momentum diffusion, which is the phenomenon at the origin of viscosity in liquids.

List of Green function solutions in 1D

$\begin{cases} u_{t}=ku_{xx} & -\infty<x<\infty,\,0<t<\infty \\ u(x,0)=g(x) & IC \end{cases}$

$u(x,t)=\frac{1}{\sqrt{4\pi kt}} \int_{-\infty}^{\infty} \exp\left(-\frac{(x-y)^2}{4kt}\right)g(y)\,dy$

$\begin{cases} u_{t}=ku_{xx} & \, 0\le x<\infty, \, 0<t<\infty \\ u(x,0)=g(x) & IC \\ u(0,t)=0 & BC \end{cases}$

$u(x,t)=\frac{1}{\sqrt{4\pi kt}} \int_{0}^{\infty} \left(\exp\left(-\frac{(x-y)^2}{4kt}\right)-\exp\left(-\frac{(x+y)^2}{4kt}\right)\right) g(y)\,dy$

$\begin{cases} u_{t}=ku_{xx} & \, 0\le x<\infty, \, 0<t<\infty \\ u(x,0)=g(x) & IC \\ u_{x}(0,t)=0 & BC \end{cases}$

$u(x,t)=\frac{1}{\sqrt{4\pi kt}} \int_{0}^{\infty} \left(\exp\left(-\frac{(x-y)^2}{4kt}\right)+\exp\left(-\frac{(x+y)^2}{4kt}\right)\right) g(y)\,dy$

$\begin{cases} u_{t}=ku_{xx}+f & -\infty<x<\infty,\,0<t<\infty \\ u(x,0)=0 & IC \end{cases}$

$u(x,t)=\int_{0}^{t}\int_{-\infty}^{\infty} \frac{1}{\sqrt{4\pi k(t-s)}} \exp\left(-\frac{(x-y)^2}{4k(t-s)}\right)f(s)\,dyds$

$\begin{cases} u_{t}=ku_{xx}+f(x,t) & 0\le x<\infty,\,0<t<\infty \\ u(x,0)=0 & IC \\ u(0,t)=0 & BC \end{cases}$

$u(x,t)=\int_{0}^{t}\int_{0}^{\infty} \frac{1}{\sqrt{4\pi k(t-s)}} \left(\exp\left(-\frac{(x-y)^2}{4k(t-s)}\right)-\exp\left(-\frac{(x+y)^2}{4k(t-s)}\right)\right) f(y,s)\,dyds$

$\begin{cases} u_{t}=ku_{xx} & 0\le x<\infty,\,0<t<\infty \\ u(x,0)=0 & IC \\ u(0,t)=h(t) & BC \end{cases}$

$u(x,t)=\int_{0}^{t} \frac{x}{\sqrt{4\pi k(t-s)^3}} \exp\left(-\frac{x^2}{4k(t-s)}\right)h(s)\,ds$

$\begin{cases} u_{t}=ku_{xx}+f & -\infty<x<\infty,\,0<t<\infty \\ u(x,0)=g(x) & IC\end{cases}$

$\quad{u=w+v}$

$\begin{cases} v_{t}=kv_{xx}+f, \, w_{t}=kw_{xx} \, & -\infty<x<\infty,\,0<t<\infty \\ v(x,0)=0,\, w(x,0)=g(x) \, & IC\end{cases}$

$\begin{cases} u_{t}=ku_{xx}+f & 0\le x<\infty,\,0<t<\infty \\ u(x,0)=g(x) & IC \\ u(0,t)=h(t) & BC\end{cases}$

$\quad{u=w+v+r}$

$\begin{cases} v_{t}=kv_{xx}+f, \, w_{t}=kw_{xx}, \, r_{t}=kr_{xx} \, & 0\le x<\infty,\,0<t<\infty \\ v(x,0)=0, \; w(x,0)=g(x), \; r(x,0)=0 & IC \\ v(0,t)=0, \; w(0,t)=0, \; r(0,t)=h(t) & BC \end{cases}$

Schrödinger equation for a free particle

With a simple division, the Schrödinger equation for a single particle of mass m in the absence of any applied force field can be rewritten in the following way:

$\psi_t = \frac{i \hbar}{2m} \Delta \psi$ , where i is the unit imaginary number, and $\hbar$ is Planck's constant divided by

2π

, and

ψ

is the wavefunction of the particle.

This equation is a mathematical analogue of the particle diffusion equation, which one obtains through the following transformation:

$c(\vec R,t) \to \psi(\vec R,t)$

$D \to \frac{i \hbar}{2m}$

Applying this transformation to the expressions of the Green functions determined in the case of particle diffusion yields the Green functions of the Schrödinger equation, which in turn can be used to obtain the wavefunction at any time through an integral on the wavefunction at t=0:

$\psi(\vec R, t) = \int \psi(\vec R^0,t=0) G(\vec R - \vec R^0,t) dR_x^0\,dR_y^0\,dR_z^0$ , with

$G(\vec R,t) = \bigg( \frac{m}{2 \pi i \hbar t} \bigg)^{3/2} e^{-\frac {\vec R^2 m}{2 i \hbar t}}$

Remark: this analogy between quantum mechanics and diffusion is a purely mathematical one. In physics, the evolution of the wavefunction according to Schrödinger equation is not a diffusive process.

Diffusion (of particles, heat, momentum...) describes the return to global thermodynamic equilibrium of an inhomogeneous system, and as such is a time-irreversible phenomenon, associated to an increase in the entropy of the universe: in the case of particle diffusion, if $c(\vec R,t)$ is a solution of the diffusion equation, then $c(\vec R,-t)$ isn't. Intuitively we know that particle diffusion tends to resorb spatial concentration inhomogeneities, and never amplify them.

As a generalization of classical mechanics, quantum mechanics involves only time-reversible phenomena: if $\psi(\vec R,t)$ is a solution of the Schrödinger equation, then the complex conjugate of $\psi(\vec R,-t)$ is also a solution. Note that the complex conjugate of a wavefunction has the exact same physical meaning as the wavefunction itself: the two react exactly in the same way to any series of quantum measurements.

It is the imaginary nature of the equivalent diffusion coefficient $i \hbar/(2m)$ that makes up for this difference in behavior between quantum and diffusive systems.

On a related note, it is interesting to notice that the imaginary exponentials that appear in the Green functions associated to the Schrödinger equation create interferences between the various components of the decomposition of the wavefunction. This is a symptom of the wavelike properties of quantum particles.

Applications

The heat equation arises in the modeling of a number of phenomena and is often used in financial mathematics in the modeling of options. The famous Black-Scholes option pricing model's differential equation can be transformed into the heat equation allowing relatively easy solutions from a familiar body of mathematics. Many of the extensions to the simple option models do not have closed form solutions and thus must be solved numerically to obtain a modeled option price. The heat equation can be efficiently solved numerically using the Crank-Nicolson method and this method can be extended to many of the models with no closed form solution. (Wilmott, 1995)

An abstract form of heat equation on manifolds provides a major approach to the Atiyah-Singer index theorem, and has led to much further work on heat equations in Riemannian geometry.

References

Einstein, A. "Über die von der molekularkinetischen Theorie der Wärme geforderte Bewegung von in ruhenden Flüssigkeiten suspendierten Teilchen." Ann. Phys. 17, 549, 1905. [1]

Wilmott P., Howison S., Dewynne J. (1995) The Mathematics of Financial Derivatives:A Student Introduction. Cambridge University Press.

L.C. Evans, Partial Differential Equations, American Mathematical Society, Providence, 1998. ISBN 0-8218-0772-2.

Categories: Heat conduction | Diffusion

This article is licensed under the GNU Free Documentation License. It uses material from the Wikipedia article "Heat_equation". A list of authors is available in Wikipedia.