Lecture 3: Diffusion: Fick’s first law
Today’s topics
• What is diffusion? What drives diffusion to occur?
• Understand why diffusion can “surprisingly” occur against the concentration gradient?
• Learn how to deduce the Fick’s first law, and understand the meaning of diffusion
coefficient and diffusion length.
In last two lectures, we learned the thermodynamics, concerned mainly with stable or equilibrium
systems. The study of materials Kinetics, like phase transformation, concerns those mechanism
by which a system attempts to reach the equilibrium state and how long it takes. One of the most
fundamental processes that control the rate at which many transformation occur is the diffusion,
as we will learn in the following several lectures and use throughout the whole semester.
General review of diffusion:
Diffusion describes the spread of particles (which can also be atoms, molecules) through random
motion usually (but not always) from regions of higher concentration to regions of lower
concentration. The time dependence of the statistical distribution in space is given by the
diffusion equation. The concept of diffusion is tied to that of mass transfer driven by a
concentration gradient, but diffusion can still occur when there is no concentration gradient (but
there will be no net flux).
History: The concept of diffusion emerged from physical sciences. The paradigmatic examples
were heat diffusion, molecular diffusion and Brownian motion. Their mathematical description
was elaborated by Joseph Fourier in 1822, Adolf Fick in 1855 and by Albert Einstein in 1905.
Specifically, atomic diffusion is a diffusion process whereby the random thermally-activated
movement of atoms in a solid results in the net transport of atoms. For example, helium atoms
inside a balloon can diffuse through the wall of the balloon and escape, resulting in the balloon
slowly deflating. Other air molecules (e.g. oxygen, nitrogen) have lower mobilities and thus
diffuse more slowly through the balloon wall. There is a concentration gradient in the balloon
wall, because the balloon was initially filled with helium, and thus there is plenty of helium on
the inside, but there is relatively little helium on the outside (helium is not a major component of
air). The rate of transport is governed by the diffusivity and the concentration gradient.
Diffusion in solid: In the crystal solid state, diffusion within the crystal lattice occurs by either
interstitial or substitutional mechanisms and is referred to as lattice diffusion. In interstitial lattice
diffusion, a diffusant (such as C in an iron alloy), will diffuse in between the lattice structure of
another crystalline element. In substitutional lattice diffusion (self-diffusion for example), the
1
atom can only move by substituting place with another atom. Substitutional lattice diffusion is
often contingent upon the availability of point vacancies throughout the crystal lattice. Diffusing
particles migrate from point vacancy to point vacancy by the rapid, essentially random jumping
about (jump diffusion). See below in this Lecture for details about interstitial or substitutional.
Diffusion is driven by decrease in Gibbs free energy or chemical potential difference.
As a simple illustration of this, consider the figure below, where two blocks of the same A-B
solid solution, but with different compositions (concentrations), are welded together and held at a
temperature high enough for long range diffusion to occur.
Case 1: along concentration gradient
① A ②
μB2
μA1
G1 Ga G2
A- rich B- rich
B
Gm
μA2 μB1
A 1 2 B
For case 1: μA1 > μA2 => A goes from 1 to 2
μB2 > μB1 => B goes from 2 to 1
The molar free energy diagram of the A-B alloy is shown at right: the molar free energy of each
part is given by G1 and G2, and the initial total free energy of the welded block is Ga, upon the
diffusion as illustrated on the left (so as to eliminate the concentration difference), the total free
energy decreases towards Gm, the free energy characteristic of a homogeneous system. In this
case, the decrease in free energy is produced by A and B diffusion along the concentration
gradient.
However, diffusion needs not always be along the concentration gradient. For example, in the
alloy systems that contain a miscibility gap, the free energy curve can have a negative curvature
at low temperature, as shown below in Case 2. In such a case, A and B atoms would diffuse
against the concentration gradient (i.e., toward the regions of high concentration as shown at left
below), though this diffusion is still a natural (spontaneous) process as it reduces the total free
energy from Ga to Gm as shown.
For case 2: μA2 > μA1 => A goes from 2 to 1
μB1 > μB2 => B goes from 1 to 2
2
Case 2: against concentration gradient
① A ② μB1
μA2
A- rich B- rich
G1 G2
B
Ga
Gm
μA1 μB2
A 1 2 B
Diffusion is always along chemical potential gradient!
Diffusion stops when chemical potential of all species everywhere are same. In most cases,
chemical potential increases with increasing concentration, so it is convenient to express
diffusion in term of concentration.
Now lets consider Interstitial diffusion vs. Substitutional diffusion:
Interstitial Substitutional (by vacancy)
Considered at low concentration
Now Let’s deduce the Diffusion Equation for interstitial diffusion based on the following
illustration, moving from position ① to ②.
3
x x+a
① ②
dc
c (x) ~ concentration at x, c (x+a) = c(x) + a ~ concentration at x+a
dx
dµ ( x )
μ(x) ~ chemical potential at x, μ(x+a) =μ(x) + a ~ chemical potential at x+a
dx
ΔGA ~ activation barrier
∆G
The forward rate ∝ e- ∆G
A
/RT
;
RT
--- see last Lecture for how to deduce this equation with an assumption that ΔG << RT, and this is
usually true for diffusion, where the free energy change within short distance is small.
ΔG = G (x+a) - G (x) = μ(x+a) - μ(x)
(here ΔG is only contributed by the change in chemical potential )
∆G µ ( x + a) µ ( x) a dµ dµ ( x )
Now = − = (as defined above, μ(x+a) =μ(x) + a )
RT RT RT RT dx dx
Let ν be vibrational frequency, then an atom will have enough energy in unit time (per second) to
overcome the barrier ν · e- ∆G
A
/RT
times, i.e., the number of jump of step “a” per second, which we
can define as “Γ”.
Γ = ν · e- ∆G
A
/RT
In a three-dimensional space (lattice), an atom can jump in six equivalent directions (±x, ±y, ±z),
ν
e- ∆G
A
/RT
so the number of jumps toward right (along “x”) in one second is .
6
So, the net forward rate for an atom to jump from x to x+a is
4
ν a dµ dµ
e- ∆G
A
Rate = /RT
(- ) (where <0)
6 RT dx dx
Let n(x) be the number of atoms (solute) per unit area at x, then the net flux along “x”. (i.e.,
number of atom passing through unit area in unit time along “x”) is
ν a dµ
J = n(x)· e - ∆G
A
/RT
(- ) (1)
6 RT dx
Since n(x) = a· c(x), Eq. (1) can be written as
a 2ν - ∆G A /RT c( x) dµ
J=- e · · (2)
6 RT dx
Now let’s consider the expression of chemical potential as we learned in the course of
Thermodynamics:
μ(x) = μ0 + RTlna(x) = μ0 + RT ln γ c( x)
where a(x) is the activity of solute (atom) at x, γ is the activity coefficient, a constant.
Then, we have
dµ RT dc( x)
= · (3)
dx c(x) dx
Substituting Eq. (3) into Eq. (2), we have
a 2ν - ∆G A /RT dc( x)
J=- e · (4)
6 dx
a 2ν - ∆G A /RT
Let’s define D = e as the intrinsic diffusion coefficient
6
Then we have
dc( x)
J = -D· (Unit: D: cm2/sec; J: number/cm2/sec)
dx
This is the Fick’s first law of diffusion.
5
Now, let’s consider diffusion in a three-dimensional space, i.e., random walk of an atom:
As defined on page 4,
Γ = ν · e- ∆G
A
/RT
a 2ν - ∆G A /RT
Combining with D = e
6
We have
1 a 2ν - ∆G A /RT
D= Γ·a2 = e
6 6
Then we have
Γ= ν e - ∆G
A
/RT
= 6D / a2 (Γ: number of jump an atom make per second)
For a random walk in three-dimensions, an atom travels an average distance of n ·a after n
steps (i.e., n times of jump) with a step length of a.
Thus, after time t, the atom jumps Γ t times (steps). So it moves a distance of
6 Dt
r=a Γt = a = 6 Dt ≈ 2.4 Dt
a2
This is usually defined as the diffusion length.