Parameter estimation of an acoustic
      differential equation by optimal control
William S. Harlan
1984, 1986, 2002
 
Usually numerical methods of inversion and 
optimization begin with a discretized version of the
simulation equations.  
Differential equations are often
easier to manipulate directly, so that discretization
can be delayed until last.  Here is an example
of optimally perturbing acoustic parameters
to fit recorded acoustic waves.  This approach
is seen most often by researchers in optimal control,
such as J.L. Lions [3]
who also publishes often in the field of geophysics.
This example derives from an explanation I received from 
Patrick Lailly in 1984
at the Institut Français du Petrole.  
In 1986, I expanded on his example in my lab notebook.
After all these years, I still find it a useful example
of multi-dimensional seismic inversion, so I've finally typed it up.
In a separate paper [1], I show a detailed
application with the one-dimensional wave-equation.
The methods in this particular paper should be much easier
to generalize to other equations and model parameters.
My preferred methods of optimization (Gauss-Newton) 
require linearizing simulation equations and calculating 
the adjoint of those linearized equations.  Here is how
you can do most of that work analytically.
In the course of this derivation, I will derive what
is called reverse-time migration, with an image directly related 
to perturbations of physical parameters.  
Let us consider the experiment of a single exploration
seismic field gather with a seismic source places at a surface
point 
 and with receivers placed at points
 and with receivers placed at points
 .  Each trace is recorded for a finite time
.  Each trace is recorded for a finite time 
 .
Assume that land experiments record the derivative of pressure with depth.  
(Marine experiments record the pressure directly, which is simpler.)
Let
.
Assume that land experiments record the derivative of pressure with depth.  
(Marine experiments record the pressure directly, which is simpler.)
Let 
 be the acoustic velocity, and
 be the acoustic velocity, and 
 the density.
Describe the pressure field
 the density.
Describe the pressure field 
 with the following two-dimensional
acoustic equation.
 with the following two-dimensional
acoustic equation.
     (1)
 
(1)
 
where the source term 
    ![$\displaystyle
{{f_s}(t)}= {{\svector \nabla}\cdot}[ {\svector{f}(t)}/ \rho({{\svector x}_s})],
$](img8.svg) (2)
 
(2)
 
and 
 contains body forces.
 contains body forces.
Use the free surface and causal time boundary conditions:
     (3)
 
(3)
 
where  is the vertical dimension of
 is the vertical dimension of  .
.
The surface boundary condition on  is not sufficient and
must be extended before
 is not sufficient and
must be extended before  is well-determined.
Choose a semi-spherical surface
 is well-determined.
Choose a semi-spherical surface 
 that cannot be reached by waves
traveling at the maximum velocity
 that cannot be reached by waves
traveling at the maximum velocity  for the maximum recorded time
 for the maximum recorded time  :
:
The radius should be large enough so that no wave reaching this
boundary 
 can be recorded during the time
 
can be recorded during the time 
 .
We can arbitrarily extend the free surface around this closed boundary
without affecting the modeled pressures within the recorded time:
.
We can arbitrarily extend the free surface around this closed boundary
without affecting the modeled pressures within the recorded time:
     (6)
 
(6)
 
The acoustic differential equation (1) 
can also be written in the following variational integral form,
also called the weak form:
|  |  |  | (7) | 
 
for all  functions
 functions 
 .
This integral is true for all
.
This integral is true for all  (whose second derivatives
are square integrable) if and only if the original equation 
(1) also holds.
 (whose second derivatives
are square integrable) if and only if the original equation 
(1) also holds.  
Conveniently, finite elements [2] also uses the weak form.
Zero-value boundary conditions (3) and (4)
are implied by the weak form, and are called natural boundary conditions.
Let  be the pressure field recorded within the volume 
at known receiver positions
 be the pressure field recorded within the volume 
at known receiver positions 
 for a source located at
 for a source located at 
 .
Define an objective function to match a recorded pressure field
.
Define an objective function to match a recorded pressure field
 with a modeled field
 with a modeled field 
 in a least-squares sense.
The objective function is a functional of the
three unknown functions of physical parameters
 in a least-squares sense.
The objective function is a functional of the
three unknown functions of physical parameters  ,
,  , and
, and  .
.
 is a smooth weighting function that we will find convenient.
 is a smooth weighting function that we will find convenient.
The optimization problem can be stated as follows: find the
model parameters  ,
,  , and
, and  so that the field
 so that the field 
 defined by (7), (6),
and (3) minimize the objective function (8).
The objective function has a least-squares form.
The weighting factor
defined by (7), (6),
and (3) minimize the objective function (8).
The objective function has a least-squares form.
The weighting factor  compensates for
geometric spreading and absorption to give approximately equal 
weight to all parts of the modeled data.
 compensates for
geometric spreading and absorption to give approximately equal 
weight to all parts of the modeled data.
To optimize the model parameters model parameters 
 ,
,  , and
, and  according to equations
(7), (6), and (8), one must
discover how a perturbation of the model parameters
will affect the objective function (8).
Let us abbreviate the variational form (7) 
of our differential equations in the following form:
 according to equations
(7), (6), and (8), one must
discover how a perturbation of the model parameters
will affect the objective function (8).
Let us abbreviate the variational form (7) 
of our differential equations in the following form:
     (9)
 
(9)
 
    ![$\displaystyle
\mbox{where } {{\svector m}{({\svector x}, t)}}= [ {c({\svector x})}, {\rho({\svector x})}, {f_s}].
$](img38.svg) (10)
 
(10)
 
 is also understood to be a implicit function of
 is also understood to be a implicit function of 
 ,
as determined by the acoustic equations.
,
as determined by the acoustic equations.
The exact form of  is unimportant for the remainder of this derivation,
except that it must be expressible as a linear functional operator on
 is unimportant for the remainder of this derivation,
except that it must be expressible as a linear functional operator on  .
.
 is a nonlinear operator on
 is a nonlinear operator on 
 .
By a different choice of model parameters (
.
By a different choice of model parameters ( ,
 ,  , and
, and  ),
one could also linearize
),
one could also linearize  with respect to the model
 with respect to the model  , but this
is not necessary.
, but this
is not necessary.
Similarly, we can rewrite our objective function (8) in
a more general form as
    ![$\displaystyle
J [ {{\svector m}{({\svector x}, t)}}] = {1 \over 2} {\sum_{r,s}}...
... \}^2 {\delta ({\svector x}- {{\svector x}_r})}{~w(t)}{~d {\svector x}\, d t}.
$](img45.svg) (11)
 
(11)
 
First, let us use the variational equation (9) to relate
perturbations of the model 
 to perturbations of the pressure
 to perturbations of the pressure 
 :
:
     (12)
 
(12)
 
The gradient product can be understood as meaning
     (13)
 
(13)
 
and both  and
 and  are functions of
 are functions of  .
.
In this form the linear operators  and
 and 
 both
operate on perturbed quantities.  Let us replace this form
by one that uses the adjoint operators on
 both
operate on perturbed quantities.  Let us replace this form
by one that uses the adjoint operators on  .
If
.
If  and
 and  are functions, the adjoint
 are functions, the adjoint  of a linear operator
 of a linear operator
 is defined by
 is defined by 
 .  The brackets
indicate the scalar products in the domains of
.  The brackets
indicate the scalar products in the domains of  and
 and  .
For example
.
For example
So we can rewrite our perturbation equation (12) as follows:
| ![$\displaystyle {{\int_0^T}\! {\int_{\Omega}}}{\delta P_s}L_1^* q {~d {\svector x...
...box{\scriptsize\boldmath m}}}(L_1 P)]^* q ~ dt {~d {\svector x}}, ~ \forall q .$](img62.svg) |  |  | (17) | 
 
If  contains only linear differential operators,
then we can obtain their adjoints
 contains only linear differential operators,
then we can obtain their adjoints  by integrating by parts, as I will
show in a later section.
This makes explicit the implicit dependence of
 by integrating by parts, as I will
show in a later section.
This makes explicit the implicit dependence of  on
 on  .
.
Next let us relate perturbations in the objective function  to perturbations in the pressure
to perturbations in the pressure  and modeling parameters
used in
 and modeling parameters
used in  .
.
Now for the clever part.  The perturbed integral (17)
must be true for all  .  Choose
.  Choose  then so that 
(17) equals the first term of the gradient (19)
for all
 then so that 
(17) equals the first term of the gradient (19)
for all  :
:
| ![$\displaystyle {{\int_0^T}\! {\int_{\Omega}}}\delta P L_1^* q {~d {\svector x}\,...
...svector x}- {{\svector x}_r})}({D_{rs}(t)}- L_2P) ~w ] {~d {\svector x}\, d t}.$](img73.svg) |  |  | (20) | 
 
For this equation to be true for all  ,
,  must satisfy
 must satisfy
| ![$\displaystyle L_1^* q = {\sum_{r,s}}L_2^* [ {\delta ({\svector x}- {{\svector x}_r})}({D_{rs}(t)}- L_2 P) w] .$](img74.svg) |  |  | (21) | 
 
This adjoint system of equations is very similar to the original
set of equations for  , but with a new source term — the
difference between the modeled and the actual data.
, but with a new source term — the
difference between the modeled and the actual data.
If  satisfies this equation, then we can combine the variational
equation (17) and the perturbation (19)
of the objective function to remove intermediate dependence on
 satisfies this equation, then we can combine the variational
equation (17) and the perturbation (19)
of the objective function to remove intermediate dependence on  .
.
We can express this perturbation (22) as
a gradient of the objective function with respect 
to model parameters 
 at a specific location
 at a specific location  .
.
The first term of this gradient (23) is the most important
because it provides information about  at all positions.
The second term is non-zero only at points
 at all positions.
The second term is non-zero only at points 
 where the
data are recorded.
 where the
data are recorded.
The simplest possible steepest descent optimization would perturb
a reference value of  by a scaled version of its gradient.
That is,
 by a scaled version of its gradient.
That is,
|  |  |  | (24) | 
 
One would also sum the perturbations over all source positions
to for a single perturbation of the model.
After one perturbation, the new reference value of 
 could be used to calculate a new reference wavefield and a new
perturbation.
could be used to calculate a new reference wavefield and a new
perturbation.  
Let us now leave the general formulation and see
how to handle the acoustic equation in particular.
Let us first perturb the objective function (8)
as the general perturbed form (19):
Next we integrate by parts over  .
.
| ![$\displaystyle - \delta J [ \rho, c, {f_s}] = {\sum_{r,s}}{{\int_0^T}\! {\int_{\...
...r x}, t)}}}\over{\partial z}}\right ] \right \} {~d {\svector x}\, w(t) \, d t}$](img90.svg) |  |  |  | 
|  |  |  | (26) | 
 
The boundary term disappears because we require 
 to vanish
at the boundary, as does
 to vanish
at the boundary, as does 
 .
.
The next step is to place the variational acoustic equation 
(7) as in our general perturbed form (17).
First perturb:
We can integrate the time derivatives in (27) 
by parts twice over time.  
The divergence and gradient terms of (27)
can be rewritten with the following chain rule and divergence theorem:
 is a scalar and
 is a scalar and  is a vector.
 is a vector.  
 is the unit normal
vector to the boundary
 is the unit normal
vector to the boundary 
 of the volume
 of the volume  .
.
So here is the rewritten form of the perturbed variational form
(27):
The boundary terms at  disappear because
 disappear because  must
observe the boundary conditions of
 must
observe the boundary conditions of  .  The
.  The 
 term is
the derivative in the direction of the normal to the surface.
Such terms do not disappear.  Other integrals that evaluate
 term is
the derivative in the direction of the normal to the surface.
Such terms do not disappear.  Other integrals that evaluate  on the surface
on the surface 
 do disappear.
Now set the right side of the expanded (29)
equal to the first term of of the perturbation 
of the objective function (26) for all perturbations
 do disappear.
Now set the right side of the expanded (29)
equal to the first term of of the perturbation 
of the objective function (26) for all perturbations
 of the wavefield.  Then
 of the wavefield.  Then
| ![$\displaystyle \left \{ {1 \over {{c({\svector x})}^2 {\rho({\svector x})}}}{{\p...
...rho({\svector x})}}}{\svector \nabla}\right ] \right \} {q {({\svector x}, t)}}$](img112.svg) |  |  |  | 
| ![$\displaystyle = - {\sum_{r,s}}{{\partial}\over{\partial z}}\left \{ {\delta ({\...
...x})}}}{{\partial {P_s {({\svector x}, t)}}}\over{\partial z}}\right ] \right \}$](img113.svg) |  |  | (30) | 
 
in the interior.  The boundary terms on 
 and
 and
 must vanish so
 must vanish so
|  |  |  | (31) | 
  
These adjoint equations (30) and (31) 
require an extrapolation on  that is very similar to that on
 that is very similar to that on  ,
except that extrapolation must now proceed backards in time from
,
except that extrapolation must now proceed backards in time from  .
A single application of this extrapolation is called reverse-time
migration.
Errors between the modeled and recorded data act as sources through 
the extrapolation at the locations of the receivers.
.
A single application of this extrapolation is called reverse-time
migration.
Errors between the modeled and recorded data act as sources through 
the extrapolation at the locations of the receivers.
The first term of  in equation (25)
now equals the right side of (29).  The boundary
conditions on
 in equation (25)
now equals the right side of (29).  The boundary
conditions on  make the surface
 make the surface 
 integral
disappear.  Thus
 integral
disappear.  Thus
To perturb  we take the dot product of the extrapolated
 we take the dot product of the extrapolated  with a filtered version of the previous guess of the wavefield
with a filtered version of the previous guess of the wavefield
 .
The perturbation of
.
The perturbation of  requires the
application of a differential operator on
 requires the
application of a differential operator on  .  The second term of
.  The second term of 
 (from equation [26]) applies only
at the receiver points and acts as an amplification of the recorded
trace.  The perturbation of
 (from equation [26]) applies only
at the receiver points and acts as an amplification of the recorded
trace.  The perturbation of  is simply equal to the adjoint
wavefield
 is simply equal to the adjoint
wavefield  evaluated at the source position.
 evaluated at the source position.
To start the iterations set 
 and the parameters
 and the parameters
 and
 and  to reasonable guesses.  The first perturbation
will attempt to invert the primary reflections and is equivalent to a 
so-called reverse-time migration.  The second iteration ostensibly would
invert second-order scattering, but the poor choice of 
background velocities would make it difficult to predict the second-order
form from the first-order scattering.  This inversion, like all inverse
scattering methods, can only invert high-frequency parameters changes
that are likely to create reflections.  Smoother changes in
 to reasonable guesses.  The first perturbation
will attempt to invert the primary reflections and is equivalent to a 
so-called reverse-time migration.  The second iteration ostensibly would
invert second-order scattering, but the poor choice of 
background velocities would make it difficult to predict the second-order
form from the first-order scattering.  This inversion, like all inverse
scattering methods, can only invert high-frequency parameters changes
that are likely to create reflections.  Smoother changes in  and
and  must be known beforehand.
 must be known beforehand.
- 1
- 
William S. Harlan.
 Separation of signal and noise applied to vertical seismic profiles.
 Geophysics, 53(7):932–946, 1988.
- 2
- 
Thomas J. R. Hughes.
 The Finite Element Method: Linear Static and Dynamic Finite
  Element Analysis.
 Dover Publications Inc., 2000.
- 3
- 
J.L. Lions.
 Optimal Control of Systems Governed by Partial Differential
  Equations.
 Springer-Verlag, 1971.