This note exists because no one explained to me satisfactorily why I should care about infinitesimal generators. These mysterious creatures pop up in the study of certain continuous-time Markov processes, such as stochastic differential equations driven by Lévy noise.
To learn: connection to Koopman operators.
Infinitesimal generators have a simple natural interpretation in terms of the evolution of the laws of Markov chains. TBD: going from infinitesimal generator to stochastic Taylor expansion. A popular treatment of these objects is Revuz and Yor (2004); but there are treatments in every stochastic calculus book AFAICS. Reading about them is complicated by the fact that many sources assume these apply only to Wiener/Itô processes or finite-state continuous-time Markov chains. Those two models are dominant tools in queueing theory and finance. But there are many more general Markov processes out there, and the ambiguous specificity of this concept is not helpful for those of us who operate in more general spaces.
I would like some specific but diverse examples to prime my intuition. I would, further, like a treatment that does not presume the Markov process in question is some kind of integral of a Brownian motion, which is not the most interesting case. Aït-Sahalia, Hansen, and Scheinkman (2010) and Reiß (2007) suit that description.
1 Getting started
First, we need to define Feller processes. I found George Lowther to be clarifying:
[Feller Processes] are Markov processes whose transition function
satisfies certain continuity conditions. […] it is often not possible to explicitly write out the transition function describing a Feller process. Instead, the infinitesimal generator is used. This approximately describes the transition kernel for small times , and can be viewed as the derivative of at time 0, .
Let us skip some boilerplate about convergence for now.
[The operator
] is called the infinitesimal generator of the semigroup . [This] can alternatively be written as
[…] So, the generator
gives the first-order approximation to for small . Restricted to
, the operator is differentiable with derivative given by .
That is, Feller processes are more general than Lévy processes and less general than the class of all continuous-time Markov processes. We can gloss over that for the current exposition and just think “well-behaved Markov” or SDEs.
An infinitesimal generator is a kind of linearization of the local Markov transition kernel for a Feller process, i.e., for a non-pathological Markov process.
OK, so now what can we do with this? Well, we might observe that because Feller processes have a kind of stochastic smoothness, we can hope that linearizations of these processes behave nicely, and in particular (conditional) something like a “local” Taylor expansion might be possible and even useful. Saz says:
For a Markov process
we define the generator by
whenever the limit exists in . Here denotes the semigroup of .
(Here
By Taylor’s formula this means that
for small
. So, basically, the generator describes the movement of the process in an infinitesimal time interval. One can show that
i.e., the generator is the time derivative of the mapping
. Reading [this] as a (partial) differential equation we see that is a solution to the PDE
This is one important reason why generators are of interest. Another, more probabilistic, reason is that the process
is a martingale. This means that we can associate with
a whole bunch of martingales, and this martingale property comes in handy very often, for example whenever we deal with expectations of the form . This leads to Dynkin’s formula. Generators are also connected with the martingale problem which in turn can be used to characterize (weak) solutions of stochastic differential equations.
The connection to the stochastic Taylor expansions is pretty glaring at this point. 🚧TODO🚧 clarify.
Then Saz specialises to Brownian motion to recover the lazy version:
Example: Brownian motion In the case of (one-dimensional) Brownian motion
, we see that
for small
. This formula can be motivated by Taylor’s formula: Indeed,
using that
and . From [this] we see that
is the (unique) solution of the heat equation
Moreover, one can show that the solution of the Dirichlet problem is also related to the Brownian motion. Furthermore, […]
is a martingale. Having Itô’s formula in mind, this is not surprising since
2 Example: Gamma process
How does this work for other processes? What if our driving noise is, say, a Gamma process,
Here
3 Other stuff about generators
Cranking the handle further we see that
So our expression involves all derivatives of
Note that we have used functional moment calculations and independence of increments, so we can assume that similar methods will work with more general Lévy processes. 🏗
4 Connection to Stein’s method
Barbour (n.d.) uses the infinitesimal generator to derive a Stein equation for the Poisson distribution, and in fact for rather general families.