Theory 3 papers

Theory Digest — Apr 4, 2026

Today’s Digest at a Glance

Today’s papers explore sophisticated probability theory and sequential analysis, focusing on optimal transport formulations, multi-armed bandit change detection, and information-theoretic entropy measures.

Weak Optimal Transport

Weak optimal transport (WOT) extends classical optimal transport by relaxing the marginal constraints to allow for “mass creation” and “mass destruction” during transport. While classical Wasserstein optimal transport requires exact preservation of marginal distributions $\pi_1 = \mu$ and $\pi_2 = \nu$ for transport plan $\pi$, weak optimal transport only requires $\pi_1 \leq \mu$ and $\pi_2 \leq \nu$, where the inequalities are in the sense of measures.

The mathematical formulation becomes:

\[\inf_{\pi \in \Pi_w(\mu,\nu)} \int c(x,y) d\pi(x,y)\]

where $\Pi_w(\mu,\nu) = {\pi : \pi_1 \leq \mu, \pi_2 \leq \nu}$ is the set of weak transport plans. This relaxation allows the transport to “waste” some mass, making it particularly suitable for problems where exact mass conservation is too restrictive, such as in certain stochastic control and bridge problems.

The key insight is that weak transport can equivalently be formulated as a static optimization problem even when the underlying dynamics are temporal, enabling powerful duality theory and computational algorithms. Intuitively, weak transport asks “what’s the cheapest way to move at most this much mass to get at least that much mass at the destination?”

Multi-Armed Bandit Quickest Change Detection

Quickest change detection in multi-armed bandit settings combines two fundamental challenges: identifying when a change occurs in a stochastic process and deciding which “arm” (data source) to observe at each time step. The classical change detection problem assumes continuous observation of all channels, but in resource-constrained settings, an agent can only observe one channel per time step.

The problem setup involves $K$ independent channels, each generating observations from distributions that may change at unknown times. The agent must sequentially choose which channel to observe while simultaneously detecting if and when changes occur. The naive approach of round-robin sampling fails because it may miss critical change periods in unobserved channels.

The Upper Confidence Bound (UCB) approach addresses this by treating log-likelihood ratios as “rewards” in a bandit framework. At each decision point, UCB selects the arm with highest upper confidence bound:

\[\text{UCB}(a,n) = \hat{\mu}_{a,n} + \sqrt{\frac{2\log n}{N_{a,n}}}\]

where $\hat{\mu}_{a,n}$ is the empirical mean reward from arm $a$ after $n$ rounds, and $N_{a,n}$ is the number of times arm $a$ was selected. The square root term provides an optimistic estimate that balances exploitation of promising arms with exploration of uncertain ones.

Progressive Filtration Enlargement and Entropy

Progressive filtration enlargement studies how additional information revealed over time affects the structure of stochastic processes and martingales. Unlike initial enlargement where all additional information is available from time zero, progressive enlargement models scenarios where information arrives gradually according to some schedule.

The mathematical framework involves two filtrations: a base filtration $\mathbb{F} = (\mathcal{F}_t)$ representing “public” information, and an enlarged filtration $\mathbb{G} = (\mathcal{G}_t)$ where $\mathcal{G}_t = \mathcal{F}_t \vee \sigma(\xi \mathbf{1}_{\tau \leq t})$ incorporates progressive revelation of discrete random variable $\xi$ at random time $\tau$. The entropy measure quantifies how much this additional information disrupts the martingale structure.

The core insight is that finite entropy ensures that martingales in the base filtration remain “close” to martingales in the enlarged filtration, preventing arbitrage opportunities in financial applications. Intuitively, progressive enlargement asks “how much does gradually learning insider information change the fair pricing of financial instruments?”

Reading Guide

The papers are connected through their treatment of dynamic optimization under incomplete information. The first paper shows how dynamic transport problems can be reformulated as static optimization, the second addresses optimal information acquisition under resource constraints, and the third quantifies the value of progressively revealed information. All three involve sophisticated probability theory applied to control and decision-making problems.


A weak transport approach to the Schrödinger-Bass bridge

Authors: Manuel Hasenbichler, Gudmund Pammer, Stefan Thonhauser · Institution: Graz University of Technology · Category: math.PR

Establishes static weak optimal transport formulation of the parametric Schrödinger-Bass semimartingale transport problem via infimal convolution, with complete duality theory and Sinkhorn-type algorithm.

Tags: optimal transport weak transport Schrödinger bridge semimartingale transport martingale optimal transport stochastic control quantitative finance Sinkhorn algorithm

arXiv · PDF

Problem Formulation

Motivation: The Schrödinger-Bass problem provides a unified framework for interpolating between important optimal transport problems: the classical Schrödinger bridge (entropic transport), the Brenier-Strassen problem, and the martingale Benamou-Brenier problem. This parametric family is relevant in quantitative finance for option pricing and stochastic control.

Mathematical setup: Let $\mu, \nu \in \mathcal{P}_2(\mathbb{R}^d)$ be probability measures with finite second moment on $\mathbb{R}^d$. For $\beta > 0$, consider continuous semimartingales $X = (X_t)_{t \in [0,1]}$ of the form:

\[dX_t = a_t dt + b_t dB_t\]

where $B = (B_t)_{t \in [0,1]}$ is $d$-dimensional Brownian motion, $(a_t)_{t \in [0,1]}$ is $\mathbb{R}^d$-valued square-integrable progressive, and $(b_t)_{t \in [0,1]}$ is matrix-valued square-integrable progressive.

Assumptions:

  1. $X_0 \sim \mu$ and $X_1 \sim \nu$
  2. $\mathbb{E}\left[\int_0^1 a_t ^2 + b_t - \text{Id} ^2_{HS} dt\right] < \infty$
  3. $b_t$ takes values in positive-definite $d \times d$ matrices

    Toy example: When $d=1$, $\mu = \delta_0$, $\nu = \delta_1$, and $\beta = 1$, the problem seeks the least costly way to transport mass from $0$ to $1$ while penalizing both drift deviations and volatility deviations from Brownian motion with unit coefficients.

    Formal objective:

    \[V^\beta_{SB}(\mu, \nu) := \inf_{X} \mathbb{E}\left[\int_0^1 \left(\frac{1}{2}|a_t|^2 + \frac{\beta}{2}|b_t - \text{Id}|^2_{HS}\right) dt\right]\]
Method

The main methodological contribution is showing that the dynamic Schrödinger-Bass problem admits an equivalent static weak optimal transport (WOT) formulation with explicit cost function $C^\beta_{SB}$.

Key steps:

  1. Static WOT formulation: The problem becomes:

    \[V^\beta_{SB}(\mu, \nu) = \min_{\pi \in \text{Cpl}(\mu,\nu)} \int_{\mathbb{R}^d} C^\beta_{SB}(x, \pi_x) \mu(dx)\]
  2. Cost construction: The cost is built via infimal convolution and deconvolution:

    \[C^\beta_{SB} = C_{V_{EOT}} \square C_{W_\beta}\]

    where $V_{EOT}$ is entropic transport and $W_\beta(\mu,\nu) = \frac{\beta}{2}W_2^2(\mu,\nu)$ is scaled Wasserstein transport.

  3. Dual formulation:

    \[V^\beta_{SB}(\mu, \nu) = \max_{f \in L^1(\nu), \beta\text{-semiconcave}} \left[\int f d\nu - \int q_\beta \square (-T_\beta[f]) d\mu\right]\]
    where $T_\beta[f] := -\log(\exp(-q_\beta \square (-f)) * \gamma)$ and $q_\beta(x) = \frac{\beta}{2} x ^2$.
  4. Sinkhorn-type algorithm: Algorithm 1 alternates between updating $\alpha_i$ and $f_i$ via the transforms in the Schrödinger-Bass system.

    Application to toy example: For $d=1$, $\mu = \delta_0$, $\nu = \delta_1$, the method would compute the dual potential $f^*$ satisfying the optimality conditions, then construct the optimal semimartingale via the Schrödinger-Bass system with drift $a_t = \nabla \log(g_{Y_0,t}(Y_t))$ and diffusion $b_t = \text{Id} + \frac{1}{\beta}\nabla^2 \log(g_{Y_0,t}(Y_t))$.

Novelty & Lineage

Prior work:

  1. Schrödinger bridge (Föllmer 1985, Léonard 2014): established static-dynamic duality for entropic transport
  2. Martingale Benamou-Brenier (Backhoff-Veraguas et al. 2020): introduced martingale transport with quadratic cost
  3. Weak optimal transport (Gozlan et al. 2017): provided framework for transport costs depending on conditional measures

    Delta: This paper introduces the parametric Schrödinger-Bass problem interpolating between these classical problems and establishes:

    • Static WOT formulation via infimal convolution/deconvolution
    • Complete duality theory with explicit dual representations
    • Sinkhorn-type numerical algorithm with convergence guarantees
    • Asymptotic analysis recovering limiting problems as $\beta \to 0, \infty$

    Theory-specific assessment:

    • The main static-dynamic equivalence (Theorem 4.1) is somewhat predictable given the WOT framework, but the explicit construction of the cost $C^\beta_{SB}$ via infimal convolution is non-trivial
    • The proof technique combines standard WOT duality with novel infimal convolution analysis (Section 3)
    • No lower bounds are established; tightness of the characterization unclear
    • The asymptotic regimes connecting to known problems are expected but require careful analysis

    Verdict: INCREMENTAL — Solid extension of existing WOT theory to a new parametric family, with thorough analysis but limited conceptual novelty beyond applying known frameworks.

Proof Techniques

Main strategy: Establish equivalence between dynamic semimartingale problem and static WOT via infimal convolution theory.

Key steps:

  1. Infimal convolution analysis (Section 3): For WOT problems $V, W$, prove:

    \[V \square W(\mu,\nu) = \inf_{\pi \in \text{Cpl}(\mu,\nu)} \int C_{V \square W}(x, \pi_x) \mu(dx)\]

    where:

    \[C_{V \square W}(x,\eta) = \inf_{\rho \in \mathcal{P}_p(Y)} [C_V(x,\rho) + W(\rho,\eta)]\]
  2. Coercivity and continuity (Theorem 3.10): Under assumptions (Crc), (Cnt), (G), prove stability:

    \[V \square W(\mu_k, \nu_k) \to V \square W(\mu,\nu)\]

    Key inequality: For minimizing sequence $(\rho_k)$:

    \[\liminf_{k \to \infty} C_{V \square W}(x_k, \eta_k) \geq C_V(x,\rho) + W(\rho,\eta) \geq C_{V \square W}(x,\eta)\]
  3. Static-dynamic bridge (Lemma 4.2): Show:

    \[C^\beta_{SB}(x,\eta) = \sup_{y \in \mathbb{R}^d} \left[-q_\beta(x-y) + \inf_{\rho} [H(\rho|\gamma_y) + \frac{\beta}{2}W_2^2(\rho,\eta)]\right]\]

    Key equation: For optimal $f^*$:

    \[C^\beta_{SB}(x,\eta) = \int f^* d\eta - q_\beta \square (-T_\beta[f^*])(x)\]
  4. Semimartingale construction: Given dual optimizer $f^*$, construct optimal $X$ via:

    \[dX_t = \nabla \log(g_{Y_0,t}(Y_t)) dt + \left(\text{Id} + \frac{1}{\beta}\nabla^2 \log(g_{Y_0,t}(Y_t))\right) dB_t\]

    where $g_y = \frac{e^{-q_\beta \square (-f^*)}}{(e^{-q_\beta \square (-f^*)} * \gamma)(y)}$.

  5. Algorithm convergence (Theorem 5.4): Monotonicity of dual functional:

    \[D_\beta[f_{i+1}] \geq D_\beta[f_i]\]

    where $D_\beta[f] = \int f d\nu - \int q_\beta \square (-T_\beta[f]) d\mu$.

Experiments & Validation

Purely theoretical - No numerical experiments or empirical validation provided.

The paper develops Algorithm 1 (Schrödinger-Bass Sinkhorn) with convergence guarantees, but implementation details and computational performance are not studied. Empirical validation would require:

  1. implementation of the infimal convolution operators $q_\beta \square (\cdot)$ and $T_\beta[\cdot]$
  2. comparison with direct semimartingale simulation methods
  3. verification of convergence rates in practical settings, and
  4. applications to financial option pricing problems.
Limitations & Open Problems

Limitations:

  1. TECHNICAL: Algorithm convergence requires $\nu$ to have all exponential moments - restrictive for heavy-tailed distributions commonly used in finance
  2. TECHNICAL: Uniqueness results are up to additive constants, requiring normalization choices in practice
  3. NATURAL: Restriction to finite second moment measures $\mathcal{P}_2(\mathbb{R}^d)$ is standard in optimal transport
  4. RESTRICTIVE: Static formulation requires measures on $\mathbb{R}^d$, limiting applicability to more general spaces
  5. TECHNICAL: Coercivity assumptions (Crc), (Cnt), (G) needed for stability results may not hold in all applications

    Open problems:

  6. Computational complexity: What is the convergence rate of Algorithm 1 and how does it scale with dimension $d$?
  7. Extensions: Can the framework be extended to more general semimartingale models (e.g., with jumps) or infinite-dimensional settings?

Learning Where to Look: UCB-Driven Controlled Sensing for Quickest Change Detection

Authors: Yu-Han Huang, Argyrios Gerogiannis, Subhonmesh Bose, Venugopal V. Veeravalli · Institution: University of Illinois Urbana-Champaign · Category: cs.IT

Proposes UCB-driven procedures for multichannel bandit quickest change detection that achieve asymptotic optimality while offering computational advantages over existing methods.

Tags: quickest change detection multi-armed bandits controlled sensing sequential analysis UCB algorithm asymptotic optimality false alarm control surveillance systems

arXiv · PDF

Problem Formulation
  1. Motivation: Quick change detection (QCD) has important applications in surveillance, quality control, and online learning. This work studies the multichannel bandit QCD problem where an agent observes one of $K$ data streams per timestep and must detect distribution changes while controlling false alarms. This extends basic QCD to settings with controlled sensing and bandit feedback.

  2. Mathematical setup: Consider $K$ sequences of observations ${X_{a,n} : a \in [K], n \in \mathbb{N}}$. At unknown change-point $\nu$, the distribution of observations shifts:

    \[X_{a,n} \sim \begin{cases}\] \[f_{a,1}, & n \geq \nu \text{ and } a \in \mathcal{A} \\\] \[f_{a,0}, & \text{otherwise}\] \[\end{cases}\]

    where $\mathcal{A} \subseteq [K]$ is the unknown subset of affected channels. Assume:

  3. All KL divergences are positive: $D(f_{a,1} | f_{a,0}) > 0$ for all $a \in [K]$
  4. Log-likelihood ratios are sub-Gaussian with parameter $v < \infty$

    Agent selects actions $A_n$ based on filtration $\mathcal{F}_{n-1}$ and stops at time $T$.

  5. Toy example: With $K = 2$, $\nu = 10$, and $\mathcal{A} = {1}$ where channel 1 shifts from $\mathcal{N}(0,1)$ to $\mathcal{N}(1,1)$ while channel 2 remains $\mathcal{N}(0,1)$. The agent must learn to concentrate on channel 1 after the change.

  6. Formal objective: Minimize worst-case detection delay

    \[J_{\mathcal{A}}(\mathbf{A}, T) := \sup_{\nu \in \mathbb{N}} \text{ess sup } \mathbb{E}_{\nu,\mathcal{A}}^{\mathbf{A}} [(T - \nu + 1)^+ | \mathcal{F}_{\nu-1}]\]

    subject to false alarm constraint $\mathbb{E}_{\infty}^{\mathbf{A}}[T] \geq \gamma$.

Method

The UCB-CuSum procedure combines Upper Confidence Bound action selection with CuSum change detection.

  1. Partition time horizon into intervals of length $W$
  2. Within each interval, use UCB to select actions treating log-likelihood ratios as rewards:

    \[\text{UCB}(a,n) = \hat{\mu}_{a,n} + \sqrt{\frac{4v \log W}{N_{a,n}}}\]

    where $\hat{\mu}_{a,n}$ is empirical mean reward from action $a$ and $N_{a,n}$ is number of pulls.

  3. Maintain CuSum statistic accumulating LLRs across all actions:

    \[C_n^{\mathbf{A}} = \max\{C_{n-1}^{\mathbf{A}}, 0\} + \text{LLR}(A_n, X_{A_n,n})\]
  4. Stop when $C_n^{\mathbf{A}} \geq b$ where $b = \log \gamma$.

    The PA-UCB-CuSum variant maintains separate CuSum statistics per action:

    \[C_{a,n}^{\mathbf{A}} = \begin{cases}\] \[\max\{C_{a,n-1}^{\mathbf{A}}, 0\} + \text{LLR}(a, X_{a,n}), & A_n = a \\\] \[C_{a,n-1}^{\mathbf{A}}, & \text{otherwise}\] \[\end{cases}\]

    and stops when $C_{a,n}^{\mathbf{A}} \geq b$ for any $a \in [K]$.

    Toy example application: With the $K=2$ Gaussian setup, after change-point $\nu=10$, UCB learns that action 1 yields higher average LLR ($D(\mathcal{N}(1,1) | \mathcal{N}(0,1)) = 0.5$) than action 2 (LLR = 0), so concentrates sampling on action 1 to accumulate evidence faster.

Novelty & Lineage

Step 1 — Prior work:

  • Xu & Mei (2021): Greedy multichannel QCD algorithm that sequentially samples single channels, proven optimal only when single channel affected or identical shifts
  • Gopalan et al. (2021): ε-Greedy Change Detector for general controlled sensing QCD, not asymptotically optimal
  • Veeravalli et al. (2024): Windowed-Chernoff-CuSum (WCC) procedure, first asymptotically optimal method for controlled sensing QCD using MLE over exponentially many subsets

Step 2 — Delta: This paper proposes UCB-based action selection policies specifically tailored for multichannel bandit QCD. Key additions:

  1. UCB-driven control policy treating LLRs as rewards
  2. periodic restarts to maintain exploration
  3. per-action variant enabling extension to unknown distributions.

    Step 3 — Theory-specific assessment:

    • Main theorems establish asymptotic optimality with delay scaling as $\log \gamma / I_{\mathcal{A}}$ where $I_{\mathcal{A}} = \max_{a \in \mathcal{A}} D(f_{a,1} | f_{a,0})$, matching known lower bound
    • Proofs use standard martingale techniques and regret bounds from UCB analysis - routine assembly of known results
    • Bounds are tight, matching the fundamental lower bound from Veeravalli et al. (2024)
    • Extension to unknown distributions via GLR statistics is novel but lacks theoretical analysis

    The UCB connection to bandit QCD is natural but not deeply surprising. The periodic restart mechanism is a standard trick to handle non-stationarity.

    Verdict: INCREMENTAL — solid extension applying UCB methodology to multichannel QCD with matching theoretical guarantees but routine proof techniques.

Proof Techniques
  1. False alarm constraint: Apply Optional Sampling Theorem to Shiryaev-Roberts-like statistics. For PA-UCB-CuSum, construct separate SR statistics per action and show their sum minus time forms a martingale.

  2. Detection delay bound: Key inequality establishing delay upper bound:

    \[J_{\mathcal{A}}(\mathbf{A}^{\text{UCB}}, T_b^{\mathbf{A}^{\text{UCB}}}) \leq W + \mathbb{E}_{1,\mathcal{A}}^{\mathbf{A}^{\text{UCB}}}[T_b^{\mathbf{A}^{\text{UCB}}}]\]

    This reduces worst-case delay to post-change delay analysis.

  3. Regret analysis connection: View cumulative LLR over $W$-length intervals as rewards $Y_i^{\mathbf{A}^{\text{UCB}}}$ with expected value:

    \[\mathbb{E}[Y_i^{\mathbf{A}^{\text{UCB}}} | \mathcal{G}_{i-1}] \geq W I_{\mathcal{A}} - \sum_{a: \Delta_a > 0} \frac{3\Delta_a + 16 \log W}{\Delta_a}\]

    where $\Delta_a = I_{\mathcal{A}} - D(f_{a,1} | f_{a,0})$ and the regret bound comes from Theorem 7.1 of Lattimore & Szepesvári (2020).

  4. Level crossing analysis: Apply Proposition 1 from Veeravalli et al. (2024) to bound expected hitting time:

    \[\mathbb{E}_{1,\mathcal{A}}^{\mathbf{A}^{\text{UCB}}}[T_b^{\mathbf{A}^{\text{UCB}}}] \leq \frac{W(b + c(1 + \sqrt{b}))}{W I_{\mathcal{A}} - \sum_{a: \Delta_a > 0} \frac{3\Delta_a + 16 \log W}{\Delta_a}}\]
  5. Asymptotic analysis: With $W = o(\log \gamma)$ and $b = \log \gamma$, the regret term becomes negligible, yielding first-order optimality.

Experiments & Validation

Synthetic benchmarks across four distributions (Gaussian, Exponential, Laplace, Beta) with $K=10$ channels and sparse heterogeneous change vector $\xi = [0, 0, 0.1, 0, 0, 0.1, 0, 0, 1, 0]^T$.

Baselines: WCC (state-of-art), Greedy, RoundRobin, PA-RoundRobin. Parameters: $W = \lceil 8 \log b \rceil$, $w = \lceil 5 \log b \rceil$ and $q = \lceil \log w \rceil$ for WCC.

Key results: UCB-CuSum and PA-UCB-CuSum achieve best MTFA-delay tradeoff across all distributions, consistently outperforming WCC and substantially better than Greedy/RoundRobin. Computational cost is 10-100x lower than WCC while achieving better statistical performance.

Extension to unknown distributions using Bernoulli GLR statistic shows PA-UCB-GLR significantly outperforms PA-RoundRobin-GLR, especially at high MTFA levels. 200K Monte Carlo trials used for evaluation.

Limitations & Open Problems

Limitations:

  1. Sub-Gaussian assumption on log-likelihood ratios - TECHNICAL (standard in QCD literature, enables concentration inequalities)
  2. Known pre/post-change distributions required for UCB-CuSum - RESTRICTIVE (limits applicability, though PA-UCB-GLR addresses this partially)
  3. Periodic restart parameter $W = o(\log \gamma)$ requires tuning - TECHNICAL (could potentially be made adaptive)
  4. Sparse change assumption not explicitly stated but implicitly needed for good performance - NATURAL (common in applications)

    Open problems:

  5. Theoretical analysis of PA-UCB-GLR procedure for unknown distributions - paper shows empirical results but no optimality guarantees
  6. Adaptive selection of restart interval $W$ without prior knowledge of $\gamma$ or change characteristics

Entropy and additional utility of a discrete information disclosed progressively in time

Authors: Anna Aksamit · Institution: University of Sydney · Category: math.PR

Extends Yor’s entropy concept from initial to progressive filtration enlargements, establishing equivalence between entropy finiteness and martingale space stability, with applications to insider trading utility.

Tags: progressive enlargement filtration theory entropy measures martingale stability insider trading logarithmic utility thin random times semimartingale decomposition

arXiv · PDF

Problem Formulation

Motivation: This paper studies how to measure the “additional information” gained when a filtration is progressively enlarged with a discrete random variable at a random time. This extends classical work by Meyer and Yor on initial enlargements to the progressive case, which matters for insider trading models and stochastic analysis.

Mathematical setup: Let $(\Omega, \mathcal{G}, P)$ be a complete probability space with filtration $\mathbb{F} = (\mathcal{F}_t)_{t \geq 0}$ satisfying usual conditions. Let $\tau$ be a thin random time with exhausting sequence $(T_n)_{n \geq 0}$, meaning:

\[[[τ]] \subset \bigcup_n [[T_n]]\] \[[[T_n]] \cap [[T_m]] = \emptyset \text{ for } n \neq m\]

Let $\xi$ be a discrete random variable taking values in $\mathbb{N}$. Define partition sets:

\[C_n^k = \{\tau = T_n < \infty, \xi = k\}\] \[C_0^0 = \{\tau = \infty\}\]

The progressively enlarged filtration is:

\[\mathcal{G}_t = \bigcap_{s > t} \mathcal{F}_s \vee \sigma(\xi \mathbf{1}_{\{u \geq \tau\}} : u \leq s)\]

Define conditional probabilities:

\[z_t^{n,k} := P(C_n^k | \mathcal{F}_t)\]

Assumptions:

  1. All $\mathbb{F}$-martingales are continuous
  2. $\tau$ is a thin random time with respect to $\mathbb{F}$
  3. $\xi$ is $\mathbb{F}_\infty$-measurable

    Toy example: When $\tau = T_1$ (single jump time) and $\xi \in {1,2}$ with equal probability, we have $C_1^1 = {\tau = T_1, \xi = 1}$ and $C_1^2 = {\tau = T_1, \xi = 2}$. Then $z_t^{1,1} = P(\tau = T_1, \xi = 1 \mathcal{F}_t)$ measures how much the filtration $\mathcal{F}$ “knows” about the event before time $T_1$.

    Formal objective: Define and analyze the $\gamma$-entropy:

    \[H_\gamma(\xi, \tau) := \sum_{n,k} \mathbb{E}\left[\mathbf{1}_{C_n^k} \left[\log \frac{1}{z_{T_n}^{n,k}}\right]^\gamma\right]\]
Method

The method defines a new entropy measure for progressive information disclosure and establishes its connection to martingale stability.

Key Definition: The $\gamma$-entropy of progressive information is:

\[H_\gamma(\xi, \tau) = \sum_{n,k} \mathbb{E}\left[\mathbf{1}_{C_n^k} \left[\log \frac{1}{z_{T_n}^{n,k}}\right]^\gamma\right]\]

Main Steps:

  1. Semimartingale Decomposition: For any $\mathbb{G}$-predictable process $G$ and $\mathbb{F}$-local martingale $Y$, the integral $X = \int G dY$ has decomposition:

    \[X_t = \hat{X}_t + \int_0^{t \wedge \tau} \frac{G_s}{Z_{s-}} d\langle Y, m \rangle_s + \sum_{n,k} \mathbf{1}_{C_n^k} \int_0^t \mathbf{1}_{\{s > T_n\}} \frac{G_s}{z_{s-}^{n,k}} d\langle Y, z^{n,k} \rangle_s\]

    where $\hat{X}$ is a $\mathbb{G}$-local martingale.

  2. Stability Condition: The finiteness of $H_{\gamma/2}(\xi, \tau)$ is equivalent to the continuous embedding of $H^p(\mathbb{F})$ into $S^r(\mathbb{G})$ where $\frac{1}{r} = \frac{1}{p} + \frac{1}{\gamma}$.

  3. Utility Connection: The additional logarithmic utility equals:

    \[\text{aU}(\mathbb{F}, \mathbb{G}) = \mathbb{E}[\langle Y^b \rangle_\tau] + H(\xi, \tau)\]

    Application to Toy Example: For $\tau = T_1$ and $\xi \in {1,2}$, if $P(\xi = 1) = P(\xi = 2) = 1/2$ and both events are independent of $\mathcal{F}_{T_1}$, then $z_{T_1}^{1,1} = z_{T_1}^{1,2} = 1/2$, giving:

    \[H(\xi, \tau) = 2 \cdot \mathbb{E}\left[\mathbf{1}_{\{\tau = T_1\}} \log 2\right] = 2 \log 2 \cdot P(\tau = T_1 < \infty)\]
Novelty & Lineage

Prior Work:

  1. Meyer (1978): Studied initial enlargement with atomic $\sigma$-field and martingale stability
  2. Yor (1985): Introduced entropy for measuring additional information in initial enlargements
  3. Aksamit-Choulli-Jeanblanc (2021): Developed theory of thin times and progressive enlargements

    Delta: This paper extends Yor’s entropy concept from initial to progressive enlargements. The key innovation is defining entropy through $H_\gamma(\xi, \tau) = \sum_{n,k} \mathbb{E}[\mathbf{1}_{C_n^k} [\log(1/z_{T_n}^{n,k})]^\gamma]$ instead of classical partition entropy.

    Theory-specific Assessment:

    • Main Result: The equivalence between $H_{\gamma/2}(\xi, \tau) < \infty$ and continuous embedding $H^p(\mathbb{F}) \hookrightarrow S^r(\mathbb{G})$ is a natural extension but requires non-trivial technical work.
    • Proof Technique: The proof assembles known techniques (dual projections, Itô’s formula, Burkholder-Davis-Gundy inequalities) but the specific application to progressive enlargements requires careful handling of the partition structure.
    • Tightness: No explicit lower bounds are provided or compared against.

    The connection to logarithmic utility provides concrete economic interpretation, but the utility formula follows fairly directly from the semimartingale decomposition.

    Verdict: INCREMENTAL — Solid extension of Yor’s classical results to progressive case, but the techniques are largely standard and the result somewhat predictable from prior work on progressive enlargements.

Proof Techniques

Main Strategy: The proof of Theorem 3.2 (equivalence of entropy condition and embedding) uses several key steps:

  1. Semimartingale Decomposition: Apply Theorem 2.2 to write any $X = \int G dY$ as:

    \[X_t = \hat{X}_t + \langle X, Y^b \rangle_t + \langle X, Y^a \rangle_t\]

    where $Y^b$ and $Y^a$ are $\mathbb{G}$-local martingales defined by:

    \[Y^b_t = \int_0^{τ \wedge t} \frac{1}{Z_{s-}} d\hat{m}_s\] \[Y^a_t = \sum_{n,k} \mathbf{1}_{C_n^k} \int_0^t \mathbf{1}_{\{s > T_n\}} \frac{1}{z_{s-}^{n,k}} d\hat{z}^{n,k}_s\]
  2. Key Reduction: By Yor’s Lemma 2 from 1985, the embedding condition is equivalent to:

    \[\mathbb{E}[\langle Y^b \rangle_\infty^{\gamma/2}] < \infty \text{ and } \mathbb{E}[\langle Y^a \rangle_\infty^{\gamma/2}] < \infty\]
  3. First Term: For $Y^b$, show $|\langle Y^b \rangle_\infty^{1/2}|_{L^\gamma} < \infty$ for all $\gamma > 0$ using the bound:

    \[\left\|\left(\int_0^\tau \frac{1}{Z_s^2} d\langle m \rangle_s\right)^{1/2}\right\|_{L^\gamma} \leq C\left[1 + \left\|\left(\log \frac{1}{U}\right)^{1/2}\right\|_{L^\gamma}\right] < \infty\]

    where $U$ is uniform on $[0,1]$.

  4. Second Term: For $Y^a$, use Itô’s formula applied to $f(x) = x - x \log x$:

    \[\mathbf{1}_{C_n^k} = z_t^{n,k} - z_t^{n,k} \log z_t^{n,k} - \int_t^\infty \log z_s^{n,k} dz_s^{n,k} - \frac{1}{2} \int_t^\infty \frac{1}{z_s^{n,k}} d\langle z^{n,k} \rangle_s\]

    Taking conditional expectation gives the crucial identity:

    \[\mathbb{E}\left[\int_t^\infty \frac{1}{z_s^{n,k}} d\langle z^{n,k} \rangle_s \Big| \mathcal{F}_t\right] = 2z_t^{n,k} \log \frac{1}{z_t^{n,k}}\]
  5. Final Equivalence: Use auxiliary Lemmas B.1 and B.2 to show:

    \[\mathbb{E}[\langle Y^a \rangle_\infty^{\gamma/2}] < \infty \iff H_{\gamma/2}(\xi, \tau) < \infty\]

    The technical heart is the Itô calculation combined with careful analysis of the infimum process $I^k_n = \inf_{t \geq T_n} z_t^{n,k}$.

Experiments & Validation

Purely theoretical. Empirical validation would require:

  1. Testing the entropy condition in concrete financial models with insider information
  2. Numerical computation of additional logarithmic utilities in realistic market scenarios
  3. Simulation studies comparing progressive vs initial enlargement effects on portfolio performance
Limitations & Open Problems

Limitations:

  1. Continuity assumption: All $\mathbb{F}$-martingales must be continuous - TECHNICAL (needed for proof technique, but discontinuous case likely tractable with more work)

  2. Thin random times: Restriction to thin times with exhausting sequences - NATURAL (standard setting in progressive enlargement theory)

  3. Discrete information: Only discrete random variables $\xi$ are considered - RESTRICTIVE (continuous case would be more general but technically much harder)

  4. Constant orthogonal parts: In utility result, requires $L^m$ and $L^{n,k}$ to be constant - TECHNICAL (simplifying assumption for explicit computation)

  5. Complete market assumption: Financial model satisfies structure condition - NATURAL (standard in logarithmic utility literature)

    Open Problems:

  6. Extend entropy definition and stability results to non-thin random times or more general progressive information structures
  7. Characterize the relationship between entropy finiteness and market completeness under progressive enlargements