If you see this, something is wrong
To get acquainted with the document, the best thing to do is to select the "Collapse all sections" item from the "View" menu. This will leave visible only the titles of the top-level sections.
Clicking on a section title toggles the visibility of the section content. If you have collapsed all of the sections, this will let you discover the document progressively, from the top-level sections to the lower-level ones.
Generally speaking, anything that is blue is clickable.
Clicking on a reference link (like an equation number, for instance) will display the reference as close as possible, without breaking the layout. Clicking on the displayed content or on the reference link hides the content. This is recursive: if the content includes a reference, clicking on it will have the same effect. These "links" are not necessarily numbers, as it is possible in LaTeX2Web to use full text for a reference.
Clicking on a bibliographical reference (i.e., a number within brackets) will display the reference.
Speech bubbles indicate a footnote. Click on the bubble to reveal the footnote (there is no page in a web document, so footnotes are placed inside the text flow). Acronyms work the same way as footnotes, except that you have the acronym instead of the speech bubble.
By default, discussions are open in a document. Click on the discussion button below to reveal the discussion thread. However, you must be registered to participate in the discussion.
If a thread has been initialized, you can reply to it. Any modification to any comment, or a reply to it, in the discussion is signified by email to the owner of the document and to the author of the comment.
The blue button below that says "table of contents" is your tool to navigate in a publication.
The left arrow brings you to the previous document in the publication, and the right one brings you to the next. Both cycle over the publication list.
The middle button that says "table of contents" reveals the publication table of contents. This table is hierarchical structured. It has sections, and sections can be collapsed or expanded. If you are a registered user, you can save the layout of the table of contents.
First published on Thursday, May 1, 2025 and last modified on Thursday, May 1, 2025
pseudo-differential equation, square root of operator, numerical range, well-posedness, sectorial operator
Applied and Computational Mathematics, University of Wuppertal, Germany
Instituto de Matematica Pura e Aplicada, Rio de Janeiro, Brazil
Functional Analysis, University of Wuppertal, Germany
Stochastics, University of Wuppertal, Germany
Pseudodifferential parabolic equations with an operator square root arise in wave propagation problems as a one-way counterpart of the Helmholtz equation. The expression under the square root usually involves a differential operator and a known function. We discuss a rigorous definition of such operator square roots and show well-posedness of the pseudodifferential parabolic equation by using the theory of strongly continuous semigroups. This provides a justification for a family of widely-used numerical methods for wavefield simulations in various areas of physics.
AMS classification: 35S10, 47G30, 76Q05
Keywords: pseudo-differential equation, square root of operator, numerical range, well-posedness, sectorial operator
A large family of one-way propagation equations used in the numerical modeling of wave phenomena is known as the parabolic wave equations (PWEs). They have their origin in the work of Leontovich and Fock [1], in which a model for the simulation of radio waves was proposed. Since then, a wide variety of PWEs have been developed to solve practical problems in seismics, acoustics, optics, and electrical engineering [2, 3, 4, 5] (sometimes referred to as beam propagation methods). In this approach, boundary value problems for the Helmholtz equation in a waveguide are replaced by Cauchy problems for PWEs, which are more convenient to handle numerically using marching solution techniques. Another reason for the success of this approach is the ability to cancel out certain oscillatory terms, thus removing a wavelength resolution limitation associated with the step size.
Currently, the standard approach to deriving PWEs is based on a formal factorization of the Helmholtz operator. Such factorization leads to evolutionary equations involving the square root of a differential operator [6], sometimes called pseudodifferential parabolic equations (PDPEs) (they also appear in the literature under a variety of names, e.g., very-wide-angle parabolic equations). Many modern wave propagation techniques are designed to solve PDPEs directly [4, 7, 8] (rather than first rewriting them, e.g., using some approximation of the square root operator). An important example is a powerful method called Split-Step Padé (SSP) [9, 10], which was a breakthrough in both accuracy and performance in underwater acoustics.
Despite the existence of a large number of works on PDPEs and their practical importance, to the best of our knowledge the questions of uniqueness, existence and well-posedness for such equations have not been addressed in the literature. In fact, most of the research on this topic does not even rigorously define the square root operator [5, 8, 7]. This letter aims to fill this gap. We provide a derivation of the most common PDPE form and rigorously define the square root operator in this equation. We then prove the uniqueness and existence of the PDPE solution using the semigroup property of the latter operator. Our discussion includes the piecewise constant dependence of the propagation medium on the range (i.e., on the evolutionary variable).
Consider the two-dimensional Helmholtz equation in Cartesian coordinates \( (x,y)\) ,
(1)
where \( u = u(x,y)\) is the unknown function and \( k = k(x,y)\) is a coefficient called medium wavenumber. In practice, \( k(x,y)\) is usually a complex quantity with both real and imaginary parts being positive and bounded both from above and from below (the imaginary part represents wave absorption, and it is usually much smaller than the real part).
Assuming that the coefficient dose not depend on \( x\) (this is a preferred direction of propagation called waveguide axis along which the medium parameters change very slowly [1, 3, 7]), we can factorize the operator in Eq. (1) to get
(2)
where the factors on the left-hand side correspond to leftward and rightward one-way propagation of the wave, respectively. Without loss of generality, hereafter we consider the latter case and rewrite the one-way counterpart of Eq. (2) as the PDPE
(3)
for \( x>0,\;-\infty<y<\infty\) , where \( A = \frac{\partial^2}{\partial y^2} + k^2\) is a differential operator acting on the functions of the coordinate \( y\) representing a direction transverse to the waveguide axis. Eq. (1) arises in various physical settings. For example, in underwater acoustics, \( u(x,y)\) can describe the acoustic pressure field in a vertical plane [9, 3], where \( x\) denotes the range and \( y\) the depth (in this case, Eq. (1) is usually supplemented by boundary conditions, representing the sea surface and the bottom, e.g., \( u(x,0)=0\) and \( u(x,H)=0\) ). Eq. (1) on the entire half-space \( x\geq 0\) can be considered as a one-way counterpart of the horizontal refraction equation for one mode amplitude in the adiabatic approximation [3, 7].
The derivation of Eq. (1) is somewhat heuristic. However, once it is obtained it is desirable to establish that, for an initial condition \( u(0,y) = u_0(y)\) at \( x=0\) , the Cauchy problem in Eq. (3) is well-posed in an appropriate function space. Well-posedness means existence and uniqueness of the solution together with continuous dependence on the initial value (see e.g[11, Section II.6] for a more thorough discussion). For Eq. (3) well-posedness is equivalent to the property that \( \mathrm{i}\sqrt{A}\) generates a strongly continuous semigroup \( (T_x)_{x \geq 0}\) [11, Theorem II.6.7]. In this case, for each initial value \( u_0 = u_0(y)\) the unique solution of Eq. (3) is given by \( u(x,y) = T_x u_0(y)\) .
In this section we discuss criteria for the existence and uniqueness of square roots of unbounded operators on a Hilbert space. This will give a precise meaning to the expression \( \sqrt{A}\) in the PDPE (3). Let \( H\) be a complex Hilbert space. We use the convention that the inner product is anti-linear in the first argument and linear in the second. Let \( A\colon H \supseteq D(A) \to A\) be a closed linear operator. We denote the spectrum of \( A\) by \( \sigma(A)\) ; its complement \( \rho(A) := \mathbb{C} \setminus \sigma(A)\) is called the resolvent set of \( A\) . We will first discuss uniqueness and then existence. Even for complex numbers (rather than operators) the complex square root is only unique if one imposes an additional assumption on the argument of the root. Similarly, the spectral conditions in the following proposition ensure the uniqueness of a square root operator, provided that it exists.
Proposition 1 (Uniqueness of square roots)
Let \( H\) be a complex Hilbert space and let \( A\colon H \supseteq D(A) \to H\) be a closed linear operator such that \( (-\infty,0] \subseteq \rho(A)\) . There exists at most one closed linear operator \( B\colon H \supseteq D(B) \to H\) with the following properties:
Proof
Consider two closed linear operators \( B\colon H\supseteq D(B) \to H\) and \( \tilde{B}\colon H\supseteq D(\tilde{B}) \to H\) which satisfy the properties (I) and (II). It follows from the spectral mapping theorem for polynomials [12, Proposition A.6.2] that the imaginary axis does not intersect \( \sigma(B)\) nor \( \sigma(\tilde{B})\) . In particular, \( B^{-1}\) and \( \tilde{B}^{-1}\) are bounded linear operators on \( H\) whose square is equal to \( A^{-1}\) and whose spectra are also contained in the open right half plane \( \mathbb{C}_+ := \{z \in \mathbb{C} \mid \mathrm{Re}(z) > 0 \}\) . Now consider the holomorphic functions
i.e., \( g\) is the principal branch of the complex square root. Note that the composition \( g \circ f\) is the identity function on the domain \( \mathbb{C}_+\) .
Since \( B^{-1}\) and \( A^{-1}\) are bounded operators whose spectra are contained in \( \mathbb{C}_+\) and \( \mathbb{C} \setminus (-\infty,0]\) , respectively, one can use the Dunford functional calculus to compute \( f(B^{-1})\) and \( g(A^{-1})\) (see e.g[13, Section V.8] or [14, Section VIII.7] for details on this functional calculus). One has \( f(B^{-1}) = \big(B^{-1}\big)^2 = \big(B^2\big)^{-1} = A^{-1}\) (here we used the multiplicativity of the functional calculus, see e.g [13, Theorem V.8.1] or [14, Theorem in Section VIII.7]) and
the first equality uses that the functional calculus is compatible with compositions of functions, see [14, Corollary 2 in Section VIII.7, p. 227]. The same reasoning can be applied to \( \tilde{B}\) instead of \( B\) , giving \( B^{-1} = g(A^{-1}) = \tilde{B}^{-1}\) . Hence, \( B = \tilde{B}\) , as claimed.
Proposition 1 justifies the following definition.
Definition 1 (The square root of an operator)
Let \( H\) be a complex Hilbert space and let \( A\colon H \supseteq D(A) \to H\) be a closed linear operator such that \( (-\infty,0] \subseteq \rho(A)\) . We say that \( A\) has a square root if there exists a closed linear operator \( B\colon H \supseteq D(B) \to H\) which satisfies \( B^2 = A\) and \( \sigma(B) \subseteq \{\lambda \in \mathbb{C} \mid \mathrm{Re} \lambda \ge 0\}\) . In this case we call \( B\) the square root of \( A\) and denote it by \( B =: A^{1/2}\) .
The existence of such a square root is not guaranteed in general. In the following we discuss an operator theoretic result which ensures the existence of a square root under assumptions that are well-suited to the PDPE (3). We need the following concept: For a closed linear operator \( A\colon H \supseteq D(A) \to H\) on a complex Hilbert space \( H\) , the numerical range of \( A\) is defined to be the set
The set \( W(A)\) is always convex [15, Theorem V.3.1, p. 267]. The complement of its closure \( \overline{W(A)}\) has either one or two connected components, and if one of them intersects \( \rho(A)\) , then this entire connected component is contained in \( \rho(A)\) [15, Theorem V.3.2].
Proposition 2 (Existence of square roots)
Let \( H\) be a complex Hilbert space and let \( A\colon H\supseteq D(A)\to H\) be a closed linear operator. Assume that there is a \( \delta > 0\) such that
Then \( A\) has a square root and the spectrum and numerical range of \( A^{1/2}\) are located in the first quadrant of \( \mathbb{C}\) and satisfy \( \sigma(A^{1/2}) \subseteq \overline{W(A^{1/2})}\) .
Proof
By assumption, the numerical range of the operator \( -\mathrm{i} A\) is contained in the closed right half plane of \( \mathbb{C}\) , which means in the terminology of [15, Section V.3.10, p. 279] that \( -\mathrm{i} A\) is accretive. It also follows from the assumption that the closed left half plane (and thus, in particular, the open left half plane) is in the resolvent set of \( -\mathrm{i} A\) which means, again in the terminology of [15, Section V.3.10, p. 279], that \( -\mathrm{i} A\) is even \( m\) -accretive.
Hence, one can apply [15, Theorem V.3.35, p. 281] which says that there is a closed linear operator \( C\) on \( H\) that satisfies \( C^2 = -\mathrm{i} A\) and whose numerical range satisfies
(4)
Moreover, by the same theorem the operator \( C\) is \( m\) -sectorial (see [15, Section V.3.10, p. 280] for the definition of this notion), so in particular all \( \lambda \in \mathbb{C}\) with sufficiently negative real part are in the resolvent set of \( C\) . So the complement of \( \overline{W(C)}\) intersects the resolvent set of \( C\) ; moreover, it follows from (4) and the convexity of \( W(C)\) [15, Theorem V.3.1, p. 267] that the complement of \( \overline{W(C)}\) is connected. The facts mentioned before Proposition 2 thus imply \( \sigma(C) \subseteq \overline{W(C)}\) . We conclude that the operator \( B := e^{\mathrm{i} \frac{\pi}{4}} C\) has all the required properties.
Note that the reference [15, Theorem V.3.35, p. 281], which we used in the proof, does not only give existence but also uniqueness of square roots – but only among all accretive operators. As a consequence of Proposition 2 one gets the following well-posedness result for differential equations.
Corollary 1 (Generation theorem for \( \mathrm{i}\) times a square root)
Under the assumptions of Proposition 2 the operator \( \mathrm{i} A^{1/2}\) generates a contractive \( C_0\) -semigroup on \( H\) .
Proof
According to Proposition 2 the square root \( A^{1/2}\) exists, and the spectrum and numerical range of \( \mathrm{i} A^{1/2}\) are located in the second quadrant of \( \mathbb{C}\) . This means that \( \mathrm{i} A^{1/2}\) is \( m\) -dissipative (which is another term for saying that minus the operator is \( m\) -accretive), so it generates a contractive \( C_0\) -semigroup according to the Lumer–Phillips generation theorem [11, Corollary II.3.20].
The following example shows why the operator \( A = \frac{\partial^2}{\partial y^2} + k^2\) in the PDPE (3) satisfies the assumptions of Proposition 2 and Corollary 1 if the real and imaginary parts of \( k\) are positive and bounded away from \( 0\) . In this case one can choose \( L = \partial^2/\partial y^2\) and \( m = k^2\) .
Example 1
Let \( H = L^2(\Omega)\) for a domain \( \Omega \subseteq \mathbb{R}^n\) , let \( L\colon H \supseteq D(L) \to H\) be a self-adjoint linear operator and let \( m\colon \Omega \to \mathbb{C}\) be a bounded and continuous (or, more generally, bounded and measurable) function that satisfies \( \mathrm{Im}(m(\omega)) \ge \delta\) for a \( \delta > 0\) and all \( \omega \in \Omega\) . Then the operator \( A := L+m\) with domain \( D(A) := D(L)\) satisfies \( \sigma(A) \cup \overline{W(A)} \subseteq \mathbb{C}_\mathrm{{Im} \ge \delta}\) , so Proposition 2 and Corollary 1 are applicable to \( A\) .
Proof
Since \( L\) is self-adjoint one has \( W(L) \subseteq \mathbb{R}\) , and it is easy to check that \( W(m)\) is contained in the closure of the range of \( m\) . Hence,
Since \( m\) is a bounded perturbation, all numbers with sufficiently negative imaginary part are contained in \( \rho(L+m)\) . To see this, first note that \( \left\lVert (\lambda-L)^{-1}\right\rVert \le \frac{1}{\left\lvert \mathrm{Im} \lambda\right\rvert }\) for all \( \lambda \in \mathbb{C} \setminus \mathbb{R}\) since \( L\) is self-adjoint and then conclude that \( \lambda - (L+m) = \big(1 - m(\lambda-L)^{-1} \big)(\lambda-L)\) is invertible for \( \left\lVert m\right\rVert _\infty < \left\lvert \mathrm{Im} \lambda\right\rvert \) by using the Neumann series. So the connected component of \( \mathbb{C} \setminus \overline{W(A)}\) that contains \( \mathbb{C} \setminus \mathbb{C}_\mathrm{{Im} \ge \delta}\) intersects the resolvent set \( \rho(A)\) and hence is contained in \( \rho(A)\) , as pointed out before Proposition 2. So \( \sigma(A) \subseteq \mathbb{C}_\mathrm{{Im} \ge \delta}\) .
Remark 1
Example 1 gives the well-posedness of the PDPE (3) under the assumptions discussed before the example. The argument assumed that \( k^2(x,y)\) does not depend on \( x\) , but it can be directly generalized to the case where \( k^2(x,y)\) is piecewise-constant in \( x\) . More precisely, assume that the interval \( x\in [0,L]\) is divided into a set of \( N\) subintervals \( [x_{j-1},x_{j}]\) , \( j = 1,\dots,N\) , where \( x_0=0\) , \( x_{N}=L\) , and \( k^2(x,y) = k_j^2(y)\) for all \( x\in [x_{j-1},x_{j}]\) . Then the Cauchy problem (3) can be solved piecewise on the subintervals \( [x_{j-1},x_{j}]\) .
We provided a theoretical foundation for the PDPEs theory, which is widely used in the numerical simulation of wave dynamics [3, 4, 5, 8, 9]. First, we gave a rigorous definition of the square root operator appearing in such equations. Second, we established the well-posedness of the corresponding Cauchy problem. Due to the abstract nature of the proofs, they cover most typical Cauchy problem setups that arise in practice (e.g., it is sufficient that the initial data \( u_0\) is square integrable), although the dependence of the problem parameters on the range (i.e., on \( x\) ) is restricted to piecewise constant functions. In many physics and engineering problems, this is exactly the way the information about the medium is usually given [3]. On the other hand, it is desirable to establish the same results as above for more general types of range-dependent media (we plan to address this in future work).
We are grateful to Markus Haase and Christian Wyss for several helpful discussions.
[1] M. Leontovich, V. Fock, Solution of the problem of electromagnetic wave propagation along the earth‘s surface by the method of parabolic equation, J. Phys. USSR 10 (1946) 13–23.
[2] J. Claerbout, Fundamentals of Geophysical Data Processing: With Applications to Petroleum Prospecting, Blackwell Scientific Publications, 1985.
[3] F. B. Jensen, M. B. Porter, W. A. Kuperman, H. Schmidt, Computational Ocean Acoustics, 2nd Edition, Springer, 2011.
[4] Y. Y. Lu, Some techniques for computing wave propagation in optical waveguides, Commun. Comput. Phys. 1 (6) (2006) 1056–1075.
[5] M. Lytaev, Rational interpolation of the one-way Helmholtz propagator, J. Comput. Sci. 58 (2022) 101536.
[6] L. Fishman, J. McCoy, Derivation and application of extended parabolic wave theories. I. The factorized Helmholtz equation, J. Math. Phys. 25 (2) (1984) 285–296.
[7] P. S. Petrov, X. Antoine, Pseudodifferential adiabatic mode parabolic equations in curvilinear coordinates and their numerical solution, J. Comput. Phys. 410 (2020) 109392.
[8] T. He, J. Liu, S. Ye, X. Qing, S. Mo, A novel model order reduction technique for solving horizontal refraction equations in the modeling of three-dimensional underwater acoustic propagation, J. Sound Vibr. 591 (2024) 118617.
[9] M. D. Collins, Generalization of the split-step Padé solution, J. Acoust. Soc. Amer. 96 (1) (2015) 382–385.
[10] P. S. Petrov, M. Ehrhardt, S. B. Kozitskiy, A generalization of the split-step Padé method to the case of coupled acoustic modes equation in a 3d waveguide, J. Sound Vibr. 577 (2024) 118304.
[11] K.-J. Engel, R. Nagel, One-Parameter Semigroups for Linear Evolution Equations, Vol. 194 of Graduate Texts in Mathematics, Springer, 2000.
[12] M. Haase, The Functional Calculus for Sectorial Operators, Vol. 169 of Operator Theory: Advances and Applications, Springer, 2006.
[13] A. Taylor, D. Lay, Introduction to Functional Analysis, 2nd Edition, John Wiley & Sons, 1980.
[14] K. Yosida, Functional Analysis, 6th Edition, Vol. 123 of Grundlehren Math. Wiss., Springer, Cham, 1980.
[15] T. Kato, Perturbation Theory for Linear Operators., Vol. 132 of Class. Math., Springer, 1995.
I am normally hidden by the status bar