162 61 6MB
German Pages 294 Year 1984
Christian Berg Jens Peter Reus Christensen Paul Ressel
Harmonic Analysis on Semigroups Theory of Positive Definite and Related Functions
Springer-Verlag New York Berlin Heidelberg Tokyo
Christian Berg Jens Peter Reus Christensen Matematisk Institut K0benhavns U niversitet Universitetsparken 5 DK-2100 K0benhavn ~ Denmark
Editorial Board P. R. Halmos Managing Editor Department of Mathematics Indiana University Bloomington, IN 47405 U.S.A.
Paul Ressel Mathematisch-Geographische FakulHit Katholische UniversiHit EichsHitt Residenzplatz 12 D-8078 EichsHitt Federal Republic of Germany
F. W. Gehring
C. C. Moore
Department of Mathematics University of Michigan Ann Arbor, MI 48109 U.S.A.
Department of Mathematics University of California at Berkeley Berkeley, CA 94720 U.S.A.
AMS Classification (1980) Primary: 43-02, 43A35 Secondary: 20M14, 28C15, 43A05, 44AI0, 44A60, 46A55, 52A07, 60E 15 Library of Congress Cataloging in Publication Data Berg, Christian Harmonic analysis on semigroups. (Graduate texts in mathematics; 100) Bibliography: p. Includes index. 1. Harmonic analysis. 2. Semigroups. I. Christensen, Jens Peter Reus. II. Ressel, Paul. III. Title. IV. Series. QA403.B39 1984 515'.2433 83-20122 With 3 Illustrations.
© 1984 by Springer-Verlag New York Inc. All rights reserved. No part of this book may be translated or reproduced in any form without written permission from Springer-Verlag, 175 Fifth Avenue, New York, New York 10010, U.S.A. Typeset by Composition House Ltd., Salisbury, England. Printed and bound by R. R. Donnelley & Sons, Harrisonburg, Virginia. Printed in the United States of America.
9 87 6 54 32 1
ISBN 0-387-90925-7 Springer-Verlag New York Berlin Heidelberg Tokyo ISBN 3-540-90925-7 Springer-Verlag Berlin Heidelberg New York Tokyo
Preface
The Fourier transform and the Laplace transform of a positive measure share, together with its moment sequence, a positive definiteness property which under certain regularity assumptions is characteristic for such expressions. This is formulated in exact terms in the famous theorems of Bochner, Bernstein-Widder and Hamburger. All three theorems can be viewed as special cases of a general theorem about functions qJ on abelian semigroups with involution (S, +, *) which are positive definite in the sense that the matrix (qJ(sj + Sk)) is positive definite for all finite choices of elements Sl' . . . , Sn from S. The three basic results mentioned above correspond to (IR, +, x* = -x), ([0,00[, +, x* = x) and (No, +, n* = n). The purpose of this book is to provide a treatment of these positive definite functions on abelian semigroups with involution. In doing so we also discuss related topics such as negative definite functions, completely monotone functions and Hoeffding-type inequalities. We view these subjects as important ingredients of harmonic analysis on semigroups. It has been our aim, simultaneously, to write a book which can serve as a textbook for an advanced graduate course, because we feel that the notion of positive definiteness is an important and basic notion which occurs in mathematics as often as the notion of a Hilbert space. The already mentioned Laplace and Fourier transformations, as well as the generating functions for integervalued random variables, belong to the most important analytical tools in probability theory and its applications. Only recently it turned out that positive (resp. negative) definite functions allow a probabilistic characterization in terms of so-called Hoeffding-type inequalities. As prerequisites for the reading of this book we assume the reader to be familiar with the fundamental principles of algebra, analysis and probability, including the basic notions from vector spaces, general topology and abstract
vi
Preface
measure theory and integration. On this basis we have included Chapter 1 about locally convex topological vector spaces with the main objective of proving the Hahn-Banach theorem in different versions which will be used later, in particular, in proving the Krein-Milman theorem. We also present a short introduction to the idea of integral representations in compact convex sets, mainly without proofs because the only version of Choquet's theorem which we use later is derived directly from the Krein-Milman theorem. For later use, however, we need an integration theory for measures on Hausdorff spaces, which are not necessarily locally compact. Chapter 2 contains a treatment of Radon measures, which are inner regular with respect to the family of compact sets on which they are assumed finite. The existence of Radon product measures is based on a general theorem about Radon bimeasures on a product of two Hausdorff spaces being induced by a Radon measure on the product space. Topics like the Riesz representation theorem, adapted spaces, and weak and vague convergence of measures are likewise treated. Many results on positive and negative definite functions are not really dependent on the semigroup structure and are, in fact, true for general positive and negative definite matrices and kernels, and such results are placed in Chapter 3. Chapters 4-8 contain the harmonic analysis on semigroups as well as a study of many concrete examples of semigroups. We will not go into detail with the content here but refer to the Contents for a quick survey. Much work is centered around the representation of positive definite functions on an abelian semigroup (S, +, *) with involution as an integral of semicharacters with respect to a positive measure. It should be emphasized that most of the theory is developed without topology on the semigroup S. The reason for this is simply that a satisfactory general representation theorem for continuous positive definite functions on topological semigroups does not seem to be known. There is, of course, the classical theory of harmonic analysis on locally compact abelian groups, but we have decided not to include this in the exposition in order to keep it within reasonable bounds and because it can be found in many books. As described we have tried to make the book essentially self-contained. However, we have broken this principle in a few places in order to obtain special results, but have never done it if the results were essential for later development. Most of the exercises should be easy to solve, a few are more involved and sometimes require consultations in the literature referred to. At the end of each chapter is a section called Notes and Remarks. Our aim has not been to write an encyclopedia but we hope that the historical comments are fair. Within each chapter sections, propositions, lemmas, definitions, etc. are numbered consecutively as 1.1,1.2,1.3, ... in §1, as 2.1,2.2,2.3, ... in §2, and so on. When making a reference to another chapter we always add the number of that chapter, e.g. 3.1.1.
vii
Preface
We have been fascinated by the present subject since our 1976 paper and have lectured on it on various occasions. Research projects in connection with the material presented have been supported by the Danish Natural Science Research Council, die Thyssen Stiftung, den Deutschen Akademischen Austauschdienst, det Danske U ndervisningsministerium, as well as our home universities. Thanks are due to Flemming Topsq,e for his advice on Chapter 2. We had the good fortune to have Bettina Mann type the manuscript and thank her for the superb typing.
March 1984
CHRISTIAN BERG lENS PETER REUS CHRISTENSEN PAUL RESSEL
Contents
CHAPTER 1
Introduction to Locally Convex Topological Vector Spaces and Dual Pairs §1. Locally Convex Vector Spaces §2. Hahn-Banach Theorems §3. Dual Pairs Notes and Remarks
1 1 5 11 15
CHAPTER 2
Radon Measures and Integral Representations §1. §2. §3. §4. §5.
Introduction to Radon Measures on Hausdorff Spaces The Riesz Representation Theorem Weak Convergence of Finite Radon Measures Vague Convergence of Radon Measures on Locally Compact Spaces Introduction to the Theory of Integral Representations Notes and Remarks
16 16 33 45 50 55 61
CHAPTER 3
General Results on Positive and Negative Definite Matrices and Kernels
66
§1. Definitions and Some Simple Properties of Positive and Negative Definite Kernels §2. Relations Between Positive and Negative Definite Kernels §3. Hilbert Space Representation of Positive and Negative Definite Kernels Notes and Remarks
66 73 81 84
Contents
X
CHAPTER 4
Main Results on Positive and Negative Definite Functions on Semigroups §1. Definitions and Simple Properties §2. Exponentially Bounded Positive Definite Functions on Abelian Semigroups §3. Negative Definite Functions on Abelian Semigroups §4. Examples of Positive and Negative Definite Functions §5. r-Positive Functions §6. Completely Monotone and Alternating Functions Notes and Remarks
86 86 92 98 113 123 129 141
CHAPTER 5
Schoenberg-Type Results for Positive and Negative Definite Functions §1. §2. §3. §4. §5.
Schoenberg Triples Norm Dependent Positive Definite Functions on Banach Spaces Functions Operating on Positive Definite Matrices Schoenberg's Theorem for the Complex Hilbert Sphere The Real Infinite Dimensional Hyperbolic Space Notes and Remarks
144 144 151 155 166 173 176
CHAPTER 6
Positive Definite Functions and Moment Functions §1. §2. §3. §4. §5.
Moment Functions The One-Dimensional Moment Problem The Multi-Dimensional Moment Problem The Two-Sided Moment Problem Perfect Semigroups Notes and Remarks
178 178 185 190 198 203 222
CHAPTER 7
Hoeffding's Inequality and Multivariate Majorization §1. The Discrete Case §2. Extension to Nondiscrete Semigroups §3. Completely Negative Definite Functions and Schur-Monotonicity Notes and Remarks
226 226 235 240 250
CHAPTER 8
Positive and Negative Definite Functions on Abelian Semigroups Without Zero §1. Quasibounded Positive and Negative Definite Functions §2. Completely Monotone and Completely Alternating Functions Notes and Remarks
252 252 263 271
References
273
List of Symbols
281
Index
285
CHAPTER 1
Introduction to Locally Convex Topological Vector Spaces and Dual Pairs
§1. Locally Convex Vector Spaces The purpose of this chapter is to provide a quick introduction to some of the basic aspects of the theory of topological vector spaces. Various versions of the Hahn-Banach theorem will be used later in the book and the exposition therefore centers around a fairly detailed treatment of these fundamental results. Other parts of the theory are only sketched, and we suggest that the reader consult one of the many books on the subject for further information, see e.g. Robertson and Robertson (1964), Rudin (1973) and Schaefer (1971). 1.1. We assume that the reader is familiar with the concept of a vector space E over a field IK, which is always either IK = IR or IK = C, and of a topology (!) on a set X, where (!) means the system of open subsets of X. Generally speaking, whenever a set is equipped with both an algebraic and a topological structure, we will require that the structures match in the sense that the algebraic operations become continuous mappings. To be precise, a vector space E equipped with a topology (!) is called a topological vector space if the mappings (x, y) ~ x + y of E x E into E and (A, x) ~ AX of IK x E into E are continuous. Here it is tacitly assumed that E x E and IK x E are equipped with the product topology and IK = IR or IK = C with its usual topology. A topological vector space E is, in particular, a topological group in the sense that the mappings (x, y) ~ x + y of E x E into E and x ~ - x of E into E are continuous. For each u E E the translation L u : X ~ X + u is a homeomorphism of E, so if fJl is a base for the filter 0/1 of neighbourhoods of zero, then u + fJl is a base for the filter of neighbourhoods of u. Therefore the whole topological structure of E is determined by a base of neighbourhoods of the origin.
2
1. Introduction to Locally Convex Topological Vector Spaces and Dual Pairs
A subset A of a vector space E is called absorbing if for each x E E there exists some M > 0 such that x E AA for all A E lK with IAI ~ M; and it is called balanced, if AA f; A for all A E lK with I AI ~ 1. Finally, A is called absolutely convex, if it is convex and balanced.
1.2. Proposition. Let E be a topological vector space and let 0/1 be the filter of neighbourhoods of zero. Then: (i) every V E 0/1 is absorbing; (ii) for every V E 0/1 there exists V E 0/1 with V + V f; U; (iii) for every V E 0/1, b(V) = nlJlI~ 1 J1V is a balanced neighbourhood of zero
contained in V.
°
PROOF. For a E E the mapping A ~ Aa of lK into E is continuous at A = and this implies (i). Similarly the continuity at (0,0) of the mapping (x, y) ~ x + y implies (ii). Finally, by the continuity of the mapping (A, x) ~ AX at (0, 0) E lK x E we can associate with a given U E 0/1 a number e > and V E 0/1 such that AV f; U for I AI ~ c. Therefore
°
eV
f;
b(V)
~
V
so V contains the balanced set b(V) which is a neighbourhood of zero because e V is so, x ~ ex being a homeomorphism of E. D From Proposition 1.2 it follows that in every topological vector space the filter 0/1 has a base of balanced neighbourhoods. A topological vector space need not have a base for 0/1 consisting of convex sets, but the spaces we will discuss always have such a base.
1.3. Definition. A topological vector space E over lK is called locally convex if the filter of neighbourhoods of zero has a base of convex neighbourhoods. 1.4. Proposition. In a locally convex topological vector space E the filter of neighbourhoods of zero has a base gj with the following properties: (i) Every V E gj is absorbing and absolutely convex. (ii) If V E gj and A 9= 0, then AV E (JI.
Conversely, given a base (JI for a filter on E with the properties (i) and (ii), there is a unique topology on E such that E is a (locally convex) topological vector space with gj as a base for the filter of neighbourhoods of zero. PROOF. If V is a convex neighbourhood of zero then b(V) is absolutely convex. If gjo is a base of convex neighbourhoods, then the family gj = {Ab( V) I V E gj 0' A 9= o} is a base satisfying (i) and (ii). Conversely, suppose that gj is a base for a filter ff on E and satisfies (i) and (ii). Then every set V E ff contains zero. The only possible topology on E which makes E to a topological vector space, and which has ff as the filter of neighbourhoods of zero, has the filter a + ff as filter of neigh-
3
§l. Locally Convex Vector Spaces
bourhoods of a E E. Calling a nonempty subset G s; E "open" if for every a E G there exists U E gj such that a + U S; G., it is easy to see that these "open" sets form a topology with a + f7 as the filter of neighbourhoods of a., and that E is a topological vector space. D In applications of the theory of locally convex vector spaces the topology on a given vector space E is often defined by a family of seminorms.
1.5. Definition. A function p: E following properties:
~
[0,
00 [
is called a seminorm if it has the
(i) homogeneity:p(:1x) = 1:1lp(x)for:1E lK,XEE; (ii) subadditivity: p(x + y) ~ p(x) + p(y) for x, y E E. If, in addition, P- 1( {O}) = {O}, then P is called a norm. If P is a seminorm and (X > 0 then the sets {x EEl p(x) < (X} are absolutely convex and absorbing. For a nonempty set A S; E, we define a mapping PA: E ~ [0, 00] by
PA(X) = inf{:1 > Olx E :1A} (where PA(X) = 00, if the set in question is empty). The following lemma is easy to prove. 1.6. Lemma. If A
S;
E is
(i) absorbing, then PA(X) < 00 for x E E; (ii) convex, then PAis subadditive; (iii) balanced, then PAis homogeneous, and
{x E ElpA(X) < I}
S;
A
S;
{x E ElpA(X)
~
I}.
If A satisfies (i)-(iii) then PA is called the seminorm determined by A. A seminorm P satisfies Ip(x) - p(y) I ~ p(x - y). In particular, if E is a topological vector space then P is continuous if and only if it is continuous at 0 and this is equivalent with {xlp(x) < (X} being a neighbourhood of zero for one (and hence for all) (X > o. We will now see how a family (Pi)iel of seminorms on a vector space E induces a topology on E.
1.7. Proposition. There exists a coarsest topology on E with the properties that E is a topological vector space and each Pi is continuous. Under this topology E is locally convex and the family of sets
is a base for the filter of neighbourhoods of zero.
4
1. Introduction to Locally Convex Topological Vector Spaces and Dual Pairs
PROOF. Let gj denote the above family of sets. Then gj is a base for a filter on E having the properties (i) and (ii) of Proposition 1.4, and the unique topology asserted there is the coarsest topology on E making E to a topological vector space in which each Pi is continuous. D The above topology is called the topology induced by the family (pJiEI of seminorms. Note that in this topology a net (x a ) from E converges to x if and only if lima plx - x a ) = 0 for all i E I. The topology of an arbitrary locally convex topological vector space E is always induced by a family of seminorms, e.g. by the family of all continuous seminorms as is easily seen by 1.4 and 1.6. 1.8. Proposition. Let E be a locally convex topological vector space, where the topology is induced by a family (pJiEI of seminorms. Then E is a Hausdorff space if and only iffor every x E E\ {OJ there exists i E I such that plx) =f O. PROOF. Suppose x =f y and that (pJiEI has the above separation property. Then there exist i E I and G > 0 such that Pi(X - y) = 2c;. The sets
{ulplx - u) < c;}, {ulply - u) < c;} are open disjoint neighbourhoods of x and y. For the converse we prove the apparently stronger statement that the separation property of (pJi E I is a consequence of E being a TI -space (i.e. the one point sets are closed). In fact, if x =f 0 and {x} is closed there exists a neighbourhood U of zero such that x ¢ U. By Proposition 1.7 there exist c; > 0 and finitely many indices iI, , in E I such that
{yIPil(y) < c;,
, Piney) < c;}
so for some i E {iI' ... , in} we have plx) ~ c;.
~
U,
D
1.9. Finest Locally Convex Topology. Let Ebe a vector space over IK. Among the topologies on E., which make E into a locally convex topological vector space., there is a finest one., namely the topology induced by the family of all seminorms on E. This topology is called the finest locally convex topology on E. An alternative way of describing this topology is by saying that the system of all absorbing absolutely convex sets is a base for the filter of neighbourhoods of zero, cf. 1.4. The finest locally convex topology is Hausdorff. In fact, let e E E\ {OJ be given. We choose an algebraic basis for E containing e and let qJ be the linear functional determined by qJ( e) = 1 and qJ being zero on the other vectors of the basis. Then p = IqJ I is a seminorm with pee) = 1, and the result follows from 1.8. Notice that every linear functional is contin uous in the finest locally convex topology. In Chapter 6 the finest locally convex topology will be used on the vector space of polynomials in one or more variables.
§2. Hahn-Banach Theorems
5
1.10. Exercise. Let E be a topological vector space, and let A, B, C, F s; E. (a) Show that A (b) Show that F
+ B is open in E if A is open and B is arbitrary. + C is closed in E if F is closed and C is compact.
1.11. Exercise. Let E be a topological vector space. Show that the interior of a convex set is convex. Show that if U is an absolutely convex neighbourhood of in E then its interior is absolutely convex. It follows that a locally convex topological vector space has a base for the filter of neighbourhoods of consisting of open absolutely convex sets.
°
°
1.12. Exercise. Show that a Hausdorff topological vector space is a regular topological space. (It is actually completely regular, but that is more difficult to prove.) 1.13. Exercise. Let E be a topological vector space and A s; E a nonempty and balanced subset. Then: (i) if A is open, A = {x E ElpA(X) < I}; (ii) if A is closed, A = {x E EI PA(X) ~ I}.
1.14. Exercise. Let P, q be two seminorms on a vector space E. Then if {x E Elp(x) ~ I} = {x E Elq(x) ~ I} it follows that P = q. 1.15. Exercise. Let the topology of the locally convex vector space E be induced by the family (pJiEI of seminorms, and let f be a linear functional on E. Then.f is continuous if and only if there exist C E JO, w[ and some finite subset J s; I such that I f(x) I ~ C • max{pi(x) liE J} for all x E E.
§2. Hahn-Banach Theorems One main result in the theory of locally convex topological vector spaces is the Hahn-Banach theorem about extensions of linear functionals. In the following we treat this and closely related results under the name of HahnBanach theorems. We recall that a hyperplane H in a vector space E over lK is a maximal proper linear subspace of E or, equivalently, a linear subspace of codimension one (i.e. dim E/H = 1). Another equivalent formulation is that a hyperplane is a set of the form qJ - I( {O}) for a linear functional qJ: E ~ lk\ not identically zero. Neither local convexity nor the Hausdorff separation property is needed in our first version of the Hahn-Banach theorem. However the existence of a nonempty open convex set A =F E is a strong implicit assumption on E.
6
1. Introduction to Locally Convex Topological Vector Spaces and Dual Pairs
2.1. Theorem (Geometric Version). Let E be a topological vector space over lK and let A be a nonempty open convex subset of E. If M is a linear subspace of E with A n M = 0, there exists a closed hyperplane H containing M with An H = 0. We first consider the case lK = IR. By Zorn's lemma there exists a maximal linear subspace H of E such that M ~ H and A n H = 0. Let C= H + > 0 AA. The sum of an open set and an arbitrary set is open, hence C is open, cf. Exercise 1.10. We now derive four properties of C and H by contradiction: PROOF.
U;.
(a) C n ( - C) = 0. In fact, if we assume x E C n (-C), we have x = h l + Alal = h 2 - A2a2 with hi E H, ai E A, Ai > 0, i = 1,2. By the convexity of A
which is impossible.
(b) H u C u ( - C) = E. In fact, if there exists x E E\(H u C u ( - C)) we define H = H + IRx, so H is a proper subspace of H. Furthermore A n H = 0 because yEA n H can be written y = h + AX with h E H and A =F (A n H = 0), and then x = (l/A)Y - (l/A)h E C u (-C), which is incompatible with the choice of x. Finally the existence of H is inconsistent with the maximality of H so (b) holds.
°
(c) Hn(Cu(-C))=0. In fact, if x E H n C then x = h + Aa with h E H, a E A and A > 0, but then a = (l/A)(x - h) E A n H, which is a contradiction. From (b) and (c) follows that H is the complement of the open set C u ( - C), hence closed.
(d) H is a hyperplane. If H is not a hyperplane there exists x E E\H such that H = H + IRx =F E. Without loss of generality we may assume x E C and we can choose y E ( - C)\H. The function f: [0, 1] ~ E defined by f(A) = (1 - A)X + Ay is continuous, sO.r - l(C) and.r - l( - C) are disjoint open subsets of [0, 1] and 1. Since [0" 1] is connected there exists containing" respectively" ct E ]0, 1[such thatf(ct) E H. But this implies y = (l/ct)(f(ct) - (1 - ct)x) E H, which is a contradiction. This finishes the proof of the real case. A complex vector space can be considered as a real vector space, and if H denotes a real closed hyperplane containing M and such that A n H = 0, then H n (iH) is a complex hyperplane with the desired properties. D
°
7
§2. Hahn-Banach Theorems
The following important criterion for continuity of a linear functional will be used several times. 2.2. Proposition. Let E be a topological vector space over IK, let qJ: E -+ IK be a nonzero linear functional and let H = qJ - 1( {O}) be the corresponding hyperplane. Then precisely one of the following two statements is true: (i) qJ is continuous and H is closed; (ii) qJ is discontinuous and H is dense.
PROOF. The closure H is a linear subspace of E. By the maximality of H we therefore have either H = H or H = E. If qJ is continuous then H = qJ - 1( {O}) is closed. Suppose next that H is closed. Let a E E\H be chosen such that qJ(a) = 1. By Proposition 1.2 there exists a balanced neighbourhood V of zero such that (a + V) n H = 0, and therefore qJ(V) is a balanced subset of lK such that 0 fll + qJ(V), hence qJ(V) f; {x E IK Ilx I < l}. It follows that IqJ(X) I < G for all x E GV, G > 0, so qJ is continuous at zero, and hence continuous. D 2.3. Theorem of Separation. Let E be a locally convex topological vector space over IK. Suppose F and C are disjoint nonempty convex subsets of E such that F is closed and C is compact. Then there exists a continuous linear functional qJ: E -+ IK such that sup Re qJ(x) < inf Re qJ(x). xeC
xeF
PROOF. Let us first suppose IK = ~, and consider the set B = F - C. Obviously B is convex, and using the compactness of C it may be seen that B is closed, cf. Exercise 1.10. Since F n C = 0 we have 0 €I B, so by 1.4 there exists an absolutely convex neighbourhood U of 0 such that U n B = 0. The interior V of U is an open absolutely convex neighbourhood (cf. Exercise 1.11) so A = B + V = B - V is a nonempty open convex set (1.10) such that 0 €I A. Since {O} is a linear subspace not intersecting A, there exists by Theorem 2.1 a closed hyperplane H with A n H = 0. Let qJ be a linear functional on E with H = qJ - 1( {O}). By 2.2, qJ is continuous. Now qJ(A) is a convex subset of ~, hence an interval, and since 0 €I qJ(A) we may assume qJ(A) f; ]0, 00 [. (If this is not the case we replace qJ by - qJ). We next claim inf qJ(x) > 0, xeB
which is equivalent to the assertion. If the contrary was true there exists a sequence (x n ) from B such that qJ(x n ) -+ O. Since V is absorbing there exists u E V with qJ(u) < 0, but X n + u E A so that qJ(x n) + qJ(u) > 0 for all n, which is in contradiction with qJ(x n) -+ O. In the case IK = C we consider E as a real vector space and find a ~-linear functional qJ: E -+ ~ as above. To finish the proof we notice that there exists precisely one C-linear functional t/J: E -+ C with Re t/J = qJ namely t/J(x) = qJ(x) - iqJ(ix), which is continuous since qJ is so. D
8
1. Introduction to Locally Convex Topological Vector Spaces and Dual Pairs
Applying the theorem to two one-point sets we find
2.4. Corollary. Let E be a locally convex Hausdorff topological vector space. For a, bEE, a =f= b, there exists a continuous linear functional f on E such that f(a) =f= f(b). We shall now treat the versions of the Hahn-Banach theorem which are called extension theorems. Although they may be derived from the geometric version, we give a direct proof using Zorn's lemma. The first extension theorem is purely algebraic and very useful in the theory of integral representations. It uses the following weakened form of the concept of a seminorm.
2.5. Definition. Let E be a vector space. A function p: E -+ IR is called sublinear if it has the following properties: (i) positive homogeneity: p(AX) = Ap(X) for A ~ 0, X E E; (ii) subadditivity: p(x + y) ~ p(x) + p(y) for x, y E E. A functionf: E
-+
IR is called dominated by p iff(x)
~
p(x) for all x E E.
2.6. Theorem (Extension Version). Let M be a linear subspace ofa real vector space E and let p: E -+ IR be a sublinear function. If f: M -+ IR is linear and dominated by p on M, there exists a linear extension 1: E -+ IR off, which is dominated by p. PROOF. We first show that it is always possible to perform one-dimensional extensions assuming M =f= E. Let e E E\M and define M' = span(M u {e}). Every element x' E M' has a unique representation as x' = x + te with x E M, r E IR. For every rx E IR the functional f~: M' -+ IR defined by f~(x + te) = f(x) + trx is a linear extension off. We shall see that rx may be chosen such thatf~ is dominated by p. By the subadditivity of p we get for all x, y E M
f(x)
+
f(y) = f(x
+ y)
~
p(x
+ y)
~
p(x - e)
or
f(x) - p(x - e)
~
p(e
+ y)
- f(y).
Defining
k = sup{f(x) - p(x - e)lx EM}, K = inf{p(e
+ y)
- f(y)ly EM},
we have - 00
< k
~
K
0 be such that x E AV. Then A-1 X E V n M and hence I f(x) I ~ A. This shows that the seminorm Pu determined by V (cf. 1.6) satisfies If(x)1 ~ Pu(x) for x EM. Let]be a linear extension offsatisfying 1](x)1 ~ Pu(x) for x E E. Then 1](x)1 ~ G for x E GV, which shows that]is continuous. D
2.10. If E denotes a topological vector space we denote by E' the vector space of continuous linear functionals on E, and E' is called the topological dual space, which is a linear subspace of the algebraic dual space E* of all linear functionals on E.
2.11. Exercise. Let E be a real vector space and p a sublinear function on E. Show that
p(x) = sup{f(x)lfEE*,f
~
p}.
2.12. Exercise. Let P1' ... , Pn: E -+ IR be sublinear functions on a real vector space E and let.f: E -+ IR be linear and satisfyingf(x) ~ P1(X) + ... + Pn(x) for x E E. Show that there exist linear functions f1' ... , fn: E -+ IR such that f = f1 + ... + fn and such that h is dominated by Pi for i = 1, ... , n. Hint: Consider the product space En. 2.13. Exercise. With the notation as in Theorem 2.6 we denote by A(f, E) the set of linear extensions]: E -+ IR offwhich are dominated by p. Clearly A (f, E) is convex. Show by a Zorn's lemma argument that A(f, E) has extreme points. Let Atf, E) such that
Xo E
E. Show that there exists an extreme point]o in
(For the notion of an extreme point see 2.5.1. The result of the exercise is due to Vincent-Smith (1966, private communication). For a generalization see Andenaes (1970).)
11
§3. Dual Pairs
§3. Dual Pairs Let r\J o = {O, 1,2, ... }, let E = ~~o be the vector space of real sequences s = (Sk)k~O and let F be the vector space of polynomials p(x) = Lk=O CkX" with real coefficients. Note that F can be identified with the subspace of sequences SEE with only finitely many nonzero terms. For sEE and p E F we can define 00
(s, p> =
L SkCk k=O
and (., . > is a bilinear mapping of E x F into ~, which clearly satisfies the axioms in the following definition, so E and F form a dual pair under ( ., .
>.
3.1. Definition. Let E and F be vector spaces over IK and (., . >: E x F -+ IK a bilinear form, i.e. separately linear. We say that E and F form a dual pair under (., . >if the following conditions hold: (i) For every e E E\{O} there existsf E F such that (e,f> =f= (ii) For every f E F\ {O} there exists e E E such that (e,f> =f=
o. o.
3.2. A locally convex Hausdorff topological vector space E and its topological dual space E' form a dual pair under the bilinear form (x, qJ> = qJ(x) for x E E, qJ E E'. The condition (ii) is clearly true and (i) follows from Corollary 2.4. A vector space E and its algebraic dual space E* form a dual pair under the bilinear form (x, qJ) = qJ(x). This example is a special case of the above example if E is equipped with the finest locally convex topology, cf. 1.9. We see below that every dual pair (E, F, (., . arises in the above way in the sense that there exist a topology 1J on E, such that E is a locally convex Hausdorff topological vector space, and an isomorphism j: F -+ E' such thatj(f)(e) = (e,f> for e E E,f E F. Such a topology 1J is called compatible with the duality between E and F. In general there exist many different topologies on E of this kind, and we will now define one, which turns out to be the coarsest compatible with the duality and therefore is called the weak topology.
»
3.3. Definition. Let E and F be a dual pair under (., . >. The weak topology a(E, F) on E is the topology induced by the family (Pf)feF of seminorms, where Pf(e) = I(e,f> I· Condition (i) of 3.1 implies that a(E, F) is Hausdorff, cf. 1.8. By reasons of symmetry there is also a weak topology a(F, E) on F.
3.4. Proposition. The topology a(E, F) is the coarsest of the topologies compatible with the duality between E and F. PROOF. If 1J is a topology compatible with the duality then e ~ (e,f> is 1J-continuous for all f E F, and so are the seminorms (Pf)feF. By 1.7 it
12
1. Introduction to Locally Convex Topological Vector Spaces and Dual Pairs
follows that aCE, F) is coarser than fJ. If E is equipped with the weak topology then e 1--+ is a continuous linear functional on E for each f E F, and the mapping j: F -+ E' given by j(f)(e) = is linear and one-to-one (condition (ii) of 3.1). To see thatj is onto we consider a aCE, F)-continuous linear functional qJ on E. By 1.7 there exists £ > 0 and .fl' ... ,In E F such that pflx) < £, i = 1, ... , n, implies IqJ(x) I ~ 1. This gives at once that
{x EEl = 0, i = 1, ... , n} ~ qJ Let us consider the linear mapping tjJ: E
tjJ(x) = «X,fl >,
-+
1(
{O} ).
(1)
/Kn defined by
... , for x E E we find
~~~ Re(b,~ f) < 1 < Re(e, ~ f)' The first inequality shows that (1/ A)f E A ° and the last inequality is then incompatible with e E A 00. D 3.7. Remark. If A is balanced we have
AO= {fEF/I(e,f>1 ~ IforalleEA}. This is often used as a definition of the (absolute) polar set. If A is a cone (i.e. AA £; A for all A ~ 0) we have
AO = {f E FI Re(e,f>
~
0 for all e E A},
which is a convex cone. With A £; E we also associate another convex cone Ai- £; F, which is closed in any topology on F compatible with the duality between E and F, namely Ai- = {fEFI(e,f> ~ Ofor alleE A}. Clearly Ai-£; - A ° and if E and F are real vector spaces and if A is a cone then Ai- = -AO. For a set A containing 0 the bipolar theorem states that A 00 is the 1]-closed convex hull of A. Using translations we therefore have the following consequence of the bipolar theorem: 3.8. Proposition. The closed convex hull of a subset of E is the same for all topologies on E compatible with a given duality. If E is a finite dimensional vector space over IK, hence isomorphic with IK n where n is the dimension of E, there is exactly one topology on E compatible with the duality between E and E*. More generally there is exactly one Hausdorff topology on E such that E is a topological vector space. We will refer to this topology as the canonical topology of E. These assertions are contained in the following result. 3.9. Proposition. Let E be a finite dimensional subspace of a Hausdorff topological vector space F. Then E is closed in F, and any algebraic isomorphism qJ: IK n -+ E (n = dim(E)) is a homeomorphism, when IK n is equipped with the topology generated by the euclidean norm.
14
1. Introduction to Locally Convex Topological Vector Spaces and Dual Pairs
We first show by induction that any isomorphism qJ: IK n -+ E is a homeomorphism. For n = 1 we put qJ(l) = e. The continuity of scalar multiplication implies that qJ: A 1--+ Ae is continuous from IK to E. The inverse qJ - 1 is a linear functional on E, and its kernel is the hyperplane {O}, which is closed since E is Hausdorff. By 2.2 it follows that qJ - 1 is continuous. Let us assume that the above statement is true for all dimensions less than n and let qJ: IK n -+ E be an algebraic isomorphism. As before the continuity of the algebraic operations shows that qJ is continuous. To see that n qJ - 1 : E -+ IK is continuous it suffices to prove that each linear functional on E is continuous. To get a contradiction let us assume that t/J: E -+ IK is a discontinuous linear functional and put H = t/J - 1({O}). Then H is a (n - 1)dimensional hyperplane, which is dense in E by 2.2. Let 11·11 be the euclidean norm (or any norm) on H. By the induction hypothesis the norm topology on H coincides with the topology induced from E, so there exists an open set V in E such that V n H = {xEHlllxll < I}. PROOF.
Since H is dense in E and V is open, we have V n H = V, where the closures are in E. But the set {x EHlllxli ~ I} is compact in H, hence in E and in particular closed in E, so we get V ~
V = V n H ~ {x EHlllxli ~ I}. Since V is absorbing in E we get E = H. By this absurdity
qJ is indeed a homeomorphism. We finally show that E is closed in F. If this is not true there exists x E E\E. Then E = span(E u {x}) is a (n + I)-dimensional space. If et, ... , en is an algebraic basis for E, then qJ: IK n + 1 -+ E given by qJ(A t , .•. , An' A) = 1 Ai ei + AX is an algebraic isomorphism, hence a homeomorphism. It follows that E is closed in E, hence x E EnE = E, which is a contradiction.
Li=
D
.>.
3.10. Exercise. Let E and F be a dual pair under (., Then the weak topology (J(E, F) is the coarsest topology on E for which the mappings e 1--+ (e,f> are continuous when .f ranges over F. 3.11. Exercise (Theorem of Alaoglu-Bourbaki). Let E be a locally convex Hausdorff topological vector space with topological dual space E' and let V be a neighbourhood of zero in E. Show that VO is (J(E', E)-compact. Hint: Show that for x E E there exists A > 0 such that I(x, .f> I ~ A for all
f
E
vo.
3.12. Exercise. Let E, F be a dual pair under (., . >and let t1 be a topology on E compatible with the duality. Let V be a closed, absolutely convex neighbourhood of zero in E and let Pu be the seminorm determined by V, cf. 1.6. Show that XE E. Pu(x) = sup{ I(x,f> f E Va},
II
15
Notes and Remarks
3.13. Exercise (Theorem of Mackey-Arens). Let E, F be a dual pair under (., .), and let d be the family of all absolutely convex and u(F, E)-compact subsets of F. For A E t91 we define
IlellA
= sup{ I(e,f)
II f
E
A},
e E E.
Show that II·IIA is a seminorm on E. Use 3.11 and 3.12 to show that if fJ is a topology on E compatible with the duality then fJ is induced by some subfamily of (11·11 A)A E.>4· Show finally that the topology induced by the family (11·IIA)AEd is the Mackey topology, i.e. the finest topology on E compatible with the duality.
Notes and Remarks In the period up to the 1940's most results in functional analysis were about normed spaces. The development of the theory of distributions of Schwartz was one main motivation for a study of general spaces, since the basic spaces of test functions and distributions are nonnormable in their natural topology. Today locally convex Hausdorff topological vector spaces are a natural frame for many theories and problems in functional analysis, e.g. the theory of integral representations, which we shall discuss in the next chapter. For historical information on the theory of topological vector spaces we refer the reader to the book by Dieudonne (1981).
CHAPTER 2
Radon Measures and Integral Representations
§1. Introduction to Radon Measures on Hausdorff Spaces It is well known that the pure set-theoretical theory of measure and integration has its limitations, and many interesting results need a topological frame because measure spaces without an underlying "nice" topological structure may be very pathological. In classical analysis this difficulty was overcome by introducing the theory of Radon measures on locally compact spaces. On these spaces there is a particularly important one-to-one relationship between Radon measures and certain linear functionals (see below) which in many treatments on analysis leads to the definition, that a Radon measure is a linear functional with certain properties. Another branch of mathematics with a need for a highly developed measure theory is probability theory. Here the class of locally compact spaces turned out to be far too narrow, partly due to the fact that an infinite dimensional topological vector space never can be locally compact. For example, it was found that the class of polish spaces (i.e. separable and completely metrizable spaces) was much more appropriate for probabilistic purposes. Later on it became clear that a very satisfactory theory of Radon measures can be developed on arbitrary Hausdorff spaces. This has been done, for example, in L. Schwartz's monograph (1973). We shall follow an approach to Radon measure theory which has been initiated by Kisynski and developed by Tops~e. It deviates, for example, from the Schwartz-Bourbaki theory in working only with inner approximation, but we hope to show that it gives an easy and elegant access to the main results.
§1. Introduction to Radon Measures on Hausdorff Spaces
17
In the following let X denote an arbitrary Hausdorff space. The natural a-algebra on which the measures considered will be defined will always be the a-algebra f!4 = f!4(X) of all Borel subsets of X, i.e. the a-algebra generated by the open subsets of X. In our terminology a measure will always be nonnegative; a measure defined on f!4(X) will be called a Borel measure on X. Later on we also need to consider a-additive functions on f!4(X) which may assume negative values, these functions will be called signed measures.
1.1. Definition. A Radon measure J1 on the Hausdorff space X is a Borel measure on X satisfying (i) Jl(C) < 00 for each compact subset C £ X, (ii) Jl(B) = sup{Jl(C) I C £ B, C compact} for each B E f!4(X). The set of all Radon measures on X is denoted M +(X).
Remark. Many authors require a Radon measure to be locally finite, i.e. each point has an open neighbourhood with finite measure. There are good reasons for not having this condition as part of the definition, see Notes and Remarks at the end of this chapter. A finite Radon measure Jl (i.e. Jl(X) < (0) satisfies for B E f!4(X) Jl(B) = inf{Jl(G)IB £ G, G open} as is easily seen by considering the property (ii) for Be. However for arbitrary Radon measures this need not be true as is shown by Exercise 1.30 below. Let .ff = .ff(X) denote the family of all compact subsets of X. Clearly the restriction to .ff of a Radon measure Jl is a set function
A: .ff
-+
[0, oo[
satisfying the axioms of a Radon content below.
1.2. Definition. A Radon content is a set function A:.ff -+ [0, oo[ which satisfies (1) A(C 2 ) - A(C 1 ) = sup{A(C)IC £ C 2 \C 1 , CE.ff} for all C 1 , C 2 E.ff with C 1 £ C 2 • The key result in our approach to Radon measure theory is the extension theorem (1.4) below, the proof of which will need the following lemma.
1.3. Lemma. A Radon content A has the following properties: (i) A(C t ) ~ A(C 2 ),for all Cf, C 2 E ~ C t £ C 2 , i.e. A is monotone. (ii) A(C t U C 2 ) + A(C t n C 2 ) = A(C 1 ) + A(C 2 )'1 i.e. A is modular. (iii) lj'a net (Ca)(J. E A in.ff is decreasing with C = na C a then A( C) = lima A(Ca) = infa A( C a). In particular for a decreasing sequence C 1 ;2 C 2 ;2 . . . of compact sets we have lim n _ oo A(C n) = A(n:=l Cn).
18
2. Radon Measures and Integral Representations
(i) as well as A(0) = 0 is obvious. (ii) We have (C 1 u C 2 )\C 2 = C 1 \(C 1 n C 2 ) and therefore
PROOF.
A(C 1 u C 2 )
A(C 2 ) = A(C 1)
-
A(C 1 n C 2 )
-
as an immediate consequence of (1). (iii) Assume that b := in(iA(C a) - A(C» > O. We choose a fixed set C ao and C' £ Cao \ C, C' E .ff such that
A(Cao) - A( C) - A(C') < b. Now na~ao(C' n C a) = 0 and therefore C' n Cal = 0 for some Cal £ Cao since C' is compact and (Ca)aE A is decreasing. From (ii) we get
A(C' u Ca) = A(C') < A(C')
+ A(Ca) ~ A(Cao ) + A(C) + b
o
implying A(C a) - A(C) < J, a contradiction.
1.4. Theorem. Any Radon content on a Hausdorffspace has a unique extension to a Radon measure. PROOF. Let A be a Radon content on X. We define for any subset A £ X the inner measure by A*(A):= sup{A(C)IC £ A, C E.ff}
and have to show that J1 := A* I~ is a measure. Of course A* is an extension of A, but it may assume the value + 00, if Ais unbounded. In a certain analogy with Caratheodory's famous abstract measure extension theorem we consider the set system
.91:= {A
£
X/A*(C n A)
+
A*(C n A C ) = A*(C) for all C
E
.ff},
and we will show that d is a a-algebra containing ~, on which the restriction of A* is a-additive. From the very definition d is closed under complements and contains the empty set. The defining property (1) of a Radon content shows that .91 even contains all open subsets of X. Let AI' A 2 E d be disjoint and let C 1 £ A b C 2 £ A 2 be compact. Then the modularity of A gives
A(C 1 )
+
A(C 2 ) = A(C 1 u C 2 ) ~ A*(A 1 u A 2 )
and hence
A*(A 1 )
+
A*(A 2 ) ~ A*(A 1 u A 2 ),
i.e. A* is "superadditive". As a consequence d may also be written as
d = {A £ X/A*(C n A)
+
A*(C n A C ) ~ A*(C) for all C
E
.ff}.
Now let a sequence AI' A 2 , ••• E.91 be given and fix C E.ff as well as o. Then there exist compact sets K j £ C n A j and L j £ C n Aj such that
G
>
j
= 1,2, ....
(2)
19
§1. Introduction to Radon Measures on Hausdorff Spaces
From the modularity of A we get
A(V: K
i)
+ A(K n + 1n
/)1 Ki )
n
~Cn
=
A(VI K
i)
+ A(K n +
I)
(3)
as well as
We have
and
I n := L n + hence
K
n
and
1 U
j-= 1
Li
(n
A~+ I)'
Aj u
j= 1
In are disjoint compact subsets of C, so that A(K n ) + A(I n ) ~ A(C).
(5)
Adding the equalities (3) and (4) and inserting (2) and (5) give
ACV: K
i)
+ AC6: Li ) =
A(~I K i ) - A(K n )
+
-
A(OI L
i)
I)
+ A(K n + + A(Ln +
A(I n )
(6)
~ A() Ki ) + A(.n Li ) - 2n~ )=1 )=1 Ifwe add (6) over n
I)
l'
= 1,2, ... , N - 1 and use (2) for j = 1 we get
A(~I K i )
+ A(6 Li )
~ A(C) -
/;-
JI ~.
(7)
Put A:= Uf=1 A j ; then A(U7=1 K j ) ~ A*(C n A) for all Nand
A(C\ Li ) =
!~n:, A(b Li ) ~ A*(C n
A
C )
(8)
by Lemma 1.3(iii), hence letting N tend to infinity in (7) gives
A*(C n A)
+ A*(C n
A
C )
~
A(C) -
B,
and since this holds for all B > 0, we have in fact shown A E or", hence d is a a-algebra containing the open sets and therefore the Borel sets. Let us now furthermore assume that the sets A l' A 2' ... E d are pairwise disjoint and that C £ A. Then lim N _ oo A(n7=1 L j ) = 0 by (8), and taking again the limit in (7) gives
Jc(C) -
B
~ ~~~ A(~I Ki)= ~~ itl A(K) = JI A(K) ~ i~1 A*(A)
20
for all
2. Radon Measures and Integral Representations
G
> O. Letting G -+ 0 we find 00
A*(A) = sup{A(C)IC £ A, C
E
L A*(A
$'} ~
j ),
j= 1
and since the reverse inequality is obvious by the superadditivity of A* we have that A* I.rtf is a measure, thus finishing our proof. D The result we are now going to prove is a kind of monotone convergence theorem for Radon measures. The usual form of this theorem on general measure spaces deals with an increasing sequence of nonnegative measurable functions; however, if the underlying measure is a Radon measure and if the functions to be integrated are lower semicontinuous (i.e. {.f > t} is open for all t E [R), then the sequence may be replaced by an arbitrary increasing net of functions, as we shall see. In the sequel we shall make repeated use of the obvious inequality 1
00
o ~ J.:= 2" i~l l u >i/2"j ~ f
(9)
being valid for arbitrary functions f with values in [0, 00]. Iff is finite the infinite series in (9) reduces to a finite sum (pointwise) and f - .in ~ 1/2n • Note thatfn increases tofalso iff assumes the value 00. Let us mention that the family of all lower semicontinuous functions is closed under finite sums, multiplication with a nonnegative constant, and that the supremum of an arbitrary subfamily of these functions is still lower semicontinuous. Noting finally that an indicator functionf = I G is lower semicontinuous if and only if G is open, we see that the functionsfn defined in (9) are lower semicontinuous if fis.
1.5. Theorem. Let J1 be a Radon measure on the Hausdorffspace X. Then the .following holds: (i) If a net (Ga)aEA of open subsets of X is increasing with
Ua Ga=
G then
J1(G) = sup J1(G a) = lim J1(G a). (ii) Ifa net (fa)(J.EA oflower semicontinuousfunctions X with sUPa j~ = f then
f
f dJl =
f
s~p f~ dJl = li~
ffa
-+
[0, 00] is increasing
dJlo
PROOF. (i) Let C £ G be compact. Then finitely many Gal' ... , Gak cover C and by assumption there is some CXo such that Gal U ••. U Gak £ Gao' implying J1(C) ~ J1(G ao ) ~ SUPa J1(G a) and therefore
J1(G) = sup{J1(C)I C £ G, C The reverse inequality is trivial.
E
$'}
~
sup J1(G a).
21
§l. Introduction to Radon Measures on Hausdorff Spaces
(ii) For every t E IR the open sets {fa> t} increase to {f > t}. Using the functions.in and the correspondingfa,n as defined in (9) we find
If~d~
=
;n ~~({f > ;n}) = ;n ~li~~({fa > ;n})
=
li~ ;n ~ ~ ({fa > ;n}) = li~ If~. nd~,
where the interchange of limits is justified, both limits being suprema, and using this device once more we get
I
f d~ = s~p I fn d~ = s~p s~p I f~.
n
d~
= sup sup Ifa,n dfl = sup I fa dfl, a
n
a
D
applying, of course, the usual monotone convergence theorem.
1.6. Remark. Theorem 1.5 can be applied to an upwards filtering family A of sets or functions by defining an increasing net in the following way: The index set and the mapping of the net will be A and the identical mapping. A Borel measure satisfying property (i) of the above theorem is usually called a r-smooth measure. The class of these measures is in general larger than the class of Radon measures, however, for finite Borel measures on locally compact spaces the two notions coincide. The generalized monotone convergence theorem expressed as property (ii) of the above theorem uses only the r-smoothness of the underlying Radon measure and therefore remains valid for r-smooth measures as well, see Top;¢>e (1970) and Varadarajan (1965). We shall need in the following the notion of restriction ofa Radon measure to a Borel subset. If X is a Hausdorff space and B E 81(X), then B is again a Hausdorff space with respect to the trace topology {B n GIG open in X} and it is easy to see that the Borel subsets of B are given by
81(B) = {B n AlA
E
81(X)} = {D
so that in fact 81(B) £ 81(X). For fl
E
E
81(X)ID £ B}
M +(X) we now define
fll B: 81(B)
~
[0, 00]
as the restriction of fl to 81(B)'l i.e. (fll B)(A) := fl(A) for A mediately seen that fll B is again a Radon measure.
E
81(B). It is im-
1.7. Proposition. Let J1 be a Radon measure on X. lfthefunctionf: X is Borel measurable, then
If d~
= sup Ke,*,"
r f d~,
JK
~
[0,00]
(10)
22 and
2. Radon Measures and Integral Representations
if f:
X
-+ [o~
oo[ is continuous then v: &I(X) v(B):=
-+ [o~
00] defined by
{! d,u
is again a Radon measure. The measure v is often denoted fJ1 or f dJ1. PROOF. If I = I B for some B E &I(X), then (10) follows from the definition of a Radon measure. It is obvious that (10) remains true iff'is an elementary measurable nonnegative function, i.e. f = = I CXi 1Bi with pairwise disjoint Borel sets Bf, ... , Bn and CXI' ••• , CXn ~ o. But it is well known that an arbitrary Borel measurable I ~ 0 is the pointwise limit of some increasing sequence of elementary functions, so that the usual monotone convergence theorem and the possibility of interchanging two suprema give (10) in the general case also. Let now f: X -+ IR+ be continuous and v(B) = SBI dj1, BE &I(X). Obviously, v is finite on compact sets. Applying (10) to the restrictions J1IB and.flB we find
Ii
{!
{{!
d,u = su p
d,uIK
E
Jf; K
~ B}
0
1.8. Let j1 be a Radon measure on X and consider the family ~ of all open J1-zero sets in X. The system of all finite unions of sets in ~ filters upwards to the union G of all sets in ~ and J1(G) = 0 by Theorem 1.5. The open set G is therefore maximal in ~ and its complement is called the support of j1 or abbreviated supp(J1). It is immediate that supp(j1) is closed and that sUPP(J1) = {x
E
X IJ1(U) > 0 for each open set U such that x
E
U}.
Particularly simple examples of Radon measures are those with a finite support which we will call molecular measures, and among these are the one-point or Dirac measures Gx defined by Gx ( {x}) = 1 and Gx ( {x }C) = o. Of course supp(Gx ) = {x} and if J1 = I CXiG Xi is a molecular measure with Xi Xj for i j, then sUPP(j1) = {XdCXi > O}. The set of molecular measures is denoted Mol+(X).
+
+
I?=
In the usual set-theoretical measure theory, as well as in the theory of Radon measures, the notion of a product measure is of central importance. In the latter case we are immediately confronted with the following problem: Let X and Y be two Hausdorff spaces; then the product of the two u-algebras of Borel sets~ usually denoted &I(X) ® &I(Y)~ is by definition the smallest u-algebra on X x Y rendering the two canonical projections 1Cx: X x Y -+ X and1Cy: X x Y -+ Ymeasurable,i.e.&I(X) ® &I(Y)istheu-algebragenerated by 1Cx I (&I(X» U 1Cf 1(&I( Y». By definition of the product topology these two projections are continuous on X x Y and therefore Borel measurable, so that always
&I(X) ® &I(Y) £ &I(X x Y).
23
§l. Introduction to Radon Measures on Hausdorff Spaces
On "nice" spaces we even have equality of these two a-algebras on X x Y, but this need not always hold, see the exercises below. Our next goal will be to show existence and uniqueness of the product of two arbitrary Radon measures. This stands in some contrast to set-theoretical measure theory where usually a-finiteness of the measures is required in order to guarantee a uniquely determined product measure. We begin with a lemma.
1.9. Lemma. Let Z be a Hausdorffspace and let stl be an algebra ofsubsets of Z containing a base for the topology. If A: .91 --+ [0, 00 [ is finitely additive then A: %(Z) --+ [0, oo[ defined by ,1(C):= inf{A(G)I C £ G, G open, G E d} is a Radon content on Z. PROOF. Let C £ Z be compact, then every point x E C has an open neighbourhood Gx E d. Finitely many of these neighbourhoods cover C and their union is still in d. Hence ,1( C) is certainly finite. Now let two compact sets C 1 £ C 2 be given. For B > 0 there is an open set G 1 ;2 C b G 1 Ed such that A(G 1 ) - ,1(C 1 ) < B. The set C:= C 2 n G1 is compact, too, allowing us to choose a further open set G E .91, G ;2 C with A(G) - ,1(C) < B. Of course, C 2 £ G U G 1 E .91 so that ,1(C 2 ) ~ A(G) + A(G 1) and therefore ,1(C 2 ) - ,1(C 1) ~ A(G) + A(G 1 ) + B - A(G 1 ) < ,1(C) + 2B. Hence
,1(C 2 )
-
,1(C 1 )
~
sup{,1(C) I C £ C 2 \C 1 , C
E
%}.
The reverse inequality will follow immediately if we can show that A is additive on disjoint compact sets. Therefore let K, L E % with K n L = 0 be given. One direction, namely
,1(K u L)
~
,1(K)
+ ,1(L)
is obvious, so it remains to be shown that for arbitrary
,1(K)
+ ,1(L)
~
,1(K u L)
B
> 0
+ B.
By definition there is an open set WEd containing K u L such that A(W) - ,1(K u L) < B. The assumption made on the algebra .91 implies that K and L may be separated by open sets G, H belonging to .91, i.e. we have K £G,
L £H,
Gn H = 0.
Hence
,1(K)
+ ,1(L)
~
A(G n W)
+ A(H n
W)
= A«G u H) n W) ~
thus finishing the proof.
A(W) < ,1(K u L)
+ B, D
24
2. Radon Measures and Integral Representations
Later on we shall need existence and unicity of certain Radon measures on the product of two Hausdorff spaces X and Y not only for the product of two measures, but also for so-called Radon bimeasures. If(X, d) and (Y,!!l) are just two measurable spaces (without an underlying topological structure) then a bimeasure is by definition a function : .91 x !!l
-+
[0,
00]
such that for fixed A E .91 the partial function B ~ (A, B) is a measure on !!l and for fixed B E !!l the function A ~(A, B) is a measure on d. Obviously, if K is a measure on .91 @ !!l, then (A, B) ~ K(A x B) is a bimeasure, but in general not even a bounded bimeasure is induced in this way, cf. Exercise 1.31. Our next result will, however, show that for Radon bimeasures such pathologies do not exist, where by definition is a Radon bimeasure if is a bimeasure defined on !!l(X) x !!l( Y) such that (K, L) < 00 for all compact sets K, Land (A, B) = sup{(K, L)IA ;2 K E g(X), B;2 L E g(y)} for all Borel sets A and B. 1.10. Theorem. Let X and Y be two Hausdorff spaces and let : !!l(X) x !!l(Y) -+ [0, 00] denote a Radon bimeasure. Then there is a uniquely determined Radon measure K on X x Y with the property
(K, L) = K(K x L)
.for all
K
E
g(X),
L
E
g(y).
Furthermore, the equality (A, B) = K(A x B) holds for all Borel sets A
E
!!l(X), B
E
!!l( Y).
PROOF. Denote Z:= X X Y and let .91 be the algebra generated by the "measurable rectangles" A x B, where A E !!l(X) and BE !!l(Y). This algebra contains, of course, the products of open sets in X (resp. Y) and therefore a base for the topology on Z. It is easy to see that there is a uniquely determined finitely additive set function A on .91 fulfilling
A(A x B) = (A, B)
for all
A
E
!!l(X)
and
B E !!l( Y).
Let us now first assume that (X, Y) < 00. Then we may apply Lemma 1.9 which, combined with the extension theorem 1.4, shows the existence of a Radon measure K on Z such that K(C)
= inf{A(G)IC
£
G E .91, G open}
for each compact set C £ Z. If C = K x L is the product of two compact sets K £ X, L £ Y, then C E .91 and K(K x L)
~
A(K x L) = (K, L)
by monotonicity of A. On the other hand, we may use the two finite Radon measures ,u(A):= (A, Y) on X and v(B):= (X, B) on Y to provide us
25
§ 1. Introduction to Radon Measures on Hausdorff Spaces
with open sets G
;2
K, H
;2
L such that J.1(G\K)
if f(x)
= g(x) = 0,
if f(x)
+ g(x) >
0,
0,
if.f(x) = g(x) = 0,
f, h"
~
g and h'
T(h)
~
T+(f)
+ h" = h, implying + T+(g)
so that finally T+ is additive on C +(X). We put T- := T+ - T which also is additive, nonnegative and positively homogeneous on C + (X). By Corollary 2.3 there are two Radon measures J1b J12 on X such that
41
§2. The Riesz Representation Theorem
implying for f E C(X) TC{) = T(f+) - T(f-) = T+(f+) - T-(f+) - [T+(f-) - T-(f-)]
= I f+ d(J.11 - J.12) - I f- d(J.11 - J.12) J1
= II dJ.1,
:=
J1l - J12
so that T = ~. By Corollary 2.3 we also get immediately that there is only one signed Radon measure J1 with this property. Let J1 = J1+ - J1- denote the Jordan-Hahn decomposition of J1, i.e. J1 +(B) = J1(B n D) and J1- (B) = J1(B n DC) where D E &leX) is chosen in such a way that both J1 + and J1- are nonnegative (see, for example, Billingsley (1979, p. 373)). Then J1+ and J1- are Radon measures and furthermore
I I dJ.11
=
sup{Ih
~
I f dJ.1+
dJ.1+ - I h dJ.1-IO
~ h ~ f, h E C(X)}
for all.{EC+(X), hence J1l ~ J1+ by Exercise 1.27 and similarly J12 ~ J1-. Consequently we get J1l (DC) = 0 = J12(D) and therefore J1l = J1 +, J12 = J1as well as 1IJ111 = J1l (D) + J12(D C). We still have to show the reverse inequality II TJlII ~ 1IJ111. Given e > 0 we choose compact sets K+ £; D, K- £; DC such that
By Urysohn's lemma there is a continuous functionf: X thatflK+ == 1 andflK- == -1, implying
If
f dJ.11
~ J.11(K+) + J.1iK-) -
2/:
~ 11J.111 -
~
[ -1,1] such
4/:.
This shows II TJlII ~ 1IJ111 and finishes the proof of the theorem.
D
We shall now consider the case where X is locally compact. For a given compact subset K £; X we denote by C K(X) the vector space of all continuous functions on X whose support is contained in K; of course, CK(X) is a Banach space with respect to the supremum norm and CC(X)
=
U
CK(X).
K E Jf"(X)
A linear functional T on CC(X) is called continuous if all the restrictions TIC K(X) are continuous in the usual sense.
2.5. Theorem. For a locally compact Hausdorff space X the continuous linear jiinctionals T on CC(X) are in a bijective linear relation with the signed
42
2. Radon Measures and Integral Representations
Radon measures J1 on X via the natural formula T(.f) =
f f dJ1.
If J1 is a signed Radon measure, i.e. J1 = J1l - J12 for Radon measures J1l' J12' then TCf) = f dJ1 := f dJ1l .f dJ12 is well defined and continuous in the above sense; the unicity of the representing measure J1 is obvious by Corollary 2.3. Now let a continuous linear functional T: CC(X) ---+ IR be given. Imitating the proof of Theorem 2.4 we define for f E CC+ (X) PROOF.
J
J
J
T+(f):= sup{T(h)l.f
~
hE CC+(X)}.
Any function h occurring here belongs to Csupp(flX), implying that for some constant a > 0 IT(h) I ~ allhll ~ all.!'II, hence
o~
T+(j)
0 there exists a compact subset K elg(x)l.for x E X\K.
£;
X such that If(x)1
~
2.6. Definition. A convex cone C £; C +(X) of nonnegative continuous functions is called an adapted cone if: (i) For every x (ii) For everY.f
E E
X there exists.f E C such that.f(x) > O. C there exists gEe such that.f E o(g).
A linear subspace V
£;
C(X) is called an adapted space if:
(iii) V = V+ - V+, where V+ = V n C +(X). (iv) V+ is an adapted cone.
If.f E CC+(X) then.f E o(f) and for.f E C1(X) we have.f E o(J]), which shows that CC(X) and CO(X) are adapted spaces. If p and q are two real
43
§2. The Riesz Representation Theorem
polynomials in one variable then p E o(q) if and only if deg(p) < deg(q). It follows easily that the space of polynomials in one variable is an adapted space of continuous functions on X = IR. Similarly the space of polynomials in k variables is adapted on X = IRk. The main property of adapted spaces is given in the following: 2.7. Theorem. Let V be an adapted space of continuous functions on a locally compact space X. For every positive linear functional L: V ~ IR there exists a Radon measure J1 on X such that V £; ,pl(J1) and
PROOF.
L(f)
= ff
{f
C(X)llfl ~ g for some g E V+}.
for all .f E V.
dp
We define
v=
E
Then V is a subspace of C(X) containing V, and a simple compactness argument combined with (i) shows that CC(X) £; V. We claim that V= V+ + Jt: In fact, iff E Ii and g E V+ is such that If I ~ g, thenf = (g + f) + (-g) shows the assertion. By Corollary 1.2.7 it follows that L can be extended to a positive linear functional L: V ~ IR, and by the Riesz representation theorem there exists a Radon measure J1 on X representing [ ICC(X). For g E V+ and qJ E CC(X) satisfying 0 ~ qJ ~ g we have
f qJ dp.
L(g) = L(g) ;;;; L(qJ) = By U rysohn's lemma the family ff to g, so by Theorem 1.5
f
g dp
=
:=
{qJ
CC(X) I0 ~ qJ ~ g} filters upwards
E
sup{f qJ dpl qJ E.9F} ~ L(g) < 00, J
hence g E !fJl(J1). To see that g dJ1 = L(g) we choose h E V+ such that g E o(h). Let e > 0 be given. There exists a compact set K £; X such that ~
g(x) We choose qJ
E
eh(x)
CC(X) such that Ix
x
for ~
o~ g -
qJ
~
gqJ
E
X\K.
1 and find ~
eh,
hence
o ~ L(g) -
l(gqJ) ~ eL(h),
or
L(g)
~
f
gqJ dp
+ eL(h) ~
f
g dp
+ eL(h),
44
2. Radon Measures and Integral Representations
which suffices since e > 0 is arbitrary and independent of h. The equality
L(g) =
f
g d/l
D
extends clearly from V+ to V.
2.8. Exercise. Let X be locally compact. Show that the dual (Co(X))' of the Banach space CO(X) equals Mb(X), the space of all finite signed Radon measures on X (which is again a Banach space with respect to the total variation norm) in the sense, that there exists a linear bijective isometry from Mb(X) onto (CO(X))'. Hint: Use the one-point compactification of X. 2.9. Exercise. Show that the following conditions are equivalent for a locally compact space X: (i) X is a-compact, i.e. X is a countable union of compact sets. (ii) There is a strictly positive function.r E CO(X). (iii) There is a function f E C(X) with f(x) ---+ 00 for x ---+ 00, i.e. such that tr ~ a} is compact for all a E IR. (iv) C(X) is an adapted space.
2.10. Exercise. Try to find a finite signed Radon measure J1 on IR such that supp(J1+) = sUPP(J1-) composition of J1.
= IR~ where J1 = J1+ - J1- is the Jordan-Hahn de-
2.11. Exercise. Let X be locally compact and consider on CC(X) the following four families (Pi)ielj,j
XI0 =f
II
=
12
= f(X),
{A
£;
= 1, ... ,4, of seminorms: PAC.{):= max 1.{(x)l,
A finite},
xeA
PK(f):= max If(x)l, xeK
13 = {oo},
Poo(f)
:=
sup I f(x) I, xeX
{pili
E
I 4 } = {pip is a seminorm on CC(X) whose restriction to CK(X) is continuous for all K
E
f(X)},
where CK(X) := {f E CC(X) Isupp(f) £; K} is considered as a Banach space with respect to the supremum norm. Let (!) l' ... , (!) 4 denote the corresponding locally convex topologies on CC(X) (which are all Hausdorff). Show that the topological duals of CC(X) with respect to these four topologies are given (by natural identification) in the following way: (CC(X), (D.)' = {J1
E
M(X)lsupp(J1) is finite},
(CC(X), (D2)' = {J1
E
M(X)lsupp(J1) is compact},
(CC(X), (D3)' = {J1
E
M(X) 111J111 < oo},
(CC(X), (D4)'
=
M(X),
~3.
45
Weak Convergence of Finite Radon Measures
where supp(J1):= supp(,u+) decomposition of ,u.
U
supp(J1-) if J1 = J1+ - J1- is the Jordan-Hahn
2.12. Exercise. Let V be an adapted space of continuous functions on a locally compact space X, let F
X be a closed subset and put
£;
V~ = {f E V I f(x) ~ 0 for all x E F}.
Show that any linear functional L: V be represented as
L(f) =
ff
---+
IR which is nonnegative on V~ can for
dp
f
E
V,
where ,u E M +(X) is supported by F.
2.13. Exercise. Let X, Y be locally compact spaces and f: X ---+ Y a continuous surjective mapping such that f-l(K) is compact in X for each compact subset K £; Y. Show that each v E M +(Y) is of the form J11 for some ,u E M +(X). Hint: Use Corollary 1.2.7 and the Riesz representation theorem. 2.14. Exercise. Let X and Y be locally compact spaces and T: CC(X) x CC(Y) ---+ IR a bilinear mapping which is positive in the sense that T(f, g) ~ 0 if.f E CC+ (X) and g E CC+ (Y). Show that
T(f, g) = for some uniquely determined J1
E
ff
® 9 dp
M + (X x Y).
§3. Weak Convergence of Finite Radon Measures The theory of weak convergence of finite Radon measures is a well-developed theory which is of great importance in probability theory, in particular when dealing with stochastic processes. We will later need only a very few basic facts which we are going to develop in this section. Let X be a Hausdorff space and denote by M~(X) the set of all finite Radon measures on X, i.e. all Radon measures J1 with J1(X) < 00. The weak topology on M~ (X) is the coarsest topology such that the functions J1 ~ f dJ1 become lower semicontinuous for every bounded lower semicontinuous .f: X ---+ IR. The family of sets
J
Gf,I:=
{p
E
M:(X)! ff
dp >
t}
is a subbase for the weak topology when f ranges over the bounded lower semicontinuous functions on X and t E IR. The following result is part of the so-called portmanteau theorem (cf. Tops~e 1970, Theorem 8.1).
46
2. Radon Measures and Integral Representations
3.1. Theorem. For J1 E M~(X) and a net (J1(l)(lEA in M~(X) the following properties are equivalent: (i) (ii) (iii) (iv) (v)
J1(l
~
lim lim lim lim
J1 weakly, i.e. in the weak topology; inf J1(l(G) ~ J1(G)for all open G £; X and lim J1(l(X) = J1(X); sup J1(l(F) ~ J1(F)for all closed F £; X and lim J1(l(X) = J1(X); inf Jf dJ1(l ~ Jf dJ1for all bounded lower semicontinuousf: X ~ ~; sup Jf dJ1(l ~ Jf dJ1for all bounded upper semicontinuousf: X -+ IR.
If (i)-(v) are fulfilled, then lim Jf dJ1(l = Jf dJ1 for each bounded continuous f: X ~ IR, and this property implies (i)-(v) if X is in addition a completely regular space. PROOF. By definition (i) is equivalent with (iv), and the equivalence of (ii) and (iii) (resp. (iv) and (v)) is immediate. "(ii) => (iv)" Let f: X -+ IR be lower semicontinuous and assume without restriction 0 ~ f ~ 1. From the obvious inequality
1)+ ~ -I L n
(f - -
n
-
l
1{!>i/n}
ni=l
~
f
we get
lim inf
f
f
d~a ~ ~ ~ lim inf ~a({f > ~}) ~ ~ ~ ~({f > ~}). J
where the last expression converges to f dJ1 as n tends to 00. The implication "(iv) => (ii)" is obvious. If f: X ~ lR is bounded and continuous and J1(l ~ J1 weakly, then by (iv) and (v) f dJ1(l ~ f dJ1. Now suppose that X is completely regular and lim f dJ1(l = f dJ1 for all continuous bounded f: X ~ IR. To show (ii) let G ~ X be open and let K £; G be compact. As an immediate consequence of the very definition of complete regularity we fi~d a continuous function f: X ~ [0,1] such thatflK == 1 andflG c == o. Then
J
J
J
J
~a 0 be given. There exist compact sets K 1 ~ B, K 2 ~ BC such that J.l1 (B\K 1) < e and J.l2(B C\K2) < e. A simple compactness argument shows the existence of two open sets G l' G 2 ~ X separating K 1 and K 2 , i.e. K 1 ~ G 1, K 2 ~ G2 and G 1 n G 2 = 0. Then
K 1 ~ G1 ~ G1 ~ G~ ~ K~ and hence G ~
J.l1(B) This holds for all
G
J.l1(K 1) ~ J.l2(K~) ~ J.l2(B)
> 0 so that J.l1 (B)
~
+ e. D
J.l2(B).
In §1 it was shown that given two Radon measures J.l, v on spaces X and ~ there is a unique product Radon measure J.l ® v on X x Ycharacterized by
J.l ® v(C x D) = J.l(C)v(D)
for all compact
C
~
X,
D
~
Y,
and giving for all measurable rectangles the" right" value. In particular the product of two finite Radon measures is finite again (as it should be), and it is only natural to guess that J.l ® v depends continuously (in the weak topology) on both of its arguments.
3.3. Theorem. Let X and Y be two Hausdorff spaces. Then the mapping (J.l, v) ~ J.l ® v from M~(X) x M~(Y) to M~(X x Y) is weakly continuous. PROOF.
In a first step we show that the mapping
X x
M~(Y) ~ M~(X x
Y)
(x, v)~Gx ® v
is continuous. Assume that XiX ~ x and Vix ~ v. Let G 1 , ••• , Gn ~ X and H l' ... , H n ~ Y be open and put U := 1 (G i x Hi). We show first that
u?=
lim inf e xex ® vix(U) ~ This holds trivially if x ~
u?=
1
® v(U).
ex
G i • Suppose now that
1:= {i ~ nix Then there exists some ~o such that we get
E
GJ
+0.
XiX E niEI
G i for all ~ ~ ~o and for those
~
hence lim inf Gxex ® vix(U)
~ lim inf vix(.U Hi) lEI
~ v(.U Hi) lEI
=
ex
® v(U).
48
2. Radon Measures and Integral Representations
Every open set U ~ X x Y has the form U = U;'EA (G;. x H;.) for suitable open sets G;. ~ X and H;. ~ Y. By Theorem 1.5 we can find, given G > 0, finitely many At, ... , An E A such that
ex ®
V(Vl(G A, x RAJ) > ex ® v(U) -
e.
This implies lim inf ex « Q9 v,,(U)
~ lim inf ex
«
Q9
V{Vl (G A, x RAJ)
~ ex Q9 V(Vl (G A, x RAJ) > ex Q9 v(U) -
e.
Hence lim inf Gx ", (8) v(X(U) ~ Gx ® v(U) and 3.1 implies Gx ", ® V(X ~ Gx ® v. The second step will now be an easy consequence of the first one. Let f: X x Y ~ [0, 00] be lower semicontinuous. From the continuity of (x, v) ........ Gx ® v we get that
X x
M~(Y) ~
(x, v) 1-+
[0, 00]
r f(x, y) dv(y) = sup Jyr (n
Jy
1\
f(x, y» dv(y)
n
is also lower semicontinuous, and using the fact that M~(Y) is again a Hausdorff space as well as the Fubini theorem for lower semicontinuous functions (1.12), we may repeat this argument and conclude that M~(X) x M~(Y) ~
(Ji, v) ........
[0, 00],
fxJr
f(x, y) dv(y) dJi(x) =
y
f
f d(Ji (8) v)
Xxy
is lower semicontinuous, too. If now f: X x Y ~ IR is lower semicontinuous and bounded, then f + c ~ for some C E IR, and
°
(j1, v) 1-+
f f d(j1 Q9 v) = f (f + c) d(j1 Q9 v) -
Cj1(X)v(Y)
is again lower semicontinuous, the second term on the right being a continuous function of Ji and v. This finishes our proof. D
3.4. Corollary. If S is a Hausdorff topological semigroup, then so is with respect to convolution.
M~(S)
PROOF. The mapping (Ji, v) ........ Ji * v is continuous as composition of (Ji, v) ........ Ji (8) v and the mapping : M~(S x S) ~ M~(S) defined by (K):= K+, where + denotes the semigroup operation and K+ is the image of K under + ~ cf. 1.15~ 1.16 and Exercise 3.7 below. D
49
§3. Weak Convergence of Finite Radon Measures
It is a general principle in mathematics to approximate complicated functions (or other objects) by simpler ones. Among the Radon measures the so-called molecular measures (i.e. finite positive linear combinations of one-point (or "atomic ") measures) are considered to be "simple" objects. They are dense in the set of all finite Radon measures even in a stronger sense than with respect to the weak topology.
3.5. Proposition. For every Hausdorffspace X the set of molecular measures is a dense subset of M~ (X) with respect to the pointwise convergence on 86(X). PROOF. As the directed set we choose the family A of all finite Borel partitions of X, i.e. the set of all ri = {B 1 , ••• , B n } ~ 86(X) such that B i =F 0, B i n Bj = 0 for i =F j and U7= 1 B i = X, ordered by refinement. For such ri we put, given any f.1 E M~(X), f.1rJ.:= 2:7=1 f.1(BJe xi , where Xi E B i is chosen arbitrarily. Now let B E 86(X)\ {0, X} be given; then for all ri E A finer than rio = {B, Be} we have f.1rJ.(B) = f.1(B); and f.1rJ.(B) = f.1(B) for all ri E A when BE {0, X}. D 3.6. Exercise. Let (f.1rJ.) and f.1 be finite Radon measures on the Hausdorff space X and let them all be concentrated on the Borel subset Y ~ X. Then f.1rJ. -+ f.1 weakly in M~ (X) if and only if f.1rJ.1 Y -+ f.11 Y weakly in M~ (Y). 3.7. Exercise. Let X and Y be two Hausdorff spaces and letf: X -+ Y be a continuous mapping. Then for any f.1 E M~(X) the image measure f.1 f belongs to M~(Y), cf. Proposition 1.15. Show that the transformation f.1 ~ f.1 f from M~(X) to M~(Y) is continuous. 3.8. Exercise. Let X be a Hausdorff space and M~(X) the set of Radon probability measures on X, i.e. M~(X) = {f.1 E M~(X)If.1(X) = I}. Let E := {ex IX E X} be the set of all one-point measures. Show that every {O, 1}valued measure f.1 E M~(X) already belongs to E. Show further that E is a weakly closed subset of M~(X) homeomorphic to X and that E = ex(M~(X». (For the notion of an extreme point see 5.1.) 3.9. Exercise. Let f.1 be a Radon measure on the Hausdorff space X. For K E %(X) define f.1K(B):= f.1(B n K) and f.1K(B):= f.1(B n K)/f.1(K) (if f.1(K) > 0). Show that the net (f.1K) converges to f.1 pointwise on 86(X) and that
li~
f
g d,uK =
f
g d,u
for every f.1-integrable function g: X -+ C. Show that if f.1(X) < (f.1K) (resp. (f.1K» converges weakly to f.1 (resp. f.1/f.1(X».
00
then
3.10. Exercise. Let (prJ.) denote a net of probability Radon measures on the product X x Y of two Hausdorff spaces, and denote by JlrJ. (resp. vrJ.) the
50
2. Radon Measures and Integral Representations
marginal distribution of Pa. on X (resp. Y) (i.e. the image measures under the two canonical projections). If (/la.) converges to /l and ( va.) converges to some one-point measure ey, then (Pa.) tends to /l (8) eye
3.11. Exercise. Let (S, +, 0) denote a Hausdorff topological abelian semigroup with neutral element 0 (cf. 1.16) and let Ji E M~(S). Then 00
Ji*n
n=O
n!
L- =
lim (eo
+ Ji/n)*n
n-+oo
holds in the sense that both limits exist and agree for all Borel subsets of S. Their common value is often called the exponential of Ji and abbreviated exp(Ji). In particular both limits exist and agree with respect to the weak topology.
3.12. Exercise. Let Sand T be two Hausdorff topological semigroups and let h: S ~ T be a continuous homomorphism. Then for Ji, v E M~(S) we have
§4. Vague Convergence of Radon Measures on Locally Compact Spaces In this section X denotes a locally compact Hausdorff space. The vector space CC(X) of continuous functions j: X ~ IR with compact support and the vector space M(X) of signed Radon measures on X form a dual pair under the bilinear form Ji
E
M(X),
4.1. Definition. The vague topology on M(X) is the weak topology a(M(X), CC(X», i.e. the coarsest topology in which the mappings /l ~ (Ji,j> are continuous, when j ranges over CC(X), cf. 1.3.10. In particular the vague topology is a Hausdorff topology. We first remark that for any lower semicontinuous functionj: X the function J11---+
~
[0,00]
f f dJ1
is lower semicontinuous on M + (X) with the vague topology. In fact, from Urysohn's lemma it follows that any suchjis the supremum of the upward filtering family of functions qJ E CC(X) satisfying 0 ~ qJ ~ j,
§4. Vague Convergence of Radon Measures on Locally Compact Spaces
51
so by Theorem 1.5
f f dll sup{f cp dill cp =
E
CC(X), 0
~ cp ~ f}-
A combination of this remark and Theorem 3.1 immediately gives the following relationship between the weak topology on M~(X) and the restriction of the vague topology to M~ (X):
4.2. Proposition. Let (/l(l)(l E A be a net on M ~ (X) and let /l E M ~ (X). Then (/l(l) converges weakly to /l if and only if (/l(l) converges vaguely to /l and lim /l(l(X) = /leX). 4.3. Corollary. The vague topology and the weak topology coincide on the set of Radon probability measures.
M~(X)
Under some extra assumptions on a net (/l(l)(lEA in M +(X) the vague convergence implies the convergence of f d/l(l)(lEA for certain functions f E C(X)\CC(X). As an important example we have
(J
4.4. Proposition. Let (/l(l)(lEA be a net on /l E M~(X). If C :=
then /leX) ~ c, andfor allf
E
M~ (X)
sup /l(l(X)
0 there exists
qJ E
CC(X) such that
qJ II 00 ~ E, and therefore we find
and now it is easy to see that (1) holds.
D
The following result characterizes the relatively compact subsets M ~ M +(X) in the vague topology, i.e. the subsets M for which the vague closure M is vaguely compact.
52
2. Radon Measures and Integral Representations
~
4.5. Theorem. A subset M only if
M +(X) is relatively vaguely compact if and
sup {III EM}
O. The set {II
E M~(X)IIl(X) ~
c}
is vaguely compact. PROOF. The set in question is closed by Proposition 4.4 and relatively compact by Theorem 4.5. D
4.7. Corollary. Suppose X is a compact space. Then weak (or vague) topology.
M~(X)
is compact in the
PROOF. By Proposition 4.6 we have that M~(X) is relatively compact in the vague topology, but also closed since 1 E CC(X). The proof is finished by the observation in Corollary 4.3. D The following result was established by Choquet (1962) in his treatment of the moment problem.
4.8. Proposition. Let V be an adapted space of continuous .functions on a locally compact space X and let L: V -+ IR be a positive linear functional. The set of representing measures, i.e. the set for all
f
E
V}
is convex and compact in the vague topology. PROOF. It is clear that C is convex. For qJ E CC+(X) there existsf that qJ ~ f (cf. the proof of Theorem 2.7), hence
;£
ff
dll = LU)
for
11
E
C,
E
V+ such
§4. Vague Convergence of Radon Measures on Locally Compact Spaces
53
so C is relatively compact by Theorem 4.5. Let (1l(l)(lEA be a net from C converging vaguely to 11 E M + (X) and let.f E V+ . By the remark following 4.1 we get
If
I
~ lim inf f
dll
dll a
= L(f).
We choose g E V+ such thatf E o(g), and let e > 0 be given. There exists a compact set K ~ X such thatf(x) ~ eg(x) for x E X\K, and if qJ E C~(X) is such that 1K ~ qJ ~ 1 we havef(1 - qJ) ~ ego From
I
f d(1l - Ila) =
I
f(1 - cp) d(1l - Ila)
+
I
fcp d(1l - Ila)
we therefore get
and sincefqJ
E
CC(X) the last term tends to zero, so we have
IIf Since
f
E
f,
dll - L(f)1
~ 2t:L(g).
> 0 is arbitrary we get Jf dll = L(f) for all.f E V+ and then for all 'I
D
V, thus proving that 11 E C.
Let 11 be a signed Radon measure on X and let G ~ X be an open subset. Then G is itself locally compact and if.f E CC(G) is extended to X by f(x) = {f(X),
0,
X E
G,
x E X\G,
then .f E CC(X). The mapping f 1--+ Jf dll is a continuous linear functional on CC(G), hence by Theorem 2.5 represented by a signed Radon measure on G, denoted III G, and called the restriction of 11 to G. In case of a (nonnegative) Radon measure 11 on X the restriction III G is ofcourse the usual set-theoretical restriction, introduced already after Remark 1.6. Vague convergence of Radon measures is a local concept as the following result shows: 4.9. Theorem. Let (G(l)(lED be an open covering of a locally compact space X. A net (lli)iEI of signed Radon measures on X converges vaguely to 11 E M(X) if and only ~f (Ili IGCX)iE I converges vaguely to (Ill G(l) for each tYw E D. PROOF. The" only if" part is obvious, so suppose that (Ili IG(l)i E I converges to (Ill G(l) for each tYw E D. Letf E CC(X) have the compact support C, and choose tYw 1 , . . . , tYw n ED such that C ~ G(ll U G(l2 U ... u G(ln. By Lemma 1.17 there exist compact sets C k ~ GCXk k = 1, ... , n such that C = C 1 U ... U Cn. By Urysohn's lemma there exist functions qJk E CC(X) such that 1Ck ~ qJk ~ 1Gock
54
2. Radon Measures and Integral Representations
and SUPP(qJk) ~ G(lk' k
= 1, ... , n. The functions f(x)
!k(x)
nqJk(X)
,
XE
C,
X E
X\C,
L qJj(x)
:=
j= 1
0,
belong to CC(X), sUPP(h) ~ G(lk andf
= Lk= 1 fk' so we have for
and the assertion follows.
i E I,
D
The following result will be used occasionally. For the proof, see e.g. Bauer (1978, p. 233).
4.10. Proposition. Suppose that the locally compact space X has a countable base for the topology. Then the vague topology on M + (X) is metrizable. 4.11. Exercise. Let X be locally compact. For a net (Ji(l) on M +(X) and Ji E M + (X) the following conditions are equivalent: (i) Ji(l ~ Ji vaguely; (ii) lim sup Ji(l(K) ~ Ji(K) for each compact K ~ X and lim inf Ji(l(G) ~ Ji(G) for all relatively compact open sets G ~ X; (iii) lim Ji(l(B) = Ji(B) for all relatively compact Borel sets B ~ X such that Ji(oB) = O.
4.12. Exercise. Show that the set of all Radon measures on a locally compact space taking only values in No = {O, 1, 2, ... , oo} is vaguely closed. 4.13. Exercise. Let the Radon measures Ji(l tend vaguely to Ji and assume that all the Ji(l are concentrated on the closed subset Y ~ X. Then Ji is concentrated on Y, too, and Ji(l tends vaguely to Ji also on the locally compact space Y. 4.14. Exercise. Show that (Ji, v) 1---+ Ji ® v is vaguely continuous as a mapping from M +(X) x M +(Y) to M +(X x Y) for two locally compact spaces X and Y. Hint: Use the Stone-Weierstrass theorem. 4.15. Exercise. Property (ii) in Exercise 4.11 above makes sense on any Hausdorff space X and hence can be used to define vague convergence of Radon measures in this generality. Show however that if M +(X) with respect to vague convergence is a Hausdorff space, then X is necessarily locally compact. Hint: If X o E X has no relatively compact neighbourhood, then for a certain net (X(l) in X we have X(l ~ XO , GXat ~ GXQ and GXat ~ o.
§5. Introduction to the Theory of Integral Representations
55
§5. Introduction to the Theory of Integral Representations Throughout this section E denotes a locally convex Hausdorff topological vector space over IR. The theory below can be applied to complex vector spaces by restricting the multiplication with scalars to real scalars. The basic observation, which should be kept in mind when reading the following general theorems, is that a convex polyhedron is the convex hull of its corners, or equivalently, that any point in the convex polyhedron is the centre of gravity of a molecular probability measure on the corners. The basic notion is that of an extreme point of a set, which is a generalization of a corner of a convex polyhedron and defined below. We shall limit ourselves here to those parts of the theory of integral representations which will be needed in the sequel. A detailed exposition can be found in Alfsen (1971) or Phelps (1966).
5.1. Definition. Let A ~ B ~ E be subsets of E. Then A is called an extreme subset of B if for all x, y E B and A E ]0, 1[: AX
+ (1
- A)Y
E
A ~ x, YEA.
A point a E A is called an extreme point of A if {a} is an extreme subset of A. It is easy to see that if A is convex then a only if for all x, yEA: a = !(X
+ y) ~ x
E
A is an extreme point if and
= y = a.
The set of extreme points of A is denoted ex(A) and in some literature called the extreme boundary of A. Notice that" extreme subset of" is a transitive relation in the set of subsets of E. The above definitions of course make sense for an arbitrary real vector space without topology. Let Xl' .•• , X n be points in E and let ,1.1' ••. , An be numbers E [0, 1] with with 1 Ai = 1. The corresponding convex combination of Xl' ..• , X n is the point
L?=
n
b
=
L Ai x i·
i= 1
If J.1 denotes the molecular measure on E with mass Ai at Xi' i.e. n
J.1
=
L AiGxi'
i= 1
56
2. Radon Measures and Integral Representations
then the point b is called the barycentre of JL For any (continuous) linear functional.f on E we have
f(b) =
Jl
AJ(X i ) =
f
f dJ1,
and this is the motivation for the following:
5.2. Definition. Let X be a compact subset of E and let J1 E M~(X) be a Radon probability measure on X. A point bEE is called the barycentre of J1 if and only if
f(b) =
Ix f dJ1
for all j'
E
E'.
Remark. There exists at most one barycentre of J1 E M~(X). In fact, if b l , b 2 are barycentres of J1 we have .f(b l ) = j'(b 2 ) for all.f E E', and since E' separates the points of E we find b l = b2 . As well as barycentre one also encounters the words centre o.f gravity and resultant. The barycentre is "the value of" the vector integral
Ix x dJ1(x), but such a vector integral need not always converge in E, so a barycentre need not exist. We have, however, the following result:
5.3. Proposition. Let X be a compact subset of E such that K = conv(X) is compact. Then.for every J1 E M~(X) the barycentre exists and belongs to K. Conversely, every point x E K is barycentre of some J1 E M ~ (X). PROOF.
For fEE' and J1
E M~(X)
HI =
{x
E
we put
Elf(x) =
ff
dJ1}
The intersection of the H f's for fEE' is the set of barycentres of J1, hence either empty or a singleton. We shall show
nH
feE'
f
n K =t=
0·
The set H f n K is a closed subset of K, so by a result from general topology it suffices to prove that (1)
for an arbitrary finite subset {ft, ... ,fn} ~ E'. For such a subset we define a continuous linear mapping T: E ~ lR n by
T(x) = Cft(x), ... ,fn(x»,
~5.
57
Introduction to the Theory of Integral Representations
and claim that p:=
(5
fl djl, ... ,
5
fn djl)
E
T(K),
which shows (1). If the contrary is true, there exists by the separation theorem 1.2.3 a linear form qJ: IR n ~ IR such that sup qJ(T(K)) < qJ(p).
(2)
The linear form qJ is given as qJ(x) = for some a and defining g = = 1 a i /; E E', (2) can be expressed
I?
= (at, . .. ,an) E ~n,
sup g(K) < { g djl, which is impossible, J1 being a probability. We next have to prove that any x E conv(X) is the barycentre of some J1 E M~(X). This is clear if x E conv(X), in fact such a point is the barycentre of a molecular measure as remarked earlier. For x E conv(X) there exist nets (x~)~ E A of points from con v(X) con verging to x and (J1~)~ E A of molecular measures from M~(X) such that x~ is the barycentre of J1~ for each ~ E A. By Corollary 4.7 there exist J1 E M~(X) and a subnet (J1~p) converging weakly to J1. For fEE' we then have
5
f djl
= li~
5
f djlap
= li~
f(x ap )
which shows that x is the barycentre of J1.
= f(x), D
5.4. Remark. If E is complete then conv(X) is compact for every compact X ~ E, so every J1 E M~(X) has a barycentre in this case. This applies in particular to Frechet spaces and Banach spaces. For details, see, e.g. Robertson and Robertson (1964). Already Minkowski proved that a compact convex set K in ~n is the convex hull of ex(K). In 1940 Krein and Milman found a far-reaching generalization of Minkowski's result:
5.5. Theorem. Every compact convex set K in E is the closed convex hull of its extreme points, i.e. K
= conv(ex(K)).
We first show that any nonempty compact set C has extreme points. We form the family ~ of nonempty, closed extreme subsets of C. Notice that C E~. A Zorn's lemma argument shows that ~ contains a minimal element M with respect to inclusion. To see that M has only one point, which is then an extreme point of C, we assume the existence of x, y E M, PROOF.
58
x
2. Radon Measures and Integral Representations
+y, and choosef E E' such thatf(x) > f(y). Then M o = {zEMlf(z) = supf(M)}
is easily seen to be an extreme subset of M, hence MoE ~ Since M 0 is a proper subset of M we are led to a contradiction. Clearly conv(ex(K» is a compact convex subset of K. If they are not equat there exists by the separation theorem 1.2.3 an fEE' such that
supf(K) > supf(conv(ex(K»).
(3)
The set
M = {x
E
Klf(x) = supf(K)}
is a nonempty compact subset of K, and by the first part of the proof ex(M) 0. Since M is an extreme subset of K we have ex(M) ~ ex(K), but this is impossible due to (3). D
+
For every subset A ~ E we have conv(A) = conv(A). An equivalent formulation of the Krein-Milman theorem is therefore that K = conv(ex(K». Using Proposition 5.3 we can reformulate the theorem in the following way:
5.6. Theorem. Let K be a compact convex set in E. Every x centre of a measure Jl E M~(ex(K».
E
K is the bary-
A natural and important question in connection with the above theorem is whether Jl can be chosen such that Jl is concentrated on the set of extreme points, i.e. such that Jl(K\ex(K» = O. The answer is yes if K is metrizable, and this is the content of Choquet's theorem. Tbe answer is no in general if K is nonmetrizable, simply because ex(K) can be a nonmeasurable subset of K in this case. There is however a satisfactory solution to the question in the nonmetrizable case also, related to the notio.n of a boundary measure. For a compact convex subset K of E we define a partial ordering t.
Show that!r, f - !r E K and deduce that the extreme points of K are the following functions qJo == 0, qJoo == 1, qJt(x) = (1 - x/t) + , 0 < t < 00. Show finally that K is a Bauer simplex. (This way of finding the extreme points of K is taken from Johansen (1967).) 5.19. Exercise. Show that the Riesz representation theorem on a compact Hausdorff space is a special case of the Krein-Milman theorem. 5.20. Exercise. (Douglas 1964). Let V be an adapted space of continuous functions on a locally compact space X and let C be the convex set of representing measures for a positive linear functional L: V -+- IR. Show that J1 E C is an extreme point of C if and only if V is dense in 2 1(X, J1). 5.21. Exercise. Let K and L be compact convex subsets of locally convex spaces~ and letf: K -+- Lbe continuous~ affine and onto. Show that ex(L) ~ f(ex(K))~ and use this result to determine the set of extreme points of
L:= {( ft df.1(t), ft df.1(t),···, ft" df.1(t)) If.1 E 2
M~([O, I])}.
Notes and Remarks We have not assumed a Radon measure to be locally finite. The reason for this is simply that we do not need this condition to derive any of the main results and, on the other hand, on many spaces a Radon measure is automatically locally finite. Certainly this is the case for locally compact spaces
62
2, Radon Measures and Integral Representations
but it holds, for example, also for metric spaces: Let fl be a Radon measure on the metric space X and suppose there is some x E X such that fl(G) = 00 for each open set G containing x, then in particular fl(B n ) = 00 if B n is the open ball of radius lin around x, and hence fl(K n ) ~ n for a suitable compact set K n ~ B n , where without restriction x E K n • Now the point is that K:= U:= 1 K n is again compact, because if K ~ UAEA GA for a family of open sets (G A)' then for some Ao we have x E GAo' and then for some no x E B no ~ GAo implying U:=no K n ~ GAO' The fact that fl(K) = 00 shows that our assumption was wrong. Exercise 1.30 shows that non locally finite Radon measures may occur, and this depends on the fact that each compact subset is finite. As another example where this is the case we mention the fine topology of potential theory, for instance on [R3, cf. Helms (1969). The fine topology is by definition the coarsest topology on 1R 3 in which all superharmonic functions are continuous. The fine topology is completely regular (Brelot 1971, p. 5), and the fact that every finely compact set is finite is proved in Helms (1969,
p.208). Next we want to relate our approach to Radon measures with the "classical" one as developed, for example, in Bourbaki (1965-1969). There, as already mentioned in the introduction, the "functional point of view" is prevalent, a (Radon) measure fl being by definition a positive linear form on CC(X), if X is locally compact. Two set functions are then considered, fa mesure exterieure fl* defined by fl*(G) = sup{l.f
E
CC(X), 0 ~
f
~
IG}
for open
G
~
X
and /l*(A)
= inf{fl*(G)IA
~
G, G open}
for
A ~ X,
and fl' derived from l'integrale superieure essentielle and given by /l'(A)
=
SUP{fl*(A n K) IKE %(X)}
for
A ~ X.
Both fl* and fl' are Borel measures, i.e. a-additive when restricted to 8l(X), and fl' is a Radon measure in our sense whereas /l* is not so in general. One has fl' ~ fl* and they agree on open sets and on Borel sets B with /l*(B) < 00, and in particular on compact sets. It follows that for locally compact and a-compact spaces one has /l'(B) = /l*(B) for all BE 8l(X), so for these spaces Bourbaki's notion of a Radon measure is equivalent to ours. This holds in particular for compact spaces. Bourbaki (1965-1969, Ch. IV, §L Ex. 5) gives an example where fl'(r') = 0 and fl*(r') = 00 for a certain closed subset r' in some locally compact space, and this shows that fl* is not a Radon measure in our sense. Bourbaki defines a (Radon) premeasure on a Hausdorff space X as a mapping W which to every compact subset K ~ X associates a Radon measure WK on K such that WK IL = WL if L is a compact subset of K. Then the following set function W' is considered W'(A) = sup{(WK)'(A n K)IK
E
%(X)}
for
A
~
X.
63
Notes and Remarks
The restriction of W· to f4(X) is a Radon measure in our sense. Bourbaki calls W a (Radon) measure if W· is in addition locally finite. It follows that Bourbaki's notion of (Radon) measures is equivalent with locally finite Radon measures in our sense. Let X be a Hausdorff space and consider Borel measures v: f4(X) --+ [0, 00 ] satisfying (a) (b) (c)
v(K) < 00 for K E f(X); v(G) = sup{v(K)IK s; G,KEf(X)} foropen G s; X; v(B) = inf{ v(G) I B s; G, G open} for B E fJ6(X).
There is a one-to-one correspondence between locally finite Radon measures J.l on X and Borel measures v satisfying (a), (b) and (c). In fact, if J.l is a locally finite Radon measure then J.l*(B) = inf{J.l(G)IB s; G, G open}
for
B
E
fJ6(X)
is a Borel measure satisfying (a), (b) and (c), and if v has these properties then v·(B)
= sup{ v(K) I K
s; B, K E f(X)}
for
B
E
fJ6( X)
is a locally finite Radon measure. Furthermore (J.l*r = J.l and (v·)* = v. For the proof of these assertions see Schwartz (1973), where a third equivalent definition of a locally finite Radon measure is given, namely as a pair (m, M) of Borel measures satisfying certain conditions realized by (J.l, J.l*) and (v·, v) in the above notation. Notice that a Borel measure v satisfying (b) and (c) is locally finite if and only if (a) holds. In the locally compact case the exterior measure J.l* satisfies (a), (b) and (c) and (J.l*r = J.l., (J.l.)* = J.l*. The generalized monotone convergence theorem (1.5) involves only the values of the underlying Radon measure J.l on open sets, so it holds for any Borel measure v which agrees with J.l on open sets. In particular, we have
°
for each lower semicontinuous function f ~ on X. For a continuous real-valued functionj'integrability with respect to J.l and v are equivalent and Sf dJ.l = Sf dv in case of integrability. Bourbaki (Ch. IX, §3) also considers the possibility of "extending" a set function A: f(X) --+ [0, oo[ to a Radon measure, however the crucial property (1) of our §1, the defining property of a Radon content, which goes back to Kisynski (1968), is not discussed there. Theoreme 1 of §3 in Bourbaki should be compared with our Lemma 1.3. Theorem 1.4 is due to Kisynski. Replacing the Hausdorff space X by an abstract set and the family f(X) of compact subsets of X by a suitable set system called "compact paving", Tops¢>e (1978) proved an abstract measure extension theorem which not only contains Theorem 1.4 but also, for example, Caratheodory's classical result.
64
2. Radon Measures and Integral Representations
It should be mentioned that on many "nice" spaces a finite Borel measure is automatically a Radon measure. This holds in particular on Polish spaces, i.e. separable and completely metrizable spaces, cf. for example, Bauer (1978, Satz 41.3), but it can even be shown for so-called analytic spaces, i.e. Hausdorff spaces which are the continuous image of some Polish space; see Dellacherie and Meyer (1978, Chap. III) for a proof. The last-mentioned book also contains a proof of the bimeasure theorem for finite Radon bimeasures on separable metric spaces. A slightly more general version may be found in Morando (1969), but both results are in fact a special case of a theorem of Marczewski and Ryll-Nardzewski (1953) about nondirect products of measures. The Riesz representation theorem is certainly a cornerstone of functional analysis. Our proof is based on Pollard and Tops¢e (1975) and we refer to the references given there, in particular to Batt (1973) for further information on this important topic. The theory of adapted spaces has had important applications in potential theory, see Sibony (1967-1968). The theory of weak convergence of finite (or probability) Radon measures is mainly motivated by its applications in probability theory and mathematical statistics. For a thorough treatment on metric spaces we refer to Billingsley (1968) and Parthasarathy (1967). Later on Tops¢>e (1970) discovered that a satisfactory theory of weak convergence can be developed on arbitrary Hausdorff spaces. For a completely regular space X the weak topology on M~(X) is induced by the weak topology a(Mb(X), Cb(X)). However, for Hausdorff spaces in general it is not possible to extend the weak topology from M~(X) to Mb(X) in such a way that Mb(X) is a Hausdorff topological vector space. In fact, if such an extension was possible then Mb(X) would be completely regular and so would {£xlx EX}, which is homeomorphic to X by Exercise 3.8. A particularly important topic for probabilistic applications is the characterization of relatively compact subsets of M~(X) in the weak topology, and we should mention the striking result due to Prohorov: For Polish spaces X a subset M ~ M~(X) is weakly relatively compact if and only if for each £ > 0 there is a compact set K ~ X such that SUPjiEM J1(X\K) < £, a condition on M called uniform tightness, see Billingsley (1968, Theorems 6.1 and 6.2). Theorem 3.3 may be found in Ressel (1977); Exercise 3.10 is a generalization of Slutsky's theorem, cf. Ressel (1982b). For a locally compact space X the space CC(X) is often equipped with the inductive limit topology of the Banach spaces CK(X), K E f(X), appearing before Theorem 2.5. With this topology, which is equal to the topology gi ven in Exercise 2.11 by the family 14 , CC( X) is a barrelled space, and the topological dual space is M(X) with the vague topology. This approach is the starting point in Bourbaki (1965-1969), who also seems to be the first who has systematically studied the vague topology. Theorem 4.5 is a special case of the Alaoglu- Bourbaki theorem, cf. Exercise 1.3.11. In the special case of X = IR Theorem 4.5 is sometimes called Helly's selection theorem. In
Notes and Remarks
65
fact it follows from 4.5 and 4.10 that any sequence of measures in M~(IR) with bounded total mass has a vaguely convergent subsequence. The result in Exercise 4.15 is due to TopscPe. The importance of the theory of integral representations lies undoubtedly in the fact that it gives a unified approach to a great number of classical formulas and theorems, cf. Phelps (1966). Let usjust mention here Herglotz' formula for nonnegative harmonic functions in a ball, the far more general Martin representation, and the theorems of Bernstein and Bochner. In Chapter 4 we shall use the theory to prove integral representation theorems for positive definite functions on abelian semigroups. The idea of considering a point in a metrizable compact convex set K as the barycentre of a probability measure concentrated on ex(K) is due to Choquet, and the whole theory is often called Choquet theory. In our applications of the theory we use only the special case where ex(K) is closed, in which case the representation theorem is equivalent with the Krein-Milman theorem. Therefore we have given a complete proof of the latter and only indicated the general results, which can be found in many books, for example, Alfsen (1971) and Phelps (1966).
CHAPTER 3
General Results on Positive and Negative Definite Matrices and Kernels
§1. Definitions and Some Simple Properties of
Positive and Negative Definite Kernels When dealing with positive and negative definite kernels a certain amount of confusion often arises concerning terminology. A positive definite kernel defined on a finite set is usually called a positive semidefinite matrix. Sometimes it is only called "positive", which may be misleading. When working on groups, the name positive definite function is used traditionally. In our previous papers on abelian semigroups we also followed this tradition. Instead of calling a kernel t/J negative definite, some authors call the kernel - t/J "conditionally positive definite" or "almost positive." In this book we use mainly the larger class of "semidefinite" kernels of all kinds and therefore prefer to avoid the prefix" semi" which otherwise would appear several hundred times. Adapting the above point of view, an n x n matrix A = (ajk) of complex numbers is called positive definite if and only if n
L
cjcka jk
~ 0
j,k= 1
for all {c 1 , ••• , Cn} ~ c. It is well known that this is the case if and only if A is hermitian (i.e. ajk = akj for j, k = 1, ... , n) and the eigenvalues of A are all ~ o. Similarly A is called negative definite if and only if A is hermitian and n
L
j,k= 1
cjckajk
~0
§I. Definitions and Some Simple Properties of Positive and Negative Definite Kernels
67
for all {c l , ... , Cn} ~ C with the extra condition Lj= I Cj = O. (This definition requires n ~ 2. Any 1 x 1 matrix A = (all) with real all is called negative definite.)
1.1. Definition. Let X be a nonempty set. A function qJ: X x X called a positive definite kernel if and only if
--+
C is
n
L
j,k= I
CjCkqJ(Xj, Xk) ~ 0
for all n E N, {Xl' ... ' X n } ~ X and {C l , ... , Cn} ~ C. We call the function qJ a negative definite kernel if and only if it is hermitian (i.e. qJ(Y, x) = qJ(x, y) for all x, Y E X) and n
L
j,k= I
CjCkqJ(xj, Xk) ~ 0
for all n ~ 2, {Xl' ... , x n} ~ X and {c l , ... , cn} ~ C with Lj= I Cj = O. If the above inequalities are strict whenever X I' ... 'Xn are different and at least one of the C I' ... , Cn does not vanish, then the kernel qJ is called strictly positive (resp. strictly negative) definite.
1.2. Remark. In the above definitions it is enough to consider mutually different elements Xl' ... , Xn E X. In fact, if Xl' ... , Xn E X are arbitrary and x a1 ' • • • , x ap are the mutually different elements among the x/s, then n
L
p
j,k= I
cjCkqJ(x j , Xk) =
L
j,k= I
djdkqJ(x aj , x ak ),
where dk
:=
L
Ci ,
k = 1, ... ,p.
{iIXi=X czk }
Furthermore, if (1: X --+ X is a bijection, then qJ is a positive (resp. negative) definite kernel if and only if qJ «(1 x (1) is a positive (resp. negative) definite kernel. If X is a finite set, say X = {Xl' ... , x n }, then plainly qJ is positive (resp. negative) definite if and only if the n x n matrix 0
(qJ(Xj' Xk»l ~j,k~n is positive (resp. negative) definite. We now list some simple properties and examples of positive and negative definite kernels.
1.3. A kernel qJ on X x X is positive (resp. negative) definite if and only if for every finite subset X 0 ~ X the restriction of qJ to X 0 x X 0 is positive (resp. negative) definite.
68
3. General Results on Positive and Negative Definite Matrices and Kernels
1.4. If qJ is positive definite, then qJ(x, x) ~ 0 for all x on the diagonal L\:= {(x, x)1 x EX}. 1.5. Let (:
E
X, i.e. qJ is nonnegative
:) be a positive definite 2 x 2 matrix. Then
o~
(1,1)(: :)C) =
a
+b+c+d
= - 1m c. Further,
implying 1m b
o~
0,0(: :)( _~) =
implying Re b = Re c, i.e. b definite kernel is hermitian.
= c.
a - ib
+ ic + d
It follows immediately that any positive
qJ on X x X is positive (resp. negative) definite if and only if qJ is symmetric (i.e. qJ(x, y) = qJ(Y, x) for all x, Y E X) and
1.6. A real-valued kernel n
L
~ 0
CjCkqJ(X j , x k )
(resp.
~O)
j,k= I
for all n E N, {Xl' ... , X n} £ X and {c l , ... , en} £ IR (resp. addition). For, if Cj = Qj + ib j , Qj and b j being real, then n
L
LJ= 1 Cj
=
0 in
n
Cj0c qJ(Xj' x k )
=
j,k= I
L
(QjQ k
+ bjbk)qJ(x j , x k)
j,k= I n
+i L
(bjQ k -
Qjbk)qJ(X j , X k ),
j, k= I
and the last sum is zero if qJ is symmetric.
1.7. A 2 x 2 matrix (:
~) is negative definite if and only if a, d E IR, b = c
and
and this inequality is equivalent with a
+d
~
2 Re b.
Therefore, we have for any negative definite kernell/J the inequality
l/J(X, x)
+
l/J(y, y) ~ 2 Re l/J(x, y).
69
§l. Definitions and Some Simple Properties of Positive and Negative Definite Kernels
1.8. Let (:
(w,
~) be a hermitian 2 x
Z)(: ~)(~) = alwI
2
2 matrix. Then for z,
iC we have
+ 2 Re(bzw) + dlzl 2 2
b 1 a w + ~z
=
WE
2
Izl + -;;(ad -lbI 2 )
(for a =F 0).
I
The matrix is therefore positive definite if and only if a det(:
~) = ad -
Ibl 2
~
0, d
~
0 and
~ O.
Hence for any positive definite kernel qJ we have I
qJ(x, y) 12 ~ qJ(X, x) . qJ(Y, y).
1.9. Iff: X -+ C is an arbitrary function, then qJ(x, y) definite, because
j.~lCjCkqJ(Xj'
Xk)
=
IJl cJ
(X)12
:=
f(x)f(y) is positive
~ O.
The kernel ljJ(x, y) := f(x) + f(y) is negative definite, for if IJ= 1 Cj = 0, then even IJ,k= 1 cjCkljJ(x j , x k) = O. In particular, a constant kernel (x, Y)H C is positive definite if and only if c ~ 0 and negative definite if and only if C E IR. 1.10. The kernelljJ(x, y) = (x - y)2 on IR x IR is negative definite, + Cn = 0 implying (for real numbers Cj' see 1.6)
In j,k=l
CjCk(X j - Xk)2 = -2
(
In
CjX j
C1
+ ...
)2 ~ o.
j=l
1.11. If X is a nonempty set, then the family of all positive (resp. negative) definite kernels on X x X is a convex cone, closed in the topology of pointwise convergence. A very important property of positive definite kernels is their closure under pointwise multiplication which was proved by Schur (1911) (in the case of matrices): 1.12. Theorem. Let qJl' qJ2: X x X -+ C be positive definite kernels. Then qJ2: X x X -+ C is positive definite, too.
qJl .
PROOF. It suffices to prove that if A = (a jk ) and B = (b jk ) are positive definite n x n matrices, then C := (ajkbjk) is positive definite.
70
3. General Results on Positive and Negative Definite Matrices and Kernels
Now it is well known from linear algebra (and also follows from 3.1 below) that there are n functions .fI' ... ,.fn: {I, ... , n} -+ e such that n
ajk = Let
C l' ... , Cn E
L fp(j)fp(k),
for
j, k
p=1
= 1, ... , n.
e be arbitrary, then n
n n cjckajkbjk = L L cjfp(j)ckfp(k)bjk ~ j,k= 1 p= 1 j,k= 1
L
o.
D
1.13. Corollary. Let qJt: X x X -+ e and qJ2: Y x Y -+ e be positive definite kernels. Then their tensor product qJt ® qJ2: (X x Y) x (X x Y) -+ e defined by qJt ® qJ2(X t , Yt, x 2, Y2) = qJt(X t , x 2)· qJ2(Yt, Y2) is also positive definite. PROOF. qJt(X t ,
If (PI: (X x Y) x (X x Y) -+ e is defined by (PI(X t , Yt' X 2, Y2) = x 2) and analogously (P2(X t , Yt, X 2' Y2) = qJ2(Yt, Y2)' then qJt ® qJ2 =
D
(PI . (P2 and is therefore positive definite.
1.14. Corollary. Let qJ: X x X -+ e be positive definite such that IqJ(x, y) I < p for all (x, y) E X X X. Then if f(z) = L:=o anz n is holomorphic in {z E e II z I < p} and an ~ 0 for all n ~ 0, the composed kernel f qJ is again positive definite. In particular if qJ is positive definite, then so is exp( qJ). 0
PROOF. By Theorem 1.12 for each n E ~ the kernel qJn is positive definite, therefore L~= 0 an qJn is positive definite for all N E ~ and so is its pointwise limit f 0 qJ. D
1.15. Remark. In contrast to Theorem 1.12 above the ordinary matrix product of two positive definite matrices is positive definite if and only if the two matrices commute. This follows from the simultaneous diagonalization of these matrices. In particular the matrix exponential of any positive definite matrix again has this property. By using the Jordan decomposition one can show that the matrix exponential of every symmetric real matrix is positive definite (even strictly). The following remarkable criterion for strict positive definiteness is often useful.
1.16. Theorem. Let A strictly positive definite
=
(a jk ) be some hermitian n x n matrix. Then A is
if and only if
det«ajk)j,k~p)
> 0
for p = 1, ... , n. Suppose first A to be strictly positive definite. As in the proof of 1.12 we choose n vectors z t, ... , Zn E en such that
PROOF.
j, k
= 1, ... , n,
§l. Definitions and Some Simple Properties of Positive and Negative Definite Kernels
71
which of course can also be written as A = BB*, where B is the n x n matrix with rows Zl' ..• ' Zn. This implies det A = Idet BI 2 ~ 0 and certainly A cannot be singular so that in fact det A > o. Obviously the same reasoning can be applied to the submatrices (ajk)j,k~P for p = 1, ... , n. For the other direction, we proceed by induction on n. The case n = 1 being trivially true, let us suppose that the theorem holds for n - 1. By assumption all> 0 and we subtract a1k/a11 times the first column from the kth column, k = 2, ... , n. The new matrix (a}k)j,k~n (where the first column remained unchanged whereas for k ~ 2 we have a}k = ajk - (a 1k/a 11 )aj1) has the same principal minors as (ajk), i.e. det«ajk)j,k~p)
=
det«ajk)j,k~p),
p
= 1, ... , n
and if we now change the matrix (a}k) to B, where all
0
0
B=
then still det«bjk)j,k~p) = det«ajk)j,k~p) for all p, and B is furthermore hermitian. Now
for p = 2, 3, ... , n implying by assumption that the (n - 1) x (n - 1) matrix
n
+
L C 1c;a 1 k + k=2
2
I C1 1
all
72
3. General Results on Positive and Negative Definite Matrices and Kernels
If (c 2 , •.. , cn) =t= 0 then the first sum is >0, and for (c 2 , ... , cn) == 0 but CI 0 the second term is strictly positive. Hence A is a strictly positive definite matrix. D
*
One might expect that a corresponding result for positive definite matrices holds if the determinants in the above theorem are only supposed to be nonnegative. The simple counterexample given by the 2 x 2 matrix
(0o 0)
-1 destroys this hope. However, the following result seems to be rather satisfactory, at least theoretically.
1.17. Theorem. Let definite
qJ:
X x X
-+
C be a hermitian kernel. Then qJ is positive
if and only if det«qJ(x j , Xk»j,k~n) ~ 0
for all n
E ~
and all
{Xl' ... ,
~
xn}
X.
PROOF. If qJ is positive definite, then as in the beginning of the proof of the above theorem we see that all determinants in question are nonnegative. Let us on the other hand assume this condition. We define a slightly perturbed kernel qJe:== qJ + e· 1.1' where e > 0 and L\ is the diagonal in X x X. For mutually different elements Xl' ... , X n E X it is easily seen that n
det«qJe(x j , Xk»j,k~n) ==
Ld
p
P e ,
p=o
where dn == 1 and dp ==
L
det«qJ(x j ,
Xk»j,keA)
~ 0
AS{I, ... ,n}
IAI=n-p
for p == 0, 1, ... , n - 1. Therefore det«qJe(x j , Xk»j.k~n) ~ en > 0 implying that qJe is a strictly positive definite kernel. Hence the pointwise limit qJ == lim e _ o qJe is positive definite. D The special case n == 2 has already been derived in 1.8.
1.18. Exercise. If qJ is a positive definite kernel, then also Re qJ, iP and IqJ 12 are positive definite, but not necessarily IqJ I. If t/J is negative definite, then so are Re t/J and If.
1.19. Exercise. For
ZE
C define M z :==
(~ ~ Z
Z
:). Then M z is positive
1
definite if and only if I z I ;£ 1 and [3 - 2 Re(z)] 1 z 12 ;£ 1. For -1 ;£ z < the matrix M z is not positive definite.
-1
73
§2. Relations Between Positive and Negative Definite Kernels
1.20. Exercise. Let H be a complex (pre-) Hilbert space. Then its scalar product > is a positive definite kernel. The squared distance l/J(x, y) := Ilx - YII 2 is negative definite.
O.
o
0 we have !l'j1(tljJ)
=
roo exp( -tsljJ) dj1(s)
pointwise on X x X, which is positive definite, being a mixture of the positive definite kernels exp( - tst/J). If on the other hand !RJl(tt/J) is positive definite for all t > 0, then for each (x, Y) E X X X we get
f.oo
~ [1 _ !l'j1(tljJ(x, Y))] = 1 - exp[ -tsIjJ(x, y)] dj1(s) t o t -+
IjJ(x, y)
roo s dj1(s)
for
t
-+
0,
where we could apply Lebesgue's theorem because of
11 - exp[ -tsljJ(x, Y)]I ~ t
'IjJ(x, y)ls.
-
Being a pointwise limit of negative definite kernels, definite, too.
t/J
itself is negative
0
Choosing Jl = 8 1 in the above theorem, we get back Theorem 2.2 for C+-valued t/J, and the choice of Jl = e- t dt shows, as already mentioned, that t/J: X x X -+ C+ is negative definite if and only if (t + t/J)-l is positive definite for all t > O.
76
3. General Results on Positive and Negative Definite Matrices and Kernels
2.4. Remark. If a probability measure J1 on ~ + has infinite first moment and t/J: X x X -+ C + is negative definite, then 2 J1(tt/J) is still positive definite for all t > 0, but, in general, the converse does not hold. It follows from later results (cf. 2.10 and 4.4.5) that exp( -~) is the Laplace transform of some probability measure J1 on ~+. Now a matrix (a jk ) of the form ajk = (Sj + Sk)2, where S b ..• , Sn ~ 0, is not negative definite in general, C 1 + ... + Cn = 0 cjckajk = 2(I Cj Sj)2, but nevertheless for all t > 0 implying
L
2J1(ta jk )
= exp( -~) = exp( -JiSj) exp( -JiSk)
is positive definite by 1.9. qJ is positive definite and qJ I~ == C for some C E ~ + then obviously C - qJ is negative definite, bounded and vanishes on the diagonal ~. A similar statement in the other direction (which may be found in the literature) is not generally true; see, however, 4.3.15. For Xl = -1, x 2 = 0, X3 = + 1, the 3 x 3 matrix
2.5. Remark. If
is negative definite, vanishes on the diagonal, and is bounded by 4, but for no real number t the matrix (t - ajk) is positive definite, because t
det( t - a jk)
= t- 1 t-4
t -
1
t t-l
t -
4
t -
1 - -8
for all
t E ~.
Negative definite kernels are intimately related to so-called "infinitely divisible" positive definite kernels. 2.6. Definition. A positive definite kernel qJ is called infinitely divisible if for each n E N there exists a positive definite kernel qJn such that qJ = (qJn)n. If t/J is negative definite then qJ = e - '" is infinitely divisible since qJn = exp( -(I/n)t/J) is positive definite and (qJn)n = qJ. Furthermore, qJ has no zeros. Proposition 2.7 below shows, in particular, that every strictly positive infinitely divisible kernel has this form. Let qJ be infinitely divisible. Then
k, n
~
1,
so that the nonnegative kernel IqJ I is infinitely divisible inside the family of all nonnegative positive definite kernels, each (under this restriction uniquely determined) nth root IqJn I again being an infinitely divisible positive definite kernel. Let ~(X) denote the closure of all real-valued negative definite kernels on X x X in the space] - 00, oo]X x x.
~2.
Relations Between Positive and Negative Definite Kernels
77
qJ ~
0 on X x X thefollowing
2.7. Proposition. For a positive definite kernel conditions are equivalent: (i) qJ is infinitely divisible; (ii) -log qJ E ~(X); (iii) qJt is positive definite for all t > O. PROOF. "(i) => (ii)" Let
t/J := -log qJ, then
t/J = lim n[1 - exp( -t/J/n)J E ~(X). "(ii) => (iii)" Let (t/J ex) be a net of (finite) negative definite kernels converging pointwise to t/J = -log qJ. Then for any t > 0 we have exp{ - tt/Jex) -+ exp( - tt/J) = qJt, so that qJt is positive definite by Theorem 2.2. "(iii) => (i)" Take t = !, t, i, ... .
D
2.8. Remarks. (I) The above proof shows that ~(X) is in fact the monotone sequential closure of the subset of all negative definite kernels, bounded above. (2) For
Z E
iC, IZ I ~ 1 the 2 x 2 matrix
G;)
is an infinitely divisible
positive definite kernel with nonuniquely determined positive definite "roots ". We conclude this section by indicating a large class of functions which operate on negative definite kernels. For J1 E M +(]O, oo[) we define g: D(J1) -+ C by
1 00
g(z)
where D{J1) is the set of Z
E
=
(l - e- AZ) df1(A.),
C for which A 1---+ 1 - e-;'z is J1-integrable.
2.9. Proposition. Let t/J: X x X -+ C be a negative definite kernel and let J1 E M +(]O, oo[). Ift/J(X x X) ~ D(J1) then g t/J is negative definite. Furthermore, for X o E X the kernel (x, y) 1---+ g[t/J(x, x o) + t/J(y, xo)J - g[t/J(x, y) + t/J{x o , xo)J is positive definite provided (t/J(X x X) + t/J(x o , x o)) u (t/J(X, x o) + t/J{xo, X)) ~ D(J1). If So A{t + A)-l dJ1{A) < 00 and t/JI~ ~ 0 then g t/J is negative definite and g[t/J(x, x o) + t/J(y, x o)] - g[t/J(x, y)J is positive definite for all X o E X. 0
0
= 1 - e-;'z where A E ]0,00[, X and C t , •.. , Cn E C such that L ci = 0
PROOF. It suffices to prove the result for g(z)
i.e. J1 = we get
G;',
D{J1) = C. If Xl'
L ci Ck{1
i, k
... , X n E
- e-;'t/J(Xj,Xk») = -
L ciCke-;'t/J(Xj,Xk) ~ 0 i, k
as an immediate consequence of Theorem 2.2. For any X o E X the kernel t/J{x, x o) + t/J{y, xo) - t/J(x, y) - t/J(x o , x o) is positive definite by Lemma 2.1,
78
3. General Results on Positive and Negative Definite Matrices and Kernels
as is therefore the kernel exp(A[t/J(X, x o)
+ l/J(y, xo) - t/J(x, y) - t/J(x o , x o)]) -
1
by 1.14, and multiplying with the positive definite kernel exp( - At/J(X, x o)) exp( - At/J(y, x o)) gives exp( - A[t/J(x, y)
+ t/J(xo, xo)]) -
exp( - A[t/J(x, x o)
+ t/J(y, xo)])
= g[l/J(x, x o) + t/J(y, x o)] - g[l/J(x, y) + t/J(x o , x o)], which is positive definite by Theorem 1.12. If J 0 then (Xf3, max((X, 13) and ak are again absolute values.
90
4. Main Results on Positive and Negative Definite Functions on Semigroups
1.11. Definition. A function J: S -. C is called bounded with respect to an absolute value a (shortly: a-bounded) if there exists a constant C > 0 such that
IJ(s) I ~ Ca(s)
s E S,
for
and J is called exponentially bounded if there exists an absolute value with respect to whichJis bounded. The set of exponentially bounded functions is an algebra.
1.12. Proposition. Let qJ a. Then
E
&J(S) be bounded with respect to an absolute value
IqJ(s) I ~ qJ(e)a(s)
Jor
s E S.
PROOF. Without loss of generality we may assume that I
qJ(S) 12 ~ qJ(S*
0
qJ(e) = 1, so that
s). By iteration we get for n E N I
qJ(s) 12n ~ qJ«s* 0 S)2 n - 1),
and using IqJ(t) I ~ Ca(t) for some constant C > 0 we get 1
qJ(s) 12n ~ Ca«s* 0 s)2 n - 1) ~ Ca(s)2 n ,
hence
IqJ(S) I ~ a(s)
lim C 2 -n
= a(s).
D
1.13. Let qJ E &J(S) and let H 0 be the linear subspace of CS generated by the functions {qJs Is E S}, where qJs(t) = qJ(s* t). Then H 0 is equipped with a scalar product
0
O. Choose t; > 0 such that t;2qJ(O) < veL). By the Stone-Weierstrass theorem there exists a real-valued function E.9'1 such that ~ 1 on Land 0 ~ ~ t; on supp(Ji). Then 2 E d, 2 ~ 0 and
which is a contradiction. Finally, the set of restrictions to supp(Ji) of the functions in .9'1 is dense in C(supp(Ji)), so by the Riesz representation theorem J.1= v. D The uniqueness statement in Theorem 2.5 combined with Theorem 2.6 can be expressed in the terminology of the theory of integral representations. 2.7. Corollary. Let a be an absolute value on S. Then
£?J'~(S)
is a Bauer simplex.
96
4. Main Results on Positive and Negative Definite Functions on Semigroups
As a special case we consider the absolute value b(s) = 1, s E S. The cone f!jJb(S) is equal to the cone of bounded positive definite functions, and we define
S:= S*
n f!jJb(S) = {p
E
S*llp(s)1 ~ 1 for s E S},
which is a compact subsemigroup of S* called the restricted dual semigroup. It is easy to see that Sis the set of bounded semicharacters. For this special case Theorem 2.6 and Corollary 2.7 lead to the following result :
2.8. Theorem. The set f!jJ~(S) of bounded positive definite functions qJ with qJ(O) = 1 is a Bauer simplex and its set of extreme points is S. A bounded positive definite function qJ has an integral representation cp(s) where J.1
E
=
L
p(s) dl1(p),
SE
S,
M +(S) is uniquely determined.
In the special case of S being an abelian group with s* = - s, Theorem 2.8 reduces to Bochner's theorem for discrete abelian groups, cf. Rudin (1962):
2.9. Theorem. Let G be a discrete abelian group. A function qJ: G -. C is positive definite in the group sense if and only if it is the Fourier transform of a (nonnegative) Radon measure on the compact dual group G. 2.10. The set of signed measures of the form J.11 - J.12 + i(J.13 - J.14) with J.1j E MC+(S*),j = 1, ... ,4 will be denoted MC(S*). Extending the convolution of measures in MC+ (S*) to MC(S*) by bilinearity, MC(S*) becomes an algebra. For /1 E MC(S*) we denote by (t the function
(t(s) =
f
p(s) dJ.1(p),
sES
s*
and {t is called the generalized Laplace transform of J.1. The mapping J.1 H (t of MC(S*) into C S , the generalized Laplace transformation, has the following properties, where a, f3 E C, J.1, v E MC(S*):
(i) (aJ.1
+
f3V)A = a{t
v;
(ii) (J.1 * v) = {t . (iii) {t = 0 => J.1 = 0; A
+
f3v;
i.e. /1 H (t is an injective algebra homomorphism. Since MC+(S*) is mapped onto the cone f!jJe(s), MC(S*) is mapped onto the subspace of C S spanned by f!jJe(s). Among the above properties (i) and (ii) are straightforward to establish, and (iii) follows from the proof in Theorem 2.5. Indeed, if J.1 = (/11 - J.12) + i(J.13 7114)' with J.1j E M C+ (S*), and K s; S* is compact such that supp(J.1 j) S; K, j = 1, ... ,4, then J dJ.1 = 0 for all E d, and the
§2. Exponentially Bounded Positive Definite Functions on Abelian Semigroups
97
restrictions I K form a dense subspace of C(K). By Theorem 2.2.4 it follows that J.1 = o. We identify M +(S) with the set of J.1 E MC+(S*) for which supp(J.1) £; S. For J.1, v E MC+(S*) with support in S, J.1 * v also has its support in S, so we can consider the convolution as a composition in M + (S). In the following result M +(S) is equipped with the weak topology, which is equal to the vague topology. The result is analogous to the Levy continuity theorem in probability theory, but simpler.
2.11. Theorem. The transformation J.1 H P, is a homeomorphism of M +(S) onto q;>b( S). Only the continuity of the inverse mapping p, H J.1 needs some explanation. Let (J.1~) be a net in M + (S), let J.1 E M +(S) and assume p,~(s) -.. P,(s) for each s E S. In particular there exists a o and C > 0 such that J.1~(S) = P,~(o) ~ C for a ~ ao , showing that (J.1~) is eventually in the compact set of measures in M +(S) with total mass ~ C. It therefore suffices to show that J.1 is the only accumulation point for (J.1~). Indeed, if (J is an accumulation point for (J.1~), then p, = fj by the continuity of hence J.1 = (J. D PROOF.
A ,
2.12. Exercise. Let S be an abelian semigroup with involution. Show that the Banach algebra [l(S) from Exercise 1.17 is abelian. Let ~ be the Gelfand spectrum of nonzero multiplicative linear functionals (Rudin 1973, p. 265), and let ~h denote the set of hermitian elements, i.e. the set of L E ~ such that L(f*) = L(f) for f E [l(S). Let r denote the set of nonzero bounded functions y : S --+ C satisfying y(s + t) = y(s) y(t) for s, t E S with the topology of pointwise convergence. For y E r let L y : [l(S) --+ C be defined by Ly(f) =
L f(s)y(s). seS
Show that y H L y is a homeomorphism of r onto ~ which maps S onto ~h. It can be proved that [l(S) is semisimple if and only if r separates the points of S, cf. Hewitt and Zuckermann (1956).
2.13. Exercise. Let (S, +, *) and (T, +, *) be abelian semigroups with involution, and let (S x T, +, *) be the product semigroup defined by (s, t) + (u, v) = (s + u, t + v), (s, t)* = (s*, t*). Show that there exists a topological semigroup isomorphism of S* x T* onto (S x T)*, which maps
S x tonto s;:-r.
2.14. Exercise. Let (S, +, *) and (T, +, *) be abelian semigroups with involutions and let h: S --+ T be a homomorphism. The dual map h: C T -.. C S is defined by h(f) = f h. Show that h(&>(T)) £; &'(S), h(&>e(T)) £; (!Je(s), h(T*) ~ S*, h(T) ~ S and that h* := hi T* is a continuous homomorphism of T* into S* which, furthermore, is one-to-one if h is onto. Let qJ E &>e(s) have the representing measure J.1 E MC+(S*). Show that C h qJ = f h for some f E &>e(T) if and only if J.1 = v * for some v E M +(T*). 0
0
98
4. Main Results on Positive and Negative Definite Functions on Semigroups
2.15. Exercise. Let a be an absolute value on S and let cp E ,qI>~(s) satisfy 2 I cp(s) 1 = cp(s + s*) for all s E S. Show that cp E S*. Show also that if cp E ,qI>~(S) satisfies Icp(s) I = a(s) for all s E S, then cp E S*. 2.16. Exercise. Let (S, +, *) be an abelian semigroup with involution. Show that S* is a linearly independent subset of C S and conclude that if S is finite then card S* ~ card S. Give an example where card S* < card S < 00. 2.17. Exercise. Show that ([ -1, 1], .).. . can be described as T u sgn· T u {sgn 2 } where T := {t 1-+ It I~, a E [0, oo]}, sgn:= 1]0,1] - 1[-1,0[, and Itl o := 1, Itl oo := 1{-1,1}(t) for t E [-1,1]. 2.18. Exercise. Let cp E ,qI>~(S). Then {s E S Icp(s) = a(s)} {s E S II cp(s) I = a(s)} are *-subsemigroups of S.
as well as
2.19. Exercise. Let (S, +, *) be an abelian semigroup with involution and let T:= {p E 511 p I == I}. If cp E ,qI>b(S) has the representing measure J.1 then J.1(T) = infses cp(s + s*). 2.20. Exercise. Let (S, +, *) be an abelian semigroup and consider S u {iiJ} as in Exercise 1.19. Let cp E ,qI>b(S) have the representing measure J.1. Then an extension iP of cp to S u {iiJ} is positive definite if and only if 0 ~ i1J( iiJ) ~ J.1( {Is})· 2.21. Exercise. Let S be an abelian semigroup with involution, let n E N /'.... -. and let S" be identified with (S)" in accordance with Exercise 2.13. Show that if cp E ,qI>b(S") has the representing measure J.1 E M + (5"), and if (J is a permutation of n elements, then cp 0 (J has the representing measure J.1u- I. Conclude that cp E ,qI>b(sn) is symmetric, i.e. cp 0 (J = cp for all permutations (J, if and only if J.1 is symmetric. 2.22. Exercise. Let S = [0, 1] and define Sot = (s + t - 1)+ for s, t E S. Show that (S, 0) is an abelian semigroup with neutral element 1 and absorbing element o. Show further that S is 2-divisible (cf. 6.8) and that S* = {I, 1{1}}.
§3. Negative Definite Functions on Abelian Semigroups In most of this section, we consider negative definite functions on an abelian semigroup with involution (S, +, *). From 3.20 onwards, we will specialize to semigroups with the identical involution.
99
§3. Negative Definite Functions on Abelian Semigroups
For tjJ E %(S) we know by Theorem 3.2.2 that e-tl/J E [!lJ(S) for t > O. Since we only have an integral representation for exponentially bounded positive definite functions and, in particular, for bounded positive definite functions, the following simple result is of some interest.
3.1. Proposition. Let tjJ E %(S). Then e-tl/J E [!lJe(s) (resp. E q;b(S)) for all t > 0 if and only if there exists an absolute value a ~ 1 and a constant C E ~ such that Re tjJ(s)
~
C - log a(s)
(resp. Re tjJ(s)
~
C
If this is the case then C
s E S,
for
s E S).
for
= tjJ(O) can be used.
PROOF. If e-l/J is exponentially bounded (resp. bounded) there exists an absolute value a, where we may assume a ~ 1 (resp. a = 1), such that
Ie-l/J(s) I =
e-Rel/J(s)
~
s E S,
e-l/J(O)a(s),
and the stated inequalities follow. Conversely, if the first inequality of the proposition holds, we get for t > 0
/e-tl/J(S) I ~ e-tCa(sr, which shows that e- tljl absolute value.
q;e(s) (resp.
E
s
E
S,
[!lJb(S) when a
E
=
1) since at is an
D
The set of negative definite functions tjJ for which Re tjJ is bounded below will be denoted %/(S).
3.2. Corollary. For tjJ
E
%/(S) we have Re tjJ(s)
3.3. Proposition. Let tjJ
E
~
tjJ(O),
SE
S.
%/(S) satisfy tjJ(O) ~ O. Then
JltjJ(s+t)I~~+~, PROOF. By Proposition
s,tES.
1.9 the matrix
2 Re tjJ(s) - tjJ(s* ( tjJ(s) + tjJ(t) - tjJ(s
+ s) + t*)
tjJ(s) + tjJ(t) - tjJ(s* + t)) 2 Re tjJ(t) - tjJ(t* + t)
is positive definite, hence, using inequality (1) of §1, I
tjJ(s)
+ tjJ(t)
- tjJ(s*
+ t)
2
1
~ (2 Re tjJ(s) - tjJ(s*
+ s)) + t))
x (2 Re tjJ(t) - tjJ(t* ~
41 tjJ(s) " tjJ(t) I.
Replacing s by s* and extracting the square root gives
ItjJ(s
+ t)1 ~
and the inequality follows.
/tjJ(s)1
+
/tjJ(t)1
+ 2~~, D
100
4. Main Results on Positive and Negative Definite Functions on Semigroups
In some later applications we shall need the following boundedness result for negative definite functions on semigroups with absorbing element (cf. 1.3).
3.4. Proposition. Let S contain the absorbing element w. Then the.functions in JV'(S) are automatically bounded. More precisely if IjJ E JV'(S) and 1jJ(0) ~ 0 then 111jJ1100 ~ fil/J(w). PROOF.
For fixed s
(
E
S the 3 x 3 matrix
1jJ(0)
ljJ(s*)
ljJ(s) ljJ(w)
ljJ(s + s*) ljJ(w)
IjJ(W)) ljJ(w) ljJ(w)
= (ajk)j,k= 1, 2, 3
is negative definite. Writing down explicitly the inequality for C 1 = C 2 = 1, C 3 = -2 gives
L cjC;a jk ~ 0
+ 2 Re ljJ(s) + ljJ(s + s*) ~ 41jJ(w), + s*) = Re ljJ(s + s*) ~ 0 by Corollary 1jJ(0)
l/J(s 3.2 we get 21jJ(w). Similarly, the choice of C 1 = 1, (2 = i, C3 = -1 - i gives - 1m ljJ(s) ~ ljJ(w), and the coefficients C 1 = 1, C 2 = - i, c 3 = i - I yield 1m ljJ(s) ~ ljJ(w). Hence 11m ljJ(s) 1~ ljJ(w) and therefore 1IIjJI 100 ~
and
since
o ~ Re l/J(s)
~
D
fil/J(OJ).
We shall next establish an important relationship between functions JV'(S) and convolution semigroups of Radon measures on the compact semigroup 5, which is the restricted dual semigroup of S.
l/J E
3.5. Definition. Let T be a Hausdorff topological semigroup with neutral element e. A convolution semigroup on T is a family (J.1t)t~O from M~(T) such that t ~ J.1t is a continuous homomorphism from the semigroup (IR+, +) into the semigroup (M~ (T), *), i.e. such that: (i) J.10 = Be; (ii) J.1t * J.1r = J.1t+r for t, r E IR+ ; (iii) t ~ J.1t is weakly continuous. In the case of T = 5 condition (iii) can be replaced by a seemingly weaker one as the following lemma shows.
3.6. Lemma. Let t ~ J.1t be a homomorphism from (IR+, +) into M +(5) such that lim t -+ 0 J.1t
=
B1
weakly, then (J.1t)t ~ 0 is a convolution semigroup on
5.
By Theorem 2.11 it suffices to prove thatjs(t):= fils) is a continuous mapping from IR+ into C for each s E S. By assumption we have js(t + r) = js(t)js(r) and lim t -+ o js(t) = 1, so there exists A > 0 such that Ijs(t) I ~ 2 for t E [0, A]. It follows that Us(t) I ~ 2n for t E [0, nA], in particular js is locally PROOF.
~3.
101
Negative Definite Functions on Abelian Semigroups
bounded. This combined with the equations (t, r, t - r > 0)
Ijs(t
+ r)
- js(t) I = Ijs(t) IIjs(r) -
11, 11
Ijs(t - r) - js(t) I = Ijs(t - r) Iljs(r) -
D
implies that js is continuous.
3.7. Theorem. There is a one-to-one correspondence between convolution semigroups (J.1t)t ~ 0 on S and negative definite functions l/J E %l(S) established
via the.formula (It(S) = e - tl/l(s)
for
t
~
0,
S E
s.
(1)
PROOF. Let (J.1t)t~O be a convolution semigroup. For s E S the mapping
js: t t---+ (It(s) is continuous by Theorem 2.11 and satisfiesjs(t + r) = js(t)js(r), hence of the form js(t) = e-tl/l(S) for a uniquely determined complex number t/J(s). Since e-tl/l E g>b(S) for each t > 0 it follows by Theorem 3.2.2 and Proposition 3.1 that t/J E %(S) and that Re t/J ~ t/J(O). Conversely, if t/J has these properties there exists a uniquely determined family (J.1t)t > 0 from M +(S) such that (1) holds. Clearly
(l,(s)(lr(s) = (It+r(s) for s
E
(J.1t)t~O
S, so
=
lim (It(s)
and
t-O
= 1 = £1(S)
is a convolution semigroup by 2.10, 2.11 and 3.6.
D
We will now consider two special types of functions t/J E %l(S), namely, purely imaginary homomorphisms and nonnegative quadratic forms. 3.8. Definition. A function t/J: S --+ C is called a purely imaginary homomorphism if it has the form t/J = ii, where I: S --+ IR is *-additive, i.e. I(s + t) = I(s) + I(t) and I(s*) = -/(s) for s, t E S. A function q: S --+ IR is called a quadratic form if
+ 2q(t)
2q(s)
=
q(s
+ t) + q(s + t*)
for
s, t
E
S.
If the involution on S is the identical, every purely imaginary homomorphism is identically zero, and a quadratic form is a homomorphism of (S, +) into (IR, +). If S is an abelian group, with the involution being s* = -s for s E S, the functional equation for a quadratic form q is
+ 2q(t)
2q(s)
=
q(s
+ t) + q(s
- t)
for
s, t
E
S.
3.9. Theorem. Purely imaginary homomorphisms and nonnegative quadratic forms belong to %l(S). PROOF. Let
t/J: S
~ Sand {c 1 ,
--+
•.. ,
C be a purely imaginary homomorphism, let {SI' ... , sn} LJ= 1 Cj = O. Then
cn } ~ C with
n
L
j.k=l hence t/J
E
%l(S).
n
CjCkt/J(sj
+ Sk) = L
j,k=1
CjCk(t/J(Sj)
+ t/J(Sk)) =
0,
102
4. Main Results on Positive and Negative Definite Functions on Semigroups
Let q: S --+ [R be a quadratic form. Note that q(O) = 0 and q(s*) = q(s). We define B: S x S --+ [R by
B(s, t) = q(s)
+ q(t)
- q(s
+ t*),
s, t
E
S,
and claim that B has the following properties for s, t, rES:
(i) (ii) (iii) (iv)
B(s, t) = B(t, s); B(s*, t) = B(s, t*) = -B(s, t); B(s + r, t) = B(s, t) + B(r, t); tB(s, s) = lim n oo (q(ns)/n 2 ). -+
Here (i) is clear, and
B(s*, t) = B(s, t*) = q(s)
+ q(t)
- q(s
= q(s + t*) - q(s) - q(t)
=
+ t)
-B(s, t).
To see (iii), we first remark that
+ q(t + t*)
q(s)
=
q(s
+ t + t*)
for
s, t
E
S,
and therefore we have
B(s, t)
+ B(r, t) =
+ q(t) - q(s + t*) + q(r) + q(t) - q(r + t*) = q(s) + q(r) + 2q(t) - t[q(s + t* + r + t*) + q(s + t* + t + r*)] = q(s) + q(r) + 2q(t) - tq(s + r*) - tq(t + t*) - tq(s + r + t* + t*) = tq(s + r) + 2q(t) - tq(t + t*) - t{2q(s + r + t*) + 2q(t) - q(s + r + t + t*)} = q(s + r) + q(t) - q(s + r + t*) = B(s + r, t). q(s)
To see (iv) we remark that for n
E
q(ns) = n 2 q(s) -
N n(n - 1)
2
q(s
+ s*).
This formula is clearly true for n = 1,2, and assuming it is correct for n - 1, n we get
q((n
+
l)s) = 2q(ns)
+ 2q(s)
- q((n - l)s
+ (s + s*))
= (2n 2 + 2)q(s) - n(n - l)q(s + s*) - q((n - l)s) - q(s + s*) = (n + 1)2q(s) -
(n
+
2
l)n
q(s
+ s*).
It follows that
. q(ns) lIm - 2 - = q(s) - tq(s n
-+00
n
+ s*) =
tB(s, s).
103
§3. Negative Definite Functions on Abelian Semigroups
If q is a nonnegative quadratic form it follows by (iv) that B(s, s) s E S. For {Sl' ... , sn} ~ S, {c 1 , ... , cn} ~ 7L we now get n
I
j,k= 1
~
0 for all
n
cjck(q(Sj)
+ q(Sk)
- q(sj
I
+ Sk)) =
j,k= 1
cjckB(Sj' Sk)
B(t, t)
=
~
0,
where
I
t =
CjSj
I
+
j,Cj~O
(-cj)sj.
j,Cjo be the corresponding convolution semigroup on S. For a E S we have r at/J .: t/J E p;>b(S), and if (Ja E M + (S) is the unique Radon measure such that r al/J - l/J = fj a then .
1
(Ja = hm(l - Re p(a)) - Jlt t
t-O
weakly. PROOF. If 1 is the identity operator on CS we have -1 (ra
t
-
1)(1 - e-t"')(s)
= -1 (1 - ra)(e-t"')(s) = -1 t
t
1 s
p(s)(l - Re p(a)) dJlip),
showing that (ra - 1)«1/t)(1 - e- t",)) E p;>b(S). For t -.. 0 this function tends pointwise to rat/J - t/J, which then necessarily belongs to p;>b(S). The assertion about the measure follows immediately from the Levy continuity theorem (2.11). D We now consider an arbitrary hermitian function t/J: S --+ C with the property that r at/J - t/J E p;>b(S) for all a E S, and we denote by (Ja the unique Radon measure on S such that rat/J - t/J = fJa.
3.12. Lemma. Let t/J: S --+ C be a hermitianfunction such that r at/J - t/J E p;>b(S) for each a E S. There exists a unique Radon measure Jl E M + (S\ {I}) such that (1 - R e p( a)) Jl
= (J a I(S\ { 1})
for
a E S.
(2)
104
4. Main Results on Positive and Negative Definite Functions on Semigroups
If tjJ E JVI(S) and (J.1t)t> 0 is the corresponding convolution semigroup then J.1 = lim t.... o(l/t)J.1tl(S\{l}) vaguely. PROOF.
For a E S we let
(!Ja:= {p
E
SIRe p(a) < I},
+
which is an open subset of S. Since (!Ja = {p E Slp(a) I} the family «(!Ja)aeS is an open covering of the locally compact space S\ {I}. On each (!Ja we consider the Radon measure
Since -(ra
-
I)(r b
-
I)tjJ(s) = -(r b
=
L
=
L
I)(r a
-
-
I)tjJ(s)
p(s)(l - Re p(a)) dUb(P)
p(s)(l - Re p(b)) dua(p),
we get by the unicity assertion of Theorem 2.8 that (1 - Re p(a))ab = (1 - Re p(b))aa'
a, b E S,
so the compatibility condition of Theorem 2.1.18 is satisfied for the measures Consequently, there exists a uniquely determined Radon measure J.1 on S\{I} such that J.11 (!Ja = La for a E S, and it is clear that J.1 is the only Radon measure satisfying (2). To finish the proof it is enough by Theorem 2.4.9 to prove that for each
(La)aeSo
aES . 1 1Im-J.1tl(!Ja=L a t .... O
g(p)
:=
vaguely on (!Jao
t
{f(P )(1
- Re p(a)) -
1,
0, is a bounded continuous function on
S, so by 3.11, D
3.13. Definition. Let tjJ be as in Lemma 3.12. The measure J.1 is called the Levy measure for tjJ or for the corresponding convolution semigroup (J.1t)t~O in case tjJ E JV'(S). -
105
§3. Negative Definite Functions on Abelian Semigroups
The Levy measure might have infinite total mass, but (2) implies that
(l - Re p(a)) djl(p)
b(S) for each a E S. (iii) There exists a triple (I, q, J.l) where I: S ~ [R is *-additive, q is a nonnegative quadratic form and J.l E £ such that
ljJ(s) = ljJ(O)
+ il(s) + q(s) + I. JS\{l}
for all s E S.
(1 - p(s)
+ iL(s, p»
djl.(p)
109
§3. Negative Definite Functions on Abelian Semigroups
The triple (I, q, J.l) is uniquely determined by t/J measure for t/J and _ l' Re t/J(ns) 1m 2
q(s ) -
n
" _ 00
E
%'(S), J.l being the Levy
+ I'1m t/J(n(s + s*» " -+
2n
00
,
SE
S.
PROOF, "(ii) => (iii)" Let J.l be the Levy measure for t/J and let t/J Jl be defined as in Proposition 3.18. The function
r := t/J - t/J(O) - t/J Jl is hermitian and rar - r = fi a - (O"al(S\{I}»A = O"a({1}) for a E S, hence positive definite and constant. By Lemma 3.14 it follows that t/J has the representation stated in (iii). Finally, if (iii) holds, it is clear by 3.9, 3.14 and 3.18 that t/J E %'(S), and that the Levy measure for t/J is J.l. This shows that the triple (I, q, J.l) is uniquely determined by t/J, and we find
= t/J(?) + q(~s) +
Re t/J2(ns)
n t/J(n(s
n
+ s*» =
n
t/J(O) n
n
~ (1
f
_ Re(p(s)"» dJ1.(p),
JS\{l} n
+ q(n(s + s*» + f
JS
n
\{l}
! (l
_ Ip(sW") dJ1.(p).
n
By the proof of Theorem 3.9 we have
,
hm "-+00
q(ns) -2-
n
= q(s) - tq(s + s*),
and by the functional equation for quadratic forms
q(n(s
+ s*» = nq(s + s*),
which gives the formula for q(s) provided the integrals tend to zero. That this is true follows from the dominated convergence theorem because of the inequalities
1 n2 (1 - Re(p(s)"»
~
n2
4
(1 - Re p(s»,
1 -(1 -lp(s)1 2 ") ~ 1 - Re p(s
n
Putting p(s)
= re i8 , r E [0, 1], 0 E ]
1
n2 (1 - r" cos(nO» Clearly 1 - r"
~
~
n2
4
-
+ s*).
n, n], these inequalities become n EN.
(1 - r cos 0),
n(1 - r) for r E [0, 1], n E N, and using the inequalities
ISi: x I ~ 1,
X E ~,
sin x 2 --2x
- n
for
n
Ixl -~ -2'
110
4. Main Results on Positive and Negative Definite Functions on Semigroups
we find for f)
E ]
-n, n]
.
= 2 SIn
1 - cos(nf)
2
nf)
2 =
hence
1
2 (1 -
n
r" cos(nf))
1
~ 2
n
(1 - r")
1
~ ;; (1 -
r)
~
+2
n
(1 - cos nO)
n2
+ 4 r"(1 -
cos f) ~
n2
4
(1 - r cos f).
D
Remark. In the above representation q and J1 are independent of the Levy function L, whereas I depends on the choice of L. In the rest of this section, we assume that S is an abelian semigroup with the identical involution. Then negative definite functions are real-valued and %'(S) is the set of negative definite functions t/J which are bounded below or equivalently satisfy t/J(s) ~ t/J(O). For f E [Rs and a E S, we define ~af E [Rs by ~af(s)
= f(s + a) - f(s),
s E S,
hence ~af = (ra - I)fwith the previous notation. The function L == 0 is a Levy function for S, and from the previous results it is easy to get the following: 3.20. Theorem. Let S have the identical involution. The following conditions are equivalent for a function t/J: S -+ [R: (i) t/J E %'(S). (ii) ~a t/J E pjJb(S) for each a E S. (iii) There exist an additive function q: S J1 E M + (8\ {I}) such that
t/J(s)
= t/J(O) + q(s) +
-+
r
JS
[0, oo[ and a Radon measure
(l - p(s)) dJ1.(p).
\{l}
The decomposition in (iii) is uniquely determined, J1 being the Levy measure for t/J and q (s)
.
= 11m "-+00
t/J(ns) --, n
SE
S.
If q == 0 then t/J is bounded if and only if J1(S\ {I})
b(N o) have the representing measure J.1 on [ - 1, IJ. Show that L~ I qJ(n) 1 < 00 if and only if(1 - Itl)-l E 2 1(J.1). 4.23. Exercise. Let A = (au) be a real positive definite k x k matrix. Show that there exists a function qJ E g>b(Nt) such that for
i,j=I, ... ,k,
where e 1 = (1,0, ... ,0), ... , ek = (0, ... ,0,1). Hint: Assume laul ~ 1. There exists a real k x k matrix B = (b ij ) such that A = B*B. Let Xl' ... ' X k E IRk be the row-vectors of B, and define a measure /l E M +([ - 1, IJk) by putting mass 1 at each of the points X l' . . . , X k .
4.24. Exercise. Let qJ: [0, IJ -+ [0, 00 [ be decreasing and let J.1 be the representing measure on f/ according to Proposition 4.20. Show that qJ(a) - qJ(a
+ 0) =
°
~ a < 1,
/l( {CO, aJ}),
O 0, B is the at most countable set of b E JO, IJ such that p( {CO, b[}) > and J.1c is the continuous part of p. The function s ........ /lc( {I E f/ IS E I}) is continuous and decreasing.
°
4.25. Exercise. Let X be a nonempty set. The set IP(X) of subsets of X is an idempotent semigroup under intersection. Describe the semicharacters of IP(X) explicitly in the case where X has two elements, and show that qJ = 1 - 1{0} is decreasing but not positive definite. 4.26. Exercise. On IR + we define X 0 Y = X + Y + xy. Show that (IR +, 0) is an abelian semigroup and that x ........ log(1 + x) is an isomorphism of (IR+ , 0) onto (IR+, +). Show that the bounded semicharacters on (IR+ ,0) are given by x ........
(_1 )t + 1
x
for
°
~ t ~
00.
4.27. Exercise. An infinite matrix (a jk ) of the form ajk = qJ(j + k) with qJ: No -+ IR is called a Hankel matrix. It is positive (resp. negative) definite if and only if qJ is positive (resp. negative) definite on the semigroup (No, +), and it is infinitely divisible if and only if qJt is positive definite for all t > 0. Show that the so-called Hilbert matrix (1/(j + k + 1))j,k~O is infinitely divisible. -
123
§5. r-Positive Functions
4.28. Exercise. Let t/J: IR + -+ IR + be given. Show that if qJ 0 t/J E g>b(1R +) for all qJ E g>b(1R +).
4.30. Exercise. For M equivalent:
£;
t/J E LN(IR +) if and only
M~ (IR~) the following three conditions are
(i) M is relatively compact with respect to the weak topology. (ii) {fe III Il E M} is uniformly equicontinuous on IR~ . (iii) {fellill E M} is equicontinuous in O.
Hint: Use Prohorov's compactness criterion mentioned in the Notes and Remarks to Chapter 2. 4.31. Exercise. (Continuity Theorem for Laplace Transforms). Let a sequence Ill' Ill'··· E M~(IR~) be given such that qJ(t) = lim n_ oo (feJ1n)(t) exists for all t E IR~ and assume that qJ is continuous at O. Then qJ = feJ1 for some Il E M ~ (lR k+) and Iln -+ Il weakly. 4.32. Exercise. If M ~ (IR~) is equipped with the weak topology and g>~ (IR~) n C(IR~) with the topology of uniform convergence on compact subsets of IR~, the Laplace transformation fe: M ~ (IR~) -+ g>~ (IR~) n C(IR~) is a homeomorphism.
§5. r-Positive Functions In this section, we shall present an approach due to Maserick (1977) to the integral representation of positive definite functions. Let A be a real or complex commutative algebra with identity e and involution *. If A is a real algebra we always assume a* = a for all a E A. The basic idea is to find an integral representation of certain "positive" linear functionals on A, the representing measure being concentrated on a set of multiplicative linear functionals. The connection to semigroups is obtained if A is chosen to be the algebra of shift operators generated by the shifts E s' s E S, where (Esf)(t) = f(s + t) forf E C S . By the relation L(Es) = f(s) functionsfon S are in one-to-one correspondence with linear functionals Lon A. The positivity concept for linear functionals depends on the notion of an admissible subset r of A. 5.1. Definition. A nonempty subset r
(i) a* = a for all a E r; (ii) e - a E alg span+(r) for all a E r; (iii) alg span(r) = A.
£;
A is called admissible if
124
4, Main Results on Positive and Negative Definite Functions on Semigroups
Here alg span(r) (resp. alg span +(r)) is the set of linear combinations ~ 0) and each Xi is a finite product of elements from r. Note that alg span +(r) is the smallest convex cone in A, stable under multiplication and containing r. Furthermore e E alg span+(r) because of (ii).
I?= 1 AiXi, where each Ai is a scalar (resp.
5.2. Lemma. Let r be an admissible subset of A. Then: (i) e - Xl ... Xn E alg span +(r)for Xl' ... ' Xn E r; (ii) for X E alg span +(r) there exists 8(X) > 0 such that e for 8 E [0,8(X)]. PROOF.
8X E
alg span +(r)
The assertion (i) follows from the algebraic identity
"n
n
n
n
e-
=
Xj
(1'( XjJ
~
e-
(1)
X j )1-(1'J,
(1$1 j=l
j=l
where the summation is taken over all a = (a l ' ... , an) E {O, 1}n\ {(I, ... , I)} (note that XO := e), and this identity follows in turn by the formula n
e=
n(x
j
+ (e
-
Xj)).
j= 1
If X = LJ= 1 Ai Yj' where A j > 0 and Yj is a product of elements from r, we put 8(X) = (IJ= 1 Aj ) - 1. For 8 E [0, 8(X)], we find
e-
8X
=
(1 -
(8 8 X
))e + e .± Aie- Yj)
E
alg span+(r)
J= 1
D
because of (i). A linear functional L: A
L(a)
~ 0
-+
C is called r-positive, where r is admissible, if
for all
a E alg span +(r).
This holds if and only if
L(a 1
•••
an)
~
0
{a 1 ,
for all finite sets
••• ,
an}
~
r.
The set Ai of r-positive linear functionals on A is a convex cone in the algebraic dual space A*, closed in the topology a(A*, A). Note that L(x*) = L(x) for X E A when L E Ai. By Lemma 5.2 it follows that L(e) > 0 for L E Ai\ {O}, so
B:= {L is a base for
E
AiIL(e) = I}
Ai.
5.3. Lemma. The base B is compact. PROOF.
For
X E
A there exists a constant K x > 0 such that
for
L
E
Ai.
~5.
125
r-Positive Functions
In fact, if x E alg span +(r) we can use K x = 8(X)-l with 8(X) from 5.2(ii), and everyxEAcanbewrittenx = Xl - X 2 + i(x 3 - x 4 )withx j Ealgspan+(r), j = 1, ... ,4. If (L(X) is a universal net on B the above inequality shows that L(x) := lim(X L(X(x) exists for every x E A, and this implies the compactness
D
ofB.
Let ~ denote the set of r-positive, multiplicative linear functionals on A, which are not identically zero. Clearly ~ is a compact subset of B.
5.4. Theorem. A linear functional L on A is r-positive ~{and only ({there exists a (necessarily unique) Radon measure J.1 E M +(~) such that L(x} =
i
l5(x} dJ1(l5}
for
The base Bfar Ai is a Bauer simplex with ex(B) =
x
A.
E
(2)
~.
A functional of the form (2) is clearly r-positive. Given L E Ai, the existence of J1 such that (2) holds follows from Theorem 2.5.6, once we have shown that ex(B) £; ~. Suppose L E ex(B) and define for a E A the translate La of L by La(x) = L(ax). For a E alg span+(r) we have La E Ai. Since L = La + L e - a and La' L e - a E Ai for a E r, La is proportional to L. The set of elements a E A for which La is proportional to L is a subalgebra of A containing r, hence La = AaL for all a E A. Evaluating at x = e shows Aa = L(a), hence L E ~. Defining y: A ---+ A by y(x) = xx* an application of Corollary 2.5.12 yields ex(B) = ~. The set of functions £5 ~ £5(x), x E A, is a point-separating *-subalgebra of C(~, C) containing the constant functions, hence uniformly dense in C(Ll, C) by the Stone-Weierstrass theorem. This implies that the representing measure is uniquely determined. D PROOF.
From the integral representation in Theorem 5.4 we immediately get:
5.5. Corollary. Any functional L for all x E A.
E
Ai is positive in the sense that L(xx*)
~
0
It is interesting to note that the corollary can be proved directly without use of the integral representation. This was done by Maserick and Szafraniec (1984) as follows: For a function .1': [0, 1] ---+ C and x E A we define
Bn(x;f) = kt/(~)(;)xk(e - x)"-k, and for f'(t) = 1, t, t 2 we find (cf. Davis (1963, p. 109))
Bn(x; 1) = e,
Bn(x; t) = x,
1 Bn(x; t 2 ) = - X n
n- 1
+ - - x2• n
126
4. Main Results on Positive and Negative Definite Functions on Semigroups
Similarly, for a functionf: [0,1]2
-+
C and Xl' X2 E A we define
Bn(x x ;f) = j.~/(~' ~)(;)(:)x{(e - x1)n- jx~(e - x )n-k, 1,
and for f(s, t)
2
=
2
(s - t)2, we find
Bn(x l' x 2 ; (s - t)2) = Bn(x l' X 2 ; S2)
+ Bn(x l' X 2 ; -
= Bn(x l ; S2) - 2B n(x l ; s)B n(x 2 ; t)
= (Xl -
If Xl' X2 E A such that Xl' LE Ai
X 2)
X2'
2
+ -1 (Xl + X2 n
-
+ Bn(x l' X 2 ; t 2) + Bn(x 2 ; t 2)
2st)
2
Xl -
2
X 2 )·
e - Xl' e - X2 E alg span+(r), we have for
hence for n -+ 00: L«x l - X 2)2) ~ O. For Xl' X2 E alg span +(r) there exists 8 > 0 such that 8X l , 8X 2 , e - 8X l , e - 8X 2 E alg span +(r), cf. 5.2, and therefore L(8 2 (X l
-
X2)2) ~ 0,
so L«x l - X2)2) ~ O. An element X E A can be written X = Xl - x 2 + i(X3-X4) with xjEalgspan+(r), j=1, ... ,4, so xx*=(X l -X 2)2+ (x 3 - X 4 )2 and finally L(xx*) ~ o. Let S be a commutative semigroup with involution, and let
A
=
{.f
cjEsjln EN, cj
)=1
E
C, S} Sj
E
be the algebra of shift operators on C S . We recall that Esf(t) = f(s + t) for s, t E S, f E ([s. Clearly, A is a commutative algebra with unit I = Eo, and defining
we get an involution on A. Since (Es)ses is a basis for A, functionsf: S -+ C and linear functionals L: A -+ C are in one-to-one correspondence via the formula L(E s ) = f(s). Iffand L correspond to each other and n
L cjEsj E A
T =
j= 1
then n
L(T) =
L cjf(Sj) = j= 1
Tf(O).
127
§5. r-Positive Functions
It follows that n
L cjCkf(sj + st),
L(TT*) =
j,k= 1
so L is positive if and only iff E (!P(S). If r ~ A is admissible, we call f r-positive if the corresponding linear functional L is r-positive, i.e. if and only if
r.f(O)
~
for all
0
T
E
alg span + (r),
or if and only if
T1 ... T"f(O)
~
0
for all
T1 ,
•• • ,
T" E r.
Note that a semicharacter p is r-positive if and only if Tp(O) T E r, since T1 ... T"p(O) = T1P(O) ... T"p(O).
~
0 for all
5.6. Theorem. Let r be an admissible subset ofthe algebra A ofshift operators.
Every r-positive function f: S -+ C is positive definite, exponentially bounded and has an integral representation f(s)
=
f.
p(s) dJ1(p),
S·
where J1 E M +(S*) is concentrated on the compact set of r-positive semicharacters. PROOF. By Theorem 5.4 the linear functional L corresponding to the rpositive functionfhas a representation
L(T)
=
i
(j(T) dj1«(j),
TEA,
where fi E M +(~). For b E ~ the function s H b(E s ) is a r-positive semicharacter, and the mapping j: ~ -+ s* given by j(b)(s) = b(E s ) is a homeomorphism of ~ onto the compact set j(~) of r-positive semicharacters. The image measure J1 := fij of fi under j is a Radon measure on S* with compact support contained in j(~), and replacing T by Es we get
f(s) =
f.
p(s) dJ1(p),
s
E
S.
S·
This shows thatfis exponentially bounded and positive definite.
D
5.7. A subset G ~ S is called a generator set for S, if every element in S\ {O} is a finite sum of elements from G u {a* Ia E G}. Let ~: S -+ ~+ be an absolute value such that ~(a) > 0 for all a E S. For (J E C, a E S, we define
l(
a) n... a = "2 I + 21X(a) Ea + 21X(a*) Ea· · (J
128
4. Main Results on Positive and Negative Definite Functions on Semigroups
The family
=
rG
{Qa,alaE
{±1, ±i},aEG}
is easily seen to be admissible if G is a generator set. The following converse of Theorem 5.6 holds:
Iff is positive definite and exponentially bounded there exists an admissible r such that f is r-positive. In fact, there exists an absolute value tJ, such that f is tJ,-bounded, and we may assume tJ,(a) > 0 for all a E S. Then
f(s) =
f
pes) djJ.(p),
s E S,
where J.1 E M + (S*) is supported by the compact set of tJ,-bounded semicharacterse With r = rG as above, where G is a generator set, we get
na,af(s) =
f
p(s)t[l
+ lX(a)-l
Re(O"p(a))] djJ.(p),
and since Ipea) I ~ tJ,(a) for p E supp(J.1) it follows that Qa, a f E (!lJCX(S). Hence (!lJCX(S) for a j E {± 1, ±i}, aj E G, in particular,
Qa1,a1 ... Qan,anf E
Qa1,a1 ... Qan,anf(O) ~
showing thatfis rG-positive. Using the absolute value tJ,
==
1 we get" (i)
=>
0 (ii)" of the following result:
5.8. Proposition. F or a function f: S -+- C the following conditions are equivalent: (i) f E (!lJb(S); (ii) f is r-positive, where r
= {Q a , a I a E {± 1, ± i}, a E S} and Qa, a
=
1(I + 2a E + 2(j) E
2"
a
a", .
PROOF. To see" (ii) => (i)" it suffices by Theorem 5.6 to verify that a r-positive semicharacter is bounded. But p E S* is r-positive if and only if 1 + Re(ap(s)) ~ 0 for s E S, a E {± 1, ± i}, i.e. if and only if Re pes), 1m pes) E [ -1, 1] for all s E S, which is equivalent with pES. D
5.9. Remark. In the above proposition it is not possible to replace r by rG
=
{Qa,ala E
{± 1, ±i}, a E G},
where G is a generator set for S. The implication" (i) => (ii)" is, of course, still true, but" (ii) => (i)" might fail to hold. In fact, if S = (N6, +, *) is the semigroup studied in 4.11, then G = {(I, O)} is a generator set and the
129
§6. Completely Monotone and Alternating Functions
rG-positive functions are given by f(n, m)
=
r
znzm dJl(z),
(n, m)
E
)[-1,1]2
N~,
which is a larger class than q;b(S). In the case where the involution is the identical this remark does not apply:
5.10. Proposition. Assume S has the identical involution and let G be a generator set for S. Then the following conditions are equivalent: (i) f E q;b(S); (ii) (1 ± Eat) ... (1
± Ean)f(O)
~ 0
for all aI' ... , an E G, n E N.
It is clear that" (i) => (ii)" holds, and (ii) is equivalent with f being r-positive, where
PROOF.
r = {tel
± Ea)/a E G}
is admissible. Finally every r-positive semicharacter p is bounded, because Ip(a)1 ~ 1 for a E G implies Ip(s) I ~ 1 for all s E S. Note that Q± 1, a = ± Ea) and Q± i, a = tl, and the families
tel
{t(l±Ea)laEG}
and
{Qa,alaE{±l,±i},aEG}
D
lead to the same r-positive functions.
5.11. Exercise. Let S = (N~, +) and let A(k) be the algebra of real polynomials in k variables Xl' •.• , X k . Show that the set r =
{Xl' ... ,
xk , 1 -
Xl -
... -
xk}
is admissible. There is a one-to-one correspondence between functions
f: N~ ~ ~ and linear functionals L: A(k) ~ ~ established via L(x n ) = fen), n E N~. Show that f is r-positive (in the sense that the corresponding linear functional L is r-positive) if and only if there exists J.1 f(n)
=
L
n
x dJl(x),
E
M +(K) such that
n E N~,
§6. Completely Monotone and Alternating Functions In this section, S is an abelian semigroup with the identical involution. We have already introduced the shift operator Ea: ~s -+ ~s defined by Eaf(s) = f(s + a) and ~a = Ea - 1, where 1 denoted the identity operator. Since {E a Ia E S} is a commuting family of operators, the algebra generated by this
130
4. Main Results on Positive and Negative Definite Functions on Semigroups
family is commutative, too; in particular, we have for
a, b E S.
Sometimes it is convenient to use also the operators
Va:=I-Ea=-L\a'
aES,
which are, of course, again commuting; note that Va Vb
n [1 -
= L\aL\b' and that
n
Val···
VanP(S) = p(S)
(1)
p(a i )]
i= t
for p E S*. In his fundamental work on capacities Choquet (1954) studied functions on S which he called monotone (resp. alternating) 0.[ infinite order. Here we shall call these functions completely monotone (resp. completely alternating).
6.1. Definition. A function qJ: S -+- ~ is called completely monotone if it is nonnegative and if for all finite sets {at, ... , an} ~ Sand S E S Val ... Van qJ(S) ~ O. A function and S E S
t/J: S
-+- ~
is called completely alternating if for all {at, ... , an}
~
S
The set of completely monotone (resp. alternating) functions is denoted .A(S) (resp. d(S)). It is clear that .A(S) and sl(S) are closed convex cones in ~s, and the nonnegative (resp. real) constant functions are contained in .A(S) (resp. scl(S)). If qJ E .A(S) then EaqJ E .A(S) and, similarly, if t/J E scl(S) then Eat/J E d(S) for all a E S, since Val ... Van(Eaf) = E aV a1 ... Vanfforf E ~s and since E a is a positive operator. Notice that qJ E Jt(S) and t/J E d(S) satisfy 0 ~ qJ ~ qJ(O) and t/J ~ t/J(O).
6.2. Remark. The following terminology is frequently used for s, at, ... , an E S: Vtf(s; at) = f(s) - f(s and inductively for n
~
+ at),
2
Vn.[(s; at, ... , an) = V n-1.[(s; at, ... , an-t) - Vn-l.[(s
+ an; at,···, an-t).
The following result connects the two concepts of completely monotone and alternating functions, and it is easily established.
131
§6. Completely Monotone and Alternating Functions
6.3. Lemma. A function t/!: S -+- ~ belongs to deS) a E S the function ~a t/! belongs to vII(S). PROOF.
1ft/!
E
deS) then L\at/! = -Vat/!
if and only if for every
~ 0 and
Val··· Van(L\at/!) = ~a Val··· Vant/! = - Va Val··· Vant/! ~ O. On the other hand, if ~a t/!
E
vII(S) for all a E S, then
Val··· Vant/! = -Val··· Van_I(L\ant/!) ~ O.
D
The relation of completely monotone functions to r-positive functions is described in the following:
6.4. Theorem. Let G ~ S be a generator set. The family r = {E a , 1 - Ea Ia E G} is admissible, and for a function qJ: S -+- ~ the following conditions are equivalent: (i) qJ is completely monotone. (ii) qJ is r-positive. (iii) There exists a measure J.1 E M +(S +) such that cp{s)
=
f.
pes) dJl{p),
Js
s
E
S.
+
(I - Ea) ... (1 - Ean)E SI ... Esrn qJ(O) = Val ... Van qJ(S 1
+ ... + Sm)
so (i) => (ii). The implication "(ii) => (iii)" follows from Theorem 5.6 since P E s* is r-positive ifand only if 0 ~ pea) ~ 1 for a E G, but this is equivalent with pES + because every s E S\ {O} is a finite sum of elements from G. Finally, if (iii) holds, then qJ(s) ~ 0 and by (1) Val'"
so qJ
qJ
E
vII(S).
Vancp{s)
=
f.
Js
p{s).n (1 - pea)) dJl{p) +
J= 1
~ 0, D
A function qJ E vII(S) is bounded by qJ(O). The set vIIl(S) of functions JI(S) such that qJ(O) = 1 is therefore a compact convex base for the
E
cone vII(S).
6.5. Theorem. The cone vII(S) is an extreme subset of g>b(S) and Jll(S) is a Bauer simplex with ex(vIIl(S)) = S+ . For qJl' qJ2 E vII(S) also qJl . qJ2 E vII(S). A function qJ E g>b(S) is completely monotone if and only if the representing measure J.1 is concentrated on S+ . The previous theorem shows that A(S) ~ g>b(S). If qJ = qJl + JI(S) and qJi E g>b(S) has representing measure J.1i E M +(S), i = 1, 2, then J.1 = J.1l + J.12' where J.1 E M +(S +) is the representing measure for qJ. It PROOF.
qJ2
E
132
4. Main Results on Positive and Negative Definite Functions on Semigroups
follows that J.11' J.12 are concentrated on S+ so qJl' qJ2 E ..4t(S), and we have shown that vII(S) is an extreme subset of g>b(S). By transitivity of extremality an extreme point of ..4t 1(S) is also an extreme point of g>~ (S), hence ex(vII I(S)) ~ S +, and in fact there is equality since S+ ~vlll(S)nex(g>~(S)). For J.1, vEM+(S+) we have supp(J.1*v)~S+, cf. 2.3, and it follows that ..4t(S) is stable under multiplication. By unicity of the representing measure for qJ E g>b(S) it follows that vII 1(S) is a simplex, and that the representing measure for qJ is concentrated on S + if qJ E g>b(S) is completely monotone. D In analogy with Theorem 6.4 it suffices to check the defining conditions for a completely alternating function for elements in a generator set.
6.6. Proposition. Let G be a generator set for S. A function t/J: S -+pletely alternating if and only if Vat ...
Vant/J(s)~O
for
SES
and
a 1, ... ,an EG,
~
is com-
n~1.
PROOF. Suppose the conditions of the proposition hold. For s E S\ {O} there exist aI' ... , an E G with s = a 1 + ... + an, and the identity (1) in the proof of Lemma 5.2 gives in this case
I - Es =
n
n
L
E~f(I
(1"*1 j==1
- Eaj )I-(1j.
Applying the above procedure to elements s l '
(I - ESt) ... (I - ESk)t/J = VSt hence
... , Sk •••
E S\ {O} we find
VSkt/J ~ 0,
D
t/J E deS).
6.7. Theorem. The cone .s:I(S) is an extreme subset of JVI(S). A function t/J E JV'(S) is completely alternating if and only if the Levy measure J.1 is concentrated on S+ \ {I}. PROOF. If t/J E deS) we have by 6.3 and 6.5 that L\a t/J E g>b(S) for a E S, hence t/J E JVI(S) by Theorem 3.20. If t/J E deS) and t/J = t/Jl + t/J2' where t/Jl' t/J2 E JV'(S), then L\a t/J = L\a t/J 1 + L\a t/J2 E ..4t(S) and L\a t/J l' L\a t/J2 E g>b(S) for a E S. As vII(S) is extreme in g>b(S) we get L\a t/J l' L\at/J2 E vII(S), and by Lemma 6.3 we conclude that t/Jl' t/J2 E deS), and have thereby shown that
deS) is extreme in JV'(S). Let t/J E JV'(S) have the Levy-Khinchin representation t/J(s}
= t/J(O} + q(s} +
f.
JS
(1 - p(s)} dJl.(p}. \{I}
It is easy to see that t/J(O), q E .s:I(S) and that 1 - P E deS) when pES +\ {I}, so if J.1 is concentrated on S +\ {I} then t/J E deS). Let us next suppose that t/J E deS). Since deS) is extreme in JV'(S) we conclude that
t/Jis}:=
i
Ta
(1 - p(s)} dJl.(p}
~6.
133
Completely Monotone and Alternating Functions
belongs to .s:#(S), too, where Ta = {p ~at/Ja E Jt(S), hence
o ~ dat/Jia) =
i
E
SIpea)
b(S) and deS) = JVl(S). PROOF. If Sis 2-divisible every semicharacter is nonnegative, so the assertions follow from 6.5 and 6.7. D
6.9. Remarks. (1) It can happen that vII(S) = g>b(S) without S being 2divisible. Let S = {O, a, b} be the commutative semigroup with neutral element 0 and a + a = a + b = b + b = a. Then S is not 2-divisible, but S* = S has two elements Po == 1, Pl(O) = 1, Plea) = Pl(b) = 0 so vII(S) = q>b(S) = g>(S). (2) Suppose G is an abelian group considered as a semigroup with the identical involution. Then G is the group of homomorphisms of G into the multiplicative group { - 1, I}. In this case vII(G) = g>b(G) if and only if G is 2-divisible. In fact, if G is not 2-divisible then G/G 2 has at least two elements, where G2 = {2glg E G}, and every element s E G/G 2 satisfies 2s = o. If s E G/G 2 , S t 0, then (T) for all pES 0
for a triple (T, S, v) as described above will play some role for a converse of Theorem 1.2.
1.4. Definition. A triple (T, S, v), where T and S are abelian *-semigroups, and v: T -+ S is a mapping fulfilling the conditions (S 1 )-(S4)' will be called a Schoenberg triple. 1.5. Theorem. Let (T, S, v) be a Schoenberg triple. If'thefilnction qJ: S -+ C is bounded and positive definite, then qJ Vn is positive definite on Tn for every nE N. 0
In view of Theorem 4.2.8 it is enough to show that p all pES. This, in fact, is true because PROOF.
p vit 0
I' ... ,
tn ) =
0
Vn E
r!J>(T n ) for
pet V(t ») = }]/(V(t)) j
l
n
=
TI p
0
vonit l '
... ,
t n)
j= 1
(where n j denotes the canonical projection) is a product of positive definite D functions. The restricted dual of the semigroup (~+, +) is given by the exponentials A ~ 00, as we have seen earlier. This shows that a triple (T, S, v) where S = ~+, is a Schoenberg triple if and only if v is a nonnegative negative definite function on T for which (Sl) and (S3) hold. In particular, if T is a connected topological *-semigroup and v: T -+ [0, oo[ is a continuous negative definite function satisfying v(O) = 0 and v $ 0, then (T, ~+ , v) is a Schoenberg triple. sHe-AS, 0 ~
1.6. Example. The space IP of all real sequences whose pth power is summable is a Banach space only for p ~ 1, i.e. the function IIxllp:= IxjIP]l/P is a norm only for p ~ 1. Nevertheless IP as well as II· lip is well defined also for
[I
~ 1.
149
Schoenberg Triples
o< p
r(B) = {q>:
[R + -+ [R I q>
continuous, q>( 11·11) E &>(B)},
where positive definiteness refers to the group sense. The determination in general of &>r(B) seems to be open. If dim B = 00 then it is a subset of {2 J1( t 2 ) IJ1 E M~ ([R + ) }, and the sets are precisely equal when B is a Hilbert space. The following result of Einhorn (1969) shows that &>r(B) may be degenerate.
154
5. Schoenberg-Type Results for Positive and Negative Definite Functions
2.3. Theorem. Let qJ: [R+ ~ [R have the property that qJ(llxll) is positive definite on the Banach space C[O, 1]. For some rJ. E [R + we then have qJ 1]0, 00 [ == rJ. and qJ(O) ~ rJ.. In particular, .9r(C[O, 1]) consists only oj'the nonnegative constants. PROOF. For n ~ 2 let {(i,j) 11 ~ i
If 0 < s
~ t ~
Ojor all t E [ -1, 1] and t/J := -log p,julfils the three conditions: (i) t/J(I) = 0; (ii) t/J is continuous; (iii) for each positive definite matrix (a jk) with entries in [ -1, 1] the matrix (t/J(a jk)) is negative definite.
If the conditions are fulfilled there exist uniquely determined numbers bb b 2 , ••• ~ 0, L bn < 00, such that 00
t/J(s) = L bn(1 - sn) n=l
jor aII
sE
[ -
1, 1].
PROOF. If P, > 0 and t/J := -log p, fulfils (i)-(iii) then t/J is a continuous function in C 2 , hence by 3.2.2 exp( -tt/J) is a continuous function in K 1 , i.e. a generating function for all t > O. In particular
p, = [exp( - t/J In)]n for all n E N, showing that /1 is infinitely divisible. Now let us assume /1 = (/1n)*n for suitable probability measures /11' /12' ... on No; equivalently, p, = (p'n)n for the corresponding generating functions. If P,(t o) = 0 for some to E] -1, 1[, then all derivatives of fi in to would vanish and then p, == 0 in contradiction to P,(I) = 1 and to {l's continuity on [-1, 1]. Hence fi(t) > 0 at least for all t E ] -1, 1] and consequently all the functions {In are strictly positive on ] -1, 1]. Now t/J:= -log p, is well defined, continuous on [ -1, 1] and finite on ] -1, 1], but still we have to exclude that t/J( -1) = 00. Let (a jk ) be a positive definite matrix, ajk E] -1, 1] for allj, k. Then (t/J(ajk)) is negative definite, because for all n E N
(e-o/n)!/I(a jk ») = (p,(a jk)l/n) = (Pn(ajk)) is positive definite by Proposition 3.7. In particular, t/J I] -1, 1] is a nonnegative negative definite function on the semigroup ] -1, 1], therefore bounded by Proposition 4.3.4, and since t/J is continuous on [-1, 1] we get t/J( -1) < 00, i.e. P,( -1) > 0, so that t/J is indeed a (real-valued) continuous function in the cone C 2 • The asserted representation of t/J follows immediately from Corollary 3.12. D
3.15. Corollary. Let /1 E M~(No) be infinitely divisible. Then /1 is the distribution of a random variable X = L:= 1 nXn' where the Xl' X 2' ... are independent, and where each X n is Poisson distributed with parameter bn, and L:=l bn < 00. PROOF.
L bn
0; (iii) qJ2(t) = qJ(t 2) for all t E [ -1, 1].
§4. Schoenberg's Theorem for the Complex Hilbert Sphere In the preceding section we have seen that a continuous function qJ: [ -1, 1] such that qJ( y») is positive definite on the unit sphere of some infinite dimensional real Hilbert space, has a power series representation qJ(s) = ans n with an ~ 0 and an < 00. We are now going to extend this result to the case of a complex Hilbert space H (always infinite dimensional), and of course we have to replace the interval [ -1, 1] by the closed unit disc D:;:::: {z E Clizi ~ I} in the complex plane. Let lr := {z E C/izi = I} denote the torus group. When we consider lr as a discrete group, we shall write lr d instead of lr. In analogy with the real case we introduce C to be the set of all functions qJ: D ~ C operating on positive definite matric'"3 in the sense that (qJ(ajk)) is positive definite whenever (a jk ) is a positive definite matrix with entries in D. By the Schur Product Theorem (3.1.12) C is a multiplicative closed convex cone in CD. It is immediately seen that each qJ E C is nonnegative on [0, 1].
! and IpO(t) I < 1t, which is true for t sufficiently close to 1, we have IpO(t) I ~ O(t k), hence · O(tk) 1· kt k- 10'(tk) k I m - - = 1m = Ip I -~ 1t-l O(t) t-l O'(t) by I'Hopital's rule because PROOF.
lim O'(t) = t-l
-)3.
The direct evaluation of this limit is tedious but elementary. It is, however, no coincidence that this limit is y'(I) because
lim ~ (te i6 (1» = 1 + i lim (}'(t) t-l dt t-l is a tangent vector to y = y(x) at x = 1.
D
4.3. Lemma. Let qJ E C and ZED and define thefunctions qJl' qJ2' qJ3' qJ4: D -+ C by
qJl, 2(W) = qJ(w) qJ3,4(W) = qJ(w)
± ![qJ(zw) + qJ(zw)] ± (1/2i)[qJ(zw) - qJ(zw)].
Then qJl' ... , qJ4 all belong to C, too.
~4,
169
Schoenberg's Theorem for the Complex Hilbert Sphere
Let (ajk) be a fixed positive definite n x n matrix with entries in D, and let (c l' . . . , Cn) E C n also be fixed. Define the function : D -+ C by
PROOF.
n
L CjCk ({J(zajk)·
(z):=
j, k= 1
We claim that E C. In fact, for any positive definite m x m matrix (b pq ) with entries in D and for all (db .. . , dm ) E cm we have m
L
p, q = 1
m
n
L
dpdq(b pq) =
L
p, q = 1 j, k = 1
dpcjdqCk ((J(bpqajk) ~ 0
because the tensor product (b pq ) ® (ajk) is positive definite by Corollary 3.1.13. Therefore n
I(z) I ~
L CjCk ((J(a jk ),
(1) =
j,k= 1
hence, denoting G = Re( 0
[x, y]-t = e-t1og[x,y] = (x oYo)-t(l - cp(x, y»-t. The right-hand side is positive definite by Corollary 3.1.14 so that t/f(x, y) := log[x, y] is a negative definite nonnegative kernel on X x X. The equation
174 cosh d
5. Schoenberg-Type Results for Positive and Negative Definite Functions
= exp(t/J) has for d, t/J
~
0 the unique solution
+ 10g(1 + Jl
d = t/J
- e- 2 t/1),
and therefore d is negative definite, too, by Corollary 3.2.10. Now we can state the above-mentioned result of Faraut and Harzallah (1974).
5.1. Theorem. Letj': [R+
~
[R be an arbitrary given junction. Then
f(log cosh d(x, y» is positive definite on X x X if and only iffE gpb([R+). The kernel f(log cosh d(x, y» is negative definite if and only iff E %'( [R +). PROOF. IffE gpb([R+), thenf(s) and then
=
flO,
00] e-J.s
dJ.l()") for some J.l EM +([0,00])
f(log cosh d(x, y» = f(log[x, y]) =
f
e-. (ii)" Suppose ajXs j = 0 on S*, i.e.
PROOF.
I
S l'
... ,
Sn E
S are distinct points and that
n
I
ajp(Sj)
=
pES*.
for
0
j= 1
In order to prove that a 1 = 0 we choose Pj E s* for j Pj(Sl) =f Pj(Sj), which is possible by (i), and define
=
2, ... , n such that
n
j'(s)
Il (Pj(s) -
=
Pj(Sj»(Pj(Sl) - Pj(Sj»-l,
SE
S.
j= 2
Thenj'belongs to the linear span of S* in CS andf(sl) j = 2, ... ,n, hence
= 1,f(sj) =
0 for
n
a1 =
I
ajf(sj) = O.
j= 1
D
The implication" (ii) => (i)" is trivial.
1.9. Suppose that the equivalent conditions of Proposition 1.8 are verified. Then C S and V(C) form a dual pair under the bilinear mapping
\ cP,
Jl
aj xSj )
=
Jl
ajcp(s),
(5)
which is well-defined because of 1.8(ii). The topology of pointwise convergence on C S is compatible with this duality. In fact, if L: C S -.. C is a continuous linear functional there exist a finite set D £; Sand c; > 0 such that IL((~o) and /lEE+([R, s). For {co, c l , k L~=o CkX and have
j.~/jCkSj+k =
f
... ,
p(X)2
cn}
~
IR we define p(x)
=
dJl(x),
so s is strictly positive definite precisely when supp(/l) is an infinite set.
2.4. Let F be a closed subset of [R. The F -moment problem consists in characterizing the F -moment sequences, i.e. the real sequences s = (sn)n ~ 0 of the furm Sn
=
L
n
x dJl(x),
n
~
where /l E E + ([R) is supported by the closed set F.
0,
187
§2. The One-Dimensional Moment Problem
Hamburger's moment problem corresponds to F
°
= IR. The case of
F = [ -a, a], a ~ has been solved in Proposition 4.4.9, and in the special case a = 1 which we will call Haviland's moment problem, two different characterizations of [-1, 1]-moment sequences s have been found, (a) SE&,b(N o), (b) sE&'(N o) and s - E 2 sE&,(N o), the latter going back to Haviland (1936), cf. 1.2. The case of F = [0, 1] is Hausdorff's moment problem and the [0, 1]-moment sequences are precisely the completely monotone functions on the semigroup (No, +), see 4.6.11. We shall now consider the case F = [0, oo[ which is called Stieltjes' moment problem, solved by Stieltjes in 1894. The [0, 00 [-moment sequences are also called Stieltjes moment sequences.
2.5. Theorem. For a sequence S tions are equivalent:
=
(sn)n
~0
of'real numbers the following condi-
(i) s, E 1 SE &'(N o). (ii) t = (So, 0, Sl' 0, S2'· ..) E &'(N o). (iii) There exists J1 EM +([0, oo[) such that Sn
PROOF.
"(i) ~ (ii)" Let
tn
=
= sn/2
f"
n
n
x d/l(x),
for n even and t n
~
=
0.
°
for n odd. For
we then have n
I
n
n CjCktj+k
=
j,k= 0
I
CjCktj+k
+ L
[(n - 1 )/2]
[nI2]
=
I
CjCkt j + k
j,k= 0 j, k odd
j,k=O j, k even
C2pC2qSp+q
I
+
p,q=O
C 2p + 1 C 2q + 1 Sp+q+ 1
p,q=O
showing that t E 9(N o). "(ii) ~ (iii)" By Hamburger's theorem (2.2) there is t 2n + 1
=
f
x 2n + 1 d(J(x)
(J
E
~
°
E + (IR) such that
= 0,
n
~
0,
hence
n
~
0,
where J1 is the image measure of (J under the continuous mapping x ~ x 2 of IR into [0, 00 [. "(iii) ~ (i)" is clear since E 1 S is represented by the positive Radon measure xJ1. D
188
6, Positive Definite Functions and Moment Functions
In analogy with Theorem 2.2 we can give an integral representation of all functions ljJ E JV(N o) with a representing measure which is not necessarily unique.
2.6. Theorem. Ajilnction ljJ: No ~ lR is negative definite if'and only if'it has a representation oj' the form ljJ(n)
where a,
r
= a + bn - cn 2 + bE
(1 - x n
n(1 - x)) d/l(x),
-
J~\{l}
lR, c ~ 0 and Il E M +(lR\ {I}) satisfies
f.
(1 - X)2 dll(X)
'l(]0, oo[) then tjJ: ]0, oo[
given in the Examples
-+ ~
defined by tjJ(s):=
J1 qJ(t) dt belongs to JVq(]O, oo[). Conversely, the derivative oj' any jiinction in JVq(]O,
00 [)
belongs to g>q(]O,
00
E).
1.14. Exercise. Let H be N k with componentwise addition. Show that fI can be identified with ([ -1, O[ u ]0, 1])k. Show that both n = (n t , ••• , nk) ~ (nt+···+nk)-l and n~(-l)nl+···+nl'(nl+···+nk)-t are positive definite (and quasibounded) and try to find their integral representations. 1.15. Exercise. Let H be the real line with maximum operation. Then g>q(H) is the set of all nonnegative decreasing functions, and JVq(H) consists of all increasing functions on ~. The same is true if we replace the line by an arbitrary nonempty subset of ~.
~2.
263
Completely Monotone and Completely Alternating Functions
1.16. Exercise. If H is the additive semigroup JO, oo[n, then FJ == [R~ (by natural identification). Show that all first partial derivatives of a function in %q(]O, 00 en) belong to gPq(]O, 00 en). 1.17. Exercise. Show that IIxll- I is not positive definite on ]0, oo[n for n > 1, 11·11
denoting the euclidean norm.
1.18. Exercise. Let l/J be a strictly positive, negative definite function on the abelian semigroup H. Then if j'E gPq(]O, oo[) we have f l/J E gPq(H), and if g E%q(]O, oo[), then g ffE %q(H). 0
0
1.19. Exercise . Use 1.18 to show that x 1---+ log(2:7 = I xJ is negative definite 00 [" and that x 1---+ (2:7=1 xJ - t is positive definite for all t > 0.
on JO,
1.20. Exercise. Let K be a compact Hausdorff space and let H be the set of all strictly positive continuous functions on K with pointwise addition. Show that H can be identified with M +(K), the set of Radon measures on K. 1.21. Exercise. Let l/J E %6(H) have the representation given in Theorem 1.9. Then l/J can be extended to a negative definite function on S, bounded below, if and only if SH\{l} [1 - p(h)] dp,(p) < 00 for all hE H.
°
1.22. Exercise. Let S be a perfect semigroup with zero element and identical involution, and assume that H:== S\{O} fulfils H == H + H. Then H is a perfect semigroup without zero in the sense that for every qJ E f2P(H) there is a unique Radon measure J1 on S* \ r 1{a}} such that qJ(h)
=
fp(h)d~(p),
hEH.
§2. Completely Monotone and Completely Alternating Functions Again let (H, +) denote an arbitrary abelian semigroup and put, as in the preceding section, S :== H u {a}, H :== S\ {I fO}}. In the definition of completely monotone (resp. alternating) functions on H no change is necessary compared with the definition for functions on semigroups with neutral element (cf. 4.6.1): we again require (resp.
~
0)
for all n E N, (hI' . .. ,hn) E Hn and all hE H. If one of the h/s is 0, this expression is still well defined and equal to zero, so that the condition is not changed by allowing (hI' ... , hn ) E sn. Of course a completely monotone function is also required to be nonnegative.
264
8" Positive and Negative Definite Functions on Abelian Semigroups Without Zero
We denote by ---'I(H) (resp. d(H)) the set of all completely monotone (resp. alternating) functions on H. Note that in contrast with the positive and negative definite functions which in a natural way are only defined on the subsemigroup H' of nonmaximal elements of H, the completely monotone (resp. alternating) functions are defined on all of H. For f: H -+ [R and h E H let fh: S -+ [R be defined by fh(S) := f(h + s), and for fixed s E S define ~sf: H -+ [R by ~sf(h) := f(h + s) - f(h). Let us first state some simple properties of ---'I(H) and d(H). 2.1. Lemma. (i) Both ---'I(H) and d(H) are closed convex cones in [RH. (ii) - ---'I(H) ~ d(H). (iii) A function qJ: H -+ [R belongs to ---'I(H) (resp. d(H)) if and only
---'I(S) (resp. qJh E d(S))for all hE H. (iv) A function t/J: H -+ ~ belongs to d(H) if and only function ~a t/J belongs to ---'I(H). (v) qJb qJ2 E ---'I(H) implies qJl . qJ2 E ---'I(H). (vi) l/J E d(H) if"and only if e-tl/J E ---'I(H)jor all t > O. PROOF.
if qJh E
if jor every a E S the
(i) and (ii) need no proof. (iii) follows from the identity
Val··· VanqJh(S) = Val··· VanqJ(h valid for all hE H, S E S, n E N, (a 1 , We get (iv) from
••• ,
+ s)
an) E sn.
Val··· Van(~al/J) = - Val··· VanVal/J, which holds for all a E S, n E N and (at, ... , an) E sn. Using (iii) and the fact that ---'I(S) is closed under pointwise multiplication we get property (v) observing also that (qJl . qJ2)h = (qJt)h . (qJ2)h. In the same D way (vi) follows from Proposition 4.6.10. Let qJ be a completely monotone function on H. Then the above lemma tells us that for each h E H the translate qJh belongs to ---'I(S) implying by Theorem 4.6.5 the representation benhavns Universitet. JONES, W. B., W. J. THRON and H. WAADELAND (1980). A strong Stieltjes moment problem. Trans. Amer. Math. Soc. 261, 503-528. JONES, W. B., O. NJASTAD and W. J. THRON (1984). Orthogonal Laurent polynomials and the strong Hamburger moment problem. J. Math. Anal. Appl. 98, 528-554. KAHANE, J.-P. (1979). Sur les fonctions de type positif et de type negatif. Seminar on Harmonic Analysis 1978-79, pp. 21-37. Publ. Math. Orsay 79, No.7, Univ. Paris XI,Orsay. KAHANE, J.-P. (1981). Helices et quasi-helices. In Mathematical Analysis and Applications (Ed. by L. Nachbin). Essays dedicated to Laurent Schwartz on the occasion of this 65th birthday. Part B. Adv. Math. Suppl. Studies 7B. New York-London: Academic Press.
References
277
KARLIN, S. and A. NOVIKOFF (1963). Generalized convex inequalities. Pacific J. Math. 13, 1251-1279. KINGMAN, J. F. C. (1972). Regenerative Phenomena. London-New York: Wiley. KISYNSKI, J. (1968). On the generation of tight measures. Studia Math. 30, 141-151. KOLMOGOROV, A. N. (1941). Stationary sequences in Hilbert space. Bull. Math. Univ. Moscow 2, 1-40. KONHEIM, A. G. and B. WEISS (1965). Functions which operate on characteristic functions. Pacific J. Math. 15, 1279-1293. KUELBS, J. (1973). Positive definite symmetric functions on linear spaces. J. Math. Anal. Appl. 42, 413-426. LINDAHL, R. J. and P. H. MASERICK (1971). Positive-definite functions on involution semigroups. Duke Math. J. 38, 771-782. LOEVE, M. (1963). Probability Theory, 3rd ed. Princeton: van Nostrand. LUKACS, E. (1960). Characteristic Functions. London: Griffin. MARCZEWSKI, E. and C. RYLL-NARDZEWSKI (1953). Remarks on the compactness and non-direct products of measures. Fund. Math. 40, 165-170. MARSHALL, A. W. and I. OLKIN (1979). Inequalities: Theory of Majorization and its Applications. New York-London: Academic Press. MASERICK, P. H. (1975). BV-functions, positive definite functions and moment problems. Trans. Amer. Math. Soc. 214, 137-152. MASERICK, P. H. (1977). Moments of measures on convex bodies. Pacific J. Math. 68, 135-152. MASERICK, P. H. (1978). A Levy-Khinchin formula for semigroups with involutions. Math. Ann. 236, 209-216. MASERICK, P. H. (1981). Disintegration with respect to Lp-density functions and singular measures. Z. Wahrsch. verw. Gebiete 57,311-326. MASERICK, P. H. and F. H. SZAFRANIEC (1984). Equivalent definitions of positivedefiniteness. Pacific J. Math. 110, 315-324. MATHERON, G. (1975). Random Sets and Integral Geometry. New York: Wiley. MAZUR, S. and S. ULAM (1932). Sur les transformations isometriques d'espaces vectoriels normes. C. R. Acad. Sci. Paris 194, 946-948. MCGREGOR, J. L. (1980). Solvability criteria for certain n-dimensional moment problems. J. Approx. Theory 30,315-333. MERCER, J. (1909). Functions of positive and negative type, and their connection with the theory of integral equations. Philos. Trans. Royal Soc. London, Sere A, 209, 415-446. MICCHELLI, C. A. (1978). A characterization of M. W. Wilson's criterion for nonnegative expansions of orthogonal polynomials. Proc. Amer. Math. Soc. 71, 69-72. MICCHELLI, C. A. and R. A. WILLOUGHBY (1979). On functions which preserve the class of Stieltjes matrices. Linear A~qebra Appl. 23, 141-156. MLAK, W. (1978). Dilations of Hilbert space operators (general theory). Dissertationes Math. 153, 1-61. MORANDO, P. (1969). Mesures aleatoires. In Seminaire de Probabilite III. Lecture Notes in Mathematics 88, pp. 190-229. Berlin-Heidelberg-New York: Springer-Verlag. MOTZKIN, T. S. (1967). The arithmetic-geometric inequality. In Inequalities (Ed. by O. Shisha). New York-London: Academic Press. NEUMANN, M. (1983). A maximality theorem concerning extreme points. J. Math. Anal. Appl. 96, 148-152. NUSSBAUM, A. E. (1966). Quasi-analytic vectors. Ark. Mat. 6,179-191. OXTOBY, J. C. (1971). Measure and Category. New York-Heidelberg-Berlin: SpringerVerlag. PAALMAN-DE MIRANDA, A. B. (1964). Topological Semigroups. Mathematical Centre Tracts 11. Amsterdam: Mathematisch Centrum. PACHL, J. (1974). Free uniform measures. Comment. Math. Univ. Carolin. 15,541-553.
278
References
PARTHASARATHY, K. R. (1967). Probability Measures on Metric Spaces. New YorkLondon: Academic Press. PARTHASARATHY, K. R. and K. SCHMIDT (1972). Positive Definite Kernels, Continuous Tensor Products, and Central Limit Theorems of Probability Theory. Lecture Notes in Mathematics 272. Berlin-Heidelberg-New York : Springer-Verlag. PARTHASARATHY, K. R., R. R. RAO and S. R. S. VARADHAN (1963). Probability distributions on locally compact abelian groups. Illinois J. Math. 7, 337-369. PARZEN, E. (1971). Time Series Analysis Papers. San Francisco: Holden Day. PETERSEN, L. C. (1982). On the relation between the multidimensional moment problem and the one-dimensional moment problem. Math. Scand. 51, 361-366. PHELPS, R. R. (1966). Lectures on Choquet's Theorem. Princeton: van Nostrand. POLLARD, D. and F. TOPS~E (1975). A unified approach to Riesz type representation theorems. Studia Math. 54, 173-190. REDEl, L. (1965). The Theory ofFinitely Generated Commutative Semigroups. New York: Pergamon Press. REEDS, J. (1979). Widder theorem characterization of Laplace transforms of positive measures on IRk. Report from the Department of Statistics, University of California, Berkeley. RESSEL, P. (1974). Laplace-Transformation nichtnegativer und vektorwertiger Masse. M anuscripta Math. 13, 143-152. RESSEL, P. (1976). A short proof of Schoenberg's theorem. Proc. Amer. Math. Soc. 57, 66-68. RESSEL, P. (1977). Some continuity and measurability results on spaces of measures. Math. Scand. 40, 69-78. RESSEL, P. (1979). Positive definite functions on abelian semigroups without zero. In Studies in Analysis (Ed. by G.-C. Rota). Adv. Math. Suppl. Studies 4, pp. 291-310. New York-London: Academic Press. RESSEL, P. (1982a). A general Hoeffding type inequality. Z. Wahrsch. verw. Gebiete 61, 223-235. RESSEL, P. (1982b). A topological version of Slutsky's theorem. Proc. Amer. Math. Soc. 85,272-274. RESSEL, P. (1983). De Finetti-type theorems: an analytical approach. Technical report Nr. 209, Department of Statistics, Stanford University. ROBERTSON, A. P. and W. J. ROBERTSON (1964). Topological Vector Spaces. Cambridge: The University Press. ROBINSON, R. M. (1969). Some definite polynomials which are not sums of squares of real polynomials. Notices Amer. Math. Soc. 16, 554. ROBINSON, R. M. (1973). Some definite polynomials which are not sums of squares of real polynomials. In Selected Questions oj'Algebra and Logic, pp. 264-282. (A collection dedicated to the memory of A. I. Mal'cev.) Izdat. "Nauka" Sibirsk. Otdel., Novosibirsk (in Russian). RUDIN, W. (1959). Positive definite sequences and absolutely monotonic functions. Duke Math. J. 26, 617-622. RUDIN, W. (1962). Fourier Analysis on Groups. New York: Interscience Publishers. RUDIN, W. (1973). Functional Analysis. New York: McGraw-Hill. SCHAEFER, H. H. (1971). Topological Vector Spaces. New York-Heidelberg-Berlin: Springer-Verlag. SCHEMPP, W. (1977). On functions of positive type on commutative monoids. Math. Z. 156, 115-121. SCHMUDGEN, K. (1979). An example of a positive polynomial which is not a sum of squares of polynomials. A positive, but not stro: 'gly positive functional. Math. lVachr. 88, 385-390. SCHOENBERG, I. J. (1938a). Metric spaces and completely monotone functions. Ann. of Math. 39, 811-841.
References
279
SCHOENBERG, I. J. (1938b). Metric spaces and positive definite functions. Trans. Amer. Math. Soc. 44, 522-536. SCHOENBERG, I. J. (1942). Positive definite functions on spheres. Duke Math. J. 9, 96-108. SCHUR, I. (1911). Bemerkungen zur Theorie der beschrankten Bilinearformen mit unendlich vielen Veranderlichen. J. Reine Angew. Math. 140, 1-29. SCHWARTZ, L. (1973). Radon Measures on Arbitrary Topological Spaces and Cylindrical Measures. London: Oxford University Press. SHOHAT, J. A. and J. D. TAMARKIN (1943). The Problem of Moments. Math. Surveys 1. Providence: American Mathematical Society. SIBONY, D. (1967-1968). Cones de fonctions et potentiels. Cours de 3eme cycle it. la Faculte des Sciences de Paris. STEWART, J. (1976). Positive definite functions and generalizations, a historical survey. Rocky Mountain J. Math. 6, 409-434. STIELTJES, T. J. (1894). Recherches sur les fractions continues. Ann. Fac. Sci. Toulouse 8, 1-122, and 9, 1-47. (In Oeuvres Completes, Vol. II, pp. 402-567. Groningen: Noordhoff, 1918.) STOCHEL, J. (1983). The Bochner type theorem for *-definite kernels on abelian *-semigroups without neutral element. In Dilation Theory, Toeplitz Operators, and other Topics, pp. 345-362. (Ed. by G. Arsene). Basel-Boston-Stuttgart: Birkhauser Verlag. SVECOV, K. I. (1939). On Hamburger's moment problem with the supplementary requirement that masses are absent on a given interval. Commun. Soc. Math. Kharkov 16, 121-128 (in Russian). SZAFRANIEC, F. H. (1977). Dilations on involution semigroups. Proc. Amer. Math. Soc. 66,30-32. SZANKOWSKI, A. (1974). On Dvoretzky's theorem on almost spherical sections of convex bodies. Israel J. Math. 17, 325-338. SZ.-NAGY, B. (1960). Extension of Linear Transformations in Hilbert Space which extend beyond this Space. New York: Ungar. TALAGRAND, M. (1976). Quelques examples de representation integrale: Valuations, fonctions alternees d'ordre infini. Bull. Sci. Math. 100 (2), 321-329. TONEV, T. W. (1979). Positive-definite functions on discrete commutative semigroups. Semigroup Forum 17,175-183. TOPS~E, F. (1970). Topology and measure. Lecture Notes in Mathematics 133, BerlinHeidelberg-New York: Springer-Verlag. TOPS"(S) IqJ(O) = I} &:.(B) %(S) negative definite functions %'(S) lower bounded negative definite functions %b(S) bounded negative definite functions A(S) completely monotone functions A 1(S) = {qJ E A(S) IqJ(O) = I} d(S) completely alternating functions Yr(S) moment functions E+(S*) = {,u E M +(S*)IJ IXsl d,u < 00 for all s E S} E+(S*, qJ) = {,u E E+(S*)IJ XS d,u = qJ(s) for all S E S} E(S*) = {,ul - ,u2 + i(,u3 - ,u4)I,ul,···,,u4 E E+(S*)} 2 set of Levy measures
129 91, 123 110, 129, 130 103 130 130 92 96
180 181 88 88 94 94 94
96 96
153 89 99
105 130 130 179 179
180 203 105
6. Notations Related to an Abelian Semigroup (H, +) which (Possibly) has no Zero (S:= H u {O}) H' = H + H
fJ = S\ {l{o}} 9(H)
the nonmaximal elements of H
&"l(H)
positive definite functions on H quasibounded positive definite functions
96(H) %(H) %q(H)
negative definite functions on H negative definite functions, quasibounded below
%~(H)
A(H) Ao(H) d(H) do(H)
completely monotone functions on H completely alternating functions on H
252 255 253 253 254 253 257 257 264 264 264
265
284
List of Symbols
7. Miscellaneous Symbols 3
PA
seminorm determined by A supremum norm E* algebraic dual space E' topological dual space alg span(r), alg span + ( r) r-positive linear functionals
II . 1100
A:
/P
= {x E IR
'pP(Jl) LP(Jl)
=
I I IJI Ix(n)IP < oo}, 0 < p