NailTheTest
Advanced Mathematics

Functional Analysis: Spaces, Operators, and Structure

Functional analysis extends linear algebra and real analysis to infinite-dimensional spaces. It is the language of modern PDE theory, quantum mechanics, and signal processing, providing the abstract framework that unifies convergence, duality, and spectral phenomena.

Table of Contents

  1. 1. Metric Spaces and Completeness
  2. 2. Normed Spaces and Banach Spaces
  3. 3. Inner Product Spaces and Hilbert Spaces
  4. 4. Bounded Linear Operators
  5. 5. Hahn-Banach Theorem
  6. 6. Open Mapping and Closed Graph Theorems
  7. 7. Uniform Boundedness Principle
  8. 8. Dual Spaces and Reflexivity
  9. 9. Weak and Weak-Star Topologies
  10. 10. Compact Operators
  11. 11. Spectral Theory
  12. 12. Fourier Analysis on Hilbert Spaces
  13. 13. Distributions and Schwartz Space
  14. 14. Sobolev Spaces and Elliptic PDE
  15. 15. Frequently Asked Questions

1. Metric Spaces and Completeness

A metric space is the minimal structure needed to define convergence, continuity, and completeness. It consists of a set X together with a distance function d that satisfies a small number of natural axioms.

Definition of a Metric Space

A metric on a set X is a function d : X x X to [0, infinity) satisfying four axioms for all x, y, z in X:

Metric Axioms

  • (M1) Non-negativity: d(x, y) ≥ 0, with d(x, y) = 0 if and only if x = y
  • (M2) Symmetry: d(x, y) = d(y, x)
  • (M3) Triangle inequality: d(x, z) ≤ d(x, y) + d(y, z)

Standard examples include the real line with d(x, y) = |x - y|, Euclidean n-space with the Euclidean norm, and function spaces with various integral-based metrics. The discrete metric assigns distance 1 to distinct points and 0 to equal points.

Convergence and Cauchy Sequences

A sequence (x_n) in a metric space (X, d) converges to x if d(x_n, x) tends to 0 as n tends to infinity. It is a Cauchy sequence if for every epsilon greater than 0 there exists N such that d(x_m, x_n) is less than epsilon for all m, n greater than N. Every convergent sequence is Cauchy, but the converse need not hold in a general metric space.

Complete Metric Spaces

Definition: Completeness

A metric space (X, d) is complete if every Cauchy sequence in X converges to a point in X.

The real numbers are complete (this is the completeness axiom). The rationals are not: the sequence 3, 3.1, 3.14, 3.141, ... approximating pi is Cauchy in the rationals but does not converge in the rationals. Completeness is stable under closed subsets and under countable products, but not under arbitrary subsets.

An important theorem about complete metric spaces is that every metric space can be completed: there exists a complete metric space into which the original embeds isometrically as a dense subset. This completion is unique up to isometric isomorphism.

The Baire Category Theorem

Theorem (Baire Category Theorem)

Let (X, d) be a complete metric space. If X is expressed as a countable union of closed sets, then at least one of those closed sets has non-empty interior.

Equivalently: a complete metric space is not a countable union of nowhere-dense sets (sets whose closure has empty interior).

The Baire Category Theorem is one of the most powerful tools in functional analysis. A set is called meager (or of the first category) if it is a countable union of nowhere-dense sets, and non-meager (of the second category) otherwise. The theorem says complete metric spaces are always of the second category in themselves.

The three fundamental theorems of functional analysis — the Open Mapping Theorem, the Closed Graph Theorem, and the Uniform Boundedness Principle — all have proofs rooted in the Baire Category Theorem. Understanding Baire's theorem is therefore a prerequisite for understanding the architecture of the subject.

Contraction Mapping Principle

Banach Fixed-Point Theorem

If T : X to X is a contraction on a complete metric space, meaning there exists k in (0, 1) such that d(Tx, Ty) ≤ k d(x, y) for all x, y, then T has a unique fixed point x* = T(x*) and the iterates T^n(x_0) converge to x* from any starting point x_0.

This theorem has concrete applications: proving existence and uniqueness of solutions to ODEs (via the Picard iteration), to integral equations, and to various implicit function theorems. The rate of convergence is geometric with ratio k^n, giving both existence and a computable approximation.

Compactness in Metric Spaces

In metric spaces, compactness has several equivalent characterizations. A metric space is compact if and only if it is complete and totally bounded (for every epsilon greater than 0, it can be covered by finitely many balls of radius epsilon). In finite-dimensional Euclidean space, compactness is equivalent to being closed and bounded (Heine-Borel theorem). In infinite-dimensional spaces the Heine-Borel theorem fails, and the unit ball is never compact — a fact with major consequences throughout functional analysis.

2. Normed Spaces and Banach Spaces

A norm adds algebraic structure to a metric: it is a metric derived from the size of vectors, compatible with the vector space operations. This interplay between the linear structure and the topology is what makes functional analysis so rich.

Norms and the Induced Metric

Definition: Norm

A norm on a real or complex vector space X is a function from X to [0, infinity), written as two vertical bars around the element, satisfying:

  • (N1) Positive definiteness: the norm of x equals zero if and only if x = 0
  • (N2) Absolute homogeneity: the norm of (alpha times x) equals the absolute value of alpha times the norm of x
  • (N3) Triangle inequality: the norm of (x + y) is at most the norm of x plus the norm of y

Every norm induces a metric via d(x, y) = the norm of (x - y). The resulting metric is translation-invariant (the distance between x + z and y + z equals the distance between x and y) and homogeneous (scaling all points by a scalar scales all distances by its absolute value). A normed space is a vector space equipped with a norm. A Banach space is a complete normed space.

The L^p Spaces

The Lebesgue spaces L^p are among the most important examples of Banach spaces and arise naturally in analysis and applications.

L^p Spaces: Definition and Properties

  • L^p(Omega) for 1 ≤ p < infinity: equivalence classes of measurable functions f on Omega with finite p-th power integral. The norm is (the integral of |f|^p)^(1/p).
  • L^infinity(Omega): essentially bounded measurable functions, with norm equal to the essential supremum of |f|.
  • L^2 is special: it is a Hilbert space, with inner product being the integral of f times the conjugate of g.
  • Hölder's inequality: if 1/p + 1/q = 1, then the integral of |f times g| is at most the L^p norm of f times the L^q norm of g.
  • Minkowski's inequality: the L^p norm of (f + g) is at most the L^p norm of f plus the L^p norm of g (this is the triangle inequality for L^p).

All L^p spaces for 1 ≤ p ≤ infinity are complete, hence Banach spaces. The dual of L^p is L^q when p and q are conjugate exponents with 1/p + 1/q = 1, for 1 < p < infinity. The dual of L^1 is L^infinity, and the dual of L^infinity is strictly larger than L^1.

The Space C[a, b]

The space C[a, b] of all continuous real-valued (or complex-valued) functions on a closed bounded interval [a, b], equipped with the supremum norm (the maximum of |f(x)| over all x in [a, b]), is a fundamental example of a Banach space. Its completeness follows from the classical theorem that a uniform limit of continuous functions is continuous.

The Weierstrass approximation theorem states that polynomials are dense in C[a, b], making C[a, b] separable (it has a countable dense subset). The Stone-Weierstrass theorem vastly generalizes this: any subalgebra of C(K) on a compact Hausdorff space K that separates points and contains the constants is dense in C(K).

Equivalent Norms and Finite Dimensions

Two norms on a vector space are equivalent if each controls the other up to a constant factor. Equivalent norms induce the same topology (same open sets, same convergent sequences). A foundational theorem states that on a finite-dimensional vector space, all norms are equivalent. This is why finite-dimensional analysis does not require careful attention to which norm is used. In infinite dimensions, norms can be genuinely inequivalent, and the choice of norm profoundly affects the analysis.

Series in Banach Spaces

In a Banach space, absolute convergence implies convergence: if the sum of the norms of the terms is finite, then the partial sums of the series converge. This characterizes completeness among normed spaces. A Schauder basis for a Banach space X is a sequence (e_n) such that every element of X can be written uniquely as a convergent series with scalar coefficients times basis vectors. Not every Banach space has a Schauder basis (an example without one was constructed by Per Enflo in 1973), though all classical spaces do.

3. Inner Product Spaces and Hilbert Spaces

Adding an inner product to a vector space gives access to the geometric notions of angle, orthogonality, and projection. Hilbert spaces — complete inner product spaces — are the infinite- dimensional setting closest to Euclidean geometry and are fundamental to quantum mechanics and Fourier analysis.

Inner Products

Definition: Inner Product

An inner product on a complex vector space H is a function from H times H to the complex numbers, written as angle brackets around the pair of vectors, satisfying:

  • (I1) Conjugate symmetry: the inner product of (x, y) is the conjugate of the inner product of (y, x)
  • (I2) Linearity in the first argument: the inner product of (ax + by, z) = a times the inner product of (x, z) plus b times the inner product of (y, z)
  • (I3) Positive definiteness: the inner product of (x, x) is non-negative and equals zero only when x = 0

Every inner product induces a norm: the norm of x is the square root of the inner product of (x, x). A Hilbert space is an inner product space that is complete with respect to this induced norm.

Cauchy-Schwarz Inequality

Cauchy-Schwarz Inequality

For all x, y in an inner product space H:

|&langle;x, y&rangle;| ≤ &lVert;x&rVert; · &lVert;y&rVert;

Equality holds if and only if x and y are linearly dependent.

The Cauchy-Schwarz inequality is the single most used inequality in functional analysis. It implies that the inner product is a continuous function, that the angle between vectors is well-defined, and that orthogonal projections are contractions. In L^2, the Cauchy-Schwarz inequality becomes the classical integral inequality: the square of the integral of f times g is at most the integral of f squared times the integral of g squared.

Parallelogram Law and Polarization

Parallelogram Law

In any inner product space:

&lVert;x + y&rVert;² + &lVert;x - y&rVert;² = 2(&lVert;x&rVert;² + &lVert;y&rVert;²)

A normed space is an inner product space if and only if its norm satisfies the parallelogram law. The inner product can be recovered from the norm via the polarization identity.

The parallelogram law characterizes inner product spaces among normed spaces. For example, L^p is a Hilbert space only when p = 2, because for other values of p its norm fails the parallelogram law.

Orthogonality and Projections

Vectors x and y are orthogonal if their inner product is zero. The Pythagorean theorem holds: if x and y are orthogonal, then the square of the norm of (x + y) equals the sum of the squares of their individual norms. A set is orthonormal if every vector has norm 1 and distinct vectors are orthogonal.

For a closed subspace M of a Hilbert space H, every vector x in H has a unique best approximation (closest point) in M, called the orthogonal projection of x onto M. The orthogonal complement M perpendicular is the set of all vectors orthogonal to every vector in M, and H is the direct sum of M and M perpendicular. This projection theorem is one of the most important structural facts about Hilbert spaces and has no direct analogue for general Banach spaces.

Orthonormal Bases and Parseval's Theorem

Bessel's Inequality and Parseval's Theorem

Let (e_n) be an orthonormal sequence in a Hilbert space H. For any x in H:

Bessel's inequality: the sum of the squares of the Fourier coefficients of x with respect to (e_n) is at most the square of the norm of x.

Parseval's theorem: (e_n) is a complete orthonormal system (an orthonormal basis) if and only if equality holds in Bessel's inequality for every x, meaning every element can be expanded as a series in the basis with the sum of squared coefficients equal to the squared norm.

Every separable Hilbert space has a countable orthonormal basis, and all separable infinite-dimensional Hilbert spaces are isometrically isomorphic to the space l^2 of square-summable sequences. Gram-Schmidt orthogonalization converts any linearly independent sequence into an orthonormal one spanning the same closed subspace.

The Riesz Representation Theorem

Riesz Representation Theorem (for Hilbert spaces)

Every bounded linear functional f on a Hilbert space H is of the form f(x) = the inner product of (x, y) for a unique y in H, and the norm of f equals the norm of y. This establishes a conjugate-linear isometric isomorphism between H and its dual space H*.

The Riesz representation theorem shows that Hilbert spaces are self-dual: the dual space is (conjugate-linearly) isomorphic to the space itself. This is a special property not shared by general Banach spaces, and it is what makes Hilbert spaces particularly tractable.

4. Bounded Linear Operators

In infinite-dimensional spaces, linear maps need not be continuous. Those that are continuous — equivalently, bounded — form an algebra and are the morphisms of functional analysis.

Continuity and Boundedness

Theorem: Equivalence of Continuity and Boundedness

For a linear operator T from a normed space X to a normed space Y, the following are equivalent: (1) T is continuous at some point; (2) T is continuous at every point; (3) T is uniformly continuous; (4) T is bounded, meaning there exists M such that the norm of T(x) is at most M times the norm of x for all x.

The operator norm of T, written as the norm of T subscript op, is the smallest such constant M, equivalently the supremum of the norm of T(x) over all unit vectors x. The space B(X, Y) of all bounded linear operators from X to Y is itself a normed space under the operator norm, and when Y is a Banach space, B(X, Y) is a Banach space.

The Spectrum and Resolvent

For a bounded linear operator T on a Banach space X (over the complex numbers), the resolvent set consists of all complex numbers lambda for which (T minus lambda I) is a bijective bounded operator with bounded inverse. The spectrum is the complement of the resolvent set.

Parts of the Spectrum

  • Point spectrum: values of lambda for which (T - lambda I) is not injective (eigenvalues, where T(x) = lambda x for some nonzero x).
  • Continuous spectrum: lambda for which (T - lambda I) is injective with dense but non-closed range.
  • Residual spectrum: lambda for which (T - lambda I) is injective but its range is not dense.

The spectrum of any bounded operator is a non-empty compact subset of the complex numbers. The spectral radius — the supremum of the absolute values of spectral values — equals the limit of the n-th root of the n-th power norm of T as n tends to infinity.

The resolvent R(lambda) = (T minus lambda I) inverse is an analytic function of lambda on the resolvent set, and the resolvent identity R(lambda) minus R(mu) = (lambda minus mu) times R(lambda) times R(mu) holds. This analytic structure is the starting point for spectral theory.

Adjoints and Self-Adjoint Operators

For a bounded operator T on a Hilbert space H, the adjoint T* is the unique operator satisfying: the inner product of (Tx, y) equals the inner product of (x, T*y) for all x, y in H. Its existence follows from the Riesz representation theorem. An operator is self-adjoint if T = T*, normal if T and T* commute, and unitary if T* is the inverse of T.

Self-adjoint operators have real spectra, a fact proved by computing the imaginary part of the inner product of (Tx, x) for any would-be complex eigenvalue. Unitary operators have spectra on the unit circle. Normal operators satisfy a spectral theorem analogous to the finite-dimensional result for normal matrices.

5. The Hahn-Banach Theorem and Its Consequences

The Hahn-Banach theorem is the cornerstone of duality theory in functional analysis. It guarantees extensions of linear functionals and separates points from closed convex sets by hyperplanes.

The Analytic Form

Hahn-Banach Theorem (Analytic Form)

Let X be a real vector space, p a sublinear functional on X (satisfying p(x + y) ≤ p(x) + p(y) and p(tx) = t p(x) for t ≥ 0), M a subspace of X, and f a linear functional on M with f(x) ≤ p(x) for all x in M. Then there exists a linear extension F of f to all of X still satisfying F(x) ≤ p(x) for all x in X.

For normed spaces, taking p to be the norm times the norm of f yields: any bounded linear functional on a subspace extends to the whole space with the same norm. The proof uses Zorn's lemma (or transfinite induction) to extend the functional one dimension at a time — the key step being that a one-dimensional extension always exists.

The Geometric Form and Separation Theorems

The geometric version of Hahn-Banach says that a point not in a closed convex set can be separated from that set by a continuous linear functional (a closed hyperplane). Stronger separation results hold under additional assumptions (e.g., separating two disjoint convex sets, at least one of which is open).

These separation theorems underpin convex optimization, duality in linear programming, and the theory of support functions for convex bodies.

Consequences of Hahn-Banach

Key Corollaries

  • Point separation: for any nonzero x in a normed space X, there exists a bounded linear functional f with f(x) = the norm of x and norm of f = 1. In particular, if f(x) = 0 for all bounded linear functionals f, then x = 0.
  • Dense subspaces: a subspace M is dense in X if and only if every bounded linear functional that vanishes on M is identically zero.
  • Norm attainment: the norm of x equals the supremum of |f(x)| over all unit-norm bounded linear functionals.
  • Isometric embedding in the bidual: every normed space X embeds isometrically into its bidual X** via the canonical evaluation map that sends x to the functional on X* that evaluates at x.

6. The Open Mapping and Closed Graph Theorems

These two theorems use completeness in a decisive way and represent the first payoff of the Baire Category Theorem in the operator-theoretic setting.

Open Mapping Theorem

Theorem (Open Mapping / Banach-Schauder)

A surjective bounded linear operator T : X to Y between Banach spaces is an open map: for every open set U in X, T(U) is open in Y.

Proof Sketch

Since T is surjective, Y is the union over n of n times the image of the unit ball. By Baire's theorem, at least one such set has non-empty interior in Y. Using the linearity of T and a limit argument, one shows the image of the open unit ball in X contains an open ball in Y. The result follows.

Bounded Inverse Theorem

Corollary: Bounded Inverse Theorem

A bijective bounded linear operator between Banach spaces has a bounded inverse. This means that in the Banach space setting, an algebraically invertible continuous linear map is automatically topologically invertible.

In finite dimensions, linear bijections between normed spaces automatically have bounded inverses (because all norms are equivalent). In infinite dimensions this is not automatic and requires completeness: there exist bijective bounded linear operators between incomplete normed spaces whose inverse is unbounded.

Closed Graph Theorem

Theorem (Closed Graph)

Let X and Y be Banach spaces and T : X to Y a linear operator. Then T is bounded if and only if its graph (the set of pairs (x, T(x)) in X times Y) is closed in the product space X times Y.

Application

The closed graph theorem is useful for proving boundedness when it is easy to verify that if x_n converges to x and T(x_n) converges to y, then y = T(x), but hard to prove boundedness directly.

Many operators arising in practice (differential operators with natural domains, operators defined implicitly by equations) have closed graphs, making this theorem an efficient tool for establishing continuity. The closed graph theorem follows from the open mapping theorem: if T has a closed graph, consider T as an operator from the Banach space X to the graph of T (which is closed in X times Y, hence Banach), and apply the bounded inverse theorem.

7. The Uniform Boundedness Principle (Banach-Steinhaus Theorem)

The Uniform Boundedness Principle is a powerful result converting pointwise information into uniform control. It is the third of the three pillars of Banach space theory, alongside Hahn-Banach and the open mapping theorem.

Statement and Proof Idea

Theorem (Banach-Steinhaus / Uniform Boundedness Principle)

Let X be a Banach space, Y a normed space, and let (T_alpha) be a family of bounded linear operators from X to Y. If for every x in X the set of norms (the norm of T_alpha(x), over all alpha) is bounded, then the set of operator norms (the norm of T_alpha, over all alpha) is also bounded.

Proof Sketch

Define, for each natural number n, the set A_n of all x in X with the norm of T_alpha(x) at most n for every alpha. Each A_n is closed (as an intersection of preimages of closed sets under continuous maps) and their union is all of X by assumption. By Baire's theorem, some A_N has non-empty interior: there exists a ball B(x_0, r) contained in A_N. For any unit vector u, x_0 + (r/2)u lies in A_N, and from this one deduces a uniform bound on the operator norms.

Applications

The Uniform Boundedness Principle has wide-ranging applications:

  • Convergence of operator sequences: if (T_n) is a sequence of bounded operators that converges pointwise (the strong operator topology), then the sequence of norms is automatically bounded.
  • Divergence of Fourier series: the classical result that there exists a continuous function whose Fourier series diverges at a point is proved by showing the norms of the Dirichlet kernel functionals are unbounded, then applying the principle in reverse (contrapositive) to produce a function witnessing divergence.
  • Weak boundedness: a weakly bounded subset of a Banach space (bounded when tested against every bounded linear functional) is automatically norm bounded.

The Principle of Condensation of Singularities

A stronger form of the Banach-Steinhaus theorem asserts: if a sequence of bounded operators (T_n) from a Banach space X to a normed space Y is not uniformly bounded, then the set of x in X for which (T_n x) is unbounded is a residual (comeager) set — meaning its complement is meager. In other words, "bad" behavior is generic when it occurs at all.

8. Dual Spaces and Reflexivity

The dual of a Banach space consists of all bounded linear functionals on it. Duality theory provides a second perspective on every Banach space and is essential for understanding the relationship between a space and its functionals.

The Dual Space

The dual space X* of a Banach space X is the space of all bounded linear functionals from X to the scalar field (real or complex numbers), equipped with the operator norm. It is always a Banach space, even when X itself is only a normed space.

Duality of Classical Spaces

  • (L^p)* is isometrically isomorphic to L^q when 1/p + 1/q = 1 and 1 < p < infinity
  • (L^1)* is isometrically isomorphic to L^infinity
  • (l^p)* is isometrically isomorphic to l^q for 1 ≤ p < infinity
  • (c_0)* is isometrically isomorphic to l^1 (the space c_0 consists of sequences converging to zero)
  • (C[a, b])* is isometrically isomorphic to the space of regular Borel measures on [a, b] (Riesz-Markov theorem)
  • Every Hilbert space H satisfies H* is isometrically isomorphic to H (Riesz representation theorem)

The Bidual and Reflexivity

The bidual X** is the dual of X*. The canonical embedding J : X to X** defined by J(x)(f) = f(x) is always an isometric embedding. When this embedding is surjective (so X and X** are isometrically isomorphic via J), the space X is called reflexive.

Examples of Reflexive and Non-Reflexive Spaces

  • Reflexive: all Hilbert spaces; L^p for 1 < p < infinity; all finite-dimensional normed spaces
  • Non-reflexive: L^1, L^infinity, C[a, b], l^1, l^infinity, c_0

Reflexivity is important because in a reflexive Banach space, bounded sequences have weakly convergent subsequences (Alaoglu's theorem applied to the bidual, combined with reflexivity). This weak compactness is essential in the calculus of variations and optimization, where one wants to extract converging subsequences from minimizing sequences.

9. Weak and Weak-Star Topologies

In infinite-dimensional Banach spaces, norm convergence is often too strong for compactness arguments. The weak topology provides a coarser notion of convergence for which bounded sets are often compact, making it a powerful tool in analysis.

Weak Convergence

Definition: Weak Convergence

A sequence (x_n) in a Banach space X converges weakly to x, written x_n to x weakly, if f(x_n) converges to f(x) for every bounded linear functional f in X*. The weak topology is the coarsest topology on X making all functionals in X* continuous.

Norm convergence implies weak convergence, but the converse fails in infinite dimensions. The standard basis vectors in l^2 converge weakly to 0 (since (e_n, y) = the n-th coordinate of y, which tends to 0 for any l^2 element y), but their norms are all 1. Weak limits are unique, and weakly convergent sequences are bounded in norm (by the Uniform Boundedness Principle).

The Weak-Star Topology

The weak-star topology on the dual space X* is the coarsest topology making the evaluation functionals (f maps to f(x)) for each fixed x in X continuous. A net (f_alpha) converges weak-star to f if f_alpha(x) converges to f(x) for every x in X. This is weaker than (or equal to) the weak topology on X*.

Banach-Alaoglu Theorem

Theorem (Banach-Alaoglu)

The closed unit ball of the dual space X* is compact in the weak-star topology. If X is separable, then the weak-star topology on the unit ball of X* is metrizable and the ball is therefore sequentially compact.

The Banach-Alaoglu theorem is extremely useful. In conjunction with reflexivity (X** = X), it implies that the closed unit ball of a reflexive Banach space is weakly compact. This gives the fundamental compactness tool used to solve variational problems: from any bounded sequence one can extract a weakly convergent subsequence.

Weak Sequential Compactness in Hilbert Spaces

In a Hilbert space, every bounded sequence has a weakly convergent subsequence. This is used repeatedly in PDE theory: one shows a minimizing sequence is bounded, extracts a weakly convergent subsequence, and then uses lower semi-continuity of the relevant functional to show the limit is a minimizer.

10. Compact Operators

Compact operators are, in a precise sense, the closest infinite-dimensional operators are to finite-dimensional linear maps. They arise naturally in integral equations and form the setting for the most complete spectral theory available beyond finite dimensions.

Definition and Basic Properties

Definition: Compact Operator

A bounded linear operator T : X to Y between Banach spaces is compact if the image of every bounded set has compact closure, or equivalently, every bounded sequence (x_n) in X has a subsequence such that (T(x_n)) converges in Y.

Finite-rank operators (those whose range is finite-dimensional) are always compact. The norm limit of a sequence of compact operators is compact. The identity operator on an infinite-dimensional Banach space is never compact — this follows from the fact that the unit ball is not compact in infinite dimensions.

Compact operators send weakly convergent sequences to norm convergent ones. On a Hilbert space, T is compact if and only if T* is compact. Hilbert-Schmidt operators (defined via an integral kernel with square-integrable kernel) are compact.

Fredholm Theory

Let T be a compact operator on a Banach space X. The Fredholm alternative for the equation (I - T)x = y states: either (I - T) is bijective (and hence boundedly invertible), or (I - T) has a non-trivial kernel (null space) and its range has finite codimension equal to the dimension of the kernel. In the latter case, the equation (I - T)x = y has a solution if and only if y is orthogonal to the kernel of (I - T*).

This result mirrors the finite-dimensional Fredholm alternative for matrices: either Ax = b has a unique solution for every b, or the homogeneous system has nontrivial solutions and Ax = b has solutions only when b is orthogonal to the null space of A transpose.

Spectrum of a Compact Operator

Spectral Properties of Compact Operators

  • The spectrum of a compact operator on an infinite-dimensional space contains 0 (0 is always in the spectrum)
  • Every nonzero spectral value is an eigenvalue (lies in the point spectrum)
  • The nonzero eigenvalues form a sequence that either is finite or accumulates only at 0
  • Each nonzero eigenvalue has a finite-dimensional eigenspace

11. Spectral Theory for Compact Self-Adjoint Operators

The spectral theory of compact self-adjoint operators on a Hilbert space is the most complete and elegant chapter in the subject, generalizing the spectral theorem for real symmetric matrices to infinite dimensions.

The Spectral Theorem

Theorem: Spectral Theorem for Compact Self-Adjoint Operators

Let T be a compact self-adjoint operator on a Hilbert space H. Then:

  • All eigenvalues of T are real and the nonzero ones form a (finite or countably infinite) sequence tending to 0
  • Eigenspaces corresponding to distinct eigenvalues are orthogonal
  • There exists a complete orthonormal system (e_n) of H consisting of eigenvectors of T, so T = the sum of lambda_n times the projection onto the span of e_n
  • Any x in H can be expanded as x = sum of the inner products of (x, e_n) times e_n, with T(x) = sum of lambda_n times the inner products of (x, e_n) times e_n

The proof proceeds by showing T attains its operator norm on some unit vector (by the compactness and self-adjointness), showing that vector is an eigenvector, passing to the orthogonal complement and repeating. This iterative spectral decomposition produces the full orthonormal system of eigenvectors.

Functional Calculus

Once an operator T is diagonalized in an orthonormal eigenbasis (e_n) with eigenvalues (lambda_n), any reasonable function f can be applied to T: f(T) is defined as the operator sending e_n to f(lambda_n) times e_n. This functional calculus allows defining square roots, exponentials, and other functions of operators in a natural way, provided the function values are well-defined on the spectrum.

Connections to Integral Equations and PDEs

The Fredholm integral equation of the second kind, which asks for a function u satisfying u(x) minus the integral of k(x,y)u(y) dy = f(x), has as its homogeneous part the eigenvalue problem for the integral operator with kernel k. When k is symmetric and square- integrable, the integral operator is compact and self-adjoint on L^2, and the spectral theorem provides a complete description of all solutions.

For the Laplacian on a bounded domain with Dirichlet boundary conditions, the inverse is a compact self-adjoint operator on L^2, whose eigenvalues are the reciprocals of the Dirichlet eigenvalues of the Laplacian and whose eigenfunctions are the Dirichlet eigenfunctions. The spectral theorem then yields a complete orthonormal system for L^2 consisting of solutions to the eigenvalue problem for the Laplacian.

12. Fourier Analysis on Hilbert Spaces

Fourier analysis has its natural home in the Hilbert space L^2. The abstract theory of orthonormal bases in Hilbert spaces unifies and extends classical Fourier series, Fourier transforms, wavelets, and other function expansions.

Classical Fourier Series in L^2

The exponential functions e_n(x) = (1 over the square root of 2 pi) times exp(inx) for all integers n form a complete orthonormal system for L^2([minus pi, pi]). Every function f in L^2 has a Fourier expansion: f equals the sum over n of the inner product of (f, e_n) times e_n, where the sum converges in the L^2 norm. Parseval's theorem gives: the squared L^2 norm of f equals the sum of squared Fourier coefficients.

Unlike pointwise convergence of Fourier series (which requires additional regularity and is more delicate — Carleson's 1966 theorem proved L^2 Fourier series converge pointwise almost everywhere), L^2 convergence is automatic and follows directly from the Hilbert space theory of complete orthonormal systems.

The Fourier Transform on L^2

The Fourier transform of a function f in L^1 is defined pointwise. For functions in L^1 that are also in L^2, the Plancherel theorem says the Fourier transform extends uniquely to an isometric isomorphism of L^2(R) onto itself: the L^2 norm of the Fourier transform of f equals the L^2 norm of f. This is the infinite-line analogue of Parseval's theorem for Fourier series.

Key properties of the Fourier transform: it converts differentiation into multiplication by the frequency variable (up to a factor of i), and convolution into pointwise multiplication. These properties are why the Fourier transform is central to the study of PDEs with constant coefficients and to signal processing.

Abstract Harmonic Analysis

The abstract theory of Hilbert spaces allows a unified treatment of many different function expansions. For instance, Legendre polynomials form a complete orthonormal system for L^2([-1, 1]), Hermite functions for L^2(R), spherical harmonics for L^2 of the sphere, and Bessel functions for L^2 on disk sectors. All of these are special cases of the general theory of complete orthonormal systems in Hilbert spaces.

13. Distributions and Schwartz Space

Distributions (generalized functions) extend the notion of function to objects that cannot be defined pointwise but behave naturally under differentiation and other operations. They provide the rigorous foundation for the Dirac delta "function" and for weak derivatives in Sobolev spaces.

Test Functions and the Schwartz Space

The space of test functions D(Omega) consists of smooth (infinitely differentiable) functions with compact support in an open set Omega in R^n. A sequence of test functions converges in D(Omega) if all functions have support in a common compact set and all partial derivatives of all orders converge uniformly.

The Schwartz Space S(R^n)

The Schwartz space S(R^n) consists of all smooth functions on R^n that decay faster than any polynomial, along with all of their derivatives. Formally, phi is in S if for every pair of multi-indices alpha and beta, the function x^alpha times the partial derivative beta of phi is bounded.

The Schwartz space is closed under the Fourier transform (making it the natural domain for the Fourier transform) and is dense in all L^p spaces. Gaussian functions and their derivatives are in S. The space S is strictly larger than the space of compactly supported smooth functions.

Distributions

Definition: Distribution

A distribution on Omega is a continuous linear functional on D(Omega). The space of distributions is denoted D'(Omega). Continuity means: if a sequence of test functions phi_n converges to phi in D(Omega), then T(phi_n) converges to T(phi).

Every locally integrable function f defines a distribution via: the action of f on a test function phi is the integral of f(x) phi(x) dx. This embeds functions into distributions. The Dirac delta at a point a, denoted delta_a, is the distribution that evaluates phi at a: the action of delta_a on phi is phi(a). It is not representable as a function.

Differentiation of Distributions

The derivative of a distribution T is defined by transposing the integration by parts formula: the action of the derivative of T on phi equals minus the action of T on the derivative of phi. This defines a derivative for every distribution, even those that are not differentiable as functions. For example, the derivative of the Heaviside function (which is 0 for negative x and 1 for non-negative x), viewed as a distribution, is the Dirac delta at 0. Every distribution is infinitely differentiable in this sense.

Tempered Distributions and the Fourier Transform

Tempered distributions S'(R^n) are continuous linear functionals on the Schwartz space S(R^n). They include all L^p functions, all polynomials, and all Dirac deltas and their derivatives. The Fourier transform extends to tempered distributions: the Fourier transform of a tempered distribution T acts on a test function phi by: the action of the Fourier transform of T on phi equals the action of T on the Fourier transform of phi. The Fourier transform of the Dirac delta is the constant function 1, and the Fourier transform of 1 is the Dirac delta (times appropriate normalization).

14. Sobolev Spaces and Elliptic PDE

Sobolev spaces are the function spaces that arise naturally in the weak formulation of boundary value problems. They capture the idea of "having k derivatives in L^p" using weak derivatives, providing the framework for a complete PDE existence theory.

Weak Derivatives and Sobolev Spaces

Definition: Sobolev Space W^(k,p)

A function f in L^p(Omega) has a weak partial derivative of order alpha if there exists a function g in L^p(Omega) such that for every smooth compactly supported test function phi: the integral of f times the partial derivative alpha of phi equals (minus 1)^(|alpha|) times the integral of g times phi.

The Sobolev space W^(k,p)(Omega) consists of all functions in L^p(Omega) whose weak derivatives of all orders up to k also lie in L^p(Omega), normed by the sum of the L^p norms of the function and all its weak derivatives up to order k. The special case W^(k,2) is written H^k and is a Hilbert space with the natural inner product.

Sobolev spaces are Banach spaces. The space H^1_0(Omega) is the closure of smooth compactly supported functions in H^1(Omega) and is the natural space for the Dirichlet problem (imposing zero boundary conditions weakly). Its norm is equivalent to the L^2 norm of the gradient alone (by the Poincare inequality when Omega is bounded).

Sobolev Embedding Theorems

Sobolev Embedding Theorem (selected cases)

  • If kp < n (the space dimension), then W^(k,p)(Omega) embeds continuously into L^q(Omega) where 1/q = 1/p - k/n (subcritical exponent). The embedding constant depends on k, p, n and the domain.
  • If kp = n, then W^(k,p)(Omega) embeds into L^q for all finite q but not into L^infinity (borderline case).
  • If kp > n, then W^(k,p)(Omega) embeds into C^(k - n/p - 1) (classical differentiability), giving Hölder continuous functions.
  • Compact embeddings (Rellich-Kondrachov): the embedding of W^(1,p)(Omega) into L^q for q strictly less than the critical exponent is compact — this means bounded sets in W^(1,p) have precompact closures in L^q.

The Rellich-Kondrachov compactness embedding is used repeatedly: from a sequence bounded in H^1 one can extract a subsequence convergent in L^2, giving compactness for energy-bounded sequences of approximate solutions.

Weak Formulation and the Lax-Milgram Theorem

The classical Dirichlet problem — find u with minus the Laplacian of u equal to f on Omega, and u equal to 0 on the boundary of Omega — is reformulated as: find u in H^1_0(Omega) such that for every test function v in H^1_0(Omega), the integral of the gradient of u dotted with the gradient of v equals the integral of f times v. This is the weak formulation: it replaces a differential equation with an integral identity testable against all smooth compactly supported functions.

Lax-Milgram Theorem

Let H be a Hilbert space, a : H times H to the scalar field a continuous bilinear form that is coercive (meaning a(u,u) is at least c times the squared norm of u for some c > 0), and F a bounded linear functional on H. Then there exists a unique u in H such that a(u,v) = F(v) for all v in H.

For the Dirichlet problem, a(u,v) = integral of gradient u dotted with gradient v, which is coercive on H^1_0 by the Poincare inequality. Hence the weak solution exists and is unique.

The Lax-Milgram theorem is the central existence theorem for elliptic PDEs in divergence form. It extends to more general elliptic operators L = minus the sum of partial derivatives of (a_ij times partial derivative) plus the sum of b_i times partial derivative plus c, provided the coefficient matrix is uniformly elliptic (the bilinear form remains coercive after handling lower order terms, e.g., by adding a multiple of the L^2 norm).

Elliptic Regularity

Lax-Milgram gives a weak solution in H^1_0. The next question is whether this weak solution is actually smooth. Elliptic regularity theory answers this: if the right-hand side f is in H^k, then the weak solution is in H^(k+2) (interior regularity, and up to the boundary if the boundary is smooth enough). In particular, if f is smooth, the weak solution is smooth, justifying the weak formulation approach for classical problems.

Related Areas of Mathematics

Frequently Asked Questions

What is a Banach space and how does it differ from a normed space?

A normed space is a vector space equipped with a norm — a function assigning a non-negative length to each vector. A Banach space is a normed space that is also complete: every Cauchy sequence converges to a point within the space. Completeness is the key distinction. All finite-dimensional normed spaces are Banach spaces. Important examples include L^p spaces for 1 ≤ p ≤ infinity and C[a, b] with the sup-norm.

What does the Hahn-Banach theorem say and why is it important?

The Hahn-Banach theorem guarantees that bounded linear functionals on a subspace of a normed space can always be extended to the full space without increasing their norm. Its importance: it proves the dual space is non-trivial, allows separation of points from closed convex sets by hyperplanes, and shows the norm of a vector equals the supremum of its values on unit-norm functionals. It is proved using Zorn's lemma.

What is the spectrum of a bounded linear operator?

For a bounded operator T on a Banach space, the spectrum is the set of complex numbers lambda for which (T minus lambda I) lacks a bounded inverse. It decomposes into the point spectrum (eigenvalues), continuous spectrum (injective, dense non- closed range), and residual spectrum (injective, non-dense range). The spectrum is always non-empty and compact. For compact self-adjoint operators on Hilbert spaces, the spectrum consists of real eigenvalues accumulating only at 0.

What is the Open Mapping Theorem?

A surjective bounded linear operator between Banach spaces is an open map: it sends open sets to open sets. The key corollary is the Bounded Inverse Theorem: a bijective bounded linear operator between Banach spaces automatically has a bounded inverse. The proof uses the Baire Category Theorem to show the image of the unit ball contains a ball around the origin in the target space.

What are Sobolev spaces and why are they used for PDEs?

Sobolev spaces W^(k,p) consist of L^p functions whose weak partial derivatives up to order k also lie in L^p. They are the natural setting for weak solutions of elliptic PDEs: the Dirichlet problem for minus the Laplacian of u equal to f becomes a problem on H^1_0, solved by the Lax-Milgram theorem when the bilinear form is coercive. Sobolev embedding theorems then control the smoothness and integrability of the resulting solutions.

What is the Uniform Boundedness Principle?

Also called the Banach-Steinhaus theorem, it states: if a family of bounded linear operators from a Banach space is pointwise bounded (for each input, the outputs are bounded), then the family is uniformly bounded (there is a single bound on all operator norms). The proof is a direct application of the Baire Category Theorem. Applications include: pointwise convergent sequences of operators have bounded norms, and the existence of a continuous function with divergent Fourier series.

What is the difference between weak convergence and norm convergence?

A sequence converges strongly (in norm) if the norms of the differences tend to 0. It converges weakly if every bounded linear functional evaluated on the sequence converges. Norm convergence implies weak convergence but not conversely. In l^2, the standard basis vectors converge weakly to 0 but have constant norm 1. Weak convergence is important because bounded sequences in reflexive spaces have weakly convergent subsequences, giving the compactness needed for variational arguments.

What is the spectral theorem for compact self-adjoint operators?

For a compact self-adjoint operator T on a Hilbert space, all eigenvalues are real and accumulate only at 0, the eigenvectors can be chosen to form a complete orthonormal system, and T can be written as a convergent series of rank-1 projections onto eigenspaces weighted by the corresponding eigenvalues. This is the infinite-dimensional analogue of diagonalizing a real symmetric matrix and is applied to integral operators, Sturm-Liouville problems, and the Laplacian on bounded domains.

How does functional analysis appear in quantum mechanics?

In quantum mechanics, the state of a system is a unit vector in a Hilbert space (typically L-2 of configuration space). Physical observables — position, momentum, energy — are represented by self-adjoint operators on this Hilbert space. The possible measurement outcomes are the spectral values of the operator: for a compact self-adjoint observable the outcomes are its eigenvalues, and the probability of measuring eigenvalue lambda sub n is the squared modulus of the inner product of the state with the corresponding eigenvector (the Born rule). The time evolution of the state is governed by the Schrodinger equation, which is an operator differential equation. The Hamiltonian operator (representing total energy) is typically an unbounded self-adjoint operator, requiring the theory of unbounded operators and spectral measures for a rigorous treatment. Heisenberg's uncertainty principle is a consequence of the non-commutativity of the position and momentum operators: their commutator equals i times the reduced Planck constant times the identity.

What is the condition number of a linear operator and why does it matter in numerical analysis?

The condition number of a bounded linear operator T between normed spaces (when T is invertible) is the product of the operator norm of T and the operator norm of the inverse of T. It measures how much a small perturbation in the input can be amplified in the output: if the condition number is large, the problem is ill-conditioned and small errors in data or floating-point arithmetic can produce large errors in the solution. For a matrix A representing a linear system, the condition number equals the ratio of the largest to smallest singular value. In the context of iterative methods for operator equations in Banach or Hilbert spaces, the condition number controls the rate of convergence of methods such as the conjugate gradient algorithm. Preconditioning — applying an approximate inverse to reduce the condition number — is one of the main tools in scientific computing, and its analysis relies directly on the operator-theoretic framework of functional analysis. The spectral condition number of a self-adjoint positive operator is the ratio of its largest eigenvalue to its smallest positive eigenvalue, and the spectral theorem makes this geometric picture precise.

Core Theorems at a Glance

Baire Category Theorem

A complete metric space is not a countable union of nowhere-dense sets. Basis for the three big Banach theorems.

Hahn-Banach Theorem

Norm-preserving extension of bounded linear functionals from subspaces to the whole space.

Open Mapping Theorem

Surjective bounded linear operators between Banach spaces are open maps; bijections have bounded inverses.

Closed Graph Theorem

A linear operator between Banach spaces is bounded if and only if its graph is closed.

Uniform Boundedness Principle

A pointwise-bounded family of bounded operators from a Banach space is uniformly bounded.

Banach-Alaoglu Theorem

The closed unit ball of X* is weak-star compact. Gives weak compactness in reflexive spaces.

Riesz Representation (Hilbert)

Every bounded functional on a Hilbert space is an inner product with a unique vector.

Spectral Theorem (Compact Self-Adjoint)

Complete orthonormal eigenbasis with real eigenvalues accumulating only at 0.

Lax-Milgram

A coercive continuous bilinear form on a Hilbert space defines a bijective bounded operator with bounded inverse.

Sobolev Embedding

W^(k,p) embeds into L^q or C^m depending on the relationship of k, p to the space dimension n.

NailTheTest

Master Functional Analysis with NailTheTest

From normed spaces and the Hahn-Banach theorem through spectral theory and compact operators, NailTheTest gives you the practice problems, worked examples, and adaptive feedback you need to build genuine mastery of graduate-level mathematics.

Also explore: Precalculus Study Guide · Advanced Measure Theory