Reproducing Kernel Hilbert Spaces

Logistics

Drop date: October 30, 2021
My office hours tomorrow
- Tuesdays 8am-9am on BlueJeans (https://bluejeans.com/205357142)
- Come prepared!
Midterm 2:
- Moved to Monday November 8, 2021 (gives you weekend to prepare)
- Coverage: everything since Midterm 1 (dont’ forget the fundamentals though), emphasis on regression

Let $F : F \to R$ be a continuous linear functional on a (possible infinite dimensional) separable Hilbert space $F$ .
Then there exists $c \in F$ such that $F (x) = {⟨ x, c ⟩}_{}$ for every $x \in F$

If ${ψ_{n}}_{n \geq 1}$ is an orthobasis for $H$ , then we can construct $c$ above as $c ≜ \sum_{n = 1}^{\infty} F (ψ_{n}) ψ_{n}$

An RKHS is a Hilbert space $H$ of real-valued functions $f : R^{d} \to R$ in which the sampling operation $S_{τ} : H \to R : f \mapsto f (τ)$ is continuous for every $τ \in R^{d}$ .

In other words, for each $τ \in R^{d}$ , there exists $k_{τ} \in H$ s.t. $f (τ) = {⟨ f, k_{τ} ⟩}_{}_{H} for all f \in H$
The kernel of an RKHS is $k : R^{d} \times R^{d} \to R : (t, τ) \mapsto k_{τ} (t)$ where $k_{τ}$ is the element of $H$ that defines the sampling at $τ$ .
A (separable) Hilbert space with orthobasis ${ψ_{n}}_{n \geq 1}$ is an RKHS with kernel $k (t, τ) = \sum_{n = 1}^{\infty} ψ_{n} (τ) ψ_{n} (t)$ iff $\forall τ \in R^{d}$ $\sum_{n = 1}^{\infty} {| ψ_{n} (τ) |}^{2} < \infty$

If ${ϕ_{n}}_{n \geq 1}$ is a Riesz basis for $H$ , we know that every $x \in H$ can be written $x = \sum_{n \geq 1} α_{n} ϕ_{n} with α_{n} ≜ {⟨ x, {\tilde{ϕ}}_{n} ⟩}_{}$ where ${{\tilde{ϕ}}_{n}}_{n \geq 1}$ is the dual basis.
A (separable) Hilbert space with Riesz basis ${ϕ_{n}}_{n \geq 1}$ is an RKHS with kernel $k (t, τ) = \sum_{n = 1}^{\infty} ϕ_{n} (τ) {\tilde{ϕ}}_{n} (t)$ iff $\forall τ \in R^{d}$ $\sum_{n = 1}^{\infty} {| ϕ_{n} (τ) |}^{2} < \infty$

Finite dimensional Hilbert space
Space of $L$ th order polynomial splines on the real line
Remark
- RKHS are more easily characterized by their kernel
- Often, we try to avoid an explicit description of the the elements in the space

Regression problem: given $n$ pairs $(x_{i}, y_{i}) \in R^{d} \times R$ , solve $min_{f \in F} \sum_{i = 1}^{n} {| y_{i} - f (x_{i}) |}^{2} + λ {‖ f ‖}_{F}^{2}$
If we restrict $F$ to be an RKHS, the problem becomes $min_{f \in F} \sum_{i = 1}^{n} {| y_{i} - {⟨ f, x_{i} ⟩}_{}_{F} |}^{2} + λ {‖ f ‖}_{F}^{2}$

where $x_{i} ≜ k_{x_{i}}$ provides the mapping between $R^{d}$ and $F$ $x_{i} : R^{d} \to R : t \mapsto k_{x_{i}} (t) = k (x_{i}, t)$
The solution is given by $\hat{f} = \sum_{i = 1}^{n} {\hat{α}}_{i} x_{i} with \hat{α} ≜ (K + λ I)^{- 1} y$ and $K ≜ [K_{i, j}]_{1 \leq i, j \leq n}$ with $K_{i, j} = {⟨ x_{i}, x_{j} ⟩}_{}$

Kernel magic
1. $K_{i j} = {⟨ x_{i}, x_{j} ⟩}_{} = {⟨ k_{x_{i}}, k_{x_{j}} ⟩}_{} = k_{x_{i}} (x_{j}) = k (x_{i}, x_{j})$
2. $\hat{f} (x) = {⟨ \hat{f}, k_{x} ⟩}_{} = \sum_{i = 1}^{n} \hat{α_{i}} k (x_{i}, x)$
Remarks
- We solved an infinite dimensional problem using an $n \times n$ system of equations and linear algebra
- Our solution and the evaluation only depend on the kernel; we never need to work directly in $F$
Question: can we skip $F$ entirely? how do we find “good” kernels?

An inner product kernel is a mapping $k : R^{d} \times R^{d} \to R$ for which there exists a Hilbert space $H$ and a mapping $Φ : R^{d} \to H$ such that $\forall u, v \in R^{d} k (u, v) = ⟨ Φ (u), Φ (v) ⟩_{H}$
A function $k : R^{d} \times R^{d} \to R$ is a positive semidefinite kernel if
- $k$ is symmetric, i.e., $k (u, v) = k (v, u)$
- for all ${x_{i}}_{i = 1}^{N}$ , the Gram matrix $K$ is positive semidefinite, i.e., $x^{⊺} K x \geq 0 with K = [K_{i, j}] and K_{i, j} ≜ k (x_{i}, x_{j})$
A function $k : R^{d} \times R^{d} \to R$ is an inner product kernel if and only if $k$ is a positive semidefinite kernel.

Regression using linear and quadratic functions in $R^{d}$
Regression using Radial Basis Functions
Examples of kernels
- Homogeneous polynomial kernel: $k (u, v) = (u^{⊺} v)^{m}$ with $m \in N^{*}$
- Inhomogenous polynomial kernel: $k (u, v) = (u^{⊺} v + c)^{m}$ with $c > 0$ , $m \in N^{*}$
- Radial basis function (RBF) kernel: $k (u, v) = \exp (- \frac{{‖ u - v ‖}_{}^{2}}{2 σ^{2}})$ with $σ^{2} > 0$