3 The Geometric Lemma
Lemma 3.2
Let \(U\) and \(U'\) be i.i.d. random variables taking values in a finite set \(X\), and let \(\sigma _1,\ldots ,\sigma _r \colon X \rightarrow \mathbb {R}^n\) be arbitrary functions. Then
for every \((\ell _1,\dots ,\ell _r) \in \mathbb {Z}^r\) with \(\ell _1,\dots ,\ell _r \ge 0\).
To simplify the notation, let us write
where \(\ell = \ell _1 + \cdots + \ell _r\) and \((a_1,\dots ,a_\ell )\) is such that \(\big| \big\{ i \in [\ell ] : a_i = j \big\} \big| = \ell _j\) for each \(j \in [r]\). Now set
and note that
since \(\langle u_1 \otimes \cdots \otimes u_r, v_1 \otimes \cdots \otimes v_r \rangle = \langle u_1, v_1 \rangle \cdots \langle v_r , u_r \rangle \) for any vectors \(u_i,v_i \in \mathbb {R}^n\). Finally, note that \(Z\) and \(Z'\) are independent and identically distributed random vectors, and therefore
as required, where the final inequality holds because \(\mathbb {E}[Z] = \mathbb {E}[Z']\).
Let \(g(x):= cosh(\sqrt{x})\) where \(cosh(x)\) is defined by \(cosh(\sqrt{x})=\overunderset {\infty }{n=0}{\sum }\frac{x^n}{(2n)!}\), then:
For all \(x\geq 0\), we have \(x \leq 2+cosh(\sqrt{x}) \leq 3e^{\sqrt{x}}\).
For all \(x {\lt}0\), we have \(-1 \leq cosh(\sqrt{x})\leq 1\). In particular, \(2+cosh(x)\geq 1\) for all \(x\in \mathbb {R}\).
To prove (1), let \(x\geq 0\) then:
However,
which concludes the lower bound in the first point. To conclude (1), we note the following for \(x\geq 0\):
To prove (2), note that for \(x{\lt}0\),
(2) then follows by noting that \(cos(\sqrt{-x}) \in [-1,1]\) for all \(x{\lt}0\).
Lemma 3.3
Let \(r \in \mathbb {N}\). If \(x_i \ge - 3r\) for all \(i \in [r]\), the function \(f \colon \mathbb {R}^r \rightarrow \mathbb {R}\) defined in 1 satisfies
To see (**), we write down upper bounds for all the factors of the LHS of (**). First, for every \(i \in [r]\), such that \(x_i \geq 0\), we note that \(cosh(\sqrt{x_i} )\leq cosh(\sqrt{x_i+3r})\) (which follows since all the coefficients in the taylor expansion of \(cosh(\sqrt{y}))\) are non-negative and thus \(cosh(\sqrt{y})\) is non-decreasing on the positive reals. In turn, \(2+cosh(\sqrt{x_i+3r}) \leq 3e^{\sqrt{x_i+3r}}\) by point 1 of the above remark. We conculde that if \(x_i \geq 0\), \(2 + cosh{\sqrt{x_i}} \leq 3 e^{\sqrt{x_i+3r}}\).
Next, we condiser the indices \(i \in [r]\) such that \(x_i{\lt}0\). In this case
since \(x_i \geq -3r\). It remains to show that \(\frac{x_j}{2+cosh(\sqrt{x_j})} \leq 1\) for every \(j \in [r]\). If \(x_j\geq 0\) this follows directly from point (1) in the above remark. If \(x_j{\lt}0\), note that the numerator of the fraction is negative while the denominator is a positive number between 1 and 3 by point(2) of the remark above.
Lemma 3.3
Let \(r \in \mathbb {N}\). If there exists an \(i \in [r]\) with \(x_i \le - 3r\), the function \(f \colon \mathbb {R}^r \rightarrow \mathbb {R}\) defined in 1 satisfies
Let \(i \in [r]\) satisfy \(x_i \leq -3r\). Note that if \(x\geq 0\), then \(\frac{x}{2+cosh(\sqrt{x})}\leq 1\) by point (1) of the above remark. Trivially, we have \(\frac{x}{2+cosh{\sqrt{x}}}{\lt}0\leq 1\) for \(x{\lt}0\) (note that \(2+cosh(\sqrt{x})\geq 1\).) Therefore, we have
On the other hand, we know that \(\frac{x_i}{2+cosh(\sqrt{x_i})} \leq \frac{x_i}{3}\) by part (2) in the remark above and therefore,
Finally we write
since the sum in the last term is \(\leq -1\) while the product is at least \(1\) (each factor in the product is at least 1).
The key idea in the two lemmas above was to find an entire function, \(2 + \cosh \sqrt{x}\), which \((a)\) has a Taylor expansion at \(x = 0\) with non-negative coefficients; \((b)\) is bounded on the negative real axis; and \((c)\) does not grow too quickly on the positive axis. The Phragmén–Lindelöf theorem implies that functions satisfying \((a)\) and \((b)\) must grow at least as fast as \(\exp \big( \Omega (\sqrt{x}) \big)\) on the positive real axis. Thus the bound on the growth of \(f\) given by the lemma is essentially best possible for constructions of this type.
Lemma 3.1
Let \(r, n\in \mathbb {N}\). Set \(\beta = 3^{-4r}\) and \(C = 4r^{3/2}\). Let \(U\) and \(U'\) be i.i.d. random variables taking values in a finite set \(X\), and let \(\sigma _1,\ldots ,\sigma _r \colon X \rightarrow \mathbb {R}^n\) be arbitrary functions. There exists \(\lambda \ge -1\) and \(i\in [r]\) such that
For each \(i \in [r]\), define \(Z_i = 3r\big\langle \sigma _i(U),\sigma _i(U') \big\rangle \), and let \(E\) be the event that \(Z_i \ge -3r\) for every \(i \in [r]\).
Consider two cases:
First assume \(\mathbb {P}(E) \ge \beta \). Note that
so with \(\lambda = -1\),
hence the claimed inequality holds.
Now, assume \(\mathbb {P}(E) \le \beta \) and assume for the sake of a contradiction that for all \(\lambda \geq -1\),
Observe that, since \(x \le 2 + \cosh \sqrt{x}\) (Theorem 14) and using Theorem 21 and linearity of expectation,
We hence have
where we use Theorem 23 to bound the case for \(E\), and Theorem 24 for \(E^c\). It follows that
Let \(M = \max \big\{ \big\langle \sigma _i(U),\sigma _i(U') \big\rangle : i \in [r] \big\} \). For any constant \(c \le C - 1\), we have
Given any \(c \leq C-1\) we take the expected value of both sides of the previous equation and apply Tonelli’s theorem to obtain:
where in the second inequality we used (Equation 2) and \(\mathbb {P}(E) \le \beta \); in the third we used \(c \le C - 1\); and in the last inequality we used the fact that \(\int _0^\infty \frac{1}{2\sqrt{x}} e^{-\sqrt{x}} \, \mathrm{d}x = 1\). Note that the second and third inequalities rely on the non-negativity of the integrand. In particular, applying this with \(c = \sqrt{3}r^{3/2}\), and recalling that \(C = 4r^{3/2} \ge c + 1\) and \(\beta = 3^{-4r}\), it follows that
which contradicts our assumption that \(\mathbb {P}(E) \le \beta \).
Hence, there exists \(\lambda \ge -1\) such that
and therefore there exists an \(i \in [r]\) as required.