Author: Eiko

Time: 2025-02-17 13:04:14 - 2025-02-17 13:04:14 (UTC)

High Dimensional Statistics

  • Concentration and its uses

  • Sub-Gaussian random variables

  • Sub-Exponential random variables

Concentration

Let Xi be a sequence of i.i.d random variables

Basic Concentration Inequality

P[f(|Xμ|)f(t)]minfEf(|Xμ|)f(t)

U-Statistics

Canonical statistics Xi/nEX, suppose we want to estimate E|X1X2|, we would consider using i<j|xixj|(n2).

The U-statistics are of the form

U=1(n2)i<jg(xi,xj)

where g is a symmetric function of two variables.

Imagine that we have g<b or assume |x|<b,

U(x1,,xn)U(x1,,xj,,xn)|1(n2)i,ij|g(xi,xj)g(xi,xj)|2b(n1)(n2)=4bn.

P(|UEU|t)2exp(nt28b2).