Order statistics
February 21, 2019 — March 17, 2020
For a sample of independent observations \(X_{1}, X_{2}, \ldots, X_{n}\) with common distribution \(F\) the ordered sample values
\[X_{(1)} \leq X_{(2)} \leq \cdots \leq X_{(n)}\] are called the order statistics.
Todo: connection to maximum processes, learning ranking, simplex…
Hung Chen’s notes are good.
Gwern did some fun engineering of order statistics, which edges around some general properties of joint maximal statistics of elliptical copulas.
My one-weird-trick in this domain is for sums of top-\(k\)th of \(N\) i.i.d. exponential random variables, which turn out to have a simple representation in terms of \(k\) random exponentials (Nagaraja 2006). The magic is that quantile transforms make this into a very general way of doing cheap order statistics for i.i.d. variables.