We are estimating the symmetric statistic ) with mean . Given data , , the u-statistic for is
Unlike v-statistics, u-statistics are unbiased (hence the “u”). They were introduced by Hoeffing in 1948. Notice that unlike v-statistics, we are summing only over all ordered permutations.
The sample mean and sample variance are both examples of u-statistics.
U-statistics admit various decompositions which help reason about their asymptotic behavior. Eg the Hayek projection and the Hoeffding decomposition.
U-statistics are reverse submartingales, so we can form confidence sequences for them (confidence sequences for convex functionals). Using similar machinery we can develop PAC-Bayes bounds for them.