(づ•ᴥ•)づ┬─┬

Notes

The Smallest Weight Punches Above Its Weight (in Expectation)

Screenshot 2025-03-24 at 19

I saw this nice bound on Twitter; it's proved as part of this work.

Assuming:

we want to show that for the smallest of the w's, we have:

𝔼[wnXni=1nwiXi]wni=1nwi.

It is necessary that at least one of wi0 and if wn=0 then there's nothing to prove, so let's assume that wn>0 (this also means that all wi0). It is also necessary that not all of Xi are 0 at the same time (an unlikely event as they are independent).

We can rewrite the bound as follows. First set rj:=wj/iwi, 0<ri1. Then, we want to prove:

𝔼[Xni=1nriXi]1.

A simpler result

There's a similar result that is a bit easier to show. First, set ri=1/n, then we can argue that the distribution of Xi/j=1nrjXj does not depend on i (this is called "symmetrization"), so they should all have the same expected value, let's say μ. From this, we have that 𝔼[k=1nXki=1nXi/n]=nμ and so

𝔼[Xni=1nXi/n]=1.

A search for a proof

We can't use symmetrization immediately with the weighted version, the riXi variables are not identically distributed anymore.

Here's an idea, though. Let us first define the shorthand S:=k=1nriXi and Ei:=E[Xi/S]. Then, from linearity of expectation, we have:

i=1nriEi=𝔼[S/S]=1.

As a side effect, 1=i=1nriEimaxkEkirimaxkEk, so if we show maxkEk=En, then we are done!

Instead of going for a clever two-step-Jensen proof (like the authors of the paper), I'll attempt to show this with calculus. 🤔 This assumes more stuff, but it's fine, this isn't math class.

Proof

For i:=j+1, j, 0<rirj (from the ordering of the wi) and we want to show that EiEj. Let’s be adventurous and take a derivative with respect to rj while holding rk for ki,j constant. Then because of krk=1, we get that drj=dri and

ddrj(EiEj)=𝔼[(XiXj)2/S2]0,

so the difference EiEj is increasing as rj is increasing (and ri is decreasing, to compensate)1. We just need to find the minimum of EiEj. We can bound this minimum from below by setting ri and rj to be equal to the middle point r=(ri+rj)/2 (as we are decreasing rj, this also decreases the difference EiEj)2.

But now Xi and Xj have the same weight r, so by symmetrization the lower bound is 0, i.e.,

EiEj𝔼[XiXjki,jrkXk+r(Xi+Xj)]=0

So, we have shown that Ei are increasing with i, so En is the maximum, which implies En1, and we are done!

  1. To justify swapping derivative and expectation, we bound the term f=(XiXj)2/S2. It can be shown that S2=(rkXk)2(ri(XiXj))2, relying on rk,Xk0 and the condition rjri (for i=j+1). This inequality implies f(XiXj)2(ri(XiXj))2=1ri2. Since rirn>0, f is bounded by the finite constant 1/rn2, allowing the interchange.

  2. Remember that rirj according to the original assumptions, so setting the two equal is the smallest allowed value for rj.