An often forgotten formula for the mean of a random variable is given by:
And for the continous case:
This blog post is going to illustrate how these formulas arise.
Explanation in Discrete Case
Let’s start from the well-known formula for the mean based on the probability mass function (pmf):
Obviously this sum can be extended to include as the resulting term doesn’t affect the sum at all.
Decomposing the sum we can arrange the involved terms in the form of a triangle:
To emphasize what’s going on: Each row consists of multiples of the pmf for some . Adding each element of a row gives a summand to the expected value / population mean.
Now, we know that the triangle corresponds to the expected value. It doesn’t matter how it is calculated. So, let’s switch from summing rows to summing columns and see what happens!.
And what does the alternative look like? The point to start at is the fact that the rightmost column adds to 1. The more you move to the right, the more of 1 you lose – in favor of the cumulative distribution function (cdf).
On the left-hand end and for all we have which doesn’t contribute to the sum at all. Hence, we can increase the upper bound of the sum to :
Continuous Case
Es finden sich online Beweise, die explizit für die Berechnung des Erwartungswertes kontinuierlicher Zufallsvariablen beschäftigen. Diese haben auch ihre Berechtigung, da sie mithilfe bewiesener Gesetzmäßigkeiten, die Korrektheit folgender Formel belegen:
Für das Verständnis ist es aber sicher sinnvoller, sich vorzustellen, dass man die Formel des vorangegangenen Abschnitts auf unendlich viele unendlich kleine anwendet. Denn nichts anderes als eine unendlich feine Summe ist ein Integral!
Leave a Reply