Statistics and Probability

5- Covariance

In probability theory and statistics, covariance is a measure of the joint variability of two random variables.

If the greater values of one variable mainly correspond with the greater values of the other variable,

and the same holds for the lesser values, (i.e., the variables tend to show similar behavior),

the covariance is positive. In the opposite case, when the greater values of one variable mainly

correspond to the lesser values of the other, (i.e., the variables tend to show opposite behavior),

the covariance is negative. The sign of the covariance therefore shows the tendency in the linear relationship between the variables.

For two jointly distributed real-valued random variables 𝑋 and 𝑌,

the covariance is defined as the expected value (or mean) of the product of their deviations from their individual expected values:

cov(𝑋,𝑌)=𝐸[(𝑋−𝐸(𝑋))(𝑌−𝐸(𝑌))]

where 𝐸(𝑋) is the expected value of 𝑋, also known as the mean of 𝑋.

The covariance is also sometimes denoted 𝜎௑௒ or 𝜎(𝑋,𝑌), in analogy to variance. By using the linearity property of expectations,

this can be simplified to the expected value of their product minus the product of

their expected values:

cov(𝑋,𝑌)=𝐸[(𝑋−𝐸(𝑋))(𝑌−𝐸(𝑌))]

=𝐸[𝑋𝑌−𝑋𝐸(𝑌)−𝐸(𝑋)𝑌+𝐸(𝑋)𝐸(𝑌)]

=𝐸[𝑋𝑌]−𝐸(𝑋)𝐸(𝑌)−𝐸(𝑋)𝐸(𝑌)+𝐸(𝑋)𝐸(𝑌)]

=𝐸(𝑋𝑌)−𝐸(𝑋)𝐸(𝑌)

Covariance of discrete random variables:

If the random variable pair (𝑿,𝒀) can take on the values (𝒙𝒊,𝒚𝒊) for 𝒊=𝟏,⋯,𝒏, with equal

probabilities 𝒑𝒊= then the covariance can be equivalently written in terms of the means 𝑬(𝑿) and 𝑬(𝒀) as

More generally, if there are n possible realizations of (𝑋,𝑌),

namely (𝒙𝒊,𝒚𝒊) but with possibly unequal probabilities 𝒑𝒊 for 𝒊=𝟏,⋯,𝒏, then the covariance is

Suppose that 𝑋 and 𝑌 have the following joint probability mass function:

(𝑥,𝑦)∈𝑆={(5,8),(6,8),(7,8),(5,9),(6,9),(7,9)}with probability respectively{0,0.4,0.1,0.3,0,0.2}

Then we can deduce that 𝑋 can take on three values (5, 6 and 7) with probability respectively (0.3,0.4,0.3)

and 𝑌 can take on two (8 and 9) with probability respectively (0.5, 0.5).

𝐸(𝑋)=5(0.3)+6(0.4)+7(0.1+0.2)=6

And

𝐸(𝑌)=8(0.4+0.1)+9(0.3+0.2)=8.5

Then,

=(0)(5−6)(8−8.5)+(0.4)(6−6)(8−8.5)+

(0.1)(7−6)(8−8.5)+(0.3)(5−6)(9−8.5)+

(0)(6−6)(9−8.5)+(0.2)(7−6)(9−8.5)=−0.1

the variance is a special case of the covariance in which the two variables are identical

(that is, in which one variable always takes the same value as the other):

cov(X,X)=Var(X)=

Covariance of linear combinations

If 𝑋,𝑌,𝑉,𝑎𝑛𝑑 𝑊 are real-valued random variables and 𝑎,𝑏,𝑐,𝑑 are real-valued constants,

then the following facts are a consequence of the definition of covariance:

cov(X,a)=0

cov(X,X)=Var(X)

cov(X,Y)=cov(Y,X)

cov(a X,b Y)=a b cov(X,Y)

cov(X+a,Y+b)=cov(X,Y)

𝑐𝑜𝑣(𝑎𝑋+𝑏𝑌,𝑐𝑋+𝑑𝑉)=𝑎𝑐 𝑐𝑜𝑣(𝑋,𝑊)+𝑎𝑑 𝑐𝑜𝑣(𝑋,𝑉)+𝑏𝑐 𝑐𝑜𝑣(𝑌,𝑊)+𝑏𝑑 𝑐𝑜𝑣(𝑌,𝑉)

For a sequence of random variables in real-valued,

and constants , we have

29 / 16

Back