If you're seeing this message, it means we're having trouble loading external resources on our website.

If you're behind a web filter, please make sure that the domains *.kastatic.org and *.kasandbox.org are unblocked.

Main content

Mean and standard deviation of a discrete random variable

Practice calculating and interpreting the mean and standard deviation of a discrete random variable.

Example: Ticket sales for a concert

Organizers of a concert are limiting tickets sales to a maximum of 4 tickets per customer. Let T be the number of tickets purchased by a random customer. Here is the probability distribution of T:
T=# of tickets1234
P(T)0.10.30.20.4
Question 1
Calculate the expected value of T.
μT=
  • Your answer should be
  • an integer, like 6
  • a simplified proper fraction, like 3/5
  • a simplified improper fraction, like 7/4
  • a mixed number, like 1 3/4
  • an exact decimal, like 0.75
  • a multiple of pi, like 12 pi or 2/3 pi
tickets

Question 2
Choose the best interpretation of the mean (or expected value) that you found in the previous question.
Choose 1 answer:

Question 3
Calculate the standard deviation of T.
Round your answer to three decimal places.
σT=
  • Your answer should be
  • an integer, like 6
  • a simplified proper fraction, like 3/5
  • a simplified improper fraction, like 7/4
  • a mixed number, like 1 3/4
  • an exact decimal, like 0.75
  • a multiple of pi, like 12 pi or 2/3 pi
tickets
For reference, here is the distribution again:
T=# of tickets1234
P(T)0.10.30.20.4

What do you think is the best interpretation of the standard deviation that you found in the previous question?
Feel free to discuss this in the comments! Keep in mind that standard deviation describes how far away from the mean data points tend to fall.

Want to join the conversation?

  • purple pi teal style avatar for user Nguyen Nguyen
    Standard Deviation gives you a sense of how spread out the data is from the mean. In other words, it shows you the variation of the sample which you are analyzing.
    (44 votes)
    Default Khan Academy avatar avatar for user
  • leaf red style avatar for user Naoya Okamoto
    Most of the data points will fall within 1.044 away from the mean value of 2.9 tickets.
    (23 votes)
    Default Khan Academy avatar avatar for user
  • blobby green style avatar for user katieleonard032
    What is the difference between variance and standard deviation? Why is standard deviation the square root of the variance? Thanks!
    (5 votes)
    Default Khan Academy avatar avatar for user
  • aqualine ultimate style avatar for user mirahbob
    It seems like the standard deviation is similar to the mean difference from the mean--mean average deviation--but since you square everything and then put it through a square root, it comes out slightly differently. What is the purpose behind that? Why is it better to use standard deviation with the mean as opposed to MAD?
    (4 votes)
    Default Khan Academy avatar avatar for user
    • starky ultimate style avatar for user KLaudano
      Taking the square root of the sum of squares for the standard deviation results in the following behavior; a few points that are very far from the mean will increase the standard deviation more than many points deviating slightly from the mean.

      Example: The mean average deviations for both of the sets {2, 2, 6, 6} and {0, 8, 4, 4} equal 2. However, the standard deviation for the first set is 2 and the standard deviation for the second set is 2.828. In the first set, all of the points deviate slightly from the mean. In the second set, a couple of points deviate largely from the mean. This results in the second standard deviation being larger than the first, even though they share the same mean average deviation.
      (7 votes)
  • blobby green style avatar for user elliots
    If many many tickets are purchased, the amount of tickets purchased will typically vary by about 1.044 tickets from the mean of 2.9 tickets.
    (6 votes)
    Default Khan Academy avatar avatar for user
  • primosaur ultimate style avatar for user Sandra Malaquias
    If we look at a large number of customers, then 68% customer, will have purchased between 1.856 and 3.94 tickets (corresponding 1.044 bellow and above the mean)
    (4 votes)
    Default Khan Academy avatar avatar for user
  • blobby green style avatar for user Aleksandr Nokhrin
    Number of tickets bought by the most of customers (about 68% of total customers) will fall between <mean>-<st dev> and <mean>+<st dev>.
    Or 2.9-1.044 <= number of tickets by customer <= 2.9+1.044
    (2 votes)
    Default Khan Academy avatar avatar for user
    • cacteye blue style avatar for user Jerry Nilsson
      Remember that 68% would be if the distribution is normal, which it isn't.

      I'm not sure if my method is correct, but I think that if we say that a person buying 𝑛 tickets is actually buying at least (𝑛 − 0.5) tickets and less than (𝑛 + 0.5) tickets, then we can view the distribution as if it were continuous.

      The fraction of customers that are within 1 standard deviation of the mean is equal to the area under the distribution curve between 𝑛 = 1.856 and 𝑛 = 3.944,
      which would be (2.5 − 1.854)⋅0.3 + 0.2 + (3.944 − 3.5)⋅0.4 = 0.5714
      or about 57%

      Then again, this would only make sense if customers could actually buy fractions of tickets.
      (3 votes)
  • duskpin ultimate style avatar for user BeeGee
    I used the 1-Var Stats function of the TI-84 Plus CE, and it gave me 0.1104 for the standard deviation. My List was {0.1, 0.3, 0.2, 0.4}, and my FreqList was just {1, 2, 3, 4}.
    Did I do something wrong, or am I supposed to multiply σx by 10 to get the correct answer?

    EDIT:

    7 months later: I found out that the FreqList option is used only for denoting multiple occurrences of whatever value it maps to in the List option. TI-84 users beware!

    Also, leaving the FreqList option blank is the equivalent to having a FreqList that has the same dimensions as the List but filled with 1's.
    (2 votes)
    Default Khan Academy avatar avatar for user
    • leaf grey style avatar for user Peterson Torio
      You can still use it but instead of leaving it blank or writing the probability (i.e. 0.84, 0.16), you have to input the product of the probability (the numbers with decimal points to show percentages) and the number of discrete random variables given. I do that and I think that this technique is the fastest way to get the standard deviation and mean.

      Note: This only works for problems where the given infos are discrete random variables and their probabilities.
      (1 vote)
  • blobby green style avatar for user Abdullah S.
    I would say that if we take many, many customers, the number of tickets purchased would vary by 1.044 from the mean of 2.9 tickets
    (2 votes)
    Default Khan Academy avatar avatar for user
  • blobby green style avatar for user Saugat Bhattarai
    The pattern of data spread from mean value.
    (2 votes)
    Default Khan Academy avatar avatar for user