Why do we need to use data? What is the point of it?

To be able to properly organize and categorize any information.

what exactly do we need data and why do we have to learn it?

WE have to show different ways to represent info, we have to learn it because it is basically all info.

how do we use this in real life?

When you do a survey and you use this to display and summarize the data appropriately.

What does a "whisker" represent in a Box and Whisker plot?

if it is on the left it is the line between min and the mid of first half. opposite if on the right.

ela is easier and math is getting harder just me?

In my opinion, it is just you. Math is actually very easy, once you can relate to the content.

What does this mean and why do we need to learn it if we don't know?

Because this can relate to real life problems. If you were a vet and you were trying to figure out how much does a typical dog weigh you would need that

What is "correlation"or "trend"?

A trend is when two variables form a pattern in a set of results displayed in a graph. Correlation describes the relationship between variables. It can be described as either strong or weak, and as either positive or negative. So, When two variables are trending, a correlation analysis will often show there is a significant relationship – simply because of the trend – not necessarily because there is a cause and effect relationship between the two variables.

Main content

Course: 6th grade > Unit 11

Lesson 10: Shape of data distributions

Data and statistics FAQ

Google Classroom

Frequently asked questions about data and statistics

What is a statistical question?

A statistical question is a question that we can answer by collecting and analyzing data from many different things or people. For example, "How tall are the students in our class?" is a statistical question, because we can measure the heights of all the students and look at how they vary. "How tall is the teacher?" is not a statistical question, because it only involves one thing or person, and we don't need data to answer it.

What are measures of center and why do we need them?

Sometimes we have a lot of data, like test scores, heights, weights, or temperatures, and we want to summarize them with one number that represents the whole group. This number is called a measure of center, because it is supposed to be close to the middle of the data. There are different ways to find the measure of center, depending on what kind of data we have and what we want to know.

The most common measures of center are the mean, the median, and the mode. The mean is the average of all the data values, which we find by adding them all up and dividing by how many there are. The median is the middle value of the data, which we find by putting them in order from smallest to largest and picking the one in the middle (or the average of the two in the middle, if there are an even number of values). The mode is the most frequent value of the data, which we find by counting how many times each value appears and picking the one that appears the most.

We can use measures of center to compare different groups of data, to see which one has higher or lower values overall, or to see how the data is distributed around the center. For example, we can compare the mean test scores of different classes, or the median heights of different sports teams, or the mode of the favorite colors of different groups of friends.

What are measures of variation and why do we need them?

Measures of center are useful, but they don't tell us everything about the data. Sometimes we also want to know how spread out the data is, or how much the values differ from each other and from the center. This is called variation, and we can measure it with different numbers, too.

Some of the most common measures of variation are the range, the interquartile range, and the mean absolute deviation. The range is the difference between the highest and the lowest values of the data, which we find by subtracting the minimum from the maximum. The interquartile range is the difference between the middle 50% of the data, which we find by dividing the data into four equal parts (called quartiles) and subtracting the first quartile from the third quartile. The mean absolute deviation is the average of how far each value is from the mean, which we find by subtracting the mean from each value, taking the absolute value (which means ignoring the negative sign), adding them all up, and dividing by how many there are.

We can use measures of variation to compare different groups of data, to see which one has more or less variability, or to see how the data is shaped around the center. For example, we can compare the range of temperatures in different seasons, or the interquartile range of incomes in different neighborhoods, or the mean absolute deviation of ages in different families.

How do we choose the best measure of center and variation for our data?

There is no single best measure of center and variation for all data, because different measures have different advantages and disadvantages, depending on the situation. We have to think about what kind of data we have, what we want to learn from it, and what we want to communicate to others.

Some things to consider are:

Is the data numerical or categorical? Numerical data can be measured with numbers, like heights, weights, or scores. Categorical data can be grouped into categories, like colors, animals, or genres. We can use mean, median, and mode for numerical data, but only mode for categorical data. We can use range, interquartile range, and mean absolute deviation for numerical data, but not for categorical data.
Is the data symmetrical or skewed? Symmetrical data has values that are evenly distributed around the center, like a bell-shaped curve. Skewed data has values that are more clustered on one side of the center and more spread out on the other side, like a tail. We can use mean, median, and mode for symmetrical data, but median and mode are more reliable for skewed data, because they are less affected by extreme values. Similarly, the interquartile range is less affected by extreme values than the range is.

How do we pick an appropriate data display?

There is no one right answer to how we pick an appropriate data display, but there are some things we can consider to help us decide. Some of the factors we can think about are:

The type and the size of the data. For example, if we have categorical data, such as favorite colors or types of animals, we might use a frequency table or a bar graph to show the data. If we have numerical data, such as heights or weights, we might use a histogram, a box plot, or a scatter plot to show the data. If we have a lot of data, we might use a graph to make it easier to see the patterns and trends in the data. If we have a small amount of data, we might use a table to show the exact values and frequencies of the data.
The purpose and the audience of the data display. For example, if we want to compare the data across different groups or categories, we might use a dot plot, a histogram, or a box plot to show the similarities and differences of the data. If we want to show the relationship between two variables, we might use a scatter plot or a line graph to show the correlation or the trend of the data. If we want to show the distribution or the shape of the data, we might use a histogram or a box plot to show the center, the spread, and the outliers of the data.

Whatever display type we choose, we would want it to be clear, easy to read, and attractive, with a title, labels, and scales. That way, our audience can read and interpret our display.

Want to join the conversation?

Sort by:

Santos
Posted 4 months ago. Direct link to Santos's post “Why do we need to use dat...”
Why do we need to use data? What is the point of it?
Button navigates to signup pageComment on Santos's post “Why do we need to use dat...”
(3 votes)
Answer
- GoodGuy613
  Posted 3 months ago. Direct link to GoodGuy613's post “To be able to properly or...”
  To be able to properly organize and categorize any information.
  Button navigates to signup page
  (3 votes)
zizhenpeng.jessica
Posted a year ago. Direct link to zizhenpeng.jessica's post “what exactly do we need d...”
what exactly do we need data and why do we have to learn it?
Button navigates to signup pageComment on zizhenpeng.jessica's post “what exactly do we need d...”
(1 vote)
Answer
- kjanani30
  Posted a year ago. Direct link to kjanani30's post “WE have to show different...”
  WE have to show different ways to represent info, we have to learn it because it is basically all info.
  Button navigates to signup page
  (3 votes)
🌟Adrienne🌟
Posted 7 days ago. Direct link to 🌟Adrienne🌟's post “how do we use this in rea...”
how do we use this in real life?
Button navigates to signup pageButton navigates to signup page
(1 vote)
Answer
- joshua
  Posted 6 days ago. Direct link to joshua's post “When you do a survey and ...”
  When you do a survey and you use this to display and summarize the data appropriately.
  Button navigates to signup page
  (2 votes)
kjanani30
Posted a year ago. Direct link to kjanani30's post “What does a "whisker" rep...”
What does a "whisker" represent in a Box and Whisker plot?
Button navigates to signup pageComment on kjanani30's post “What does a "whisker" rep...”
(0 votes)
Answer
- johnnyjazzman
  Posted 5 months ago. Direct link to johnnyjazzman's post “if it is on the left it i...”
  if it is on the left it is the line between min and the mid of first half. opposite if on the right.
  Button navigates to signup page
  (2 votes)
Jordyn Gray
Posted 19 days ago. Direct link to Jordyn Gray's post “Hey, why do we use histog...”
Hey, why do we use histograms when we could just use a box and whisper plots?
Button navigates to signup pageButton navigates to signup page
(0 votes)
Answer
- Anya P Smith
  Posted 18 days ago. Direct link to Anya P Smith's post “Because it is just anothe...”
  Because it is just another useful tool of organizing data
  Comment on Anya P Smith's post “Because it is just anothe...”
  (1 vote)
ja07320
Posted 2 months ago. Direct link to ja07320's post “yes i do not understan he...”
yes i do not understan help?
Button navigates to signup pageButton navigates to signup page
(0 votes)
Answer
jr05020
Posted 2 months ago. Direct link to jr05020's post “what exactly do we need d...”
what exactly do we need data and why do we have to learn it?
Button navigates to signup pageComment on jr05020's post “what exactly do we need d...”
(0 votes)
Answer
lillie
Posted a year ago. Direct link to lillie's post “ela is easier and math is...”
ela is easier and math is getting harder just me?
Button navigates to signup pageComment on lillie's post “ela is easier and math is...”
(0 votes)
Answer
- @$ White Liger
  Posted a year ago. Direct link to @$ White Liger's post “In my opinion, it is just...”
  In my opinion, it is just you. Math is actually very easy, once you can relate to the content.
  Comment on @$ White Liger's post “In my opinion, it is just...”
  (3 votes)
Lauryn B.
Posted a month ago. Direct link to Lauryn B.'s post “What does this mean and w...”
What does this mean and why do we need to learn it if we don't know?
Button navigates to signup pageButton navigates to signup page
(0 votes)
Answer
- Anya P Smith
  Posted 20 days ago. Direct link to Anya P Smith's post “Because this can relate t...”
  Because this can relate to real life problems. If you were a vet and you were trying to figure out how much does a typical dog weigh you would need that
  Comment on Anya P Smith's post “Because this can relate t...”
  (3 votes)
liu Leon
Posted a year ago. Direct link to liu Leon's post “What is "correlation"or "...”
What is "correlation"or "trend"?
Button navigates to signup pageComment on liu Leon's post “What is "correlation"or "...”
(0 votes)
Answer
- Maggie (Mesgana Tadesse)
  Posted a year ago. Direct link to Maggie (Mesgana Tadesse)'s post “A trend is when two varia...”
  A trend is when two variables form a pattern in a set of results displayed in a graph. Correlation describes the relationship between variables. It can be described as either strong or weak, and as either positive or negative. So, When two variables are trending, a correlation analysis will often show there is a significant relationship – simply because of the trend – not necessarily because there is a cause and effect relationship between the two variables.
  Comment on Maggie (Mesgana Tadesse)'s post “A trend is when two varia...”
  (6 votes)