Thinking about Data
Become wise in the ways of statistical thinking with this approachable yet hearty introduction.
StartStatistical Thinking
Lesson 1 of 1
- 1Let’s imagine we are working for the city government of the fictional city of Melody Metropolis. The mayor of Melody Metropolis wants to know more about the musicians who currently live in the city…
- 2To start our summary for the mayor, let’s describe some of the categorical variables in the musician dataset — those variables that contain qualitative information on the city’s musicians. Fi…
- 3Now that we’ve learned about some of the categorical variables in our musician dataset, it’s time to explore some numeric variables — those with quantitative data. There are a lot of ways we …
- 4Let’s return to our visualization of the distribution of musician ages. ![Bar plot showing the distribution of ages. Ages range from about 15 to 70. The frequencies determine the heights of the ba…
- 5As we’re moving through the numeric variables in our musician dataset, we come across some interesting details when we inspect the income variable. ![Plot showing the distribution of the income va…
- 6Let’s find an alternative measure to the mean. We want to find a value that represents the typical musician income, but we don’t want to use the actual values in the computation because the data ar…
- 7For our income data, the difference between the mean ($34,795) and median ($32,978) was only about $2,000. You may be wondering: Is the difference ever larger? Let’s imagine some very famous celeb…
- 8One measure that we haven’t covered that is usually talked about alongside the mean and median is the mode. The mode is defined as the value with the highest frequency, but we can also think of the…
- 9Aggregating data is a way of exploring variable relationships. We specifically looked at relationships between a numeric variable and a categorical variable, but we should also examine relationship…
How you'll master it
Stress-test your knowledge with quizzes that help commit syntax to memory