Let’s review! In this lesson, you learned how to use NumPy to analyze different distributions and generate random numbers to produce datasets. Here’s what we covered:
- What is a histogram and how to map one using Matplotlib
- How to identify different dataset shapes, depending on peaks or distribution of data
- The definition of a normal distribution and how to use NumPy to generate one using NumPy’s random number functions
- The relationships between normal distributions and standard deviations
- The definition of a binomial distribution
Now you can use NumPy to analyze and graph your own datasets! You should practice building your intuition about not only what the data says, but what conclusions can be drawn from your observations.
Instructions
Practice what you’ve just learned with a dataset on sunflower heights! Imagine that you work for a botanical garden and you want to see how the sunflowers you planted last year did to see if you want to plant more of them.
Calculate the mean and standard deviation of this dataset. Save the mean to sunflowers_mean
and the standard deviation to sunflowers_std
.
We can see from the histogram that our data isn’t normally distributed. Let’s create a normally distributed sample to compare against what we observed.
Generate 5,000 random samples with the same mean and standard deviation as sunflowers
. Save these to sunflowers_normal
.
Now that you generated sunflowers_normal
, uncomment (remove all of the #
) the second plt.hist
statement. Press run to see your normal distribution and your observed distribution.
Generally, 10% of sunflowers that are planted fail to bloom. We planted 200, and want to know the probability that fewer than 20 will fail to bloom.
First, generate 5,000 binomial random numbers that represent our situation. Save them to experiments
.
What percent of experiments
had fewer than 20 sunflowers fail to bloom?
Save your answer to the variable prob
. This is the approximate probability that fewer than 20 of our sunflowers will fail to bloom.
Print prob
. Is it likely that fewer than 20 of our sunflowers will fail to bloom?