In this lesson you will learn about aggregates in R using dplyr. An aggregate statistic is a way of creating a single number that describes a group of numbers. Common aggregate statistics include mean, median, and standard deviation.
Additionally, you will learn how you can group data into different subsets based on column values. This can help narrow the focus of a summary statistic to a subset of a dataset. R you ready to get started?
Instructions
In the workspace to the right we have loaded data from ShoeFly.com, a fictional e-commerce shoe store. The data includes information regarding customer orders as well as the source of page visits to ShoeFly.com’s website.
Review the code and the output. What calculations are made, and on which subsets of the data do they occur?
Throughout the rest of this lesson you will dig into ShoeFly.com’s data to learn more about its customer’s orders and how users are discovering the website. Proceed to the next exercise to get started!