Learn

In this lesson you will learn about aggregates in R using dplyr. An aggregate statistic is a way of creating a single number that describes a group of numbers. Common aggregate statistics include mean, median, and standard deviation.

Additionally, you will learn how you can group data into different subsets based on column values. This can help narrow the focus of a summary statistic to a subset of a dataset. R you ready to get started?

Instructions

In the workspace to the right we have loaded data from ShoeFly.com, a fictional e-commerce shoe store. The data includes information regarding customer orders as well as the source of page visits to ShoeFly.com’s website.

Review the code and the output. What calculations are made, and on which subsets of the data do they occur?

Throughout the rest of this lesson you will dig into ShoeFly.com’s data to learn more about its customer’s orders and how users are discovering the website. Proceed to the next exercise to get started!

Take this course for free

Mini Info Outline Icon
By signing up for Codecademy, you agree to Codecademy's Terms of Service & Privacy Policy.

Or sign up using:

Already have an account?