Learn
Learn Seaborn Introduction
Calculating Different Aggregates

In most cases, we’ll want to plot the mean of our data, but sometimes, we’ll want something different:

  • If our data has many outliers, we may want to plot the median.
  • If our data is categorical, we might want to count how many times each category appears (such as in the case of survey responses).

Seaborn is flexible and can calculate any aggregate you want. To do so, you’ll need to use the keyword argument estimator, which accepts any function that works on a list.

For example, to calculate the median, you can pass in np.median to the estimator keyword:

sns.barplot(data=df, x="x-values", y="y-values", estimator=np.median)

Consider the data in results.csv. To calculate the number of times a particular value appears in the Response column , we pass in len:

sns.barplot(data=df, x="Patient ID", y="Response", estimator=len)

Instructions

1.

Consider our hospital satisfaction survey data, which is loaded into the Pandas DataFrame df. Use print to examine the data.

2.

We’d like to know how many men and women answered the survey. Use sns.barplot() with:

  • data equal to df
  • x equal to Gender
  • y equal to Response
  • estimator equal to len
3.

Use plt.show() to display the graph.

4.

Change sns.barplot() to graph the median Response aggregated by Gender using estimator=np.median.

Folder Icon

Sign up to start coding

Already have an account?