## Histogram

The histogram groups data into bins and plots the binned data as bars.

## Uses

Histograms are used to plot the density of a data distribution.

(read more at Wikipedia)

## Code: Histogram in R

set.seed(11) df.histogram_dummy <- data.frame( x_var = rnorm(2000) ) # DEFAULT BINWIDTH (range/30) ggplot(data=df.histogram_dummy, aes(x=x_var)) + geom_histogram()

## Results

## Explanation

The code to create a histogram in R is very straight forward.

The fundamentals of building a histogram in R (using ggplot2) are basically the same as those for building any data visualization. We:

1. Specify our dataset

2. Create a "mapping" (i.e., a relationship) between variables in our dataset and aesthetic attributes in our plot

3. Specify the exact geometric objects we want to plot

(This deep underlying structure is essentially the same for all visualizations. We've seen a similar process for building a bar chart, scatterplot, and line chart. This deep structure will also remain as we create more sophisticated visualizations.)

Let's take a look at the code:

ggplot(data=df.histogram_dummy, aes(x=x_var))

Here we start by calling the

Next, with the

The typical process of creating a histogram, requires that we "bin" the data; that is, we need to divide the numeric variable into intervals (analysts frequently call these "bins", or sometimes "buckets"). After creating the numeric intervals, we count the number of records that fall within each interval.

Let's take a look at

geom_histogram()

Here, this line tells the

*wider*than the default.

# Wider binwidth ggplot(data=df.histogram_dummy, aes(x=x_var)) + geom_histogram(binwidth=.5)

We can also narrow the binwidth, which gives us a slightly different view of the data distribution.

# Narrower binwidth ggplot(data=df.histogram_dummy, aes(x=x_var)) + geom_histogram(binwidth=.1)

I just googled ‘small multiples r histogram’ and found your link halfway down the page. The two most recent posts seem to be exactly what I need. I am going to give it a try. Thanks! (btw, it looks like you forgot the comment section on the small multiples page.)

Awesome to hear that it’s exactly what you need!

I’ll be publishing another post on Dec 11 that you might find useful as well.

It’s about data manipulation, but I’ll also show how to create small multiples histograms based on

subsetsof your data. Check back in a few days!(also, thanks for the heads up on the comments section. fixed.)

Sir, I am a big fan of all your blogs .Unfortunately I cannot find an option to register.There is no sign up option in the member login page..Do I have to pay an amount to be a member?

We have a couple of courses for mastering data science …. one for Python and one for R.

These courses open for enrollment only a few times per year.

We announce enrollment opportunities only to people on our email list, so you need to join our email list to hear when the courses open up!