Creating visualizations of your data can help extract vital information. It can let you see the bigger picture and understand your data more deeply.
With the help of visualization tools like charts, graphs, maps, etc., you can easily understand the dynamics, trends, and relationships among data items and draw essential inferences.
A histogram is one such helpful visualization tool that helps you understand the distribution of your data.
In this tutorial, I will show you how to make a histogram in Google Sheets and customize it.
Table of Contents
To make a histogram, follow these steps:
You can download example template to follow along with this tutorial.
A histogram is a chart showing how a variable is distributed. It divides the range of your data into intervals, displaying how many of the data values fall into each interval. Basically, this means that a histogram shows the frequency of how often a value appears in a range or data. It looks similar to a bar graph, but there are a lot of differences between the two. The vertical axis often represents the frequency.
For example, in a census with a range of age values from 18 to 35, you can use a histogram to show often the ages appeared. It can also be used with a data range, for example, how often the ages 10-20 appeared in the census.
Each of these intervals is displayed in the form of ‘bins’ or ‘buckets’. Visually, the bins may look like bars of a bar graph, but a histogram is actually quite different from a bar graph.
A histogram is usually used for summarizing continuous data. You can use it to show the frequencies in the data. They help identify patterns such as skew, symmetry, and multimodality in the data.
Histograms are also used in quality control processes to monitor variations in manufacturing processes or product characteristics. You can use them to identify deviations from the standard.
A Google Spreadsheet histogram is primarily different from a bar graph in terms of the application.
A Google Sheets histogram is used to understand the data distribution, while a bar graph is used to compare variables. The kind of data plotted by histograms and Bar Graphs is also different.
Histograms mainly plot quantitative data. So you plot how data of a single category is distributed. Bar graphs, on the other hand, plot categorical data. So you plot the quantity or frequency of data in different categories.
Histograms in Google Sheets work the same way as all other histograms. In this tutorial, we will show you how to create a histogram in Google Sheets to visualize your data and how to further customize the histogram according to your requirement. You can also use a line of best fit together with a histogram.
To understand how to create a Google Sheet histogram, we will use the data shown in the image below:
This dataset contains scores of students in an exam. We want to know how to create histograms in Google Sheets to understand how the student scores in the exams were distributed.
To make the histogram for the above data, follow these steps:
You should now see a histogram on your worksheet.
Google Sheets performs its own calculations on your data and displays what it believes to be the optimal number of bins for your histogram.
Its calculations, however, are usually far from perfect. As such, you will usually feel the need to customize the histogram to give it the look and functionality you want after creating a histogram in Google Sheets.
Now that you know how to make a histogram in Google Sheets, you customize it to your liking. Usually, the Chart editor has a ‘Customize’ tab that lets you enter all your specifications.
However, sometimes the Chart editor goes away after your Google Sheet histogram has been created.
To make it appear again and to customize your histogram, do the following:
The Chart style category in the Chart editor lets you set the background color, border color, font style, and size of your chart.
In our example, we changed the background color to “light green 3”, and allowed the other settings to remain the same.
You can use the Histogram category of the Chart editor to adjust the bin sizes to your requirement. For example, the intervals of scores displayed along the x-axis have very arbitrary sizes.
Distributing exam scores into these intervals does not really make much sense in practice.
So it would be better if the distributions were in intervals of 10.
For this, we need to change the ‘Bucket sizes’ to 10, as shown below:
Your chart should then display student score distributions in intervals of 10:
The outlier percentile drop-down lets you group data outliers with the closest relevant bucket. Besides this, the Show item dividers checkbox lets you add a line between each item in the chart.
This could sometimes help make the histogram easier to read and understand.
This category lets you provide the text and formatting for the chart title and subtitle and the titles for both the x and y axes.
For example, you can use it to give a title for the vertical axis by selecting the “Vertical axis title” option from the dropdown menu and then setting the title as “Student Count”.
Your histogram would then look like this:
This category lets you choose the colors for the bars (or bins) of your histogram. For example, you can use it to give your bins a “light red berry” color.
Your histogram would then look like this:
This becomes even more helpful when you want to compare different variable distributions in one histogram. Then you could have different colors for different series.
For example, if you had to compare the distribution of marks for two different classes, you could use one color for grade 6 and another for grade 7.
The “Legend” category, as its name suggests, lets you provide settings and formatting for the histogram legend. Using this, you can provide the following settings for the legend:
In our example, we don’t really need a legend, since there’s just one variable. So we can set the legend position to none.
You can use this category to change the range of the histogram. For example, you might want to reduce the range of values within which you want the bins to be distributed.
In our example, it would make sense to distribute the scores between 0 and 100.
For this, you will need to change the min and max values for the Horizontal axis category to 0 and 100, respectively.
Adjusting the min and max inputs really helps you provide context to your histogram.
Some other settings available under these categories include:
Your histogram would then look like this:
Finally, you can format the histogram to contain major and/or minor gridlines. You can also set what colors you want the gridlines to be, or choose to not have them at all.
This category also lets you set and format major and/or minor ticks on your histogram’s vertical and horizontal axes. As before, you can choose not to have any ticks at all.
If it isn’t automatically added, there should be a space at the top of your chart you can click and add text. Or, you can use the Chart menu under Title.
Relative frequency histograms charts use percentages in the y-axis to represent the frequency while the horizontal axis shows the class. It is an easy way to analyze data.
The first step will be to make a regular data histogram by highlighting your data points and then go to Insert > Chart and in Chart type, choose a Histogram chart in the options. You can customize your histogram however you wish in the chart format window and a chart title if you wish.
From there, we can create a relative frequency table first. To get the frequency of the data,
We use the formula
=FREQUENCY(data,class)
Here is an example:
=FREQUENCY(B2:B20,F2:F5)
To get the relative frequency, we convert the frequencies into percentages by dividing them by the sum of the values. To do this in our example, we input the formula:
=G2/SUM($G$2:$G$5)
Now we can convert the relative frequencies into percentages. Select the column, and on the toolbar, click Format as percentages.
Now we have a relative frequency table that we can use to create our relative frequency histogram. Select the class column then hold the ctrl button and select the Relative frequency table.
Go to Insert > Chart > Column chart
Now you have your relative frequency histogram in Google Sheets. It has the bar chart format, but the information is represented and can be used as a histogram. You can use the normal histogram to compare with the relative frequency histogram to ensure the data is accurate. You can do a lot with this histogram, like adding error bars.
If you’re having some trouble with this, you can:
Although you can’ tmake a combo chart with a histogram you can make a double histogram. A double histogram contains two data distributions that are compared together. For example, if you have data on the sports that male students and female students participate in and you would like to compare the two, you can use a double histogram. Creating a double histogram in Google Sheets is a very simple process.
To make a double histogram on Google Sheets, simply select your data set and go to Insert > Chart. In the charts tab choose the column chart and customize it as you see fit. With that you have your double histogram. That’s all on how to make histogram on Google Sheets with two data sets.
If you would like to use our example table, you can access it here.
With that, we end this tutorial. We showed you why and how you could use a histogram.
We also showed you how to make a histogram in Google Sheets and customize its various components to gain full control over its format and settings.
We hope this tutorial has been helpful to you. You can also check out how to make a bell curve in Google Sheets.
Other Google Sheets tutorials you may like: