Box Plot Explained: How to Read and Use One for Thesis Group Comparisons
6 min read
A box plot packs five key statistics - minimum, Q1, median, Q3, and maximum - into one diagram. Once you know how to read the IQR, whiskers, and outlier dots, you can instantly assess skewness, spread, and extreme values. This is exactly why supervisors expect box plots alongside t-tests and ANOVA: they show the distributions being compared and make your thesis data analysis more transparent.
What a Box Plot Shows: The Five-Number Summary
A box and whisker plot visualises five statistics simultaneously: the minimum, the first quartile (Q1), the median, the third quartile (Q3), and the maximum.
The box covers the middle 50% of your data - from Q1 to Q3. This range is called the interquartile range (IQR). The line inside the box is the median (the value that splits your data in half). The whiskers extend from the box to the smallest and largest values within 1.5 × IQR of the box edges. Points beyond the whiskers are plotted individually as outliers.
How to Read the Box: Median, IQR, and Skewness
The median line divides the box into two sections. If the median line sits in the centre of the box, the middle 50% of values are roughly symmetrically distributed. If the median is closer to Q1 (the bottom of the box), more values are packed into the lower half - the distribution is right-skewed. If closer to Q3, it is left-skewed.
A tall box means high variability in the middle 50% of the data. A short box means values are clustered tightly around the median.
How to Read the Whiskers and Identify Outliers in Your Thesis Data
Each whisker extends to the most extreme data point that is still within 1.5 × IQR from the box. Any point beyond that distance is considered a mild outlier and plotted as an individual dot.
Points beyond 3 × IQR are extreme outliers (sometimes shown as a different symbol).
Outliers in a box plot are not necessarily errors - they may be real extreme observations. Investigate each one: is it a data entry error, a legitimate extreme case, or an influential observation that warrants separate analysis?
Using Box Plots to Compare Groups Side by Side in Your Thesis
Box plots are most powerful when placed side by side to compare groups. Aligning boxes for two or more groups on the same y-axis makes differences in median, spread, and outlier patterns immediately visible.
For example: comparing exam scores across three study groups shows not just which group scored highest on average, but whether one group had far more variability, or whether outliers in one group are pulling the mean away from the median.
This is why box plots are often required alongside t-tests and ANOVA: they visualise the distributions the test is comparing, making your analysis more transparent.
When to Use a Box Plot vs. a Histogram in Thesis Data Analysis
Use a box plot when comparing multiple groups side by side - it is compact and easy to read across groups.
Use a histogram when you want to show the detailed shape of a single distribution (e.g., to check normality visually before running a parametric test).
For thesis results sections, a combination of both is common: a histogram or Q-Q plot in the assumptions section, and box plots in the main results for group comparisons.
Box Plot Anatomy: Quick Reference Guide
- Box (rectangle): the interquartile range (Q1 to Q3), middle 50% of data
- Median line: the value splitting the dataset in half
- Whiskers: extend to the most extreme values within 1.5 × IQR of the box
- Outlier dots: individual values beyond the whisker endpoints
- IQR (interquartile range): Q3 − Q1, measures spread of the middle 50%
- Skewness indicator: median closer to Q1 → right skew; closer to Q3 → left skew
Frequently asked questions
What does IQR mean and how is it calculated?
▾
How many outliers in a box plot is too many?
▾
Can I use a box plot for Likert scale data?
▾
Do I need to include box plots in APA format?
▾
How do I create a box plot in SPSS?
▾
Should I report the median or the mean when I show a box plot in my thesis?
▾
Further reading
Histogram Explained: How to Create, Read, and Use One for Thesis Normality Checks
· Data visualisationWhich Statistical Test to Use for Your Thesis: A Complete Decision Guide
· Test selectionThesis Data Analysis: The 5 Critical Steps Students Skip (With Checklist)
· Data analysisOne-Way ANOVA for Your Thesis: Complete Guide with Post-Hoc Tests, Effect Size, and APA Templates
· Statistical tests
Free tool
Not sure which statistical test to use?
Answer 5 quick questions about your research design and get the right test - with an explanation of why - in under two minutes.
Statoria Team
Statistics educators & software developers
We build Statoria to help bachelor and master students get through their thesis data analysis without stress. Our guides are written by researchers with experience in social science statistics and student supervision.
Related guides
Histogram Explained: How to Create, Read, and Use One for Thesis Normality Checks
Apr 2026 · 3 min read
Which Statistical Test to Use for Your Thesis: A Complete Decision Guide
Feb 2026 · 3 min read

