Univariate, Bivariate & Multivariate Analysis in Data Visualisation

Chanchala Gorale
3 min readJun 24, 2024

--

In data visualization, different types of analysis are used to explore and present data, each serving specific purposes based on the complexity and number of variables involved. Here, we will delve into univariate, bivariate, and multivariate analysis.

Univariate Analysis

Definition: Univariate analysis involves the examination of a single variable. The purpose is to understand the distribution, central tendency, and dispersion of the variable.

Techniques:

Descriptive Statistics:

  • Measures of Central Tendency: Mean, median, mode.
  • Measures of Dispersion: Range, variance, standard deviation, interquartile range (IQR).

Visualization Methods:

  • Histograms: Show the frequency distribution of a variable.
  • Box Plots: Display the distribution of data based on a five-number summary (minimum, first quartile, median, third quartile, and maximum).
  • Bar Charts: Useful for categorical data to show the frequency of categories.
  • Pie Charts: Represent the proportion of categories in a whole.

Examples:

  • Examining the age distribution of a population.
  • Analyzing the sales figures for a single product over time.

Bivariate Analysis

Definition: Bivariate analysis involves the examination of two variables simultaneously to understand the relationship between them.

Techniques:

Descriptive Statistics:

  • Correlation Coefficient: Measures the strength and direction of the linear relationship between two variables (e.g., Pearson’s correlation).
  • Covariance: Indicates the direction of the linear relationship between variables but not the strength.

Visualization Methods:

  • Scatter Plots: Display the relationship between two continuous variables. Points represent pairs of values.
  • Line Graphs: Show trends over time for two variables.
  • Box Plots: Can compare the distributions of a continuous variable across different categories of another variable.
  • Bar Charts: Compare two categorical variables.
  • Heatmaps: Represent data in matrix form where colors represent values, often used to show correlation matrices.

Examples:

  • Analyzing the relationship between hours studied and exam scores.
  • Examining the correlation between height and weight.

Multivariate Analysis

Definition: Multivariate analysis involves the examination of more than two variables simultaneously. It is used to understand complex relationships and patterns within the data.

Techniques:

Descriptive Statistics:

  • Multivariate Regression: Models the relationship between a dependent variable and multiple independent variables.
  • Principal Component Analysis (PCA): Reduces the dimensionality of the data while retaining most of the variation.
  • Factor Analysis: Identifies underlying relationships between variables.

Visualization Methods:

  • Scatter Plot Matrix: Displays pairwise scatter plots for multiple variables.
  • Parallel Coordinates Plot: Represents each variable as a parallel axis, and each data point as a line across these axes.
  • Heatmaps: Show correlations or interactions among multiple variables.
  • 3D Scatter Plots: Visualize relationships among three variables.
  • Bubble Plots: Similar to scatter plots but add a third variable represented by the size of the points (bubbles).

Examples:

  • Analyzing the impact of multiple factors (age, income, education level) on spending behavior.
  • Studying the relationship between multiple health indicators (blood pressure, cholesterol, BMI) and the risk of heart disease.

Summary

  • Univariate Analysis: Focuses on understanding the characteristics of a single variable using histograms, box plots, bar charts, etc.
  • Bivariate Analysis: Examines the relationship between two variables using scatter plots, line graphs, correlation coefficients, etc.
  • Multivariate Analysis: Explores complex relationships among more than two variables using scatter plot matrices, PCA, regression models, etc.

Each type of analysis serves a unique purpose and is chosen based on the complexity of the data and the insights required. Visualization techniques are crucial in each case to effectively communicate findings and patterns.

--

--

Chanchala Gorale
Chanchala Gorale

Written by Chanchala Gorale

Founder | Product Manager | Software Developer

No responses yet