The Intraclass Correlation Coefficient in Mixed Models

The Analysis Factor

Published May 28, 2025

The Intraclass Correlation Coefficient, or ICC, can be very useful in many statistical situations, but especially so in Linear Mixed Models.

Linear Mixed Models are used when there is some sort of clustering in the data.

Two common examples of clustered data include:

individuals sampled within sites (hospitals, companies, community centers, schools, etc.). The site is the cluster.
repeated measures or longitudinal data where you collect multiple observations from the same individual. The individual is the cluster in which multiple observations are grouped.

Observations from the same cluster are usually more similar to each other than observations from different clusters. If they are, you can’t use statistical methods on these data to that assume independence. If you do, estimates of variance, and therefore p-values, will be incorrect.

Mixed models not only account for the correlations among observations in the same cluster, they give you an estimate of that correlation.

What is the Intraclass Correlation Coefficient?

Here you can see the equation of a very simple linear mixed model. This has a single fixed independent variable, X, and a single random effect u. For simplicity, I’m going to assume that I’ve already centered X on its mean. This is also known as a random intercept model.

The subscripts i and j on the Y indicate that each observation j is nested within cluster i.

The u represents the random intercept for each cluster. It works the same way as a residual term, and measures the distance from each subject’s intercept to the overall intercept β0. Rather than calculate an estimate for every one of those distances, the model estimates a single variance σ20.

That variance parameter estimate is the between-cluster variance. The variance of the residuals is the within-cluster variance. Their sum is the total variance in Y that is not explained by X.

If there is no real correlation among observations within a cluster, the cluster means won’t differ.

It’s only when some clusters have generally high values and others have relatively low values that the values within a cluster are correlated.

Recommended by LinkedIn

Causal Patterns & Five Examples

Brian Hughes 7 years ago

Coefficient of Determination-R Squared-Day3

Ridham Mahajan 3 years ago

Very simple but surprisingly accurate equation for…

Yevhen Rutovytskyy 6 years ago

In the graph on the right, each cluster has its own trajectory of a different color. The thick black line represents the overall trajectory, averaged across all clusters.

Some clusters, like the magenta one, have all three values above the overall (black) mean. Those values will be correlated, because they’re all relatively high. Simultaneously, those three points have a high mean.

Likewise, the turquoise cluster has all three values below the overall (black) mean. Again, those values will be correlated, because they’re all relatively low. And the turquoise mean is quite low.

And so it goes. When some clusters have generally high values and others have generally low, (in other words, where there is consistency among a cluster’s responses), there is variation among the clusters’ means. This is the between-cluster variance.

The within-cluster variance represents how far each point is to the cluster specific mean. In other words, what the variation of the magenta points around the magenta trajectory?

In this graph, it’s pretty small. Because those magenta points are all pretty high, they are quite close to their trajectory, and there is not a lot of within-cluster variation.

The ratio of the between-cluster variance to the total variance is the Intraclass Correlation. It tells you the proportion of the total variance in Y that the clustering accounts for.

You can also interpret it as the correlation among observations within the same cluster.

Why ICC is useful

1. It can help you determine whether or not a linear mixed model is even necessary. If you find that the correlation is zero, that means the observations within clusters are no more similar than observations from different clusters. Go ahead and use a simpler analysis technique.

2. It can be theoretically meaningful to understand how much of the overall variation in the response is explained simply by clustering. For example, in a repeated measures psychological study you can tell to what extent mood is a trait (varies among people, but not within a person on different occasions) or state (varies little on average among people, but varies a lot across occasions).

3. It can also be meaningful to see how the ICC (as well as the between and within cluster variances) changes as you add variables to the model.

Dr Mircea Zloteanu 11mo

Isn't ICC in mixed models just capturing the random effects? I think it's often confused with the unadjusted version, or with r2.

The Intraclass Correlation Coefficient in Mixed Models

The Analysis Factor

What is the Intraclass Correlation Coefficient?

Recommended by LinkedIn

Why ICC is useful

More articles by The Analysis Factor

Others also viewed

CUBE (Combined Uncertainty and Bathymetry Estimator)

The Power (and Magic) of Prediction and Forecasting

The quest for denominators

R-Squared Abuse(and how to avoid it).

How important is that variable?

Time Series Forecasting

Time series analysis with R

Multiple Correlation coefficient

Coefficient of determination

Explore content categories

What is the Intraclass Correlation Coefficient?

Recommended by LinkedIn

Why ICC is useful

More articles by The Analysis Factor

Should I Specify a Model Predictor Variable as Categorical or Continuous?

Three Habits in Data Analysis That Feel Efficient, Yet are Not

The Difference Between an Odds Ratio and a Predicted Odds

How to Decide Between Multinomial and Ordinal Logistic Regression Models

Odds Ratio: Standardized or Unstandardized Effect Size?

Six Easy Ways to Complicate Your Analysis

The Problem with Using Tests for Statistical Assumptions

Six Differences Between Repeated Measures ANOVA and Linear Mixed Models

What Are Nested Models?

Guidelines for writing up three types of odds ratios

Others also viewed

CUBE (Combined Uncertainty and Bathymetry Estimator)

The Power (and Magic) of Prediction and Forecasting

The quest for denominators

R-Squared Abuse(and how to avoid it).

How important is that variable?

Time Series Forecasting

Time series analysis with R

Multiple Correlation coefficient

Coefficient of determination

Explore content categories