Ordinal regression: modelling rating/Likert scales

Learn how to fit ordinal regressions for rating data

Author

Stefano Coretta

Published

September 29, 2024

Prerequisites

Regression: binary outcome variables.

1 Rating data and ordinal regression

Rating scale data, including Likert scales, is a common type of variable in linguistics. Any study using a questionnaire or linguistic judgements will very likely have at least some rating data.

Likert scales are usually structured so that the two extremes are opposite values and then you move in steps from one extreme to the other through a neutral option (these are called bipolar scales): for example, strongly disagree, disagree, somewhat disagree, neither agree nor disagree, somewhat agree, agree and strongly agree. This is a 7-point Likert scale, although 5-point scales are also common.

Rating scales can also just be monopolar scales: for example, from “none” to “a lot” on the scale of “excitement”.

Rating data is categorical and discrete! Alas, a lot of researcher have been taught to treat these as numeric (for example transforming the scales to numbers from 1 to 7 for a 7-point Likert scale) and apply numeric methods for analysis. This is conceptually mistaken, since these scales are not numeric and don’t have the same properties as numeric variables have.

2 Plotting ordinal data

Given the common, but mistaken practice, to treat rating data as numeric, studies tend visualise the data using plot types meant for numeric data, like box plots (on why you shouldn’t use box plots ever, see this entry)

A very useful type of plot for ordinal data are the so-called divergent stacked bar chart. You can learn how to create these in R here.

3 Ordinal regression models

To learn how to fit ordinal regression models in R with brms, I refer you to this great (provocatively named) tutorial by João Verissimo: Analysis of rating scales: A pervasive problem in bilingualism research and a solution with Bayesian ordinal models.