Statistical variables

Learn about the type of statistical variables
Author

Stefano Coretta

Published

September 12, 2023

1 Statistical variables

A (statistical) variable is any characteristics, number, or quantity that can be measured or counted.

  • When you observe or measure something, you are taking note of the values generated by the variable.
  • It’s called variable because it varies (ha!).
  • The opposite of a variable is a constant.

Some examples:

  • Token number of telic verbs and atelic verbs in written Sanskrit.
  • Voice Onset Time of stops in Mapudungun.
  • Friendliness ratings of synthetic speech.

Try and think of more!

2 Types of variables

You will find that some statistics textbooks overcomplicate things when it come to types of variables. From an applied statistics perspective you only need to be able to identify numeric vs categorical variables and continuous vs discrete variables.

2.1 Numeric vs categorical variables

This distinction is quite self-explanatory:

  • Numeric variables are variables that are numbers.

  • Categorical variables are variables that correspond to categories, groups or levels on a scale.

Examples

Numeric variables

  • Number of multi-verb predicates in a book.
  • Duration of stressed vowels.
  • Rating score between 0-100.

Categorical variables

  • Gender (non-binary, female, male, …).
  • First vs second language users.
  • Ejective vs non-ejective consonant.

Learning how to recognise variables is a fundamental skill in quantitative data analysis, since the type of variables determines the type of analyses you can carry out.

2.2 Continuous vs discrete variables

Orthogonal to the numeric/categorical distinction, there is the continuous vs discrete distinction.

This one can be at times less straightforward.

  • A continuous variable is a variable that can take on any value between any two numbers.

  • A discrete variable is a variable that can only take on a set of values, and no value in between.

Numeric variables can be either be continuous or discrete, while categorical variables can only be discrete.

Note

Numeric continuous variable: between any two values there is an infinite number of values.

  • The variable can take on any positive and negative number, including 0.

  • The variable can take on any positive number only.

  • Proportions and percentages: The variable can take on any number between 0 and 1.

Numeric discrete variable: between any two consecutive values there are no other values.

  • Counts: The variable can take only on any positive integer number.

Categorical (discrete) variable. There are three main subtypes.

  • Binary or dichotomous: The variable can take only one of two values.

  • The variable can take any of three of more values (sometimes called a multinomial variable).

  • Ordinal: The variable can take any of three of more values and the values have a natural order.

3 Operationalisation

The action of operationalisation consists in choosing how to measure something: as a numeric or as a categorical variable.

In some cases, the choice is obvious, but in most cases something could be operationalised either way and it will be a matter of study design that helps you choose.

For example, age can be operationalised as years (numeric) or as age groups, like young vs old (categorical).

Think of ways to operationalise the following variables:

  • Voice Onset Time.
  • Friendliness of speech.
  • Lexical frequency.