Statistical variables
University of Edinburgh
When we ask a research question, we collect a sample \(y\) from a population.
\(y\) is a sample of values (\(y_1, y_2, y_3, ..., y_n\)
).
Sample of values can be e.g.:
Number of telic and atelic verbs in a historical corpus of Sanskrit.
Voice Onset Time of stops from 50 speakers Mapudungun.
Friendliness ratings of synthetic speech as indicated by 300 participants.
…
\(y\) is a sample of values (\(y_1, y_2, y_3, ..., y_n\)
).
We say that the values in the sample \(y\) were generated by a (random) variable \(Y\).
\(Y\) is a (random) variable that generates the values in the sample \(y\).
A (statistical) variable is any characteristics, number, or quantity that can be measured or counted
When you observe or measure something, you are taking note of the values generated by the variable.
It’s called variable because it varies (ha!).
The opposite of a variable is a constant.
\(Y\) is a (random) variable that generates the values in the sample \(y\).
Variables can be e.g.:
Token number of telic verbs and atelic verbs in written Sanskrit.
Voice Onset Time of stops in Mapudungun.
Friendliness ratings of synthetic speech.
…
Numeric continuous variable: between any two values there is an infinite number of values.
The variable can take on any positive and negative number, including 0.
The variable can take on any positive number only.
Proportions and percentages: The variable can take on any number between 0 and 1.
Numeric discrete variable: between any two consecutive values there are no other values.
Categorical (discrete) variable.
Binary or dichotomous: The variable can take only one of two values.
The variable can take any of three of more values.
Ordinal: The variable can take any of three of more values and the values have a natural order.
We can operationalise something as a numeric or a categorical variable.
Think of ways to operationalise the following:
Voice Onset Time.
Friendliness of speech.
Lexical frequency.
…
The sample \(y\) is generated by a (random) variable \(Y\).
A (statistical) variable is any characteristics, number, or quantity that can be measured or counted.
Variables can be numeric or categorical.
We operationalise a measure/observation as a numeric or a categorical variable.