Statistical modelling with regression
1 Instructions
For this challenge you have to practice statistical modelling using regression models with the brms package.
The following sections give some more specific instructions for three data frames. To learn more about the data frames, check the “Description” links.
2 Albanian VOT
DescriptionThe file coretta2021/alb-vot.csv
has acoustic measurements related to Voice Onset Time (VOT) in Albanian stops.
Use regression to answer the following question: What is the average VOT of voiceless stops in Albanian?
3 Massive Auditory Lexical Decision (MALD)
DescriptionIn tucker2019/mald_1_1.rds
there is data from a lexical decision task: participants listened to a word and had to say if the word was a real word or not.
Answer the following question: What is the effect of phonetic Levinstein distance PhonLev
on logged reaction times RT_log
?
4 Scalar inferences and scalar diversity
DescriptionThe data in pankratz2021/si.csv
is from a study that looked at scalar inferences. A scalar inference happens when you encounter a sentence like in (1) and you infer that Fatima didn’t eat all of the cookies.
- Fatima ate some of the cookies.
In particular, the study looks at the phenomenon of scalar diversity: the observation that scalar inferences are made at different rates for different words. For example, for “The food was cheap” (where “cheap” is a weaker scalar word), people do often infer that the food wasn’t free (a stronger word on the same scale of price). But, for “The village was pretty”, people don’t often infer that the village wasn’t beautiful.
Use a regression model to answer the following question: What is the relationship between the logged co-occurrence frequency of weak/strong adjective and their semantic distance?