QML - Week 8

Bernoulli regression models

Stefano Coretta

Bouba and kiki

Holland and Wertheimer (1964), Chen et al. (2016)

Cross-modality

There are links between the auditory and visual modalities.
A lot of research this concept, in fact originated with Köhler (1929) who used “takete” and “maluma” (not the singer).
Koppensteiner, Stephan, and Jäschke (2016) ask if not only shape, but also motion patterns enter in the cross-modality link.

Motion patterns

Koppensteiner, Stephan, and Jäschke (2016)

Motion patterns

Koppensteiner, Stephan, and Jäschke (2016)

46 students (24 females and 22 males; age M = 25.1 years, SD = 3.6) of the University of Vienna.
They saw a word (takete or maluma) and two stick-figures moving.
They had to pick the stick figure they thought was described by the word.

The data

kopper <- read_delim("data/koppensteiner2016/takete_maluma.txt")
kopper

# A tibble: 460 × 5
   Tak_Mal_Stim Answer  Corr_1_Wrong_0 Rater                   Female_0
   <chr>        <chr>            <dbl> <chr>                      <dbl>
 1 Takete       CORRECT              1 10MacI_12_10_11_59_.txt        0
 2 Takete       CORRECT              1 10MacI_12_10_11_59_.txt        0
 3 Takete       CORRECT              1 10MacI_12_10_11_59_.txt        0
 4 Takete       CORRECT              1 10MacI_12_10_11_59_.txt        0
 5 Takete       CORRECT              1 10MacI_12_10_11_59_.txt        0
 6 Maluma       CORRECT              1 10MacI_12_10_11_59_.txt        0
 7 Takete       CORRECT              1 10MacI_12_10_11_59_.txt        0
 8 Takete       CORRECT              1 10MacI_12_10_11_59_.txt        0
 9 Maluma       CORRECT              1 10MacI_12_10_11_59_.txt        0
10 Takete       CORRECT              1 10MacI_12_10_11_59_.txt        0
# ℹ 450 more rows

Accuracy

Figure 1: Accuracy by stimulus type.

Modelling accuracy

We want to model the proportion of correct responses by stimulus type: both types should elicit the same level of accuracy.
We can use a Bernoulli regression model.

kopper <- kopper |> 
  mutate(
    Answer_f = factor(Answer, levels = c("WRONG", "CORRECT"))
  )
levels(kopper$Answer_f)

[1] "WRONG"   "CORRECT"

Accuracy: Bernoulli regression

kop_bm <- brm(
  Answer_f ~ Tak_Mal_Stim,
  data = kopper,
  family = bernoulli,
  cores = 4,
  seed = 9128,
  file = "cache/kop_bm"
)

Accuracy: model summary

summary(kop_bm)

 Family: bernoulli 
  Links: mu = logit 
Formula: Answer_f ~ Tak_Mal_Stim 
   Data: kopper (Number of observations: 460) 
  Draws: 4 chains, each with iter = 2000; warmup = 1000; thin = 1;
         total post-warmup draws = 4000

Regression Coefficients:
                   Estimate Est.Error l-95% CI u-95% CI Rhat Bulk_ESS Tail_ESS
Intercept              0.81      0.15     0.52     1.11 1.00     3978     2479
Tak_Mal_StimTakete     0.15      0.21    -0.25     0.56 1.00     4026     2442

Draws were sampled using sampling(NUTS). For each parameter, Bulk_ESS
and Tail_ESS are effective sample size measures, and Rhat is the potential
scale reduction factor on split chains (at convergence, Rhat = 1).

Expected predictions of accuracy

Credible intervals of the difference

kop_bm_draws <- as_draws_df(kop_bm)

kop_bm_draws |> 
  summarise(
    `90%` = paste0("[", paste(quantile2(b_Tak_Mal_StimTakete, c(0.05, 0.95)) |> round(2), collapse = ", "), "]"),
    `80%` = paste0("[", paste(quantile2(b_Tak_Mal_StimTakete, c(0.1, 0.9)) |> round(2), collapse = ", "), "]"),
    `70%` = paste0("[", paste(quantile2(b_Tak_Mal_StimTakete, c(0.15, 0.85)) |> round(2), collapse = ", "), "]"),
    `60%` = paste0("[", paste(quantile2(b_Tak_Mal_StimTakete, c(0.2, 0.8)) |> round(2), collapse = ", "), "]"),
    `40%` = paste0("[", paste(quantile2(b_Tak_Mal_StimTakete, c(0.3, 0.7)) |> round(2), collapse = ", "), "]")
  ) |> 
  knitr::kable(align = c("ccccc"))

90%	80%	70%	60%	40%
[-0.18, 0.49]	[-0.11, 0.42]	[-0.06, 0.37]	[-0.02, 0.33]	[0.04, 0.27]

Is there a difference in accuracy by stimulus type?

According to the model, the difference in log-odds is [-0.25, 0.56] at 95% probability.
Only at 40% probability we can argue for an increase in log-odds for takete stimuli between 0.04 and 0.27.
We can’t argue for or against a difference. We just do not know.

References

Chen, Yi-Chuan, Pi-Chun Huang, Andy Woods, and Charles Spence. 2016. “When “Bouba” Equals “Kiki”: Cultural Commonalities and Cultural Differences in Sound-Shape Correspondences.” Scientific Reports 6 (1): 26681. https://doi.org/10.1038/srep26681.

Holland, Morris K., and Michael Wertheimer. 1964. “Some Physiognomic Aspects of Naming, or, Maluma and Takete Revisited.” Perceptual and Motor Skills 19 (1): 111–17. https://doi.org/10.2466/pms.1964.19.1.111.

Köhler, Wolfgang. 1929. Gestalt Psychology. New York: Liveright.

Koppensteiner, Markus, Pia Stephan, and Johannes Paul Michael Jäschke. 2016. “Shaking Takete and Flowing Maluma. Non-Sense Words Are Associated with Motion Patterns.” PLOS ONE 11 (3). https://doi.org/10.1371/journal.pone.0150610.