Open Research
Case studies:
Amoruso et al. (2025) published in Nature in November: “Multilingualism protects against accelerated ageing”
They argue that being multilingual reduces the effects of ageing.
Specifically, speaking one foreign language were 1.3 times less likely (OR = 0.77, 95% CI: 0.71–0.83), two foreign languages 1.96 times less likely (OR = 0.51, 95% CI: 0.47–0.55), and three or more foreign languages 1.56 times less likely (OR = 0.64, 95% CI: 0.59–0.69). These findings underscore the protective effect of multilingualism on aging trajectories.
While this approach provides evidence of temporal precedence, it does not establish causality. Proper causal inference would require experimental, quasi-experimental or intervention-based designs.
Problematic for two reasons:
“protects” is causal.
Causal Inference can be achieved with observational studies when using causal inference approaches, like Directed Acyclic Graphs.
Doing non-causal inference (and being explicit about it), yet using a causal word as second word in the title. If you pay Nature € 10.690, they will publish this in Nature Ageing. I can tell you what I think of that for free. www.nature.com/articles/s43...
— Casper Albers 🟥 (@casperalbers.nl) 11 November 2025 at 07:58
[image or embed]
This slide unfortunately generalizes well 🥲
— Julia M. Rohrer (@dingdingpeng.the100.ci) 11 November 2025 at 09:25
[image or embed]
I personally think this is a "harder problem" than we care to admit. So, are you a (social) scientist struggling with this situation? A break between what you _want_ to study (a causal process) and what you feel you _can_ credibly study (a correlation)? Here are some readings that might help. 👇
— Dan de Kadt (@dandekadt.bsky.social) 11 November 2025 at 11:24
[image or embed]
Intrinsic vowel duration: high vowels are shorter and low vowels are longer (Lieberman and Kubaska 1979).
Three explanatory models:
Duration as articulatory distance (Turk et al. 1994).
Duration as a vowel-specific target (Toivonen et al. 2015; Bermúdez-Otero 2010).
Both articulatory distance and vowel-specific target (Coretta 2025).
Figure 1: DAGs representing three explanatory models of intrinsic vowel duration.
F1 is only a proxy for articulation: it conflates articulatory distance and peak velocity.
There might be other explanations than “vowel-specific category”. For example, optimisation process in the XT/3C model (Turk and Shattuck-Hufnagel 2020a, 2020b; Elie, Lee, and Turk 2023; Elie, Šimko, and Turk 2024; Elie, Simko, and Turk 2024).
Direct articulatory data.
Revised Directed Acyclic Graphs.
Framing in light of alternative explanations.
Duration as articulatory distance.
Duration as other than articulatory distance (different possible processes: categorical specification or optimisation process).
Both articulatory distance and other than articulatory distance (different possible processes: categorical specification or optimisation process).
DAGitty demo.
To estimate the direct causal effect of vowel and speech rate on distance:
To estimate the direct causal effect of distance on peak velocity:
To estimate the direct causal effect of distance on duration:
\[ \begin{aligned} V_p & = k \cdot D^\alpha\\ ln(V_p) & = ln(k) + \alpha \cdot ln(D) \end{aligned} \]
Peak velocity \(V_p\) is equal to the product of a constant \(k\) and distance \(D\) raised to the power of \(\alpha\).
Equivalently, logged peak velocity is equal to logged \(k\) plus the product of \(\alpha\) and logged distance.
\[ y = \beta_0 + \beta_1 \cdot x \]
In R: log(dur) ~ 1 + log(dist)
You learned a lot, but there is a lot more to learn!