QML Data
  • Home

Data for Quantitative Methods in Linguistics

This website contains a curated list of data tables (aka tabular, rectangular, or spreadsheet data) from linguistic studies, which can be used in courses of statistics, quantitative methods and data analysis.

Help us!

In an effort to improve the Equality, Diversity and Inclusiveness (EDI) of this collection, I am looking for data tables on minoritised and Global South languages and from research by minoritised researchers.

If you have or know of data that qualifies, feel free to get in touch with me (Stefano) at s.coretta@ed.ac.uk.

Download

year title author languages population
2009 The KEY to the ROCK: Near-homophony in nonnative visual word recognition Mitsuhiko Ota, Robert J. Hartsuiker, Sara L. Haywood English Japanese
2012 The phonetic profile of Korean formal and informal speech registers Bodo Winter, Sven Grawunder Korean Korean
2016 Shaking Takete and Flowing Maluma. Non-Sense Words Are Associated with Motion Patterns Markus Koppensteiner, Pia Stephan, Johannes Paul, Michael Jäschke   English
2016 Taste and smell words form an affectively loaded and emotionally flexible part of the English lexicon Bodo Winter English English
2018 Acoustics and articulatory durational measures of Italian and Polish Stefano Coretta Italian, Polish  
2018 Data on vowel formants and duration in Italian Stefano Coretta Italian Verbano-Cusio-Ossola
2018 Electroglottographic data on Italian Stefano Coretta Italian Verbano-Cusio-Ossola
2018 Formant trajectories in Italian and Polish Stefano Coretta Italian, Polish  
2018 Ultrasound tongue imaging data of Italian and Polish speakers Stefano Coretta Italian, Polish  
2020 A Cross-Cultural Analysis of Early Prelinguistic Gesture Development and Its Relationship to Language Development Thea Cameron-Faulkner, Nivedita Malik, Circle Steele, Stefano Coretta, Ludovica Serratrice, Elena Lieven   Cantonese, Bangladeshi, British
2020 Language practices of Emilian and Esperanto communities: spaces of use, explicit language attitudes and self-reported competence Jessica Hampton, Stefano Coretta Emilian, Esperanto  
2020 Mixean Basque Voice Onset Time Ander Egurtzegi, Chris Carignan Mixean Basque  
2020 Morphology and dialectology in the Linguistic Survey of Scotland Pavel Iosad, Will Lamb Gaelic Scottish
2020 Revisiting the Suffixing Preference: Native-Language Affixation Patterns Influence Perception of Sequences Martin, Alexander, Culbertson, Jennifer English, Kîîtharaka English, Kîîtharaka
2020 Second language users exhibit shallow morphological processing Song, Y., Do, Y., Thompson, A., Waegemaekers, E., Lee, J. English English, Chinese
2021 Albanian Voice Onset Time Stefano Coretta, Josiane Riverin-Coutlée, Enkeleida Kapia, Stephen Nichols Northern Tosk Albanian Albanian
2021 Albanian formant data Stefano Coretta, Josiane Riverin-Coutlée, Enkeleida Kapia, Stephen Nichols Northern Tosk Albanian Albanian
2021 Massive Auditory Lexical Decision 1.1 Tucker, Benjamin V. English English
2021 The role of relevance for scalar diversity Elizabeth Pankratz, Bob van Tiel English English
2022 ‘Everywhere here can say this’: The English locative impersonal Sluckin, Benjamin L., Itamar Kastner English British
2022 Glottolog 4.6 data: Agglomerated Endangerment Status Stefano Coretta, Harald Hammarström, Robert Forkel, Martin Haspelmath, Sebastian Bank    
2023 V2-Relatives in Old English Bettelou Los, Stefano Coretta Old English  
2024 Sibilant mergers in 18th-century Basque: A quantitative study Dorota Krajewska, Eneko Zuloaga, Ander Egurtzegi Basque (18th-century)  
2024 The organization of verb meaning in Lengua de Señas Nicaragüense (LSN): Sequential or simultaneous structures? Diane Brentari, Susan Goldin-Meadow, Laura Horton, Ann Senghas, Marie Coppola Lengua de Señas Nicaragüense (LSN) Nicaraguan
No matching items