Quantitative Methods for LEL

Welcome to the main site of the course Quantitative Methods for Linguistics and English Language (Semester 1).

This website is your go-to place throughout the semester for any info related to the course.

You can find the data for the course in the QML Data website.

1 Course description

This course is an introduction to quantitative data analysis (including data wrangling, visualisation and modelling) as commonly employed in linguistics, using the R software.

This course has a “choose your own adventure” format. Learn more in Schedule.

Among the topics that you can learn there are the following:

  • The basics of quantitative data analysis.
  • Data preparation.
  • Data summaries.
  • Principles of data visualisation.
  • Statistical modelling with linear models.
  • Statistical inference using Bayesian inference (posterior probability distributions and credible intervals) and frequentist inference (p-values and confidence intervals).

At completion of the course you will have gained the following skills:

  • Import common data formats, tidy and transform data.
  • Choosing and reporting appropriate summary measures.
  • Using compelling visualisations to communicate a specific message about patterns in the data.
  • Master linear models for different types of data (continuous measures and binary outcomes).
  • Using Bayesian inference to answer research questions and avoid common interpretation pitfalls of frequentist techniques.

Examples from different branches of linguistics will be used to provide you with hands-on experience in quantitative data analysis and Open Research practices.

2 Course rationale

This course is designed to help you develop the necessary skills for conducting and interpreting analyses of data as commonly employed in linguistics.

The content and objectives of the course are in response to recent advances in our understanding of the theory behind research methods.

Recent meta-scientific research has identified three important aspects of research: the reproducibility, replicability and generalisability.

  • A result is reproducible when the same analysis steps performed on the same dataset consistently produces the same answer.
  • A result is replicable when the same analysis performed on different datasets produces qualitatively similar answers.
  • A result is generalisable when a different analysis workflow performed on different data sets produces qualitatively similar answers.

See Definitions for a more detailed explanation.

However, based on surveys from different disciplines, we are currently facing the three research crises (reproducibility, replicability and generalisability crises) by which most results are neither reproducible, nor replicable, nor generalisable (Munafò et al. 2017, Simmons et al. 2011, Ioannidis 2005, Yarkoni 2022).

The Open Research movement (also known as Open Science or Open Scholarship, Crüwell et al. 2019) was developed with the aim of improving our understanding of these crises and with the objective of providing researchers with guidelines and tools to produce reproducible, replicable and generalisable research.

The statistical philosophy adopted in the course is that of the New Statistics (Cumming 2014, Kruschke and Liddell 2018. The main goal of the New Statistics is to shift the attention from statistical significance to estimation with quantification of uncertainty.

3 Support

Ask for help

If at any point during the course you don’t feel comfortable with any aspect of the course, you are unsure about anything that has been covered in class or in your own time, you are struggling to keep up with the course workload, you are experiencing mental of physical distress due to a pre-existing or new illness, medical condition or disability, or you find yourself unable to access basic needs like food or housing, please do get in touch with me and/or the PPLS support (go to the PPLS UG or MSc Hub on SharePoint > Support for students > Health & Wellbeing).

We are humans first and the rapidly-changing new world we are living in now can put us under pressure. What is most important to me is that you are first and foremost healthy and able to participate to the course, and that you succeed and get the most out of the course.

It is OK not to be OK, and remember that you are not alone. Other people, teachers and students, might be struggling right now or have struggled before and might have gone through what you are going through now. Remember that support exists for you, so please do reach out. If you see somebody close to you struggling, please let them know about the available support network and encourage them to reach out.

4 Contacts

You can reach me (Stefano) at or on Teams.

If you want to book office hours with me, you can do so here: https://bit.ly/33BH84L.

Here’s the list of course tutors:

  • Alice Marikan

  • Anna Laoide-Kemp

  • Carine Abraham

  • Georges Sakr

  • Odysseas Myresiotis Alivertis

  • Ponrawee Prasertsom