Domestic and International Common Language (DICL) Database
The database contains 11 index measures of linguistic similarity between 242 countries, both domestically and internationally. The domestic measures capture linguistic similarities present among populations within a single country while the international indexes capture language similarities between two different countries. The indexes, which are based on 6,674 languages, reflect three different dimensions of language: common official languages, common native and acquired spoken languages, and linguistic proximity across different languages. This database has many uses, such as in the study of bilateral flows—including FDI, migration, and international trade—as well as in regional or country level analyses.
1 Variables
iso3_i
-
ISO 3-letter code for country i.
country_i
-
Full name of country i.
iso3_j
-
ISO 3-letter code for country j.
country_j
-
Full name of country j.
col
-
Common official language indicator [
1
if countries i and j share at least one official language (national or provincial, statutory or de facto),0
otherwise]. cor
-
Restricted official language indicator [
1
if countries share at least one national statutory or de facto official language,0
otherwise]. cnl
-
Common native language index: it measures overlap in native language speakers.
cal
-
Common acquired language index: it measures overlap in non-native or learned languages.
csl
-
Common spoken language index: it measures overlap in all spoken languages (native + acquired).
lpn
-
Linguistic proximity for different native languages: it measures how closely related the native languages of the two countries are.
lpa
-
Linguistic proximity for different acquired languages: same as
lpn
but for acquired languages. lps
-
Linguistic proximity for all spoken languages.
bpn
-
Branch proximity for native languages.
bpa
-
Branch proximity for acquired languages.
bps
-
Branch proximity for all spoken languages.