Religiousness worldwide: translation of the Duke University Religion Index into 20 languages and validation across 27 nations

Toscanelli, Cecilia; Shino, Elizabeth; Robinson, Sarah L.; Thalmayer, Amber Gayle

doi:10.1186/s42409-022-00041-2

Validation of Measurement Instruments
Open access
Published: 10 October 2022

Religiousness worldwide: translation of the Duke University Religion Index into 20 languages and validation across 27 nations

Measurement Instruments for the Social Sciences volume 4, Article number: 13 (2022) Cite this article

3680 Accesses
2 Citations
1 Altmetric
Metrics details

Abstract

Religiousness and spirituality are important in the study of psychology for several reasons: They are central to identity and values; they have been reported as being positively associated with health and well-being; and they capture (and perhaps lead to) the largest measurable psychological differences between societies. At five items, the Duke University Religion Index (DUREL) is an efficient measure, which advantageously distinguishes between religious sentiment and activity, and between formal versus private involvement. This project extends its internal validation throughout the world, with formal tests of measurement invariance in three languages in Namibia (Study 1) and in a global sample of 26 countries (Study 2). Results confirmed a two-subscale factorial structure of Religious Activity (combining organizational and non-organizational activities) and Intrinsic Religiosity in Namibia and in half of the 26-country samples. In 13 other countries, fit was best for a one-factor model. Fit was problematic where there was too little intra-national variance: in China and Japan, where religious involvement is universally low, and in Tanzania, where it is universally high. Scalar measurement invariance was found for the one-factor structure across 13 samples and for the two-factor structure across 11 samples. External validation of the scale is examined using psychological and sociodemographic variables. This validation of the DUREL supports its use across contexts, facilitating increased attention to this important aspect of both personality and culture.

Religion is a key aspect of human psychology, which has played an important role in shaping human societies and values (Schulz et al., 2019). Although religious involvement may be decreasing in the USA (Jones, 2021), the majority of people in the world still report being affiliated with a religion (Pew Research Center, 2018), and it is often a central part of their identies (Tarakeshwar et al., 2003). Important to cultural psychology, religiosity and associated values, such as family and gender roles, also demonstrate significant cross-national differences, even larger in effect than those of popular variables such as individualism and collectivism (Saucier et al., 2015). Furthermore, religious involvement has emerged as a source of resilience against mental disorders (De Berardis et al., 2020), suggesting its potential relevance for understanding national differences in the prevalence of such disorders (e.g., Berkessel et al., 2021; Dückers et al., 2016). Thus, religious sentiment and involvement are important to psychology and cross-cultural research (Tarakeshwar et al., 2003), although these topics have been relatively underrepresented in psychological science, perhaps related to the underrepresentation of researchers from majority world contexts (Thalmayer, Toscanelli, & Arnett, 2021) where religion is more central to daily life (Pew Research Center, 2018).

There are many ways to measure religiousness (Hall et al., 2008; Remizova et al., 2022). The focus of this study is the popular and highly efficient inventory, the five-item Duke University Religion Index (DUREL), which includes three components: organizational religious involvement, non-organizational involvement (independent prayer, meditation, or study), and intrinsic or subjective religiosity (Koenig & Büssing, 2010). The DUREL has been shown to converge with other commonly used inventories, for example the Santa Clara Strength of Religious Faith Questionnaire (SCSRFQ; Plante et al., 2002), a five-item unidimensional measure emphasizing the force of religious faith (r with DUREL total score .79, Saffari et al., 2013; .86, Storch et al., 2004), and the Personal Religious Inventory (PRI; Lipsmeyer, 1984), with 45 items and subscales for prayer, ritual attendance, and the integration of religion in cognition, affect, and behavior (r_{ORA, ritual attendance} =.84; r_{NORA, personal prayer} = .76; Lace & Handal, 2018).

The DUREL has been used in diverse contexts including Muslim Iran (Saffari et al., 2013), Catholic Portugal (Lucchetti et al., 2012) and in China where multiple religions are practiced (Wang et al., 2014), though without comprehensive validation. In addition to capturing differences between societies in the typical degree of religious involvement (Saucier et al., 2015), the distinctions made by the DUREL between external behavior (practicing religion) and internal sentiment (spiritual feelings) create a potential bridge between cultural and personality psychology, capturing both between-nation and between-person differences (Saucier, 2019). The current project aims to support better inclusion of religious involvement and sentiment in cross-cultural psychological research by introducing 19 total new translations of the DUREL and extending its validation to three languages in Namibia (Study 1), and to 26 countries around the world (Study 2), considering both internal (psychometric properties and cross-cultural measurement invariance) and external (associations with other variables) validation.

Religiousness and culture

Religiousness may be a key variable in the study of cross-cultural differences. In most nations, a majority of people report being affiliated with a religion, but this varies from lows of 13% in China, 28% in the Czech Republic, and 36% in Japan to near 100% in the Middle East, Sub-Saharan Africa, and other parts of Asia (e.g., Malaysia and Indonesia; Pew Research Center, 2018). The five DUREL items had the largest effect sizes for discriminating between 30 national groups among 50 psychological scales hypothesized to distinguish between cultural groups (281 items), including values (e.g., classic contrasts such as individualism and collectivism), world views, behavioral practices, and personality characteristics (Saucier et al., 2015). The variables that followed the DUREL in terms of effect size included beliefs about family and gender roles, which tend to associate with religious values. In the big picture, using data from the World Values Survey, Schulz et al. (2019) show how the family practice policies of the Western Church may have set in motion lifestyle changes that shaped the psychological changes (individualism, analytic thinking, impersonal pro-sociality) that now define Western culture.

The contemporary significance of inter-cultural differences in religiosity is supported by a study of 106 countries on the importance of religion in peoples’ lives (Pew Research Center, 2018). Note that while cross-cultural comparisons must be interpreted with caution, the rating of importance was determined to be the most invariant aspect in a later study, making it the most suitable of those measured for cross-national assessment (Remizova et al., 2022). The world’s highest religious importance was reported in Sub-Saharan Africa, where between 71% (Botswana) and 98% (Ethiopia) of respondents reported religion as being “very important” to them. Some countries in Latin America (Honduras and Brazil) had rates nearly as high, while in others half the population or less reported high importance. Middle Eastern countries varied from 36% in Israel to 78% in Iran. For Western countries, the highest rates were in the USA (53%) and Greece (56%), and the lowest in the UK and Germany (10%). Asia-Pacific had the largest within-region contrasts, with high importance varying from 3% in China and 10% in Japan, to 80% in India and 90% in Indonesia. Thus, while religion plays a role in virtually every society, contrasts within and across regions are substantial, defining meaningful differences between groups. It is also noteworthy that the importance of religion tends to be lower in Western, industrialized contexts and higher in the “majority world,” e.g., Asia-Pacific, Africa, the Middle East, and Latin America, where almost 90% of humans live, but who are consistently underrepresented in mainstream psychology (e.g., Kagitcibasi, 2002; Thalmayer, Toscanelli, & Arnett, 2021). Better incorporation of religiosity into psychological studies could mean better representing the experiences and perspectives of the global human population.

Since its appearance in the 1990s, the DUREL has been used in many contexts. We identified 16 translations in prior published studies, detailed in Table 1. These include four translations to languages in Asia, three to the Middle East, three to Europe, three to the Americas, and one to Austronesia. None were identified in African languages. Perhaps partly due to the variety of disciplines represented, none of the validation procedures in these studies meet currently accepted standards for the adaptation of psychological measures (e.g., Byrne & van de Vijver, 2010; Fischer & Karl, 2019). Six come from a project which tested translations on samples of 20 to 55 individuals, three studies only tested internal consistency, one assessed no psychometric properties, and none tested measurement invariance. Furthermore, while the DUREL was initially presented strictly as a three-component measure, most later researchers have used it as a unidimensional construct (Chen et al., 2014; Hafizi et al., 2014; Saffari et al., 2013). The internal validation of the DUREL and its proposed structure have thus not been assessed systematically for applicability across contexts. Measurement invariance, in particular, is crucial in cross-cultural research to ensure that survey scores are appropriately comparable (Fischer & Karl, 2019). This is especially relevant in the study of religiosity where the construct may have highly varying and even culture-specific meanings (Remizova et al., 2022). Establishing the extent of the cross-cultural suitability of this practical inventory and making it available in many languages could allow for better comparisons within and between groups, facilitating exploration of religiousness directly and also making it practical to include as a covariate in studies of other psychological phenomena.

Table 1 Published translations of the Duke Religion Index by region and language

Full size table

Associations between religiousness and other psychological variables

The association of religious sentiment and/or engagement with other psychological or sociodemographic variables is important to understand, but has so far mainly been studied cross-sectionally, with measures that have not been tested for cross-cultural measurement invariance. The existing literature is briefly reviewed with this caveat in mind, to provide a summary of current assumptions about these associations, and to form loose hypotheses that can be tested in the Namibian and (where possible) global contexts.

Prior research has generally indicated a positive association for religiousness with well-being and health, for example, between religious practice and life satisfaction in a sample of over 20,000 participants (Berthold & Ruch, 2014). Measures of social commitment in religious activities have also indicated a positive association with physical health (Koenig & Larson, 2001; Seybold & Hill, 2001). Meta-analyses associate religiousness with reduced mortality (Chida et al., 2009; McCullough et al., 2000), and better health and longevity (Seybold & Hill, 2001). However, this assumes a “healthy” way of living one’s religion; commitment of a dogmatic or authoritarian type has been linked to intergroup conflict and child abuse (Seybold & Hill, 2001). Associations can also depend on which dimension of religiosity is taken into account: while depressive symptoms are generally lower among the religious (Smith et al., 2003), an “extrinsic” religious attitude (engaging in religious activities for self-serving ends or to avoid dealing with problems) or negative religious coping (blaming God) instead associate with depressive symptoms (Smith et al., 2003).

Studies among Muslims in Farsi (Hafizi et al., 2014) and Catholics in Portuguese (Lucchetti et al., 2012) have reported lower religious engagement among people with more education, using the DUREL total score. A large multi-country study reported that people with more education were less religious in 18 of 39 contexts studied, but that the opposite was true in nine nations, and there was no effect in 12 (Schwadel, 2015). They reported that association between higher education and lower religiosity was strongest in more religious nations (Schwadel, 2015), but as 26 of the 39 countries were European or closely related Western contexts, the four Asian countries include the wealthiest contexts in the region, and the only African country was South Africa, it is not clear how this finding might generalize to majority world contexts, specifically to Africa, the most consistently religious region in the world. In the only study identified that compared religious involvement to income, scores on DUREL Intrinsic Religiosity were negatively associated with income (Lucchetti et al., 2012).

Women have been assumed to be more religious than men (Argyle & Beit-Hallahmi, 2013), but the religion and the activity may play a role (Vardy et al., 2022). Women identify as more religious in Christian, Muslim, Hindu, and Buddhist, but not Jewish contexts, and engage in more religious activities in Christian, Hindu, and Buddhist, but not Muslim or Jewish contexts (Sullins, 2006). Ultimately, neither gender nor age was found to be a consistent predictor of religiosity in a meta-analysis of 63 studies from 19 countries (Saroglou, 2010).

Goals for the current study

In two studies, we examine the internal and external validation of the DUREL (Koenig & Büssing, 2010) to facilitate its use in the study of psychology within and across cultures. In terms of internal validation, including psychometric and structural validity, in Study 1 we report on the DUREL’s applicability in two African languages, Oshiwambo and Khoekhoegowab, and in English in Namibia. These are the first published translations of the DUREL into African languages, using a multi-step process including expert panels, administered to large samples of community adults. With nearly 98% of the population in Namibia identifying as Christian, and with religious involvement very high throughout Sub-Saharan Africa (Pew Research Center, 2012), this is an important variable to include in local psychological studies. In Study 2, we assess the psychometric and structural properties of the DUREL in the Survey of World Views data, a global sample of university students from 33 countries (described in detail in Saucier et al., 2015; 26 samples used for analysis). As the nations in Study 2 have diverse predominant religions (16 majority Christian, 4 majority Muslim, 1 majority Buddhist, 2 majority Hindu, 2 majority unaffiliated, many with great diversity; detailed percentages are provided in Supplemental Table S1), this allows for the assessment of the instrument in a variety of religious, as well as national and linguistic, contexts. Based on prior research and as reported in our pre-registered analysis plan, we expected the DUREL subscales to have good internal consistency across contexts.

To assess external validation of the DUREL scale, in both studies, for 30 total samples, we examine the association of the DUREL to gender. In the Namibian samples, we additionally examine associations of the DUREL and its subcomponents with age, life satisfaction, physical health, education, income, and employment status. Based on prior literature, we loosely hypothesized finding positive associations between DUREL scores with well-being and physical health, small gender differences in favor of women being more religious, and higher DUREL scores to associate with lower educational level, without a priori expectations regarding the DUREL subcomponents. As prior literature on the relation of religiosity with other aspects of socioeconomic status and age are minimal, we address these associations in an exploratory way.

Together, these studies allow us to assess the internal validity of the DUREL in 30 total contexts, including in 20 total languages, validating 17 new translations, with some additional assessment of external validity, particularly in Namibia. To the extent permissible based on our assessment of cross-sample measurement invariance, we then compare scores across nations. Based on international surveys of religiosity, we expected average scores on the DUREL and/or its subcomponents to be higher in African countries, in India and in Muslim-majority Asian and Middle Eastern countries, and in South and Latin America, and lower in Western contexts to make a cross-cultural assessment of the DUREL’s reliability and validity and to draw cross-national contrasts on religious involvement.

Study 1: the DUREL in three languages in Namibia

Method

Participants

Participants were adult native speakers of Oshiwambo in northern Namibia (n = 678), native speakers of Khoekhoegowab from villages and towns throughout the country (n = 645), and speakers of English (non-native) from the capital city Windhoek and surrounding areas (n = 589). Oshiwambo, spoken by nearly 50% of the population, and Khoekhoegowab, spoken by about 12%, are the two most commonly spoken African languages in Namibia (Frydman, 2011). English is the official language since independence in 1990, when it replaced Afrikaans. For this reason, English speakers were recruited in and around the capital where it is more commonly spoken than in rural areas. (Note that while there is a small population of white Namibians of German, English, and Afrikaner heritage, none were included in our samples.) Demographic information collected included age, gender, home language, participant and parents’ level of schooling, household income, employment status, and location of survey-interview. A summary is provided in Table 2; full details are in Supplemental Table S2.

Table 2 Sample characteristics by language group, in percentages

Full size table

Chi-square tests indicated some significant differences among the samples. Educational attainment, measured by asking to the participants to indicate what is the highest level of education that they completed, was highest among the English-speaking and lowest among the Oshiwambo-speaking sample, χ²(14) = 391.43, p < .001. The same pattern was observed for mother’s, χ²(14) = 186.11, p < .001, and father’s education, χ²(14) = 173.64, p<.001, and for monthly income, χ²(12) = 317.17, p < .001. To assess employment or engagement in education, participants were asked to choose one or more from seven options. The proportion that reported being a student did not vary by sample, χ²(2) = 2.90, p = .23. However, the English-speaking sample were most likely to have regular part-time, χ²(2) = 38.41, p < .001, or full-time work, χ²(2) = 49.69, p<.001, and the Oshiwambo language sample was least likely. The categories of “currently not working,” χ²(2) = 81.34, p < .001, “working at home, or other unpaid work,” χ²(2) = 13.26, p = .001, “seeking paid work,” χ²(2) = 36.15, p < .001, and “occasional paid work,” χ²(2) = 45.26, p < .001, were endorsed most often by Khoekhoegowab speakers and least often by the English language sample.

Procedure

Ethical review of the study plan was made by the University of Namibia’s Research Ethics Committee (UREC) and data was collected from July to September, 2019. The English language survey data was collected in a paper and pencil format, as English is the national language and the language of instruction in most Windhoek schools and is both spoken and written in local work settings.

Many potential participants in the Khoekhoegowab and Oshiwambo language samples, however, were expected to lack confidence in reading and writing their mother tongue, because many attended school in Afrikaans or in English despite speaking an African language at home. For this reason, in those samples, the survey was filled out by an interviewer based on the oral responses of participants.

Teams of eight to 15 interviewers/data collectors for each of the three language contexts were graduates of the sociology or psychology programs of the University of Namibia (BA or MA degree), primary- or secondary-school teachers of the language in question, or experienced data collectors, having worked on previous academic studies and/or for the national census survey. Each team met for a weekend-long training led by the second and fourth authors of this report. Data collection occurred in a 4- to 8-week period following the training and was coordinated with local leaders. Participants were recruited by interviewers in their home communities from among neighbors, church members, colleagues, the parents of students, and strangers from nearby villages and neighborhoods. Interviewers asked participants their age and their home language, and they noted gender, the location of the survey, and notes about how the participant was recruited and how the interview went. The interview and the preliminaries regarding informed consent and instructions were in the same language as the survey. Written informed consent was obtained from all participants. The interview typically lasted approximately 40 min.

Materials

Results from three inventories are reported here. The full survey also included inventories on mental health and personality traits, described in other projects and in the pre-registered analysis plan, https://osf.io/y8d4z/?view_only=347d245ea9544b09a4111a8650059af4. All study materials are available at https://osf.io/6d8gs/?view_only=a32710f4784e447f890aa077ff6b5bc9.

The inventories were translated into Khoekhoegowab and into Oshiwambo (Oshikwanyama dialect) following a process defined by the World Health Organization. This is described in detail here, as potential guide for other researchers. While such a process is probably always ideal, it is arguably especially relevant when working in smaller-scale languages, for example as there is no published English to Oshiwambo dictionary. (1) Forward translation was completed for Khoekhoegowab by a native speaker and PhD candidate in African linguistics, and for Oshiwambo by a native speaker and professional translator referred to us by the African Languages department at the University of Namibia. (2) Expert panels were held for each language, including the initial translators, subject matter experts and language experts all of whom were mother-tongue speakers, and the last author of this report who was the principal investigator of the study. For Khoekhoegowab, the additional experts included a clinical psychologist, a social worker, and a senior lecturer of the Khoekhoegowab at the University of Namibia. For Oshiwambo, in addition to the original translator, they included four Oshiwambo-speaking faculty members of the University of Namibia: a clinical psychologist and senior lecturer of psychology (the second author of this report), a lecturer of psychology, a lecturer of education, and a senior lecturer of Oshiwambo. At these day-long panel meetings, the initial forward translation next to the original English was projected onto a screen. The group discussed every line of translation, ultimately coming to consensus on a refined translation. (3) Back-translation of the refined translation was conducted by a professional translator, in each case, who had not been present at the panel meeting and had no prior knowledge of the surveys. This version was reviewed together with the expert, the initial translator, and the third and final authors of this report, and adjustments were made. (4) Pre-testing and cognitive interviewing were conducted by a research assistant in each of the languages, who piloted the survey with members of the relevant communities. These translations are made freely available to other researchers in the online materials posted for this study.

Note that because the English language version of the survey was completed on paper, items were worded in the first person. The Khoekhoegowab and Oshiwambo surveys were planned to be read aloud to participants, so items were worded in the second person.

Duke University Religion Index (DUREL)

The five items measure three aspects of religiosity: Organizational Religious Activity (ORA; one item), Non-Organizational Religious Activity (NORA; one item), and Intrinsic Religiosity (IR; three items; Koenig & Büssing, 2010). Responses on a 6-point Likert-type scale are linked to different terms depending on item (respectively: “never” to “more than once a week;” “rarely or never” to “more than once a day;” “definitely not true of me” to “definitely true of me”). Though the authors recommend against a total score, EFA and CFA support for one has been reported in samples from Iran (Saffari et al., 2013) and China (Chen et al., 2014), and many other studies have used one. The mean, standard deviation, Cronbach’s alpha, and Omega of the DUREL in the three languages are reported in Table 3. To provide comparability with prior research and facilitate reproducibility for future researchers, and with the justification of high alpha and Bollen’s omega coefficients, we report all raw item, subscale, and total scores.

Table 3 Descriptive statistics and psychometric properties for DUREL items and components

Full size table

Satisfaction with Life Scale (SWLS)

This 5-item measure assesses well-being in terms of global cognitive judgment (Diener & Emmons, 1985). The scale showed acceptable reliability in all languages (English α = .74; Khoekhoegowab α = .74; Oshiwambo α = .70). The measurement invariance of the SWLS across these samples is described in detail in a separate report (CITATION REDACTED FOR BLIND REVIEW). Support was found for a 4-item version, excluding the fifth item, which is more abstract than the others and has been found to be problematic in many studies. The analyses here rely on the first four items only.

General Self-Reported Health (GSRH)

Physical health was rated on a 5-point scale from poor to excellent. A meta-analysis of 22 studies has shown this one-item self-assessment to correlate highly with longer and more invasive measures of health status and to be strongly associated with risk of death over 5 years (DeSalvo et al., 2006; DeSalvo & Muntner, 2011).

Analyses

Data exclusions

Based on criteria described in the pre-registered analysis plan, 125 of the 2037 cases collected were excluded from the analyses. This included those marked for exclusion by the interviewer (either because it was not completed or because the interviewer did not believe it was reliably completed; n =15); from participants under age 18 (n = 2); missing more than 15% of total item responses (n = 21); or where problems were found in assessment of the longer questionnaires, for example where these was almost no variation in response option usage across 30 or more items (n = 87).

Missing data

The MissMech package in R (Jamshidian et al., 2014) was used to evaluate the homoscedasticity, multivariate normality, and missing completely at random (MCAR) status of the data in preparation for analysis (Jamshidian & Jalal, 2010). While multivariate normality was rejected, there was no reason a priori to expect normality given the skewed ordinal distribution of the DUREL. The nonparametric test of homoscedasticity was not rejected (p = .24), and there was not sufficient evidence to reject MCAR at p < .05. The fraction of missing information (fmi), e.g., how much estimation is affected by nonresponses (Savalei & Rhemtulla, 2012), for item means and variances was less than .01, indicating little to no impact of missing data on obtained estimates. Together, these analyses gave us confidence in our assessment that missing data were likely missing at random, if not missing completely at random, and the impact of missing values would likely be negligible. Multiple imputation or estimates relying on FIML were therefore utilized. Multivariate nonnormality, however, led us adopt distribution-free methods for CFA and SEM modeling, specifically WLSMV (Blunch, 2012).

Psychometric properties and structure

We tested the factor structure of the DUREL for each language group using exploratory factor analysis (EFA) to confirmatory factor analysis (CFA) with a random split training/testing design. Parallel analysis (O’Connor, 2000), Velicer’s minimum average partial (MAP) test (Velicer, 1976), VSS complexity 1 and 2, empirical BIC, and sample size-adjusted BIC were used to assess the optimal number of factors. For CFA, the criteria used to compare the models followed standard practice by considering multiple indices of acceptable fit (Bowen & Masa, 2015; McDonald & Ho, 2002; Vandenberg & Lance, 2000), including a decrease in chi-square value; Tucker-Lewis index (TLI; Tucker & Lewis, 1973) and comparative fit index (CFI) values over .95 (Cheung & Rensvold, 2002; Hu & Bentler, 1999); McDonald’s noncentrality index (MFI; McDonald, 1989; McDonald & Marsh, 1990) over .90; and root mean square error of approximation (RMSEA; Steiger, 1990) and standard root mean square residual (SRMR; Bentler, 1995) below .08 (Cheung & Rensvold, 2002; Hu & Bentler, 1999). Because chi-square and RMSEA can be biased by degrees of freedom or sample size (Chen et al., 2008; Shi et al., 2020), we interpret them in light of other indices. It is recommended that when degrees of freedom are low, as here, higher RMSEA should not be taken as indicative of problematic fit in the absence of problems with other fit indices (Kenny et al., 2015).

Measurement invariance and group differences in religiosity

We tested the cross-group invariance of the DUREL using multi-group confirmatory factor analysis (MG CFA) with lavaan (Rosseel, 2012) and associated packages in R (R Core Team, 2020). Following standard procedures with recommended modifications for ordinal data (Bowen & Masa, 2015; Byrne & van de Vijver, 2010; Fischer & Karl, 2019; Putnick & Bornstein, 2016), we assessed for three levels of invariance: (1) configural, with parameters free to vary for each group; (2) metric, constraining factor loadings to be equal; and (3) scalar, constraining both item loadings and thresholds to equality across groups. The location and scale of each latent item response underlying the five ordinal indicators were identified using delta parameterization and identification constraints recommended by Wu and Estabrook (2016). We used polychoric correlation and weighted least squares estimation (WLVSV) with robust standard errors and a mean and variance adjusted test statistic (Satterthwaite approach; Rosseel, 2012). Again, multiple indices were used to judge model comparison (Bowen & Masa, 2015; McDonald & Ho, 2002; Vandenberg & Lance, 2000), and the chi-square test, which is known to be sensitive to sample size and can lead to over-rejecting invariance, is interpreted in light of TLI, CFI, MFI, RMSEA, and SRMR. The difference in fit across levels of invariance was assessed based on a difference in CFI < .01 and a change in RMSEA ≤ .01 (Svetina et al., 2020). Group latent means can be compared using ANOVA with LSD post hoc tests where group equivalence at the scalar level is established.

Associations with other variables

For each language group, we were interested in how sociodemographic variables predict religiosity, and in how religiosity predicts physical and mental health. We thus report standardized coefficients from a regression of DUREL subcomponents on sociodemographic variables (age, gender, education, and income on), and of SWLS and GSRH scores on DUREL subcomponents, accounting for the sociodemographic variables. For these analyses, education was recoded into a continuous variable indicating years of study: did not finish primary school = 0, grade 7 = 7, grade 10 = 10, grade 12 = 12, vocational or other diplomas = 14, bachelor’s degree = 16, master or post-graduate = 18. Similarly, the categories used to assess income were recoded into a continuous variable using the mid-point of the ranges in Namibian dollars that were provided (see Supplemental Table S3; the new coding includes values of 0, 250, 1000, 2250, 4000, 7500, and 10,000).

To evaluate associations with the categorical variable of employment, we employed ANOVA with Tukey’s post hoc tests. From among several options to describe employment, four mutually exclusive categories were created: student; working part time; working full time; unemployed (including those seeking work or working only occasionally). Those who selected only “work at home or other unpaid work” were excluded from this analysis. Due to making multiple comparisons, we avoid the standard p < .05 criteria and instead only interpret ANOVA differences and Tukey’s HSD post hoc tests where p < .01.

Results

Exploratory to confirmatory factor analyses

For all three samples, Bartlett’s sphericity measures and Kaiser, Meyer, Olkin (KMO) measures of sampling adequacy confirmed the data to be appropriate for factor analysis: English χ² (10) = 637.54, p < .001, KMO = 0.72; Khoekhoegowab χ² (10) = 861.56, p < .001, KMO = 0.76; Oshiwambo χ² (10) = 1640.22, p < .001, KMO = 0.70. Parallel analysis, VSS complexity 1 and 2, empirical BIC, sample size-adjusted BIC, and Velicer’s MAP test indicated a two-factor configuration for all three samples, with results reported in Supplemental Table S2. We considered retaining a model of three components based on the theoretical foundation of the DUREL and face validity (Wang et al., 2014), but CFA with two single-item factors was undermined by lack of identifiability, and EFA results suggested that the religious activity items (ORA and NORA) optimally loaded together as one factor, leading us to rely on a two-factor model for CFA. CFA confirmed this, with improvement in all fit indexes, as shown in Table 4 (the same analyses using scaled values are reported in Supplemental Table S5). The two-factor configural models with item loadings and factor correlations are displayed graphically in Fig. 1.

Table 4 DUREL configural fits: one vs two factors in three languages in Namibia

Full size table

Measurement invariance

Results of measurement invariance analysis among the three language groups are reported in Table 5. With increasing constraints, goodness-of-fit indices remained stable, with RMSEAs at .06, CFI and TFI > .99, MFI > .97, and SRMR < .04. Decrease in CFI and TFI and other fit indices were negligible with increasing parameter constraints (Cheung & Rensvold, 2009). From these results, we conclude that invariance was supported at the scalar level: factor location and scale as well as item loadings and thresholds are equivalent between groups, allowing direct comparison of scores. Item thresholds established for Namibia across language groups for each item, grouped by scale, are shown in Fig. 2.

Table 5 Model indices for measurement invariance of the DUREL in three languages in Namibia

Full size table

Group differences in religiosity

Because scalar level invariance was established for the two-component model, we compared mean scores across groups using latent scores using ANOVA, with results reported in Supplemental Table S4. No significant differences in either religious activity or intrinsic religiosity were seen between the three language groups.

Associations with other variables

In Table 6, we report the standardized coefficients from a regression of latent scores on the DUREL subcomponents on sociodemographic variables (age, gender, education, and income on), and of SWLS and GSRH scores on DUREL subcomponents (latent scores), accounting for the sociodemographic variables. The most consistent predictor of religious engagement was gender: Women scored higher in all three groups, on both subscales for English and Khoekhoegowab language groups, and for religious activity for the Oshiwambo group. Older participants in the Khoekhoegowab and Oshiwambo samples were more religious, but this was not true in the English language sample. Higher education was associated with higher religiosity for the English language group, but not for the other groups. There were no significant associations with income.

Table 6 Standardized coefficients for associations of latent scores on DUREL scales with life satisfaction, health, and demographic variables

Full size table

Satisfaction with Life was predicted by Religious Activity for Khoekhoegowab speakers only, and by Intrinsic Religiosity for Oshiwambo speakers only. Self-reported health was predicted by Religious Activity for Oshiwambo speakers only. Intrinsic Religiosity predicted better self-reported health for Khoekhoegowab speakers but worse health for Oshiwambo speakers. Results were more consistent for the sociodemographic variables in the model, with higher age predicting worse health in all three samples, and higher education and income generally associating with better satisfaction and health.

Differences in religiosity based on level of employment (student, working part time, working full time, or unemployed) were tested using ANOVA, interpreting differences with post hoc tests only where significant at p < .01. For both English and Oshiwambo speakers, there were no significant differences at this level. For Khoekhoegowab speakers, there were significant but small effects indicating more religious involvement among those who are employed full-time versus those who are unemployed, for RA F(1, 4) = 4.19, p < .01, η² = .026, and IR, F(1, 4) = 5.57, p < .05, η² = .034 .

Discussion

In Study 1, we tested the DUREL’s psychometric properties, structure, and association with other variables for three language groups in Namibia, including the two most-spoken African languages and the national language of English. These two new translations, the first into African languages, were developed using a multi-step translation process including expert panels and were tested on large samples of community adults representative of the local populations. They are now freely available to the scientific community. Given that Africa may be the most religious region in the world (e.g., Pew Research Center, 2018), this study of the DUREL in an African context, to our knowledge the first and only, is overdue.

Consistent with prior research but contrary to the recommendations of the original authors, we found the DUREL total score to have good internal consistency. However, our analysis with multi-group CFA better supported a two-factor structure. The three-item factor of Intrinsic Religiosity was well supported, while the single-item scales for Organizational and Non-Organizational religious activity loaded together as one factor, which we termed “Religious Activity.” In Namibia, at least, it appears that those who attend church also tend to spend time in private religious activities. We were able to establish scalar level measurement invariance for this two-factor model across the three groups, indicating that the scale is used similarly, and scores can be compared, across these groups in Namibia. We did not find differences in Religious Activity or Intrinsic Religiosity among the groups.

Association of the DUREL with life satisfaction, health, and sociodemographic variables partially confirmed hypotheses based on prior literature. As has been seen in other contexts, more religious Namibians reported higher life satisfaction and health in some regards, though these results were not consistent across DUREL subcomponents or samples, with more non-significant than significant associations. Contrary to prior findings (e.g., Hafizi et al., 2014; Lucchetti et al., 2012; Schwadel, 2015), more educated people were more rather than less religious in the English language Namibian sample, though there was no significant association in the other samples, after controlling for other sociodemographic variables. Given the generally high religious engagement in Namibia, this finding in particular contradicts the finding of Schwadel (2015) that the role of education in increasing secularization is especially strong in highly religious contexts. This would need to be replicated in other samples before conclusions can be drawn, but suggests that the role of education may differ in under-studied, majority world contexts. There were no significant associations with income.

As has been seen in other Christian samples, women reported more religious involvement than men, both in terms of religious activity and private sentiment. Exploratory analyses found that older participants were more religious in some regards. Being employed full-time, a variable not hitherto tested in relation to the DUREL, was associated with higher religious involvement only among Khoekhoe speakers.