Risk factors relate to the variability of health outcomes as well as the mean: A GAMLSS tutorial

Version of Record

Accepted for publication after peer review and revision.

Download
Cite
Share
CommentOpen annotations (there are currently 0 annotations on this page).

Version of Record published: January 26, 2022 (This version)
Accepted Manuscript published: January 5, 2022 (Go to version)
Accepted: January 4, 2022
Received: July 20, 2021
Preprint posted: March 31, 2021 (Go to version)

1. Of interest
A retrospective cohort study of Paxlovid efficacy depending on treatment time in hospitalized COVID-19 patients

Zhanwei Du, Lin Wang ... Lauren A Meyers

Short Report Apr 16, 2024
Further reading

Abstract
Editor's evaluation
Introduction
Methods
Results
Discussion
Data availability
References
Article and author information
Metrics

Abstract

Background:

Risk factors or interventions may affect the variability as well as the mean of health outcomes. Understanding this can aid aetiological understanding and public health translation, in that interventions which shift the outcome mean and reduce variability are typically preferable to those which affect only the mean. However, most commonly used statistical tools do not test for differences in variability. Tools that do have few epidemiological applications to date, and fewer applications still have attempted to explain their resulting findings. We thus provide a tutorial for investigating this using GAMLSS (Generalised Additive Models for Location, Scale and Shape).

Methods:

The 1970 British birth cohort study was used, with body mass index (BMI; N = 6007) and mental wellbeing (Warwick-Edinburgh Mental Wellbeing Scale; N = 7104) measured in midlife (42–46 years) as outcomes. We used GAMLSS to investigate how multiple risk factors (sex, childhood social class, and midlife physical inactivity) related to differences in health outcome mean and variability.

Results:

Risk factors were related to sizable differences in outcome variability—for example males had marginally higher mean BMI yet 28% lower variability; lower social class and physical inactivity were each associated with higher mean and higher variability (6.1% and 13.5% higher variability, respectively). For mental wellbeing, gender was not associated with the mean while males had lower variability (–3.9%); lower social class and physical inactivity were each associated with lower mean yet higher variability (7.2% and 10.9% higher variability, respectively).

Conclusions:

The results highlight how GAMLSS can be used to investigate how risk factors or interventions may influence the variability in health outcomes. This underutilised approach to the analysis of continuously distributed outcomes may have broader utility in epidemiologic, medical, and psychological sciences. A tutorial and replication syntax is provided online to facilitate this (https://osf.io/5tvz6/).

Funding:

DB is supported by the Economic and Social Research Council (grant number ES/M001660/1), The Academy of Medical Sciences / Wellcome Trust (“Springboard Health of the Public in 2040” award: HOP001/1025); DB and LW are supported by the Medical Research Council (MR/V002147/1). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Editor's evaluation

Using data from the 1970 British Birth Cohort study, the authors demonstrated the utility of Generalized Additive Models for Location, Scale and Shape (GAMLSS) to investigate the association of three risk factors (sex, socioeconomic circumstances, and physical inactivity) with body mass index and mental wellbeing. This work provides empirical evidence for why we should consider how risk factors influence the variability and not just the mean of outcomes. From the perspective of developing personalized medicine, it is important to know whether interventions have response heterogeneity as the first step. If such heterogeneity is identified, the next step will be to identify the factors associated with the heterogeneity (or those who will be benefitted from the intervention). Therefore, this study contributes to the first step by investigating the possibility of response heterogeneity.

https://doi.org/10.7554/eLife.72357.sa0

Introduction

What is health? Contrary to simplistic notions of its being defined as the absence of disease, it is now increasingly understood that most outcomes of public health significance are continuous in nature (Keyes and Galea, 2016). This applies to both physical and mental health outcomes (Plomin et al., 2009; Keyes, 2002). The use of binary endpoints, while having utility in clinical applications, should not hinder investigation of the influences of health outcomes which are ultimately continuous. Further, analysing the determinants of health using continuous rather than binary outcomes is beneficial both practically (with more statistical power and less information loss) and substantively (greater aetiological understanding). Indeed, those at high risk of a developing an illness may comprise a minority of those who ultimately succumb (Rose, 2001).

Studies into the effect on continuous outcomes of exposures, be they risk factors in observational studies or interventions in randomised trials, typically focus on mean differences in the outcome, using linear regression. However linear regression assumes homoscedasticity, that is that the variability of the outcome is unrelated to the exposure, and often this is not the case. It is possible to extend regression analysis to model the variability as well as the mean, and this has benefits in terms of not only the model’s fit but also its interpretation. If for example the intervention in a trial can be shown to reduce variability in the outcome, this could reasonably be viewed as evidence of intervention success (Subramanian et al., 2018) independent of the intervention’s effect on the mean. Treatment for refractive vision errors—glasses, contact lenses, and/or corrective surgery—seeks to improve vision by shifting individuals towards a specified standard (e.g. 20/20 vision) (Vitale et al., 2006). Successful treatments alter the mean refraction, but they are even more successful if they also reduce the substantial variability in refraction arising from the mix of short- and long-sighted individuals.

Similarly, obesity interventions aim to reduce body mass index (BMI) and shift treated individuals from overweight (25–30 kg/m²), obese ( > 30 kg/m²), or severely obese ( > 45 kg/m²) to the normal range (20–25 kg/m²). However, here the effect of the intervention on variability is often to increase it. Even if not formally tested, visual comparisons of outcome distributions of some influential trials suggest that weight loss interventions increase rather than reduce BMI variability, (Truby et al., 2006) presumably since they are effective in some but not all participants.

Understanding if and how risk factors influence variability in health outcomes has aetiological significance, consistent with the goal of epidemiological science to understand the distribution of health (Porta, 2008). Risk factors could feasibly affect outcome variability yet not affect the mean—for example, one study found that breastfeeding was not related to mean childhood BMI, yet was related to lower childhood BMI variability (Beyerlein et al., 2008b). Similarly, sex may affect variability and/or average levels of an outcome—for instance, males may have greater variability than females in some cognitive traits (Hyde, 2014) and brain structures (Wierenga et al., 2022).

Identifying associations between risk factors and outcome variability may also be useful to identify the absence or presence of heterogeneity in susceptibility to interventions or risk factors and thus aid aetiological understanding. Indeed, the finding that substantial increases in mean BMI in recent decades have been matched by increases in BMI variability indicates that there may be differential susceptibility to the obesogenic environment (Flegal and Troiano, 2000; Johnson et al., 2015). In the context of randomised controlled trials, the finding of variability in treatment effects between individuals has been used to justify individualised approaches to treatment (personalised medicine). Reflecting the challenges of empirically testing this, however, five separate meta-analyses have tested heterogeneity in response to antidepressant therapy; despite using the same dataset, different methods and divergent conclusions were drawn (Luedtke and Kessler, 2021).

Another advantage of modelling varability arises in common situations where the outcome under study is non-linearly related to other outcomes of interest. For instance, BMI influences mortality and morbidity rates, but the relationship between BMI and mortality is thought to be J-shaped ( Bhaskaran et al., 2018) compared with those in the normal range, mortality risks are greater for those who are under- or overweight. In this case, the total effect of an intervention to reduce BMI on these wider outcomes is not fully captured by its average BMI effect. Rather, understanding the total distributional effect on BMI is required.

Figure 1 shows three hypothetical scenarios for an intervention to affect the distribution of an outcome. In the first case (Panel A), the intervention has an impact that is consistent across the population: all individuals are affected and to the same extent. In the second case (Panel B), the intervention has the same mean impact, but variability is also increased: some are positively affected, others negatively. In the third case (Panel C), the mean is again increased, but so is skewness. There is heterogeneity in response, with some seeing more positive responses than others. The policy implications may be different in each case. In the second and third scenarios, efforts could be directed to identify those who are (more) positively impacted, so as to increase the net benefit or cost-effectiveness of the intervention. Indeed, in a choice between interventions, an intervention generating lower expected benefits but smaller variability in outcomes may be chosen, in so far as reducing inequalities is seen as a policy goal in itself.

Figure 1

Download asset Open asset

Simulated data for three interventions each having the same effect on the mean, but different effects on the variability (middle panel) and skewness (bottom panel).

Recent studies in biological (Sun et al., 2020; Nakagawa et al., 2014), environmental (Pitt et al., 2020), and economic science (Hohberg et al., 2020; Silbersdorff and Schneider, 2019; Silbersdorff et al., 2018) have begun to examine how risk factors relate to the distribution of the outcome of interest. However, there have been few epidemiological applications of this approach to date; (Beyerlein et al., 2008a) and fewer still that provide explanations for such findings, which are essential if such methods are to have utility. Indeed, one recent study which investigated the association between mental health symptoms and lower income explicitly avoided interpretation of its findings on variability, focusing instead on issues relating to the application of such methods (Silbersdorff and Schneider, 2019).

Regression methods that allow variability to be modelled are uncommon. One particular method, Generalised Additive Models for Location, Scale and Shape (GAMLSS) (Rigby and Stasinopoulos, 2005) has become the standard for constructing growth reference centiles, (Cole et al., 2009) where the aim is to model the outcome’s distribution as a function of age. It defines the distribution in terms of distribution moments, i.e. the mean, variance, and optionally skewness and kurtosis. This allows for factors influencing the higher moments to be identified in just the same way as for the mean, and it provides a simple and elegant interface for modelling variability in epidemiology.

Another arguably underutilised (Beyerlein, 2014) and related statistical approach to investigating risk factors for continuous outcomes is quantile regression. Recent epidemiological studies using this method have found that risk factors for higher BMI—particularly lower social class and physical inactivity—have sizably larger effect sizes at higher BMI centiles (Bann et al., 2020; Green and Rowe, 2020). This has potentially important policy implications—risk factors which have larger effects amongst those at highest health risk are likely to have a more favourable effect on population health than alternatives which do not (Bann et al., 2020). However, the reason for this phenomenon is not yet understood—it is likely to be logically consistent with results of GAMLSS analyses in which risk factors influence outcome means, variability and/or skewness.

In this paper, we provide a worked example of the use and interpretation of GAMLSS. Accompanying this is an online tutorial and full replication syntax for running GAMLSS in R (https://osf.io/5tvz6/). We investigate whether and how several established risk factors—sex, childhood socioeconomic circumstances, and physical inactivity (Stringhini et al., 2017)—relate to differences in outcome mean and variability. We choose two different continuous outcomes, an indicator of adiposity (body mass index, BMI) and mental wellbeing. These are two weakly correlated health outcomes, each of independent importance to population health. Each risk factor-outcome combination is the subject of previous (separate) literature which focuses largely on mean differences only. For instance, low socioeconomic position in childhood has been repeatedly related to higher BMI (Bann et al., 2018; Senese et al., 2009) and worse mental wellbeing in adulthood; (Wood et al., 2021; Simanek et al., 2021; Wood et al., 2017) greater physical activity has notable likely bi-directional links with lower BMI ; (Jakicic et al., 2019) and higher wellbeing; (Black et al., 2015; Choi et al., 2019; Pinto Pereira et al., 2014) while males and females seemingly have similar mean BMI and wellbeing, (Wood et al., 2017) this may mask differences in variability or skewness, as suggested in the sizable sex differences in overweight and obesity rates (Conolly et al., 2017).

The further investigation of differences in variability and skewness in these outcomes is therefore arguably of substantive interest, providing further motivation to the tutorial content. We highlight the contribution of GAMLSS by contrasting results with the more commonly used linear regression and (less commonly used) quantile regression models.

Methods

Study sample

The 1970 British birth cohort study consists of all 17,196 babies born in Britain during one week of March 1970, with 9 subsequent waves of follow-up from childhood to midlife (Elliott and Shepherd, 2006) At the most recent wave (46 years), 12,368 eligible participants (those alive and not lost to follow-up) were invited to be interviewed at home by trained research staff—8581 participants provided at least some data in this wave. At all waves, informed consent was provided and ethical approval granted.

Health outcomes

We selected two outcomes in midlife which capture different dimensions of health and are continuously distributed: adiposity (BMI), and mental wellbeing (Warwick-Edinburgh Mental Wellbeing Scale (WEMWBS)). BMI was measured at 46 years, and wellbeing at 42 years (Wood et al., 2021) WEMWBS consists of 14 positively worded items—such as “I’ve been feeling optimistic about the future” and “…feeling cheerful”—measured on a five-point Likert scale, which are summed to give a total well-being score ranging from 14 to 70 (highest well-being) (Tennant et al., 2007).

Risk factors

We chose three risk factors across different domains—each of them likely to independently influence health outcomes (Stringhini et al., 2017). They were coded as binary variables to simplify comparison of descriptive and GAMLSS results: sex (female/male), socioeconomic position (social class at birth; coded as non-manual/manual), and a behavioural risk factor (reported physical activity at 42 years; reported days in which the participant took part in exercise for 30 min or more in a typical week ‘working hard enough to raise your heart rate and break into a sweat’, coded as active ( ≥ 1 days)/inactive (0 days)). We examined if the binary split of risk factors influenced the inferences drawn—additional analyses were conducted with them coded instead as categorical variables (social class in six categories and physical inactivity from 0 to 7 days).

Analytical strategy

To visually inspect the outcome distributions and their differences across risk factor groups, we first plotted separate kernel density estimates alongside relevant descriptive statistics (mean, standard deviation, and coefficient of variation [CoV = SD/mean]). This enables a descriptive depiction of variability, with unadjusted GAMLSS results corresponding to each descriptive statistic. We then used GAMLSS (Rigby and Stasinopoulos, 2005) separately with each outcome, to formally investigate whether risk factors were associated with (1) differences in mean outcome, (2) differences in outcome variability, and (3) differences in outcome skewness. Linear regression analysis, in contrast, only enables mean differences in outcomes to be investigated.

GAMLSS is a form of regression analysis that estimates different ‘moments’ of the outcome distribution. The first moment is the location (see mean in Figure 1 panel a), the second is variance, which specifies the scale or spread (SD in Figure 1 panel b) the third is skewness which quantifies the relative size of the distribution tails (Figure 1 panel c). As in linear regression analyses covariates can optionally be included, and appropriate link functions can be chosen for use.

GAMLSS requires that the distribution is specified at the outset. In this tutorial we use two distributions which we recommend for use in epidemiological research of continuous outcomes. First, the normal distribution (called NO in GAMLSS), where location is measured by the mean and scale by the standard deviation (SD). The normal distribution has no ‘shape’ moments, as there is no skewness and kurtosis is fixed.

Second, a more complex distribution which enables skewness to be investigated: the Box-Cox Cole and Green (BCCG). Here location is the median, scale is the generalised coefficient of variation (CoV), which is calculated in the normal case as SD/mean, and shape is skewness as defined by the Box-Cox power required to transform the outcome distribution to normality. The transformation requires the outcome to be on the positive line, so zero or negative values are excluded. BCCG is effectively NO with added skewness, though parameterised differently. A Box-Cox power of 1 indicates that the distribution is normal, 0 is log-normal and –1 inverse normal, so a smaller (i.e. more negative) power corresponds to more right skewness.

After choosing a distribution, linear models are used to specify the relationship between the independent variables and the different moments of the outcome distribution. As with other regression models, GAMLSS provides a standard error for each estimated coefficient, from which 95% confidence intervals can be calculated. We note that more experienced users may wish to use alternative distributions which GAMLSS facilitates (Rigby et al., 2019).

In our primary analyses we used the NO and BCCG families. Differences in variability are modelled with a log link, and can be multiplied by 100 and interpreted as percentage differences in variability to aid interpretation (Lewontin, 1966). Differences in the mean and median were also analysed as percentages, to aid comparability across outcomes and model estimates. To aid comparison of descriptive statistics and model estimation results, we first conducted analyses adjusting for each risk factor alone. We then adjusted for the risk factors jointly.

Separately we fitted conditional quantile regression models to estimate risk factor and BMI associations at the lower, middle and upper quartiles of the outcome distribution, that is the 25th, 50th, and 75th centiles.

All analyses were conducted using R v4.1.1. We used the gamlss package version 5.3–4 to produce gamlss models (Stasinopoulos and Rigby, 2007). Syntax to replicate all analyses is presented online (https://osf.io/5tvz6/).

Results

A total of 6007 participants had valid data for BMI and all risk factors, and 7104 for WEMWBS. Mean BMI was 28.4 (SD = 5.5), and mean WEMWBS 49.2 (8.3). Higher BMI was weakly associated with lower wellbeing (r = –0.07, p < 0.01). BMI was moderately right-skewed (Figure 2, left panel) and WEMWBS left-skewed (Figure 2, right panel). Visual and descriptive comparisons of the BMI and wellbeing distributions by risk factor suggest that differences in the outcome mean and variability are not always in the same direction.

Figure 2

Download asset Open asset

Kernel density plots for body mass index and mental wellbeing, stratified by risk factor group.

Note: CoV = coefficient of variation (SD/mean).

GAMLSS results for the binary risk factors are shown in Tables 1 and 2, with the results using the extra risk factor categories inSupplementary file 1. Associations were similar in the unadjusted and mutually adjusted analyses, so the former are described below.

Table 1

Risk factors in relation to body mass index: differences in mean, variability and skewness estimated by GAMLSS (n = 6007).

Risk factor	%	NO distribution		BCCG distribution
Risk factor	%	Mean	SD	Median	CoV	Skewness*
Female (ref)	52.4%	28.1	6.1	26.9	0.22	1.10
Male	47.6%	28.7	4.6	28.2	0.16	0.75
Unadjusted difference, % (SE)		1.9 (0.5)	–27.6 (1.8)	4.1 (0.4)	–23 (1.8)	0.48 (0.11)
Adjusted† difference, % (SE)		2.2 (0.5)	–27.4 (1.8)	4.4 (0.4)	–22.6 (1.8)	0.54 (0.11)

Non-manual (ref)	36.3%	27.7	5.2	27	0.19	1.15
Manual social class	63.7%	28.8	5.5	28	0.19	0.90
Unadjusted difference, % (SE)		4.0 (0.5)	6.1 (1.9)	4.4 (0.5)	6 (1.9)	0.39 (0.11)
Adjusted† difference, % (SE)		3.8 (0.5)	5.5 (1.9)	4.3 (0.4)	5.6 (1.9)	0.40 (0.12)

Physically active (ref)	73%	28.1	5.2	27.4	0.19	0.97
Inactive	27%	29.1	6.0	28.3	0.21	0.94
Unadjusted difference, % (SE)		3.3 (0.6)	13.5 (2.1)	2.9 (0.5)	10.4 (2.1)	0.08 (0.12)
Adjusted† difference, % (SE)		3.3 (0.6)	12.1 (2.1)	3.1 (0.5)	9.3 (2.1)	0.12 (0.12)

*

Skewness is estimated as the Box-Cox power (that is, the power required to transform the outcome to a normal distribution); differences are the absolute difference in Box-Cox power in each subgroup estimated by GAMLSS. GAMLSS estimates multiple distribution moments simultaneously; thus, differences may not exactly correspond to descriptive comparisons reported above.
†

Estimates mutually adjusted for sex, social class and physical inactivity.
NO: normal distribution; BCCG: Box-Cox Cole and Green distribution: SD: standard deviation; CoV: coefficient of variation; GAMLSS: Generalized Additive Models for Location, Scale and Shape; SE, standard error.

Table 2

Risk factors in relation to mental wellbeing (WEMWBS): differences in mean, variability and skewness estimated by GAMLSS (n = 7,104).

Risk factor	%	NO distribution		BCCG distribution
Risk factor	%	Mean	SD	Median	COV	Skewness*
Female (ref)	52.8%	49.2	8.5	50	0.17	–0.41
Male	47.2%	49.1	8.2	50	0.17	–0.40
Unadjusted difference, % (SE)		–0.2 (0.4)	–3.9 (1.7)	–0.3 (0.4)	–3.5 (1.7)	0.02 (0.11)
Adjusted† difference, % (SE)		–0.6 (0.4)	–3.6 (1.7)	–0.7 (0.4)	–2.6 (1.7)	0.00 (0.11)

Non-manual (ref)	34.8%	50.1	7.9	51	0.16	–0.45
Manual social class	65.2%	48.7	8.5	49	0.17	–0.37
Unadjusted difference, % (SE)		–2.8 (0.4)	7.2 (1.8)	–2.9 (0.4)	10.9 (1.8)	–0.20 (0.12)
Adjusted† difference, % (SE)		–2.5 (0.4)	6.0 (1.8)	–2.7 (0.4)	9.8 (1.8)	–0.24 (0.12)

Physically active (ref)	72.4%	49.9	8.0	51	0.16	–0.38
Inactive	27.6%	47.3	8.9	48	0.19	–0.36
Unadjusted difference, % (SE)		–5.3 (0.5)	10.9 (1.9)	–5.2 (0.4)	16.2 (1.9)	–0.12 (0.12)
Adjusted† difference, % (SE)		–5.3 (0.5)	9.9 (1.9)	–5.1 (0.4)	15.2 (1.9)	–0.10 (0.12)

*

Skewness is estimated as the Box-Cox power (that is, the power required to transform the outcome to a normal distribution); differences are the absolute difference in Box-Cox power in each subgroup estimated by GAMLSS. GAMLSS estimates multiple distribution moments simultaneously; thus, differences may not exactly correspond to descriptive comparisons reported above.
†

Estimates mutually adjusted for sex, social class and physical inactivity.
NO: normal distribution; BCCG: Box-Cox Cole and Green distribution: SD: standard deviation; CoV: coefficient of variation; GAMLSS: Generalized Additive Models for Location, Scale and Shape; SE, standard error.

Body mass index

Males had higher mean BMI yet lower variability than females—see Figure 2 and Table 1. The SD for BMI was lower in males (4.6) than females (6.1) that is a 28% difference (difference in log(SD) *100). This matches the estimate obtained from GAMLSS—males had 27.6% (SE: 1.8%) less variability than females (Table 1).

In contrast, lower social class and physical inactivity were both associated with higher mean BMI and higher BMI variability (Figure 2 and Table 1). Those from lower social class households had 4% (SE 0.5%) higher mean BMI than those from non-manual classes, and 6.1% (1.9%) more variability. Physically inactive participants had 3.3% (0.6%) higher mean BMI and 13.5% (2.1%) more variability.

The GAMLSS results were similar with the BCCG distribution rather than NO (Table 1). That is, risk factors associated with higher mean BMI and higher SD were also associated with higher median BMI and higher CoV. Male sex and lower social class were both associated with less right skewness of the BMI distribution; the Box-Cox power was 0.5 (0.1) higher in males and 0.4 (0.1) higher for manual social class. Physical activity was not associated with outcome skewness.

Mental wellbeing – Warwick-Edinburgh mental wellbeing scale

There was little evidence of sex differences in mean wellbeing, while males had marginally less variability than females by 3.9% (1.7%). Lower social class and physical inactivity were both associated with lower mean yet higher variability (Figure 2 and Table 2). Those from lower social class households had a 2.8% (0.4%) lower mean yet 7.2% (1.8%) higher variability. Physically inactive participants had 5.3% (0.5%) lower mean yet 10.9% (1.9%) higher variability. These findings were similar in mutually adjusted analyses (Table 2).

The results were similar with the BCCG distribution (Table 2). There was evidence suggesting that lower social class was associated with less skewness in the wellbeing distribution; sex and physical activity were not associated with outcome skewness.

Comparison with quantile regression findings

For BMI, the associations of lower social class and physical inactivity were stronger at upper quantiles (Table 3; e.g., manual social class had 3.7 (0.6) higher BMI at the the median, and 4.9 (0.7) at the 75th); estimates at higher centiles were also estimated less precisely than at lower centiles (larger SE). In contrast sex differences were present at lower centiles but absent at the 75th centile. These findings corresponded with those from GAMLSS using BCCG, with all BMI centiles plotted by risk factor group (Figure 3). This comparison highlights the utility of GAMLSS—risk factor differences in the mean, variability, and skewness can each be quantified and thus visually depicted.

Table 3

Risk factors in relation to body mass index (BMI) and mental wellbeing (WEMWBS): percentage differences at multiple points of the outcome distribution estimated by quantile regression.

Outcome	Risk factor	25th centile	50th centile	75th centile
BMI @ Age 46	Male vs female	6.8 (0.5)	4.5 (0.6)	–0.8 (0.7)
	Father’s Class	3.7 (0.6)	3.7 (0.6)	4.9 (0.7)
	Exercise Level	1 (0.7)	3 (0.7)	4.3 (0.8)
WEMWBS @ Age 42	Sex	0 (0.7)	0 (0.5)	0 (0.3)
	Father’s Class	–4.5 (0.7)	–4 (0.5)	–1.8 (0.3)
	Exercise Level	–6.9 (0.5)	–6.1 (0.5)	–1.8 (0.5)

Note: results show the percentage difference (log-transformed x 100) in BMI or mental wellbeing (WEMWEBS; standard errors in parenthesis) at different centiles of the outcome distribution; estimates are mutually adjusted.

Figure 3

Download asset Open asset

Association between risk factors and BMI by BMI centile.

Plotted lines are calculated using GAMLSS estimation results of the entire outcome distribution; points at the 25th, 50th, and 75th centiles are estimated using quantile regression models. Marginal effects show the differences in outcome between each risk group across the outcome distribution.

For WEMWBS, the associations of lower social class and physical inactivity were also stronger at lower quantiles (Table 3), yet had larger standard errors. Sex was not associated with WEMWBS at any centile. These findings corresponded with those from GAMLSS (Figure 4).

Figure 4

Download asset Open asset

Association between risk factors and mental wellbeing (WEMWBS) by centile.

Plotted lines are calculated using GAMLSS estimation results of the entire outcome distribution; points at the 25th, 50th, and 75th centiles are estimated using quantile regression models. Marginal effects show the differences in outcome between each risk group across the outcome distribution.

Discussion

Using an underutilised analytical approach (GAMLSS), we present empirical evidence to support the idea that risk factors can relate to sizable differences in outcome variability, and even outcome skewness, in addition to differences in the outcome mean. Females had higher variability in BMI and mental wellbeing than males; lower social class and physical inactivity were each associated with higher variability in both BMI and mental wellbeing, despite having different directions of association with the mean (higher BMI yet lower mental wellbeing).

Our findings add to an emerging literature which has investigated associations between risk factors and outcome variability. Studies (Sun et al., 2020; Nakagawa et al., 2014; Pitt et al., 2020; Hohberg et al., 2020; Silbersdorff and Schneider, 2019; Silbersdorff et al., 2018; Beyerlein et al., 2008a) have reported that risk factors associated with higher means are also associated with higher outcome variability. For example, (Beyerlein et al., 2008a) found that multiple risk factors for high childhood BMI (such as more frequent television viewing and greater rapid infant weight gain) were related to both higher mean BMI and greater variability in BMI. However, previous studies have not utilised multiple outcomes or nationally representative samples, and have not systematically considered explanations for such findings or their implications.

Our findings help to reconcile findings from GAMLSS with those using quantile regression (Beyerlein et al., 2008a; Bann et al., 2020; Green and Rowe, 2020) which have reported stronger effect sizes for BMI risk factors at higher BMI centiles. This finding is both consistent with and helps explain the GAMLSS findings. For instance, lower social class and physical inactivity are related to higher BMI mean and variability, yet less BMI skewness; the net result is higher effect estimates at upper centiles which are less precisely estimated, as seen in quantile regression. While both analytical approaches have merit, GAMLSS has a number of attractive features for use in aetiological research: it enables each distribution moment to be separately investigated, and uses predetermined distribution families which enable computation of sparsely distributed variables.

Why are risk factors associated with differences in outcome variability? There are multiple possible explanations. First, risk factors may not be sufficient for an outcome to occur but rather only have a causal effect in the presence of other factors, for instance as posited in models such as the stress-diathesis model of mental health (Zuckerman, 1999). Such additional factors could also operate as effect modifiers which increase the strength of the risk factor. Factors such as genetic propensity to weight gain may for example modify the effect on weight gain of exposure to adverse socioeconomic circumstances (Tyrrell et al., 2017). Other environmental factors could operate similarly—such that the association between lower social class and higher BMI is weaker amongst those living in a local environment which is less ‘obesogenic’ (i.e. less conducive to physical inactivity and lower energy intake) (Drewnowski et al., 2007; Stafford et al., 2007). The net result of such divergent effects would be increased variability since the effects would range from zero to the upper bound of the effect. This explanation may also apply to mental wellbeing, given evidence for the myriad environmental (Ludwig et al., 2012; Wood et al., 2021) and genetic determinants (Luciano et al., 2018; de Moor et al., 2015) which could modify the effects observed in the current study.

Alternatively, between-person differences in confounding and/or measurement error may also lead to risk factors being associated with outcome variability. For example, in the present study physical activity was measured via a single item capturing reported activity of a moderate-vigorous intensity for at least 30 min per day; this is an imperfect reflection of the underlying exposure which may have a causal effect (e.g. total energy expenditure [across all intensities of activity] in the case of adiposity; (Bann et al., 2014) or time spent in specific activities conducive to wellbeing in the case of mental wellbeing [Black et al., 2015]). The net result would be higher variability in those reporting higher physical activity levels. A related issue is the extent to which the exposure captures the same ‘dose’ across participants in a given study. The physical activity measure used here counted the number of days that bouts of activity lasted at least 30 min; this likely reflects substantial variability in the level of exercise actually undertaken, thus leading to greater differences in outcome variability. This could partly explain the associations of lower social class with greater outcome variability, since social class is one dimension of socioeconomic position, such that there may be substantial between-person variation in other dimensions (e.g. parental education, income and/or wealth [Moulton et al., 2021; Galobardes et al., 2006]) which may each influence outcomes, leading to greater variability.

The study highlights the fact that analyses by GAMLSS and quantile regression lead to similar results at the selected quantiles of the outcome distribution—see Figures 3 and 4. However GAMLSS, by analysing the whole distribution, can in some cases provide more efficient estimates of the quantiles. Compare for example the standard errors of the median as obtained by the BCCG distribution (Tables 2 and 3) and quantile regression (Table 3); the GAMLSS standard errors are smaller.

Strengths and limitations

Strengths of this study include the analytical approach used (GAMLSS) to empirically investigate differences in outcome variability. While differences in variability can be informed by descriptive comparison (e.g. comparing standard deviations), GAMLSS additionally enables computation of estimates of precision and incorporates multivariable specifications (e.g. confounder or mediator adjustment; and inclusion of interaction terms). The use of the 1970 birth cohort data is an additional strength, enabling investigation of multiple risk factors and two largely orthogonal yet important continuous health outcomes. The national representation of this cohort is also advantageous—highly distorted sample selection can bias conventional epidemiological results (i.e. mean differences in outcomes) (Munafò et al., 2018), and may also bias comparisons of outcome variability.

The study also has limitations. As in all observational studies, causal inference is challenging despite the use of longitudinal data. Associations of social class at birth with outcomes for example could be explained by unmeasured confounding—this may include factors such as parental mental health. This is challenging to falsify empirically owing to a lack of such data collected before birth. In contrast, sex is randomly assigned at birth, and thus its associations with outcomes are unlikely to be confounded. However, sex differences in reporting may bias associations with mental wellbeing. Physical activity and mental wellbeing were ascertained at broadly the same age, so that associations between the two could be explained by reverse causality; existing evidence appears to suggest bi-directionality of links between physical activity and both outcomes (Pinto Pereira et al., 2014; Barone Gibbs et al., 2020). Finally, attrition led to lower power to precisely estimate smaller effect sizes (e.g. gender differences in mental wellbeing) or confirm null effects. Such attribution could potentially bias associations—those in worse health and adverse socioeconomic circumstances are disproportionately lost to follow-up (Mostafa and Wiggins, 2015; Mostafa et al., 2021). The focus of principled approaches to handle missing data in epidemiology has been on the main parameter of interest—typically beta coefficients in linear regression models—and further empirical work is required to investigate the potential implications of (non-random) missingness for the variability and other moments of the outcome distribution.

Potential implications

This study used an underutilised approach to empirically investigate associations between risk factors and outcome variability in a single cohort study. Thus, our findings require replication and extension in other datasets across other risk factors and health outcomes. Future studies should also seek to explain their findings, and where possible falsify potential explanations. Understanding how risk factors relate to and/or cause differences in outcome variability is not a standard part of epidemiological training, and it entails additional analytical and conceptual complexity. Thus, with greater application of these tools an emerging consensus on best practice should develop. In the first instance, we recommend both descriptive and formal investigation, and that analysts carefully consider the use of both absolute (e.g. SD) and relative (e.g. CoV) differences in variability. Since the CoV is fractional standard deviation (e.g. SD/mean or log SD), its suitability of use depends on the a priori anticipated relationship between the mean and variance.

In the context of randomised controlled trials, the finding of variability in treatment effects between individuals has been used to justify individualised approaches to treatment (personalised medicine). It is beyond the scope of the current article to discuss the tractability of this for complex outcomes in which treatment effects are unpredictable (Davey Smith, 2011). Trials are designed typically to detect only mean differences in outcomes (Senn, 2016); nevertheless, additionally presenting outcome variability before and after treatment would be helpful to better appraise intervention effects (Subramanian et al., 2018). GAMLSS provides a useful framework with which to formally investigate this, even where the homoscedasticity assumption does not hold (i.e. where risk factors or treatment groups differ in their outcome variance). Where there are multiple potential efficacious interventions, further studies could meta-analyse existing trials to identify the types of intervention which additionally reduce outcome variability.

Conclusion

We provide empirical support for the notion that risk factors or interventions can either reduce or increase variability in health outcomes. This finding is consistent with results from quantile regression analysis where a risk factor vs outcome association is stronger (or weaker) at higher outcome centiles. Such findings may be explained by heterogeneity in the causal effect of each exposure, by the influence of other (typically unmeasured) variables, and/or by measurement error. This underutilised approach to the analysis of continuously distributed outcomes may have broader utility in epidemiological, medical, and psychological sciences. Our tutorial and syntax content is designed to facilitate this.

Data availability

All data are available to download from the UK Data Archive: https://beta.ukdataservice.ac.uk/datacatalogue/series/series?id=200001.

References

(2014) Physical activity across adulthood in relation to fat and lean body mass in early old age: findings from the Medical Research Council National Survey of Health and Development, 1946-2010
American Journal of Epidemiology 179:1197–1207.
https://doi.org/10.1093/aje/kwu033
- PubMed
- Google Scholar
1. Bann D
2. Johnson W
3. Li L
4. Kuh D
5. Hardy R
(2018) Socioeconomic inequalities in childhood and adolescent body-mass index, weight, and height from 1953 to 2015: an analysis of four longitudinal, observational, British birth cohort studies
The Lancet. Public Health 3:e194–e203.
https://doi.org/10.1016/S2468-2667(18)30045-8
- PubMed
- Google Scholar
(2020) Determinants of the population health distribution: an illustration examining body mass index
International Journal of Epidemiology 49:731–737.
https://doi.org/10.1093/ije/dyz245
- PubMed
- Google Scholar
(2020) Bidirectional 10-year associations of accelerometer-measured sedentary behavior and activity categories with weight among middle-aged adults
International Journal of Obesity 44:559–567.
https://doi.org/10.1038/s41366-019-0443-8
- PubMed
- Google Scholar
(2008a) Alternative regression models to assess increase in childhood BMI
BMC Medical Research Methodology 8:59.
https://doi.org/10.1186/1471-2288-8-59
- PubMed
- Google Scholar
(2008b) Breastfeeding and childhood obesity: shift of the entire BMI distribution or only the upper parts?
Obesity 16:2730–2733.
https://doi.org/10.1038/oby.2008.432
- PubMed
- Google Scholar
1. Beyerlein A
(2014) Quantile regression-opportunities and challenges from a user’s perspective
American Journal of Epidemiology 180:330–331.
https://doi.org/10.1093/aje/kwu178
- PubMed
- Google Scholar
(2018) Association of BMI with overall and cause-specific mortality: a population-based cohort study of 3·6 million adults in the UK
The Lancet. Diabetes & Endocrinology 6:944–953.
https://doi.org/10.1016/S2213-8587(18)30288-2
- PubMed
- Google Scholar
1. Black SV
2. Cooper R
3. Martin KR
4. Brage S
5. Kuh D
6. Stafford M
(2015) Physical Activity and Mental Well-being in a Cohort Aged 60-64 Years
American Journal of Preventive Medicine 49:172–180.
https://doi.org/10.1016/j.amepre.2015.03.009
- PubMed
- Google Scholar
(2019) Assessment of Bidirectional Relationships Between Physical Activity and Depression Among Adults: A 2-Sample Mendelian Randomization Study
JAMA Psychiatry 76:399–408.
https://doi.org/10.1001/jamapsychiatry.2018.4175
- PubMed
- Google Scholar
1. Cole TJ
2. Stanojevic S
3. Stocks J
4. Coates AL
5. Hankinson JL
6. Wade AM
(2009) Age- and size-related reference ranges: a case study of spirometry through childhood and adulthood
Statistics in Medicine 28:880–898.
https://doi.org/10.1002/sim.3504
- PubMed
- Google Scholar
Report
(2017)
Health Survey for England 2016: Adult overweight and obesity

Health and Social Care Information Centre.
- Google Scholar
1. Davey Smith G
(2011) Epidemiology, epigenetics and the “Gloomy Prospect”: embracing randomness in population health research and practice
International Journal of Epidemiology 40:537–562.
https://doi.org/10.1093/ije/dyr117
- PubMed
- Google Scholar
1. de Moor MHM
2. van den Berg SM
3. Verweij KJH
4. Krueger RF
5. Luciano M
6. Arias Vasquez A
7. Matteson LK
8. Derringer J
9. Esko T
10. Amin N
11. Gordon SD
12. Hansell NK
13. Hart AB
14. Seppälä I
15. Huffman JE
16. Konte B
17. Lahti J
18. Lee M
19. Miller M
20. Nutile T
21. Tanaka T
22. Teumer A
23. Viktorin A
24. Wedenoja J
25. Abecasis GR
26. Adkins DE
27. Agrawal A
28. Allik J
29. Appel K
30. Bigdeli TB
31. Busonero F
32. Campbell H
33. Costa PT
34. Davey Smith G
35. Davies G
36. de Wit H
37. Ding J
38. Engelhardt BE
39. Eriksson JG
40. Fedko IO
41. Ferrucci L
42. Franke B
43. Giegling I
44. Grucza R
45. Hartmann AM
46. Heath AC
47. Heinonen K
48. Henders AK
49. Homuth G
50. Hottenga JJ
51. Iacono WG
52. Janzing J
53. Jokela M
54. Karlsson R
55. Kemp JP
56. Kirkpatrick MG
57. Latvala A
58. Lehtimäki T
59. Liewald DC
60. Madden PAF
61. Magri C
62. Magnusson PKE
63. Marten J
64. Maschio A
65. Medland SE
66. Mihailov E
67. Milaneschi Y
68. Montgomery GW
69. Nauck M
70. Ouwens KG
71. Palotie A
72. Pettersson E
73. Polasek O
74. Qian Y
75. Pulkki-Råback L
76. Raitakari OT
77. Realo A
78. Rose RJ
79. Ruggiero D
80. Schmidt CO
81. Slutske WS
82. Sorice R
83. Starr JM
84. St Pourcain B
85. Sutin AR
86. Timpson NJ
87. Trochet H
88. Vermeulen S
89. Vuoksimaa E
90. Widen E
91. Wouda J
92. Wright MJ
93. Zgaga L
94. Porteous D
95. Minelli A
96. Palmer AA
97. Rujescu D
98. Ciullo M
99. Hayward C
100. Rudan I
101. Metspalu A
102. Kaprio J
103. Deary IJ
104. Räikkönen K
105. Wilson JF
106. Keltikangas-Järvinen L
107. Bierut LJ
108. Hettema JM
109. Grabe HJ
110. van Duijn CM
111. Evans DM
112. Schlessinger D
113. Pedersen NL
114. Terracciano A
115. McGue M
116. Penninx B
117. Martin NG
118. Boomsma DI
119. Genetics of Personality Consortium
(2015) Meta-analysis of Genome-wide Association Studies for Neuroticism, and the Polygenic Association With Major Depressive Disorder
JAMA Psychiatry 72:642–650.
https://doi.org/10.1001/jamapsychiatry.2015.0554
- PubMed
- Google Scholar
(2007) Disparities in obesity rates: analysis by ZIP code area
Social Science & Medicine 65:2458–2463.
https://doi.org/10.1016/j.socscimed.2007.07.001
- PubMed
- Google Scholar
1. Elliott J
2. Shepherd P
(2006) Cohort profile: 1970 British Birth Cohort (BCS70)
International Journal of Epidemiology 35:836–843.
https://doi.org/10.1093/ije/dyl174
- PubMed
- Google Scholar
1. Flegal KM
2. Troiano RP
(2000) Changes in the distribution of body mass index of adults and children in the US population
International Journal of Obesity and Related Metabolic Disorders 24:807–818.
https://doi.org/10.1038/sj.ijo.0801232
- PubMed
- Google Scholar
(2006) Indicators of socioeconomic position (part 1)
Journal of Epidemiology and Community Health 60:7–12.
https://doi.org/10.1136/jech.2004.023531
- PubMed
- Google Scholar
1. Green MA
2. Rowe F
(2020) Explaining the widening distribution of Body Mass Index: A decomposition analysis of trends for England, 2002–2004 and 2012–2014
Area 53:362–372.
https://doi.org/10.1111/area.12675
- Google Scholar
(2020) Treatment effects beyond the mean using distributional regression: Methods and guidance
PLOS ONE 15:e0226514.
https://doi.org/10.1371/journal.pone.0226514
- PubMed
- Google Scholar
1. Hyde JS
(2014) Gender similarities and differences
Annual Review of Psychology 65:373–398.
https://doi.org/10.1146/annurev-psych-010213-115057
- PubMed
- Google Scholar
(2019) Physical Activity and the Prevention of Weight Gain in Adults: A Systematic Review
Medicine and Science in Sports and Exercise 51:1262–1269.
https://doi.org/10.1249/MSS.0000000000001938
- PubMed
- Google Scholar
1. Johnson W
2. Li L
3. Kuh D
4. Hardy R
(2015) How Has the Age-Related Process of Overweight or Obesity Development Changed over Time? Co-ordinated Analyses of Individual Participant Data from Five United Kingdom Birth Cohorts
PLOS Medicine 12:e1001828.
https://doi.org/10.1371/journal.pmed.1001828
- PubMed
- Google Scholar
1. Keyes CLM
(2002) The Mental Health Continuum: From Languishing to Flourishing in Life
Journal of Health and Social Behavior 43:207.
https://doi.org/10.2307/3090197
- Google Scholar
Book
1. Keyes KM
2. Galea S
(2016) Population Health Science
Oxford University Press.
https://doi.org/10.1093/med/9780190459376.001.0001
- Google Scholar
1. Lewontin RC
(1966) On the Measurement of Relative Variability
Systematic Zoology 15:141.
https://doi.org/10.2307/2411632
- Google Scholar
1. Luciano M
2. Hagenaars SP
3. Davies G
4. Hill WD
5. Clarke T-K
6. Shirali M
7. Harris SE
8. Marioni RE
9. Liewald DC
10. Fawns-Ritchie C
11. Adams MJ
12. Howard DM
13. Lewis CM
14. Gale CR
15. McIntosh AM
16. Deary IJ
(2018) Association analysis in over 329,000 individuals identifies 116 independent variants influencing neuroticism
Nature Genetics 50:6–11.
https://doi.org/10.1038/s41588-017-0013-8
- PubMed
- Google Scholar
1. Ludwig J
2. Duncan GJ
3. Gennetian LA
4. Katz LF
5. Kessler RC
6. Kling JR
7. Sanbonmatsu L
(2012) Neighborhood effects on the long-term well-being of low-income adults
Science 337:1505–1510.
https://doi.org/10.1126/science.1224648
- PubMed
- Google Scholar
1. Luedtke A
2. Kessler RC
(2021) New Directions in Research on Heterogeneity of Treatment Effects for Major Depression
JAMA Psychiatry 78:478–480.
https://doi.org/10.1001/jamapsychiatry.2020.4489
- PubMed
- Google Scholar
1. Mostafa T
2. Wiggins RD
(2015) The impact of attrition and non-response in birth cohort studies: a need to incorporate missingness strategies
Longitudinal and Life Course Studies 6:131–146.
https://doi.org/10.14301/llcs.v6i2.312
- Google Scholar
(2021) Missing at random assumption made more plausible: evidence from the 1958 British birth cohort
Journal of Clinical Epidemiology 136:44–54.
https://doi.org/10.1016/j.jclinepi.2021.02.019
- PubMed
- Google Scholar
(2021) Parental Wealth and Children’s Cognitive Ability, Mental, and Physical Health: Evidence From the UK Millennium Cohort Study
Child Development 92:115–123.
https://doi.org/10.1111/cdev.13413
- PubMed
- Google Scholar
(2018) Collider scope: when selection bias can substantially influence observed associations
International Journal of Epidemiology 47:226–235.
https://doi.org/10.1093/ije/dyx206
- PubMed
- Google Scholar
1. Nakagawa S
2. Poulin R
3. Mengersen K
4. Reinhold K
5. Engqvist L
6. Lagisz M
7. Senior AM
8. O’Hara RB
(2014) Meta‐analysis of variation: ecological and evolutionary applications and beyond
Methods in Ecology and Evolution 6:143–152.
https://doi.org/10.1111/2041-210X.12309
- Google Scholar
(2014) Depressive symptoms and physical activity during 3 decades in adult life: bidirectional associations in a prospective cohort study
JAMA Psychiatry 71:1373–1380.
https://doi.org/10.1001/jamapsychiatry.2014.1240
- PubMed
- Google Scholar
(2020) Modeling risks from natural hazards with generalized additive models for location, scale and shape
Journal of Environmental Management 275:111075.
https://doi.org/10.1016/j.jenvman.2020.111075
- PubMed
- Google Scholar
(2009) Common disorders are quantitative traits
Nature Reviews. Genetics 10:872–878.
https://doi.org/10.1038/nrg2670
- PubMed
- Google Scholar
Book
1. Porta M
(2008)
A Dictionary of Epidemiology

Oxford, UK: Oxford University Press.
- Google Scholar
1. Rigby RA
2. Stasinopoulos DM
(2005) Generalized additive models for location, scale and shape (with discussion)
Journal of the Royal Statistical Society 54:507–554.
https://doi.org/10.1111/j.1467-9876.2005.00510.x
- Google Scholar
Book
(2019) Distributions for Modeling Location, Scale, and Shape
Taylor Francis Group.
https://doi.org/10.1201/9780429298547
- Google Scholar
1. Rose G
(2001) Sick individuals and sick populations
International Journal of Epidemiology 30:427–432.
https://doi.org/10.1093/ije/30.3.427
- PubMed
- Google Scholar
1. Senese LC
2. Almeida ND
3. Fath AK
4. Smith BT
5. Loucks EB
(2009) Associations between childhood socioeconomic position and adulthood obesity
Epidemiologic Reviews 31:21–51.
https://doi.org/10.1093/epirev/mxp006
- PubMed
- Google Scholar
1. Senn S
(2016) Mastering variation: variance components and personalised medicine
Statistics in Medicine 35:966–977.
https://doi.org/10.1002/sim.6739
- PubMed
- Google Scholar
(2018) Reconsidering the income-health relationship using distributional regression
Health Economics 27:1074–1088.
https://doi.org/10.1002/hec.3656
- PubMed
- Google Scholar
1. Silbersdorff A
2. Schneider KS
(2019) Distributional Regression Techniques in Socioeconomic Research on the Inequality of Health with an Application on the Relationship between Mental Health and Income
International Journal of Environmental Research and Public Health 16:20.
https://doi.org/10.3390/ijerph16204009
- PubMed
- Google Scholar
(2021) Objective and subjective childhood socioeconomic disadvantage and incident depression in adulthood: a longitudinal analysis in the Sister Study
Social Psychiatry and Psychiatric Epidemiology 56:1201–1210.
https://doi.org/10.1007/s00127-020-02013-5
- PubMed
- Google Scholar
(2007) Pathways to obesity: identifying local, modifiable determinants of physical activity and diet
Social Science & Medicine 65:1882–1897.
https://doi.org/10.1016/j.socscimed.2007.05.042
- PubMed
- Google Scholar
1. Stasinopoulos DM
2. Rigby RA
(2007) Generalized Additive Models for Location Scale and Shape (GAMLSS) in R
Journal of Statistical Software 23:1–46.
https://doi.org/10.18637/jss.v023.i07
- Google Scholar
1. Stringhini S
2. Carmeli C
3. Jokela M
4. Avendaño M
5. Muennig P
6. Guida F
7. Ricceri F
8. d’Errico A
9. Barros H
10. Bochud M
11. Chadeau-Hyam M
12. Clavel-Chapelon F
13. Costa G
14. Delpierre C
15. Fraga S
16. Goldberg M
17. Giles GG
18. Krogh V
19. Kelly-Irving M
20. Layte R
21. Lasserre AM
22. Marmot MG
23. Preisig M
24. Shipley MJ
25. Vollenweider P
26. Zins M
27. Kawachi I
28. Steptoe A
29. Mackenbach JP
30. Vineis P
31. Kivimäki M
32. LIFEPATH consortium
(2017) Socioeconomic status and the 25 × 25 risk factors as determinants of premature mortality: a multicohort study and meta-analysis of 1·7 million men and women
Lancet 389:1229–1237.
https://doi.org/10.1016/S0140-6736(16)32380-7
- PubMed
- Google Scholar
(2018) The “average” treatment effect: A construct ripe for retirement. A commentary on Deaton and Cartwright
Social Science & Medicine 210:77–82.
https://doi.org/10.1016/j.socscimed.2018.04.027
- PubMed
- Google Scholar
1. Sun J
2. Covaci A
3. Bustnes JO
4. Jaspers VLB
5. Helander B
6. Bårdsen BJ
7. Boertmann D
8. Dietz R
9. Labansen AL
10. Lepoint G
11. Schulz R
12. Malarvannan G
13. Sonne C
14. Thorup K
15. Tøttrup AP
16. Zubrod JP
17. Eens M
18. Eulaers I
(2020) Temporal trends of legacy organochlorines in different white-tailed eagle (Haliaeetus albicilla) subpopulations: A retrospective investigation using archived feathers
Environment International 138:105618.
https://doi.org/10.1016/j.envint.2020.105618
- PubMed
- Google Scholar
1. Tennant R
2. Hiller L
3. Fishwick R
4. Platt S
5. Joseph S
6. Weich S
7. Parkinson J
8. Secker J
9. Stewart-Brown S
(2007) The Warwick-Edinburgh Mental Well-being Scale (WEMWBS): development and UK validation
Health and Quality of Life Outcomes 5:63.
https://doi.org/10.1186/1477-7525-5-63
- PubMed
- Google Scholar
1. Truby H
2. Baic S
3. deLooy A
4. Fox KR
5. Livingstone MBE
6. Logan CM
7. Macdonald IA
8. Morgan LM
9. Taylor MA
10. Millward DJ
(2006) Randomised controlled trial of four commercial weight loss programmes in the UK: initial findings from the BBC “diet trials.”
BMJ 332:1309–1314.
https://doi.org/10.1136/bmj.38833.411204.80
- PubMed
- Google Scholar
1. Tyrrell J
2. Wood AR
3. Ames RM
4. Yaghootkar H
5. Beaumont RN
6. Jones SE
7. Tuke MA
8. Ruth KS
9. Freathy RM
10. Davey Smith G
11. Joost S
12. Guessous I
13. Murray A
14. Strachan DP
15. Kutalik Z
16. Weedon MN
17. Frayling TM
(2017) Gene-obesogenic environment interactions in the UK Biobank study
International Journal of Epidemiology 46:559–575.
https://doi.org/10.1093/ije/dyw337
- PubMed
- Google Scholar
(2006) Prevalence of visual impairment in the United States
JAMA 295:2158–2163.
https://doi.org/10.1001/jama.295.18.2158
- PubMed
- Google Scholar
1. Wierenga LM
2. Doucet GE
3. Dima D
4. Agartz I
5. Aghajani M
6. Akudjedu TN
7. Albajes-Eizagirre A
8. Alnaes D
9. Alpert KI
10. Andreassen OA
11. Anticevic A
12. Asherson P
13. Banaschewski T
14. Bargallo N
15. Baumeister S
16. Baur-Streubel R
17. Bertolino A
18. Bonvino A
19. Boomsma DI
20. Borgwardt S
21. Bourque J
22. den Braber A
23. Brandeis D
24. Breier A
25. Brodaty H
26. Brouwer RM
27. Buitelaar JK
28. Busatto GF
29. Calhoun VD
30. Canales-Rodríguez EJ
31. Cannon DM
32. Caseras X
33. Castellanos FX
34. Chaim-Avancini TM
35. Ching CR
36. Clark VP
37. Conrod PJ
38. Conzelmann A
39. Crivello F
40. Davey CG
41. Dickie EW
42. Ehrlich S
43. Van’t Ent D
44. Fisher SE
45. Fouche J-P
46. Franke B
47. Fuentes-Claramonte P
48. de Geus EJ
49. Di Giorgio A
50. Glahn DC
51. Gotlib IH
52. Grabe HJ
53. Gruber O
54. Gruner P
55. Gur RE
56. Gur RC
57. Gurholt TP
58. de Haan L
59. Haatveit B
60. Harrison BJ
61. Hartman CA
62. Hatton SN
63. Heslenfeld DJ
64. van den Heuvel OA
65. Hickie IB
66. Hoekstra PJ
67. Hohmann S
68. Holmes AJ
69. Hoogman M
70. Hosten N
71. Howells FM
72. Hulshoff Pol HE
73. Huyser C
74. Jahanshad N
75. James AC
76. Jiang J
77. Jönsson EG
78. Joska JA
79. Kalnin AJ
80. Klein M
81. Koenders L
82. Kolskår KK
83. Krämer B
84. Kuntsi J
85. Lagopoulos J
86. Lazaro L
87. Lebedeva IS
88. Lee PH
89. Lochner C
90. Machielsen MW
91. Maingault S
92. Martin NG
93. Martínez-Zalacaín I
94. Mataix-Cols D
95. Mazoyer B
96. McDonald BC
97. McDonald C
98. McIntosh AM
99. McMahon KL
100. McPhilemy G
101. van der Meer D
102. Menchón JM
103. Naaijen J
104. Nyberg L
105. Oosterlaan J
106. Paloyelis Y
107. Pauli P
108. Pergola G
109. Pomarol-Clotet E
110. Portella MJ
111. Radua J
112. Reif A
113. Richard G
114. Roffman JL
115. Rosa PG
116. Sacchet MD
117. Sachdev PS
118. Salvador R
119. Sarró S
120. Satterthwaite TD
121. Saykin AJ
122. Serpa MH
123. Sim K
124. Simmons A
125. Smoller JW
126. Sommer IE
127. Soriano-Mas C
128. Stein DJ
129. Strike LT
130. Szeszko PR
131. Temmingh HS
132. Thomopoulos SI
133. Tomyshev AS
134. Trollor JN
135. Uhlmann A
136. Veer IM
137. Veltman DJ
138. Voineskos A
139. Völzke H
140. Walter H
141. Wang L
142. Wang Y
143. Weber B
144. Wen W
145. West JD
146. Westlye LT
147. Whalley HC
148. Williams SC
149. Wittfeld K
150. Wolf DH
151. Wright MJ
152. Yoncheva YN
153. Zanetti MV
154. Ziegler GC
155. de Zubicaray GI
156. Thompson PM
157. Crone EA
158. Frangou S
159. Tamnes CK
160. Karolinska Schizophrenia Project (KaSP) Consortium
(2022) Greater male than female variability in regional brain structure across the lifespan
Human Brain Mapping 43:470–499.
https://doi.org/10.1002/hbm.25204
- PubMed
- Google Scholar
1. Wood N
2. Bann D
3. Hardy R
4. Gale C
5. Goodman A
6. Crawford C
7. Stafford M
(2017) Childhood socioeconomic position and adult mental wellbeing: Evidence from four British birth cohort studies
PLOS ONE 12:e0185798.
https://doi.org/10.1371/journal.pone.0185798
- PubMed
- Google Scholar
1. Wood N
2. Hardy R
3. Bann D
4. Gale C
5. Stafford M
(2021) Childhood correlates of adult positive mental well-being in three British longitudinal studies
Journal of Epidemiology and Community Health 75:177–184.
https://doi.org/10.1136/jech-2019-213709
- PubMed
- Google Scholar
Book
1. Zuckerman M
(1999) Diathesis-Stress Models
American Psychological Association.
https://doi.org/10.1037/10316-000
- Google Scholar

Article and author information

Author details

David Bann

Centre for Longitudinal Studies, Social Research Institute, University College London, London, United Kingdom

Contribution
Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Resources, Visualization, Writing - original draft, Writing - review and editing

For correspondence
david.bann@ucl.ac.uk

Competing interests
No competing interests declared

"This ORCID iD identifies the author of this article:" 0000-0002-6454-626X
Liam Wright

Centre for Longitudinal Studies, Social Research Institute, University College London, London, United Kingdom

Contribution
Formal analysis, Investigation, Methodology, Resources, Software, Visualization, Writing - review and editing

Competing interests
No competing interests declared
Tim J Cole

Great Ormond Street Institute of Child Health, University College London, London, United Kingdom

Contribution
Conceptualization, Investigation, Methodology, Visualization, Writing - review and editing

Competing interests
No competing interests declared

Funding

Medical Research Council (MR/V002147/1)

David Bann
Liam Wright

Economic and Social Research Council (ES/M001660/1)

David Bann

Wellcome Trust (HOP001/1025)

David Bann

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Ethics

Human subjects: This paper uses secondary data analysis using data from a cohort study which has been followed-up since birth in 1970. Cohort members provided informed consent, and the study received full ethical approval - most recently from the NRES Committee South East Coast-Brighton and Sussex.

Version history

Preprint posted: March 31, 2021 (view preprint)
Received: July 20, 2021
Accepted: January 4, 2022
Accepted Manuscript published: January 5, 2022 (version 1)
Version of Record published: January 26, 2022 (version 2)

Copyright

This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.

Metrics

1,344

views
179

downloads
7

citations

Views, downloads and citations are aggregated across all versions of this paper published by eLife.

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Article PDF

Open citations (links to open the citations from this article in various online reference manager services)

Mendeley

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

David Bann
Liam Wright
Tim J Cole

(2022)

Risk factors relate to the variability of health outcomes as well as the mean: A GAMLSS tutorial

eLife 11:e72357.

https://doi.org/10.7554/eLife.72357

Categories and tags

Research organism

Human

Share this article

Cite this article

Simulated data for three interventions each having the same effect on the mean, but different effects on the variability (middle panel) and skewness (bottom panel).

Kernel density plots for body mass index and mental wellbeing, stratified by risk factor group.

Risk factors in relation to body mass index: differences in mean, variability and skewness estimated by GAMLSS (n = 6007).

Risk factors in relation to mental wellbeing (WEMWBS): differences in mean, variability and skewness estimated by GAMLSS (n = 7,104).

Risk factors in relation to body mass index (BMI) and mental wellbeing (WEMWBS): percentage differences at multiple points of the outcome distribution estimated by quantile regression.

Association between risk factors and BMI by BMI centile.

Association between risk factors and mental wellbeing (WEMWBS) by centile.

Author details

David Bann

Contribution

For correspondence

Competing interests

Liam Wright

Contribution

Competing interests

Tim J Cole

Contribution

Competing interests

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

Categories and tags

Research organism

Further reading