<?xml-model href='http://www.tei-c.org/release/xml/tei/custom/schema/relaxng/tei_all.rng' schematypens='http://relaxng.org/ns/structure/1.0'?><TEI xmlns="http://www.tei-c.org/ns/1.0">
	<teiHeader>
		<fileDesc>
			<titleStmt><title level='a'>My, how you've grown: A practical guide to modeling size transitions for integral projection model ( &lt;scp&gt;IPM&lt;/scp&gt; ) applications</title></titleStmt>
			<publicationStmt>
				<publisher>Ecological Society of America</publisher>
				<date>05/01/2025</date>
			</publicationStmt>
			<sourceDesc>
				<bibl> 
					<idno type="par_id">10609729</idno>
					<idno type="doi">10.1002/ecy.70088</idno>
					<title level='j'>Ecology</title>
<idno>0012-9658</idno>
<biblScope unit="volume">106</biblScope>
<biblScope unit="issue">5</biblScope>					

					<author>Tom_E X Miller</author><author>Stephen P Ellner</author>
				</bibl>
			</sourceDesc>
		</fileDesc>
		<profileDesc>
			<abstract><ab><![CDATA[<title>Abstract</title> <p>Integral projection models (IPMs) are widely used for studying continuously size‐structured populations. IPMs require a growth sub‐model that describes the probability of future size conditional on current size and any covariates. Most IPM studies assume that this distribution is Gaussian, despite calls for non‐Gaussian models that accommodate skewness and excess kurtosis. We provide a general workflow for accommodating non‐Gaussian growth patterns while retaining important covariates and random effects. Our approach emphasizes visual diagnostics from pilot Gaussian models and quantile‐based metrics of skewness and kurtosis that guide selection of a non‐Gaussian alternative, if necessary. Across six case studies, skewness and excess kurtosis were common features of growth data, and non‐Gaussian models consistently generated simulated data that were more consistent with real data than pilot Gaussian models. However, effects of “improved” growth modeling on IPM results were moderate to weak and differed in direction or magnitude between different outputs from the same model. Using tools not available when IPMs were first developed, it is now possible to fit non‐Gaussian models to growth data without sacrificing ecological complexity. Doing so, as guided by careful interrogation of the data, will result in models that better represent the populations for which they are intended.</p>]]></ab></abstract>
		</profileDesc>
	</teiHeader>
	<text><body xmlns="http://www.tei-c.org/ns/1.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:xlink="http://www.w3.org/1999/xlink">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>INTRODUCTION</head><p>Structured demographic models-matrix and integral projection models (MPMs and IPMs)-are powerful tools for data-driven modeling of population and community dynamics. In contrast to MPMs for populations with discrete structure (life stage, age class, etc.), IPMs <ref type="bibr">(Easterling et al., 2000)</ref> accommodate populations structured by continuous state variables, most commonly size. A related innovation of the IPM framework is its emphasis on regression-based modeling for parameter estimation, which often carries important advantages for making the most of hard-won data <ref type="bibr">(Ellner et al., 2022)</ref>.</p><p>A standard workflow allows ecologists to assemble an IPM from data using familiar regression tools to describe growth, survival, reproduction, and other demographic transitions as functions of size <ref type="bibr">(Coulson, 2012;</ref><ref type="bibr">Ellner et al., 2016)</ref>. The relative ease of regression analyses, accommodating covariates (e.g., environmental factors, experimental treatments) and complex variance structures (e.g., random effects, correlated errors), has facilitated a growing IPM literature that examines how biotic or abiotic factors affect population dynamics (e.g., <ref type="bibr">Louthan et al., 2022;</ref><ref type="bibr">Ozgul et al., 2010)</ref> and explores the consequences of demographic heterogeneity associated with spatial, temporal, and individual variation (e.g., <ref type="bibr">Compagnoni et al., 2016;</ref><ref type="bibr">Crone, 2016;</ref><ref type="bibr">Plard et al., 2018)</ref>. The vital rate regressions (or "sub-models") are the bridge between the individual-level data and the population-level model and its predictions; it is important to get those right.</p><p>Compared with other vital rates, growth is special. The survival and reproduction sub-models only need to provide a single predicted value as functions of size (we use "size" as the name for whatever continuous variable defines the population structure). But the growth model must specify the full probability distribution of subsequent size conditional on initial size, defining the growth "kernel" G z 0 , z &#240; &#222; that gives the probability density of future size z 0 at time t + 1 conditional on current size z at time t. Whenever survival and reproduction are size-dependent, the entire distribution of size transitions can strongly influence IPM predictions because it governs how frequently size changes are much greater or much lower than average. <ref type="bibr">Easterling et al. (2000)</ref> provided the original template for modeling size transitions in IPMs. They first tried simple linear regression, assuming normally distributed size changes with constant variance. Because the residuals from this regression exhibited nonconstant variance, they used a two-step approach to estimate the size dependence in mean squared residuals (better options soon became available, such as the lme function in R). However, even after accounting for nonconstant variance, growth data may still be non-normal. Size transitions are often skewed such that large decreases are more common than large increases <ref type="bibr">(Peterson et al., 2019;</ref><ref type="bibr">Salguero-G omez &amp; Casper, 2010)</ref>, or vice versa <ref type="bibr">(Stubberud et al., 2019)</ref>. Size transitions may also exhibit excess kurtosis ("fat tails"), where extreme growth or shrinkage is more common than predicted by the tails of the Normal distribution <ref type="bibr">(H&#233;rault et al., 2011)</ref>.</p><p>The observation that the Normal (or Gaussian) distribution may poorly describe size transitions in real organisms has been made before, and several studies have emphasized that alternative distributions should be explored <ref type="bibr">(Easterling et al., 2000;</ref><ref type="bibr">Peterson et al., 2019;</ref><ref type="bibr">Rees et al., 2014;</ref><ref type="bibr">Williams et al., 2012)</ref>. For example, <ref type="bibr">Peterson et al. (2019)</ref> showed that skewness in size transitions could be modeled through beta regression on transformed data (for reasons we describe below, this approach also has some drawbacks), or by fitting a skewed Normal distribution. They showed that incorporating skew could have important consequences for model-based inferences, and concluded that "testing of alternative distributions for growth &#8230; [should] become standard in the construction of size-structured population models." Nonetheless, default use of Gaussian growth distributions (often with nonconstant variance) remains the standard practice. The general state-of-the-art in the literature appears to remain where it was 20 or so years ago, using the default Gaussian model without examining critically whether or not it actually describes the data well. We are guilty of this, ourselves.</p><p>The persistence of Gaussian growth models is understandable. Popular packages such as lme4 <ref type="bibr">(Bates et al., 2015)</ref>, mgcv <ref type="bibr">(Wood, 2017)</ref>, and MCMCglmm <ref type="bibr">(Hadfield, 2010)</ref> make it easy to fit growth models with potentially complex fixed-and random-effect structures, but the possible distributions of continuous responses are limited and default to Gaussian. Abandoning these convenient tools for the sake of more flexible growth modeling means, it may seem, sacrificing the flexibility to model diverse sources of demographic variation, some of which may be the motivation driving the study in the first place.</p><p>Our goal here is to present and illustrate a practical "recipe" that moves growth modeling past the standards set over 20 years ago. Using software tools that are now readily accessible, ecologists can escape the apparent trade-off between realistically modeling non-Gaussian size transitions and flexibly including multiple covariates and random effects. As with any recipe, users may need to make substitutions or add ingredients to suit their needs. We emphasize graphical diagnostics for developing and evaluating growth models, rather than a process centered on statistical tests or model selection. Through empirical case studies, we demonstrate how tools that were nonexistent or not readily available when IPMs first came into use now make it straightforward and relatively easy to identify when the default model is a poor fit to the data, and to then choose and fit a better growth model that is no harder to use in practice. We illustrate our approach by revisiting three published case studies (and three additional case studies in Appendix S3), including examples from our own previous work. In each case, the Gaussian assumption does not stand up to close scrutiny. We illustrate how we could have done better, and the consequences of "doing better" for our ecological inferences. All analyses were carried out in R (R Core Team, 2022) version 4.0 or higher and may be reproduced from publicly available code and data (see Data availability statement).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>FLEXIBLE GROWTH MODELING</head><p>The modeling process that we suggest runs as follows (Figure <ref type="figure">1</ref>).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Fit a "pilot" model assuming a Gaussian distribution, but allowing for nonconstant variance</head><p>This step is familiar to most IPM users, as it is the start and end of the standard approach. It may include model selection to identify which treatment effects or environmental drivers affect the mean and/or variance of future size. Nonconstant variance is often fitted in a two-stage process, first fitting mean growth assuming constant variance, then doing a regression relating the squared residuals to initial size or the fitted mean of subsequent size. Fitting mean and variance simultaneously as functions of initial size, as can be done with R packages mgcv and nlme, is advantageous when possible because incorrectly assuming constant variance can affect model selection for the mean. We illustrate both one-step and two-step approaches in the case studies below.</p><p>Allowing nonconstant variance removes the need for transforming the data to stabilize growth variance. Transformation may still be useful if it does not create new problems such as making some state-fate relationships highly nonlinear. In particular, log-transformation often reduces or eliminates heteroskedasticity in growth data <ref type="bibr">(Ellner et al., 2016)</ref> and also helps avoid eviction at small sizes <ref type="bibr">(Williams et al., 2012)</ref>.</p><p>The fitted mean and variance functions should be checked before going any further. If they are perfectly correct, standardized residuals (residuals scaled by the SD) will have zero mean and unit variance overall, and will exhibit no trends in mean or variance with initial size or fitted mean value. However, estimates of the mean and variance functions are somewhat smoothed because of the inescapable bias-variance trade-off, so scaled residuals will retain some variation in location and scale. Given enough data, statistical tests will detect that variation. So instead, we take for granted the presence of F I G U R E 1 Recommended steps in growth modeling (left) and guide to common non-Gaussian distributions of size x for x &#8477; that can accommodate different combinations of skewness and kurtosis (right). All of these distributions (often including multiple versions or parameterizations of each) are available in the R package gamlss.dist, except for the skewed generalized t, which is available in the package sgt <ref type="bibr">(Davis, 2015)</ref>.</p><p>trends and assess their importance by fitting nonparametric (NP) spline regression models for residuals (trend in mean) and absolute residuals (trend in variance) as a function of initial size or fitted value. The mean and variance functions can be accepted if the regression curves for the scaled residuals are nearly flat.</p><p>Use graphical diagnostics to identify if and how the standardized residuals deviate from Gaussian, and to choose a more appropriate distribution</p><p>If the Gaussian growth model is valid, the standardized residuals should be Gaussian with zero skewness or excess kurtosis. Growth data may deviate from this in many ways, and the nature of the deviations can guide the search for a better distribution. Tests such as the D'Agostino test of skewness <ref type="bibr">(D'Agostino, 1970)</ref> and the Anscombe-Glynn test of kurtosis <ref type="bibr">(Anscombe &amp; Glynn, 1983</ref>) can be used to diagnose whether the standardized residuals, in aggregate, deviate from normality <ref type="bibr">(Komsta &amp; Novomestky, 2015)</ref>. However, the aggregate distribution may be misleading if skewness or kurtosis vary with size or other covariates. Skewness changing from positive at small sizes to negative at large sizes might produce zero overall skewness, but really requires a distribution that can allow both positive and negative skew, such as the skewed Normal or Johnson S U distributions. Alternatively, growth data may exhibit leptokurtosis (in which case the t distribution may be a good choice) or may shift from platykurtosis to leptokurtosis depending on initial size (in which case the power exponential distribution may be a good choice). It is therefore essential to visualize trends in distribution properties with respect to either initial size, or expected future size for models with multiple covariates. Figure <ref type="figure">1</ref> includes guidance on how the skew and kurtosis properties of the standardized residuals suggest options for an appropriate growth distribution. In our case studies we exploit the many distributions in the gamlss R package <ref type="bibr">(Stasinopoulos &amp; Rigby, 2007)</ref>, but other distribution families can be used.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Refit the growth model using the chosen distribution</head><p>In models with multiple covariates and/or random effects, each potentially affecting several distribution parameters, "refit the model" could entail a massive model selection process to identify the "best" non-Gaussian model. With so many options, model uncertainty may be overwhelming and over-fitting becomes a significant risk even when precautions against it are taken.</p><p>We therefore argue for adopting a more modest goal: remedy the defects evident in the standardized residuals of the Gaussian model. This recommendation is based on the finding that parameter estimation using Gaussian regression models is generally robust to deviations from normality of the residuals <ref type="bibr">(Schielzeth et al., 2020)</ref>. That is, the fitted mean of the Gaussian model (as a function of covariates) is probably a very good approximation for the fitted mean in the corresponding non-Gaussian model-and if it is not, the next step in the modeling process will catch that. The functional forms for skew and kurtosis of the non-Gaussian model can be guided by the qualitative features of the graphical diagnostics (e.g., that skewness switches from positive to negative with increasing size). As we demonstrate below, the mean and SD functions can often be carried over exactly from the pilot Gaussian model.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Evaluate the final growth model through graphical diagnostics comparing simulated and real growth data</head><p>A good model will generate simulated data that look like the real data. Again, it is important to inspect the properties of simulated data as a function of initial size, fitted mean, or other covariates rather than examining the aggregate distribution. We again suggest below graphical diagnostics, based mainly on quantiles, that can be used to compare simulated with real growth data. If the simulated data do not correspond well with the real data, alternative or more flexible distribution families should be considered, or more complex functions relating distribution parameters to size and other covariates.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>HOW SHOULD SKEWNESS AND KURTOSIS BE MEASURED?</head><p>Non-Gaussian growth modeling requires scrutinizing the skewness and kurtosis of standardized residuals, so measurement of these properties warrants attention. The standard measures are based on the third and fourth central moments, respectively, of the distribution: skewness = m 3 =&#963; 3 , excess kurtosis = m 4 =&#963; 4 -3 where m k &#188; &#58628; X -X &#192; &#193; k is the kth central moment of a random variable X and &#963; 2 is the variance (second central moment). A Gaussian distribution has zero skewness and zero excess kurtosis. The standard measures are simple and easy to use, but they have poor sampling properties. Because the measures involve high powers of data values, a few outliers can produce very inaccurate estimates. Figure <ref type="figure">2</ref> shows a simulated example, where the underlying data are samples of 200 values from a t distribution with 8 degrees of freedom, repeated 5000 times; the true skew is 0, and the true excess kurtosis is 1.5. The distance between the largest and smallest estimates (indicated by the dotted red vertical lines), relative to the distance between the 5th and 95th percentiles (dashed blue lines), shows the broad extent of extreme values that can occur even with a large sample, especially for kurtosis.</p><p>We therefore recommend nonparametric (NP) measures of skewness and kurtosis that are based on quantiles and thus are less sensitive to a few extreme values. Let q &#945; denote the &#945; quantile of a distribution or sample (e.g., q 0:05 is the 5th percentile). For any 0 &lt; &#945; &lt; 0:5, a quantile-based measure of skewness is given by <ref type="bibr">(McGillivray, 1986</ref>)</p><p>NP Skewness measures the asymmetry between the tails of the distribution above and below the median. The size of the upper tail can be measured (for any 0 &lt; &#945; &lt; 0:5) by &#964; U &#188; q 1 -&#945;q 0:5 ; for &#945; &#188; 0:05 this is the difference between the 95th percentile and the median. The lower tail size is &#964; L &#188; q 0:5q &#945; . The definition above is equivalent to</p><p>An NP Skewness of AE0:2 says that the difference in tail sizes is 20% of their total. The range of possible values is -1 to 1. Both &#945; &#188; 0:25 (sometimes called "Kelly's skewness") and &#945; &#188; 0:1 ("Bowley's skewness") are common choices. We used &#945; &#188; 0:1. An analogous quantile-based measure of kurtosis <ref type="bibr">(Jones et al., 2011)</ref> is</p><p>For &#945; &#188; 0:05, NP Kurtosis is the difference between the 95th and 5th percentiles, relative to the interquartile range. To facilitate interpretation, we scale NP Kurtosis relative to its value for a Gaussian distribution, and subtract 1 so that the value for a Gaussian is zero. We call this "NP Excess Kurtosis." A value of AE0:2 means that the tails are on average 20% heavier than those of a Gaussian with the same interquartile range. We calculate NP Kurtosis using &#945; &#188; 0:05, to focus on the tail edges, but again this is somewhat arbitrary. Figure <ref type="figure">2C</ref>,D illustrate how, applied to the same simulated samples, the NP measures produce a smaller fraction of highly inaccurate estimates caused by a few extreme values. Also note that, in contrast to the moment-based measures, numerically small values of the NP measures (e.g., 0.1 or 0.2) should not be disregarded, because both measures are scaled so that a value of 1 indicates extremely large departures from a Gaussian distribution.</p><p>Using quantile-based measures carries the added value that quantile regression can be used to estimate how they vary with initial size or expected future size. In the examples below, we use the qgam package <ref type="bibr">(Fasiolo et al., 2020)</ref> to fit spline quantile regression models, which accommodate nonlinear size-dependence in skewness and kurtosis. One risk of spline regression is that fitted quantiles may be excessively "wiggly" without constraints on their complexity; with realistic amounts of data, we can hope to estimate broad trends in distribution shape, but not fine-scale variation. In the examples below, we limit complexity by fitting splines with k &#188; 4 basis functions unless otherwise noted. Parametric quantile regression is also an option.</p><p>For consistency, we also use quantile-based measures of mean and SD when comparing real and simulated data, and use quantile regression to visualize their trends. Specifically, following <ref type="bibr">Wan et al. (2014)</ref>, NP mean &#188; q 0:25 + q 0:5 + q 0:75 3 , NP SD&#188; q 0:75q 0:25 1:35 : &#240;4&#222;</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>CASE STUDY: LICHEN, VULPICIDA PINASTRI</head><p>We begin with a simple example where current size is the only predictor of future size. Growth data for the epiphytic lichen Vulpicida pinastri were analyzed first by <ref type="bibr">Shriver et al. (2012)</ref> and again by <ref type="bibr">Peterson et al. (2019)</ref> in their study of skewed growth distributions. We therefore had an a priori expectation of deviation from normality.</p><p>The data set includes 1542 interannual transitions in thallus area (in square centimeters) observed from 2004 to 2009 in Kennicott Valley, AK. Shriver et al., 2012 used a mixture distribution that separated "normal growth or shrinkage" from "extreme shrinkage." We aimed to fit a single growth model that could realistically accommodate both types of size transition without requiring ad hoc decisions about which observations of shrinkage were "extreme" or not.</p><p>With initial size as the only predictor, a convenient way to fit a Gaussian model with nonconstant variance is the gam function in the mgcv library <ref type="bibr">(Wood, 2017)</ref> using the gaulss family. Following a bit of model selection, we fit the mean and SD of future size as second-order polynomials of current size, then calculated the scaled residuals from the fitted mean and SD. (gam() is most commonly used to fit smooth splines (s()) for predictor variables, but it can also fit parametric regressions.) Here, the first argument to gam() is a two-element list that defines the linear predictors for mean and SD:</p><p>The data and fitted mean and SD are shown in Figure <ref type="figure">3A</ref>, and the corresponding diagnostic plots are in Figure <ref type="figure">4A</ref>,B. Our diagnostic plots are similar to plots made by R's plot.lm function, except that we use spline regression to allow data-driven choice of curve smoothness, and use absolute residuals (rather than their square roots) so that the SD of the regression curve is on the same scale as the residuals. The spline curves are not exactly flat-their SDs, given above each panel, are positive-but the trends are much too small to be worth fixing.</p><p>Quantile regression on the scaled residuals generates the skewness and kurtosis diagnostics shown in Figure <ref type="figure">3B</ref>. As expected based on previous analyses, the graphical analysis of the standardized residuals indicates negative skew, especially at larger sizes (Figure <ref type="figure">3B</ref>). We also find positive excess kurtosis for all sizes.</p><p>We turned to the Johnson's S-U (JSU) distribution for improvement. The JSU is a four-parameter leptokurtic distribution allowing positive or negative skew, with the convenient property that its location and scale parameters mu and sigma are the mean and SD, respectively, which greatly facilitates the transition from a pilot Gaussian model. JSU is not available in any standard linear or additive modeling packages, to our knowledge. But that is not a barrier because we can write a likelihood function using the dJSU() function in the gamlss.dist package. Following the best fit Gaussian model, we defined mu and sigma of the JSU as quadratic polynomials of initial size and, based on Figure <ref type="figure">3B</ref> we define the skewness parameter nu as a linear function of size and kurtosis parameter tau as a positive constant. The likelihood function therefore has nine parameters to estimate. We fit the model using the maxLik package with starting coefficient values for mu and sigma based on the pilot Gaussian model. We chose maxLik because it offers the BHHH optimization method, which works well for non-Gaussian likelihoods in our experience.</p><p>Simulating data from the fitted JSU model indicates a compelling improvement over the best Gaussian model, not only in skewness and kurtosis (Figure <ref type="figure">5C</ref>,D) but also the NP SD (Figure <ref type="figure">5B</ref>). Note, in these data simulation figures Gaussian and non-Gaussian data are offset by an arbitrary amount to more easily visualize their correspondence to the real data (black lines in Figure <ref type="figure">5</ref>).</p><p>To understand the practical consequences of improved growth modeling, we assembled the remainder of the lichen IPM following <ref type="bibr">Shriver et al. (2012)</ref>. The asymptotic population growth rate &#955; based on Gaussian growth differs from the JSU growth model by about 1% annual population growth (Table <ref type="table">1</ref>), in line with results of <ref type="bibr">Peterson et al. (2019)</ref>. However, even this modest difference can lead to biased estimates of extinction risk from the Gaussian model, particularly over longer time horizons (Figure <ref type="figure">6</ref>). We also explored differences in other life history metrics (Table <ref type="table">1</ref>) using functions from Hern andez et al. <ref type="bibr">(2024)</ref>. For example, the JSU growth model predicts values for mean lifespan, mean lifetime reproductive success, and generation time that are 15%-25% lower than the Gaussian growth model. In this case study, properly modeling non-normal size transitions-which was easy to do with a few extra lines of code-can influence ecological inferences, at least based on point estimates. However, Table 1 also provides bias-corrected, bootstrapped CIs <ref type="bibr">(Diciccio &amp; Efron, 1996)</ref>, and these are heavily overlapping between the Gaussian and JSU models for all life history traits, suggesting that effects of "improved" growth modeling are small relative to our uncertainty in model parameters.</p><p>One could argue that this example was a convenient "straw man" to disqualify Gaussian growth, because it was recognized by the original and subsequent analysts that size transitions are strongly skewed <ref type="bibr">(Peterson et al., 2019;</ref><ref type="bibr">Shriver et al., 2012)</ref>. In all remaining case studies, including those in Appendix S3, we reexamine growth data that were modeled as Gaussian in the original published analysis.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>CASE STUDY: TREE CHOLLA CACTUS, CYLINDROPUNTIA IMBRICATA</head><p>The next case study, focused on the tree cholla cactus Cylindropuntia imbricata at the Sevilleta Long-Term Ecological Research site in central New Mexico, adds a new feature to the simple size-dependent regressions in the previous study: random effects associated with temporal (year) and spatial (plot) environmental heterogeneity. This long-term study was initiated in 2004, and different subsets of the data have been analyzed in various IPM studies, all using Gaussian growth kernels <ref type="bibr">(Compagnoni et al., 2016;</ref><ref type="bibr">Czachura &amp; Miller, 2020;</ref><ref type="bibr">Elderd &amp; Miller, 2016;</ref><ref type="bibr">Miller et al., 2009;</ref><ref type="bibr">Ohm &amp; Miller, 2014)</ref>. In fact, <ref type="bibr">Elderd and Miller (2016)</ref> presented a Gaussian growth model as an example of a well-fit growth function, based on an overall distribution of residuals that appeared Gaussian and posterior predictive checks (PPCs) of a Bayesian model that suggested consistency between the real data and data simulated from the fitted model (fig. <ref type="figure">4</ref> in <ref type="bibr">Elderd &amp; Miller, 2016)</ref>. While PPCs and the associated "Bayesian p-value" are popular diagnostic tools, they are often too conservative <ref type="bibr">(Conn et al., 2018;</ref><ref type="bibr">Zhang, 2014)</ref>, failing to reject marginally bad   <ref type="bibr">Miller (2020)</ref>. Following previous studies, we quantified size as the natural logarithm of plant volume (in cubic centimeters), derived from height and width measurements.</p><p>We begin growth modeling, as above, with a generalized additive model with the mean and SD of size in year  t + 1 modeled as smooth function of size in year t, with random intercepts for year and plot and assuming normally distributed residuals:</p><p>Note that here we fitted the SD function with k &#188; 6 basis functions rather than our default of k &#188; 4 because, in a preliminary analysis, we found a moderate variance trend in the standardized residuals using k &#188; 4, suggesting a need for greater flexibility. With k &#188; 6, spline regression detected essentially no trend in the mean of the resulting standardized residuals (Figure <ref type="figure">4C</ref>,<ref type="figure">D</ref>).</p><p>The growth variance is estimated to peak at small to medium sizes (Figure <ref type="figure">3C</ref>). The standardized residuals show clear signals of negative skew and positive excess kurtosis across most of the size distribution, but strongest in the middle (Figure <ref type="figure">3D</ref>). We therefore need a distribution family allowing negative skew and positive excess kurtosis, both of which may be negligible at some sizes. We first tried Johnson's S U and then the skewed t distributions, which provided some improvements but there were still visible discrepancies between simulated and real data. We next turned to the SHASH distribution, which allows a greater range of kurtosis for a given amount of skew, and vice versa <ref type="bibr">(Jones and</ref>  <ref type="table">Pewsey 2009;</ref>  T A B L E 1 Life history attributes derived from integral projection model (IPM) kernels that included Gaussian or "improved" growth sub-models for six case studies.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Species</head><p>Growth model &#955; Lifespan Lifetime reproductive output Age at reproduction Generation time Lichen (V. pinastri) Gaussian 1.01 (0.99, 1.04) 6.4 (3.6, 11.1) 1.4 (0.5, 3.1) 6.5 (5.7, 7.3) 40.8 (30.5,57.4) Improved 1.00 (0.98, 1.03) 5.4 (3.1, 9.7) 1 (0.4, 2.4) 6.4 (5.4, 7.3) 36.6 (27.5, 48.6) Cactus (C. imbricata) Gaussian 0.994 (0.99, 0.996) 6.11 (3.66, 8.63) 21.8 (8.27, 49.4) 17.6 (1.75, 22.7) 189 (131, 266) Improved 0.993 (0.991, 0.998) 5.38 (3.34, 16.3) 13.4 (5.72, 251) 20.3 (1.21, 22.2) 179 (133, 298) Orchid (O. purpurea) Gaussian 1.09 (1.08, 1.1) 1.08 (1.06, 1.11) 20.0 (12.6, 31.0) 5.07 (4.78, 5.31) 104 (73.1, 150) Improved 1.09 (1.08, 1.1) 1.08 (1.06, 1.1) 19.3 (12.0, 29.9) 5.03 (4.75, 5.3) 100.7 (71.0, 145.0) Pike (E. Lucius) Gaussian 1.62 (1.35, 1.89) 1.2 (1.09, 1.35) 5.75 (2.9, 9.7) 1.09 (1.03, 1.18) 4.96 (4.26, 5.84) Improved 1.62 (1.35, 1.88) 1.2 (1.09, 1.35) 5.76 (2.91, 9.73) 1.09 (1.03, 1.18) 4.94 (4.30, 5.84) Creosote (L. tridentata) Gaussian 1.033 (1.029, 1.04) 4.52 &#215; 10 6 (2.14 &#215; 10 5 , 1.82 &#215; 10 8 ) 3.19 &#215; 10 5 (1.27 &#215; 10 4 , 1.24 &#215; 10 7 ) 32.7 (29.2, 36.0) 5.27 &#215; 10 6 (2.50 &#215; 10 5 , 1.95 &#215; 10 8 ) Improved 1.034 (1.03, 1.04) 3.26 &#215; 10 5 (1.98 &#215; 10 3 , 1.66 &#215; 10 7 ) 2.31 &#215; 10 4 (5.83 &#215; 10 2 , 1.27 &#215; 10 6 ) 32.8 (29.3,36.0) 3.7 &#215; 10 5 (2.63 &#215; 10 3 , 1.93 &#215; 10 7 ) Coral (G. ventalina) Gaussian &#8230; 17.3 (11.9, 24.3) &#8230; 10.5 (9.3, 11.8) 31.6 (28.3, 36.7) Improved &#8230; 17.5 (12.1, 24.3) &#8230; 10.7 (9.4, 12.2) 30.9 (27.4, 35.3) Note: The improved distributions were Johnson's S-U (JSU) (lichen, creosote), SHASH (cactus, pike, coral), and skewed t (orchid). Pike, creosote, and coral case studies are presented in Appendix S3. The original coral case study assumed an open population with constant recruitment from a large source region, so some life history attributes cannot be computed from the published model. Values in parenthesis are 95% bootstrap CIs, specifically the bias-corrected (BC) bootstrap CIs <ref type="bibr">(Diciccio &amp; Efron, 1996)</ref>. Table can be reproduced from scripts crossspp_growth.R, Vulpicida_boot.R, Akumal_corals_boot.R <ref type="bibr">(Ellner &amp; Miller, 2025)</ref>. Appendix S1). This flexibility proved necessary to generate simulated data that compared favorably with the real data, so we proceeded with the SHASH. Conveniently, SHASH is available as an mgcv family, allowing for flexible size-dependence in skewness and kurtosis without having to select specific size-dependent functions.</p><p>Here, the first argument to gam() is now a four-element list specifying the linear predictors for the four parameters of the SHASH distribution.</p><p>Data simulated from the SHASH model compared favorably with the real data (Appendix S4: Figure <ref type="figure">S1</ref>). Similar to the lichen case study, we see that correctly modeling skewness and kurtosis improved estimation of the NP mean and SD (Appendix S4: Figure <ref type="figure">S1A</ref>,<ref type="figure">B</ref>), yielding a growth model that is truer to the data.</p><p>We next explored how improved growth modeling influenced IPM results. The &#955; values predicted by Gaussian and SHASH growth functions, corresponding to the average plot and year, were nearly identical (Table <ref type="table">1</ref>) but we could also leverage structure of the study design to quantify demographic variance associated with temporal and spatial heterogeneity. We used the fitted random effects from the vital rate models to estimate the asymptotic growth rate for each year (&#955; t ), centered on the average plot, and for each plot (&#955; p ), centered on the average year. Estimates of &#955; t from the Gaussian growth model were often greater than estimates from the SHASH growth model, particularly in some of the harshest years (Figure <ref type="figure">7A</ref>), and therefore the Gaussian model predicted lower temporal variance in fitness (SD</p><p>Plot-to-plot variation was more similar between the two models (SD</p><p>, although spatial variation in fitness was much lower than temporal variation (Figure <ref type="figure">7B</ref>). The difference in temporal variance would suggest that Gaussian growth modeling would predict a higher stochastic growth rate &#955; S , because temporal variance has a negative effect on &#955; S . However, the stochastic growth rate from the Gaussian growth model (&#955; S &#188; 0:992) was nearly identical to that of the SHASH growth model (&#955; S &#188; 0:991). This is likely because temporal fluctuations in vital rates, which is where the SHASH growth model would make a difference, have a weaker influence on &#955; S than the temporal fluctuations in size structure that they generate <ref type="bibr">(Compagnoni et al., 2016;</ref><ref type="bibr">Ellis &amp; Crone, 2013)</ref>. The SHASH and Gaussian growth models predicted differences in other life history traits but, as in the lichen case study, these differences were small relative to the uncertainty captured by bootstrapped CIs (Table <ref type="table">1</ref>). Interestingly, the SHASH model had wider uncertainty intervals, particularly for lifespan and lifetime reproductive output, presumably because the additional parameters it requires introduce additional sources of uncertainty in these estimates. Thus, in this case study, modeling non-Gaussian size transitions with a Gaussian growth model may or may not influence IPM results depending on the target of the analysis, and whether the emphasis is on point estimates or uncertainty intervals.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>CASE STUDY: LADY ORCHID, ORCHIS PURPUREA</head><p>Our final case study examines selection on life history strategies in the lady orchid Orchis purpurea. In a prior study, <ref type="bibr">Miller et al. (2012)</ref> analyzed how costs of reproduction (flowering or not in year t) affected growth from year t to t + 1. The two growth kernels for flowering and nonflowering were then used in an IPM to quantify the optimal flowering size that balances the benefits of waiting to flower at larger sizes against the greater risk of death before flowering. The original study assumed Gaussian size transitions with nonconstant variance depending on initial size. Here we revisit that analysis to derive improved growth kernels. We use this case study to illustrate several new elements and challenges, including modeling skewness and kurtosis as functions of expected future size.</p><p>The data, originated by Dr. Hans Jacquemyn, come from 368 plants in a Belgian population censused annually from 2003 through 2011. Here we use data only from the "light" habitat in the original study. We used the natural logarithm of total leaf area as the size variable in the IPM.</p><p>As a variation on software, we fitted the pilot Gaussian model using the lmer function in the lme4 package, as in the original study. We fit three candidate linear models that included fixed effects of size in year t (model 1), additive effects of size and flowering status in year t (model 2), or an interaction between size and flowering (model 3), all including random intercepts for year. The interaction model was strongly favored (difference in AIC between the best and next-best models [&#916;AIC] = 10.5). Unlike our previous case studies, here we have multiple fixed effects (initial size and flowering status) that may influence the variance of future size. In cases such as this it is convenient to model variance as a function of expected future size, rather than initial size as we did with the lichens and cacti. The expected (or "fitted") values reflect the combined influence of all fixed and random effects, and therefore implicitly account for multiple sources of variation in the variance.</p><p>Models where error variance is a function of fitted values cannot be fitted directly with lme4 (nor in the mgcv functions for generalized additive models). But it can still be done with lmer through an iterative re-weighting approach, as follows. In lmer, weights w i can be used to indicate that the observations y i have error variance proportional to 1=w 2 i . The iterative steps are as follows, and code that executes these steps is in orchid_growth_modeling.R.</p><p>1. Fit the expected value assuming Gaussian-distributed residuals with constant variance. 2. Fit the SD of the residuals as a function of the corresponding fitted value. 3. Refit the model, with weights equal to the inverse of the SD estimated in step 2.</p><p>We iterated steps 2 and 3 until the root mean square change in weights was below 10 -6 . This is not elegant, but it works and converges quickly. In step 2, we modeled the log of the SD (because SDs cannot be negative) as a quadratic polynomial in the fitted mean. In exploratory analyses we found that the quadratic term was necessary to fit the SD. We did this for all candidate models and, for a fair Akaike information criterion (AIC) comparison, we then refit all candidate models with the weights estimated from the top model.</p><p>The updated model selection continued to favor the size &#215; flowering interaction model (3), but now with a weaker improvement over the next best model (&#916;AIC = 6.7). The fitted mean (a function of initial size and flowering status) and fitted SD (a function of the fitted mean) are shown in Figure <ref type="figure">3E</ref>. Spline regression found no trend in the mean of the resulting standardized residuals, and only small variation in the variance (Figure <ref type="figure">4E</ref>,<ref type="figure">F</ref>).</p><p>The best Gaussian model indicated a growth cost associated with flowering at the start of the census interval and a decline in growth variance with increasing expected values (Figure <ref type="figure">3E</ref>). The standardized residuals indicated negative skewness (10%-20% difference in tail weight) and excess kurtosis (10%-40% fatter than Gaussian) across much of the size distribution but both were negligible at large expected sizes (Figure <ref type="figure">3F</ref>).</p><p>As possible improvements, we explored the skewed t and JSU distributions, both leptokurtic distributions with flexible skewness. Based on comparisons between real and simulated data we were happier with the skewed t, which we fit with a custom likelihood function similar to the JSU growth model for the lichen data. However, rather than refitting all parameters of the skewed t model, as we did with the lichen JSU, we built a "hybrid" likelihood function that uses the fitted mean and SD from the best Gaussian model, and estimates parameters  <ref type="bibr">Ellner &amp; Miller, 2025)</ref>.</p><p>that control skewness and kurtosis as linear functions of expected future size. This is easy because the gamlss.dist package provides a parameterization of the skewed t in which the location parameter &#956; is the mean and scale parameter &#963; is the SD <ref type="bibr">(Rigby et al., 2019)</ref>. The hybrid likelihood looks like this:</p><p>Based on diagnostics of the standardized residuals, parameters that control skewness and kurtosis are defined as linear functions of the mean (note that the tau parameter uses a log x -2 &#240; &#222;link function). This approach relies on the robustness of fitted Gaussian models to deviations from normality, which implies that the fitted mean and variance from a Gaussian model are good approximations for the mean and variance of the corresponding non-Gaussian model. If one is skeptical of this approach, it is possible to simultaneously refit all parameters of the skewed t. However, recall that unlike the lichen case study, the pilot Gaussian model here includes random year effects, and the expected values getting passed into dSST account for this source of variation. Estimating random effects "from scratch" with a custom likelihood model is possible (we provide guidance on doing this with a "shrinkage" approach, in Appendix S2), but generally should not be necessary. Instead, a key advantage of the hybrid approach is retention of the fitted random effects and associated variance components, which get shuttled from the Gaussian model into the non-Gaussian model without any fuss (though it was critical to use a parameterization of the skewed t for which mu is the mean and sigma is the SD). And, if this approach does not "work" (i.e., deviations from normality biased the fitted values of the Gaussian model) one would quickly find out when comparing simulated with real data. In this case, size transition data simulated from this model corresponded favorably to the real data, much better than the pilot Gaussian model, including improvements in the SD, skewness, and kurtosis of future size (Appendix S4: Figure <ref type="figure">S2</ref>).</p><p>Finally, we used the improved growth model to revisit key results of the original study. <ref type="bibr">Miller et al. (2012)</ref> used the orchid IPM to estimate the evolutionarily stable strategy (ESS) as the mean size at flowering that maximizes lifetime reproductive success (R 0 ), given the constraint that flowering when small reduces growth and thus elevates mortality risk. Repeating that analysis here, we found that improved growth modeling has virtually no influence on predictions for optimal life history strategies (Figure <ref type="figure">8</ref>). ESS flowering sizes were nearly identical between IPMs with Gaussian versus skewed t growth models, and both aligned well with the observed mean flowering size (dashed vertical line in Figure <ref type="figure">8</ref>). Similarly, there were very small differences between growth functions in other metrics of orchid life history and, again, these differences were overwhelmed by uncertainty associated with parameter estimation (Table <ref type="table">1</ref>).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>DISCUSSION</head><p>Much of the appeal of IPMs has stemmed from their embrace of continuous size structure through regression-based approaches, and the potentially complex fixed-and random-effect structures that those approaches allow. Using familiar statistical tools and with relatively few parameters to estimate, IPM users can incorporate important sources of variation in demography and interrogate their influence on ecological and evolutionary dynamics. With this opportunity comes the burden of getting it right: an IPM is only as good as the statistical sub-models for the underlying data. The growth sub-model is the trickiest part because it defines a distribution of future size conditional on current size. Distributions have many properties-"moments"-and a good growth model should recapitulate the properties of real size transitions. The default assumption of Normally distributed size transitions, employed overwhelmingly across 20+ years of IPM studies, is an arbitrary historical precedent. In our case studies and, we suspect, more broadly, skewness and excess kurtosis were common features of size transitions. Our most important message is that the assumption of normally distributed size transitions can easily be abandoned, and a more inquisitive process of growth modeling should take its place.</p><p>We have attempted to lay out what that process should look like, emphasizing visual diagnostics to characterize how data deviate from Gaussian. One implication of relying on visual diagnostics is that goodness of fit is in the eye of the beholder. This empowers IPM users to make informed choices, but it is not very prescriptive; we have not suggested any hard rules for choosing among distributions, only that a good growth model should generate data that look like the real thing. Alternatively, model selection could be used to identify best fitting growth distributions and best fitting functions for higher moments. However, model selection among growth distributions with 3-5 parameters, each of which may be functions of multiple state variables or fitted values, can quickly explode in complexity, and we are not convinced it is worth the trouble.</p><p>Our work follows the important contribution of <ref type="bibr">Peterson et al. (2019)</ref>, who were similarly motivated by inadequacy of the Gaussian model but arrived at different recommendations. These authors developed a creative approach in which size data are transformed onto a 0, 1 &#189; scale and size transitions on that scale are modeled using beta regression. The beta distribution can accommodate positive, negative, or zero skew, potentially varying with size, so the Peterson et al. approach is a flexible option for skewed growth data. However, beta regression also has some limitations: common beta regression packages do not fit random effects (e.g., betareg; Cribari-Neto &amp; Zeileis, 2010) or do not do so reliably (in our experience gamlss regressions with random effects are numerically unstable); and the two-parameter beta distribution does not allow skewness and kurtosis to be fitted independently. Additionally, the initial transformation onto 0, 1 &#189; scale requires estimating extreme quantiles of the growth distribution (e.g., 0.01 and 0.99) as a function of initial size. In our experience those quantile estimates can be very sensitive to how size dependence is modeled, and model selection is challenging for extreme quantiles where data are (by definition) very sparse. Rather than picking one distribution as a new default, users can leverage the vast arsenal of continuous probability distributions-all at one's fingertips with a few lines of code-so that the data and their particular deviations from normality can guide the choice of a better distribution. It is also possible to use mixtures of multiple growth distributions, as done by <ref type="bibr">Shriver et al., 2012</ref> to model "normal" and "extreme" types of lichen shrinkage. In reanalyzing that data set, we found that a single, flexible distribution (Johnson's S-U) could recapitulate the observed size transitions, but there may be other cases where mixture distributions are preferable or necessary.</p><p>In all of our case studies, non-Gaussian growth models always yielded more satisfying fits to size transition data than the Gaussian models published in those papers. However, to our relief, none of these reanalyses yielded a "gotcha" result that overturned results of the original study. In fact, in this small sampling of case studies, improved growth modeling had weak to modest effects on IPM results, similar in magnitude to the results of <ref type="bibr">Peterson et al. (2019)</ref>, and for most species and life history metrics, these effects were overwhelmed by the uncertainty associated with parameter estimation (Table <ref type="table">1</ref>). For some case studies, one might argue that non-Gaussian modeling was not worth the trouble-only it was almost no trouble at all, and we could not have known whether or not a non-Gaussian model would have made a difference before fitting it. Even where Gaussian and improved growth models differed in IPM results, we do not know "true" values to compare them against, only that one describes the data better than another, and a closer match to the data does not necessarily translate to better predictive ability due to the bias-variance trade-off.</p><p>We caution against taking too much comfort in weak effects of "improved" growth modeling; in other scenarios, the choice of growth distribution could be more consequential. We focused on life history metrics such as mean lifespan and mean lifetime reproductive success (Table <ref type="table">1</ref>). It is possible that higher moments of those traits (e.g., variance or skewness of lifespan and lifetime reproduction) are more sensitive to the tails of the growth distribution. It is also worth noting that most of our case studies focused on perennial life histories (perennial plants and lichens) characterized by relatively slow growth, heavy losses during recruitment, and high survival once established, and these species all had mean lifespans between one and six years and generation times on the order of decades. Life histories such as these may be relatively robust to subtle features of the growth kernel. In Appendix S3 we present three additional case studies that broaden our life history coverage, including pike (Esox lucius), a fish with a generation time of four to five years, and creosote bush (Larrea tridentata), a desert shrub that is virtually immortal once established. Life history metrics from these "fast" and "slow" populations were no more sensitive to improved growth modeling than those of the perennial plants and lichens (Table <ref type="table">1</ref>). More systematic comparative analyses may provide insight into which types of species and life histories are more likely to exhibit strong skewness and kurtosis, and which demographic quantities are more or less sensitive to these features of size transition.</p><p>Our case studies illustrate a diversity of software packages and computational approaches to reflect the diversity of preferences and habits that the community of IPM analysts bring to their own problems. We like spline generalized additive models (gams) for their flexibility and for mgcv's numerous options for distribution families and overall speed and reliability. However, there are some applications for which classical parametric regression would be preferable because the coefficients carry biological meaning. For example, regression coefficients may be targets of natural selection <ref type="bibr">(Rees &amp; Ellner, 2016)</ref> and may combine to influence traits of interest such as the expected size at flowering (e.g., in Figure <ref type="figure">8A</ref>), a function of the intercept and slope of the size-dependent flowering function <ref type="bibr">(Metcalf et al., 2003)</ref>. Some potentially useful distributions are not available in linear modeling software packages, but that should not be a barrier to their use: as in several of our case studies, custom likelihood functions allow non-Gaussian models without sacrificing the complex, multilevel features that one might be accustomed to fitting in lme4, for example. Bayesian analysis may further broaden the options for non-Gaussian candidate distributions and may help estimate hard-to-fit parameters through the brute force of sampling algorithms. Bayesian analysis also provides a natural way to propagate uncertainty from vital rate sub-models to full model predictions <ref type="bibr">(Elderd &amp; Miller, 2016)</ref>, as an alternative to our bootstrapping approach. However, as of this writing, most of the non-Gaussian distributions that we have discussed are not available in popular Bayesian software packages such as Stan or JAGS. While user-defined distributions can always be coded from scratch, this may be a significant technical barrier for many IPM analysts.</p><p>From the outset there have been concerns about "how well these methods [IPM growth kernels] can deal with different patterns of growth, stasis, and shrinkage" <ref type="bibr">(Morris &amp; Doak, 2002, p. 200</ref>), compared with "binning" methods that use observed transition frequencies between user-defined size classes as the transition probabilities in a (possibly large) matrix model <ref type="bibr">(Doak et al., 2021)</ref>. The non-Gaussian models that we have considered here are not a panacea. For example, none of them allow bimodal growth, such as might occur if herbivore-or pathogen-attached individuals experience rapid tissue loss. When the shape of the growth distribution is nearly the same for all initial sizes, a nonparametric IPM growth kernel can be defined from a kernel density estimate for scaled residuals <ref type="bibr">(Ellner et al., 2016, p. 288)</ref>. Outside that special situation, nonparametric approaches require choosing multiple smoothing parameters, which is very challenging. We are currently exploring whether "targeted learning" approaches developed for causal inference (van der Laan &amp; Rose, 2011) can be used to circumvent smoothing parameter selection. Targeted learning starts with a pilot model and updates it iteratively to achieve unbiased estimates and valid CIs for a particular "target" quantity, such as &#955; or mean lifespan. Preliminary results suggest that targeted learning with a deliberately under-smoothed pilot model works well for complex growth patterns <ref type="bibr">(Zhou &amp; Hooker, 2024)</ref>. But nonparametric methods are data-hungry, so when departures from Gaussian are quantitative rather than qualitative, parametric modeling as developed here will make more efficient use of limited data.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>CONCLUSION</head><p>Gaussian-distributed size transitions are probably the exception in nature, not the rule, yet two decades of IPM studies have relied overwhelmingly on Gaussian growth models. Using tools not available when IPMs were first developed, it should often be possible now to make major improvements over a Gaussian model, without worrying about finding the "best" alternative. By generating predicted size transitions that are truer to the data, IPM analysts can narrow the gap between model and nature.</p></div><note xmlns="http://www.tei-c.org/ns/1.0" place="foot" xml:id="foot_0"><p>19399170, 2025, 5, Downloaded from https://esajournals.onlinelibrary.wiley.com/doi/10.1002/ecy.70088 by Rice University, Wiley Online Library on [26/06/2025]. See the Terms and Conditions (https://onlinelibrary.wiley.com/terms-and-conditions) on Wiley Online Library for rules of use; OA articles are governed by the applicable Creative Commons License</p></note>
		</body>
		</text>
</TEI>
