Methods |
Corresponding author: Steven F. Railsback ( steve@langrailsback.com ) Academic editor: Hazel Parry
© 2025 Steven F. Railsback, Bret C. Harvey.
This is an open access article distributed under the terms of the CC0 Public Domain Dedication.
Citation:
Railsback SF, Harvey BC (2025) Representing mortality risk in mechanistic models. Individual-based Ecology 1: e141005. https://doi.org/10.3897/ibe.1.141005
|
Mortality risk is a critical and complex component of individual fitness and individual-based ecology, especially when risk-avoidance behaviors are considered. Organisms are subject to multiple kinds of risk that can vary with habitat, time, individual state, individual activity and behavior, and population status. Yet risk is often represented very simply in models and there is little literature on practical ways to model its variation. In our experience, desirable characteristics of risk models include: (a) survival probability can vary with multiple variables of individuals, habitat, and other entities; (b) relations between survival and specific variables can be added or removed or modified without re-fitting the entire model; (c) relations between variables and survival can take different forms, including continuous and nonlinear functions and discrete values; (d) relations between variables and survival are easy to understand and fit to many kinds of data or assumptions; and (e) they can be calibrated by adjusting only one parameter. We review the terminology and conventions that ecologists often use to model risk, and provide a mathematical framework for modeling risk. For complex risks, we describe and illustrate a method with “survival increase functions” that each relate survival probability to one variable. These functions can have a different form for each variable and can each be based on different information. The multiple functions are combined into a single survival probability value that is easily calibrated. We discuss methods for evaluating survival increase functions, ranging from general field observations to controlled field experiments, knowledge and data on mechanisms driving survival, and even conceptual models of those mechanisms. This approach has proven practical for representing complex effects of multiple variables on survival probability in models that represent how individual behavior and fitness depend on risk.
Modeling Mortality Predation Risk Survival
One of the greatest promises of individual-based ecology is the ability to incorporate more realistic mechanisms in our models, thereby making them more general and more capable. Mortality is, of course, an extremely important mechanism in ecology. Mortality is important not only because it is a fundamental driver of population dynamics but also because, as ecologists now widely acknowledge (e.g.,
Representing how mortality risks vary (over space or time, or among individuals) is important for at least two kinds of model. First are models of the effects of changes that strongly affect risk; examples include land use changes and reintroduction of predators (both illustrated by
To address such purposes, individual-based ecology has the challenge of developing methods for quantifying how risk varies and for representing that variation in our models.
How individual-based models (IBMs) can represent behavior as fitness-seeking tradeoffs among risk, growth, and other elements of fitness has been addressed to some extent (e.g., chapter 5 of
While little of it addresses general, mechanistic modeling, there is extensive empirical literature on spatial and temporal variation in predation mortality and risk, especially in wildlife and livestock subject to predation by large carnivores (reviewed, e.g., by
Such observational study methods and literature have limited value for the individual-based modeler. Studies that examine only mortality events cannot distinguish risk (as we define it below) from the confounding effect of prey exposure (the number of prey killed by predators is a function of both risk and number of prey). The literature and methods are generally limited to large terrestrial predators and prey, which are particularly easy to observe. Because these studies are purely empirical, their results are of questionable value for modeling other systems or novel future conditions. Observational studies also tend to have limited temporal resolution because it is generally difficult to determine precisely (e.g., between day, night, or twilight) when a predation event occurred. And these studies are of course expensive and time-consuming. However, observational studies provide valuable clues about general mechanisms (e.g., differences between wolves and cougars in predation success in dense vs. open habitat, and in willingness to hunt near humans;
Our objectives are to review concepts and mathematical methods that can be used to model how mortality risks vary over multiple dimensions in IBMs and other mechanistic ecological models. We specifically address models that treat one or more risks as explicit functions of multiple characteristics of habitat, individuals, or time. However, we do not address models in which mortality emerges from direct interactions among individuals, e.g., by representing adaptive predators or contagious disease spread.
We also discuss ways in which such methods can be supported by multiple kinds of information, from natural history knowledge, mechanistic understanding, and empirical experiments of several kinds. Our methods are especially applicable to systems and species for which reliable empirical information on risks is difficult to obtain, and for models that benefit from being more mechanistic and general because those characteristics make them useful for predicting responses to novel conditions under which strictly empirical representation of risk would be unreliable.
We focus, but not exclusively, on predation risk to mobile animals that use risk avoidance behaviors such as selecting when and where to feed. Our experience is primarily with modeling salmonid fish, specifically the “InSTREAM” family of IBMs for predicting effects of river management on trout and salmon populations (e.g.,
We begin by reviewing concepts we find important for understanding and modeling risk.
Traditional equation-based ecological models (e.g., the Lotka-Volterra equations) represent mortality via a rate parameter d, the fraction of a population that dies within a specified unit of time (Haefner 2012). However, in an IBM mortality is not a population-level rate but an individual-level event: every time step, each individual either survives or dies. A very simple way to model mortality as an event is using the variable “risk” R as a probability of mortality within a specific time period. We can set the value of R by assuming it equal to an observed mortality rate, but it is important to remember that when we model mortality of individuals in this way, R is a probability, not a rate.
(Here, we generally assume 1 day as the time period. We also use “risk” as a general term for mechanisms that could cause mortality, e.g., the risk of heart attack. The context should make it clear when “risk” is used in this general sense or specifically as a probability of death, R.)
The term “mortality” also has general meanings—the phenomenon or rate of death among individuals—and a specific meaning, the death of an individual. Mortality of an individual is typically represented in IBMs as an event that occurs randomly with probability R of occurring within one time period.
Studies and models that explicitly represent both predators and prey often follow the convention (e.g.,
Models can be simpler to describe and implement when mortality is represented via survival S, the probability of surviving a specific time period, so S = 1.0 – R. Under the very common (yet often unstated) assumption that mortality risk in any time period is independent of risk in adjacent periods, the probability of surviving any t time periods is simply St, which is especially convenient for models with time steps of variable lengths. Our trout model simulates four time steps representing the light phases of each day (
Modeling risk as a survival probability is also convenient because the probability of surviving multiple, independent, kinds of mortality (discussed below) is calculated simply as the product of the survival probabilities for each mortality type.
Even though we typically evaluate risk and survival as daily probabilities, understanding and making good modeling decisions requires thinking about risk over longer time periods. Seemingly minor differences in daily survival result in large differences over meaningful future periods.
The widespread misuse of the so-called “µ/f rule” (or “µ/g”) provides a good example of the importance of time for understanding risk.
When estimating survival parameters for models, it helps to think about survival over a meaningful time period and then calculate the corresponding daily probabilities. For example, we might estimate that an animal in a particularly risky habitat has a 20% chance of surviving for a month. The corresponding daily survival is: . Another approach is to estimate a median survival time (tm) and calculate daily survival probability from it: .
In some models it is important to distinguish among risks that individuals do vs. do not perceive and respond to. Human harvest, vehicle collisions, disease, natural toxins, pollutants, and introduced predators are example risks that a modeler might assume animals are naïve about and do not avoid via behavior. Such risks can be represented by modeling them exactly as the other risks while assuming that the individuals do not consider them in risk-driven behaviors. For example,
Here we list some of the “dimensions” in which mortality risks can vary, ending with a summary of those dimensions in our salmonid models. These dimensions are variables or groups of variables that risks often vary over.
Risk types can include predation (including multiple types of predator, as considered below), disease, extreme weather or other habitat conditions, starvation or dehydration, and human-induced risks such as harvest.
Many risks are affected by habitat conditions at various scales. For example, risk of wolf and cougar predation on ungulates can differ between open and forested areas (
Some risks vary over time cycles from daily to seasonal and even multi-year. Temporal variation in risks is generally driven by lower-level mechanisms such as predator physiology and life cycles, and daily and seasonal light cycles, but it can be convenient to model such variation as time-driven.
Risks commonly vary dramatically with characteristics of the model individuals. If classical ecological models address variation in risk, they typically do so by assuming predation risk decreases as prey size increases. In many real systems, however, the kinds of predators and risk change as prey proceed through their life cycles, and the relations between size and risk may not be simple. Often, different life stages are subject to quite different risks, in which case life stage can be the individual state variable with the most effect on risk.
Models that represent risk-avoidance behavior must represent the effects of behavior on risk. For example, a model that represents how individuals choose between feeding during day vs. night as an adaptive tradeoff between risk and food intake must represent how risk differs between feeding and alternative activities, during both day and night. (
Some risks can be subject to direct feedbacks from the local or global population status. Mechanisms of such feedbacks include (1) competition for predator avoidance habitat (hiding places), (2) attraction or prey switching by predators, and (3) cannibalism.
We illustrate these axes of variation in risk via Table
Axes of risk variation in the trout model of
Life stage | Type of risk | Risk dimensions and variables |
---|---|---|
Eggs | Thermal stress and disease | Habitat: temperature |
Dewatering | Habitat: water depth | |
Scour | Habitat: river flow | |
Superimposition (displacement by another spawning female) | Habitat: area of spawning habitat | |
Population status: number of spawners | ||
Juvenile and adult trout | Thermal stress | Habitat: temperature |
Stranding | Habitat: water depth | |
Starvation and disease | Individual state: length, weight | |
Predation by terrestrial animals | Habitat: depth, velocity, light intensitya, distance to escape cover | |
Time: light intensitya | ||
Individual state: length | ||
Individual activity: feeding vs. hiding | ||
Population status: availability of hiding cover | ||
Predation by fish | Habitat: depth, temperature, light intensitya | |
Time: light intensitya | ||
Individual state: length | ||
Individual activity: feeding vs. hiding | ||
Population status: density of piscivorous adult trout |
In this section we present one general framework for modeling variation in risk. It includes the steps of (1) identifying types of risk; (2) for each type, representing how survival depends on each of multiple variables; and (3) combining the effects of multiple variables into one survival probability value.
This framework differs from conventional approaches by letting us develop models of risk that are realistically complex, general instead of specific to a site or time, and supported by multiple kinds of information.
The first step is determining which types of risk to model separately. We assume that the modeler has carefully determined what risks need to be in a model to meet its intended purpose. “Pattern-oriented modeling” is a powerful strategy for doing so, thoroughly illustrated by
There are of course costs of using more types of risk: more assumptions and parameters, and more computations. While traditional modelers associate more parameters with higher uncertainty, a more-resolved representation of risk could reduce uncertainty by supporting more realistic behavior and results and by making more empirical evidence useful for representing risk.
Next, for each type of risk we need to model how survival probability S varies with one or more selected variables. Selecting the variables that drive each risk is similar and related to the process of selecting types of risk: variables should be included if they are considered essential to the model’s purpose or—via pattern-oriented modeling—essential for establishing the model’s credibility. But a variable might be considered essential for a model only because it is a critical driver of risk.
We discuss four ways to model how S depends on selected variables, but focus on the third and fourth, which we find most useful and novel.
Above (Conventional approaches to variation in risk) we identify examples of studies that modeled risk statistically, from several kinds of data, and discuss limitations of such approaches. Logistic regression is an appealing analysis framework because it estimates how the probability of an event—mortality—depends on multiple variables. However, modelers must be aware of differences between observed mortality rates and the survival probability variables in their models, especially for models containing risk-avoidance behavior. A model parameter representing S in the absence of risk avoidance behavior might be poorly estimated by data on actual mortality events, because actual mortality rates depend on prey abundance and behavior as well as the underlying survival probability.
If there are few types of risk and they vary over few variables that can all be treated categorically, variation in S can be modeled as a matrix (Table
Risks assumed to depend on only one variable can be modeled as continuous functions that are either estimated from observations (as in Option 1) or designed by considering a variety of information. Such univariate functions could have any shape as long as they produce values between 0.0 and 1.0. We find two forms especially useful.
Logistic curves are useful for representing nonlinear effects of a variable on survival (Fig.
Example logistic curve for S, representing trout survival of high-temperature stress. The round symbols represent observations from two laboratory studies of daily survival at temperatures of 24, 26, and 28 °C. The solid curve is a logistic function fit to the three observations, the model of daily S. The square symbols indicate the two parameters that define the logistic curve: X0.1 and X0.9 are 30.2 and 25.8 °C. The dotted curve shows 10-day survival probability, equal to S10.
The equation for a logistic function is:
Eq. 1
where Z is a function of the X value:
. Eq. 2
We find it convenient to define A and B, and therefore the shape of the logistic curve, via two model parameters, X0.1 and X0.9, which are the X values that produce Y values of 0.1 and 0.9 (Fig.
Because of the exp functions in equations 1 and 2, logistic functions are particularly vulnerable to variable overflow/underflow errors, which occur when the computer tries to produce a floating-point number larger or smaller than it can handle. The ranges of values causing errors depend on the software platform and sometimes the hardware. These errors can be avoided while providing the ability to evaluate logistic functions over wide ranges by adding code that sets Y to 1.0 when Z is greater than (e.g.) 35 (corresponding to Y = 0.999999999999999 in 64-bit Excel) and to 0.0 when Z is less than -200 (corresponding to Y = 1.4E-87).
Linear models of survival can be useful for chronic risks that persist over long times and cannot be alleviated rapidly. We use a linear model of how survival of starvation and disease depends on an individual’s energy reserves, in models in which energy reserves can change relatively little in one time step. If we define an individual variable energy-deficit as the fraction by which energy reserves are below those of a healthy individual (so, e.g., energy-deficit = 0.2 means the individual’s energy is 20% below a healthy value), we can model the daily probability of not starving SS as linear: SS = 1.0 – (PS×energy-deficit) where PS is a parameter. Higher values of PS cause survival to decrease more rapidly as energy decreases.
This kind of linear model produces a gradually decreasing probability of surviving prolonged periods of risk, e.g., when energy-deficit is constantly positive or increasing steadily due to constant weight loss (Fig.
Example linear survival model, representing survival of starvation and disease SS as a function of energy deficit. The solid lines indicate the value of SS when PS is 0.2, when energy-deficit (left) is constant at 0.2, and (right) starts at 0.0 and increases by 0.01 on each of 30 days. The dotted lines indicate the cumulative probability of surviving from day 0.
Univariate models can be fit to data, including data from multiple sources, or even to a set of assumed values. Both linear and logistic functions can be fit via regression when suitable data are available, and via other techniques when data are not suitable for regression. For example, the logistic parameters X0.1 and X0.9 illustrated in Fig.
None of the previous three methods for modeling S have all of the following characteristics that we find essential for modeling some kinds of risk, especially predation:
Simply fitting a multivariate equation for S would not provide these characteristics, so we developed the following method for modeling complex survival probabilities. It uses separate functions to represent the effect of each variable, with those effects then combined into a survival probability. As we discuss below, this approach assumes that variables driving S have independent effects.
First, we identify a minimum survival probability Smin, which is a model parameter. The value of Smin represents survival under the least-safe conditions.
Second, we estimate “survival increase functions” (SIFs) for each variable (of model individuals, their habitat, or anything else) that affects survival. The SIFs produce “survival increase” values that range from 0.0 to 1.0 and represent the degree to which the variable increases survival probability: a SIF value of 0.0 provides no increase in survival and a value of 1.0 makes the individual completely safe (S = 1.0). However, SIFs have no other limitations on their form or origin: each can be a different function type developed from different information.
The function types described above as univariate models of S are also useful as SIFs. Most of the SIFs we use are logistic curves, but we also use discrete functions to represent the effects of boolean (true-false) or categorical variables. For example, a SIF for use of hiding behavior can simply be: survival increase is 0.8 if the individual is hiding and 0.0 if not. A SIF for the effect of light on risk could simply be: survival increase is 0.0 in daytime, 0.6 during twilight, and 0.9 at night.
A SIF can also be modified so that its value never reaches 1.0—no values of its variable make individuals completely safe. For example, a model of predation survival could include a variable representing vegetation density, and assume that survival probability is higher when vegetation is denser but some predators can be successful in even the densest vegetation. We can model that effect with a SIF that uses a logistic curve limited to values less than 1.0; such a curve only requires one additional parameter for the maximum survival increase, which all logistic curve values are multiplied by.
Example SIFs (for the risk of terrestrial predators on trout) are illustrated in Fig.
Example survival increase functions, for variables affecting terrestrial predation risk to trout in Table
The third step is to combine the SIFs into a value of S, using a method (
. Eq. 3
Using the example survival increase values of Fig.
One limitation of this method is that it does not represent interactions among the ecological factors affecting risk, e.g., the relation between depth and S depending on the value of light intensity. Such interactions could of course be added to a model, if the benefits of additional realism appear to outweigh the costs of substantial additional model complexity.
The final step in using this method is to estimate a value for Smin. We typically do this via calibration of the full model, searching for Smin values that produce reasonable model results. In many models, values of such survival parameters are found by fitting results to observed or assumed values of abundance or survival rate. However, models with tradeoff behaviors that relate (e.g.) growth to risk may also produce growth and size results that also respond strongly to Smin. For such models, it may be necessary to calibrate Smin simultaneously with parameters that drive growth.
If a model includes more than one type of risk that uses this method for S (e.g., predation by terrestrial animals and fish, in Table
The main advantages of the SIF approach is that it allows models of survival to make use of many kinds of data and mechanistic understanding. Each SIF can be based on the best information available for the relation it represents, whether that information is from field observations, controlled field or laboratory experiments, or only conceptual models. Observations of realized predation, such as those we reference in the Introduction, can inform estimates of habitat effects on predation risk after confounding factors are taken into account. However, here we focus on (a) field experiments designed to evaluate how risk varies along specific gradients and (b) use of information on mechanisms driving risk.
Field observations that contrast survival across relevant gradients in habitat or in the characteristics of individuals—on spatial and temporal scales relevant to the model in use—provide ideal information for building survival increase functions. Such observations are not common in the scientific literature, although the most feasible spatial and temporal scales for field experiments (e.g., a few square meters with trials of one day or less) are likely to align with the scales of many individual-based models. However, animals generally do not voluntarily occupy high-risk habitat, so enclosures or tethering (e.g.,
An alternative to directly measuring risk is to quantify the prey’s perception of risk or antipredator behavior. For a variety of reasons, the perception of, and responses to, predation risk by prey may not match actual risk (
In addition to direct measures of relative survival or the perception of risk by prey, we can use basic life history information and fundamental principles of physiology and ecology to identify specific mechanisms through which variables affect risk and then to evaluate SIFs that represent those mechanisms. This approach is highly dependent on the system being modeled; we illustrate it with examples from the trout predation relations identified in Table
Survival is clearly a key ecological mechanism, not only because it directly affects abundance but also because risk-avoidance behaviors can have strong indirect effects on individual fitness. IBMs and other mechanistic models designed to address problems driven in part by survival therefore are likely to need realistic representations of how mortality risks vary. This need is especially strong for models that represent behaviors that trade off risk and other elements of fitness. We previously (
Understanding and modeling risk and its effects are challenging because animals (and plants) are subject to a variety of risks, and each kind of risk can vary with characteristics of the individuals, their habitat, their behavior, and interactions such as competition and cooperation. Risk is also challenging to model because observed mortality rates are not necessarily directly related to intrinsic predation risk. Observed mortality is instead often a complex outcome of intrinsic risk (e.g., predator density), risk reduction behavior, and population status. Disease mortality rates, for example, could be a function of pathogen distributions (the intrinsic risk), the energy available to individuals to fight infection, and the frequency of infectious interactions among individuals. Predation mortality rates can be a function of predator densities, predator avoidance behavior, and prey densities. Consequently, observed mortality rates—even if we know the cause of mortality—do not necessarily provide the information we need to model risk.
In our experience, the criteria for a useful multivariate model of survival probability listed above (Multivariate models using “survival increase functions”) are very important: to produce IBMs mechanistic enough to be reliable under diverse and novel conditions, without undue effort, we need approaches that let us model the effects of multiple variables, using different information and function forms for each variable, while the effect of each variable is easy to see and understand. The survival increase function method we propose meets those criteria and we expect it to be generally useful in a variety of models. However, we also expect that other modelers will develop useful alternatives.
The authors have declared that no competing interests exist.
No ethical statement was reported.
No funding was reported.
Conceptualization: SFR, BCH. Writing – original draft: SFR, BCH.
Steven F. Railsback https://orcid.org/0000-0002-5923-9847
Bret C. Harvey https://orcid.org/0000-0003-0439-9656
All of the data that support the findings of this study are available in the main text.