CADDIS Volume 4: Data Analysis

## Predicting Environmental Conditions from Biological Observations (PECBO) Appendix

- Introduction

- Using Existing Taxon-Environment

Relationships - Estimating Taxon-Environment

Relationships - Computing Inferences

- R Scripts

##### Topics in Estimating Taxon-Environment Relationships

### Parametric Regressions

Parametric regressions assume that taxon-environment relationships follow a pre-specified form. A common assumption for taxon-environment relationships is that the distribution of a particular taxon is unimodal with respect to environmental gradients. Then, in the case of presence/absence data, a convenient model for the probability of observing a particular taxon, following ter Braak and Looman (1986), is as follows:

where *p* is the probability of observing the taxon, and the left side of the equation is the logit transformation of this probability. Additionally, *x* is the value of the environmental variable, *u* is the species optimum (the point along the environmental gradient where the probability of observing the species is maximized), *t* is a measure of the niche breadth, and *a* is related to the maximum probability of observation. The constants *b _{0}*,

*b*, and

_{1}*b*can be determined using generalized linear models (GLMs). Then the parameters

_{2}*u*,

*t*, and

*a*can be determined as follows:

Examples of taxon-environment relationships estimated using Equation 3 for two genera, *Heterlimnius* and *Malenka*, with respect to stream temperature are shown in Figure 5. The computed curves closely track the observed capture probabilities for both genera. Confidence limits broaden for *Heterlimnius* as temperatures approach the minimum value, because data are sparse in that region.

Regression models can use multiple environmental variables to predict taxon occurrences. An example model equation is shown below.

The effect of each environmental variable (e.g, *x, y*) on taxon occurrence is modeled as a quadratic function. Also, the possibility that the two variables interact can be modeled. In the case of Equation 7, the interaction between the two variables is modeled as the product of *x* and *y*. Terms can be included or excluded depending upon our understanding of the variables and their effects upon taxon persistence. For example, you might choose to omit the interaction term, xy, if you knew that the two environmental variables did not interact.

Simultaneously modeling several variables often provides a more accurate means of distinguishing between the effects of different correlated stressors (Yuan 2007b).

The use of parametric functions to describe the taxon-environment relationship is both a strength and a weakness of the parametric approach. On the one hand, these functions allow the investigator to summarize the taxon-environment relationship using a short list of pre-defined parameters. On the other hand, the *a priori* assumption of a functional form may restrict the taxon-environment relationship to a shape that is not well supported by field observations. Plots of observed data and modeled functional fits can be inspected to help establish whether the assumed functional forms are appropriate.

Statistical scripts for computing single variable parametric regressions are available under the R scripts tab of this section.

#### Biological inference

Once taxon-environment relationships have been estimated using parametric regression, the most appropriate method to use for computing inferences is a maximum likelihood approach.

#### Abundance data

It also is possible to develop regression relationships that model the abundance of different taxa, rather than their presence or absence.

#### References

**Estimating Taxon-Environment Relationships:** Overview Previous Next