Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Logistic regression for clustered data from environmental monitoring programs
Umeå University, Faculty of Social Sciences, Umeå School of Business and Economics (USBE), Statistics.
Umeå University, Faculty of Science and Technology, Department of Ecology and Environmental Sciences.
Show others and affiliations
2018 (English)In: Ecological Informatics, ISSN 1574-9541, E-ISSN 1878-0512, Vol. 43, p. 165-173Article in journal (Refereed) Published
Abstract [en]

Large-scale surveys, such as national forest inventories and vegetation monitoring programs, usually have complex sampling designs that include geographical stratification and units organized in clusters. When models are developed using data from such programs, a key question is whether or not to utilize design information when analyzing the relationship between a response variable and a set of covariates. Standard statistical regression methods often fail to account for complex sampling designs, which may lead to severely biased estimators of model coefficients. Furthermore, ignoring that data are spatially correlated within clusters may underestimate the standard errors of regression coefficient estimates, with a risk for drawing wrong conclusions. We first review general approaches that account for complex sampling designs, e.g. methods using probability weighting, and stress the need to explore the effects of the sampling design when applying logistic regression models. We then use Monte Carlo simulation to compare the performance of the standard logistic regression model with two approaches to model correlated binary responses, i.e. cluster-specific and population-averaged logistic regression models. As an example, we analyze the occurrence of epiphytic hair lichens in the genus Bryoria; an indicator of forest ecosystem integrity. Based on data from the National Forest Inventory (NFI) for the period 1993-2014 we generated a data set on hair lichen occurrence on > 100,000 Picea abies trees distributed throughout Sweden. The NFI data included ten covariates representing forest structure and climate variables potentially affecting lichen occurrence. Our analyses show the importance of taking complex sampling designs and correlated binary responses into account in logistic regression modeling to avoid the risk of obtaining notably biased parameter estimators and standard errors, and erroneous interpretations about factors affecting e.g. hair lichen occurrence. We recommend comparisons of unweighted and weighted logistic regression analyses as an essential step in development of models based on data from large-scale surveys.

Place, publisher, year, edition, pages
Elsevier, 2018. Vol. 43, p. 165-173
Keywords [en]
Bryoria, Cluster-specific model, Complex sampling design, Correlated data, Logistic regression, tional forest inventory, Population-averaged model
National Category
Environmental Sciences
Identifiers
URN: urn:nbn:se:umu:diva-145390DOI: 10.1016/j.ecoinf.2017.10.006ISI: 000424721000015OAI: oai:DiVA.org:umu-145390DiVA, id: diva2:1186835
Available from: 2018-03-01 Created: 2018-03-01 Last updated: 2018-06-09Bibliographically approved

Open Access in DiVA

fulltext(2255 kB)73 downloads
File information
File name FULLTEXT01.pdfFile size 2255 kBChecksum SHA-512
9adf0b7735d3e53d75f537540a45ba4c223217fdb060d9b6312d659f2f8b1db85e123120ffc598f13fd384352e749be897a13e28666cfa05e743d34555762b5e
Type fulltextMimetype application/pdf

Other links

Publisher's full text

Search in DiVA

By author/editor
Ekström, MagnusEsseen, Per-Anders
By organisation
StatisticsDepartment of Ecology and Environmental Sciences
In the same journal
Ecological Informatics
Environmental Sciences

Search outside of DiVA

GoogleGoogle Scholar
Total: 73 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 126 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf