Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. EPFL thesis
  4. A Quasi-Likelihood Approach to Zero-Inflated Spatial Count Data
 
doctoral thesis

A Quasi-Likelihood Approach to Zero-Inflated Spatial Count Data

Monod, Anthea  
2012

The increased accessibility of data that are geographically referenced and correlated increases the demand for techniques of spatial data analysis. The subset of such data comprised of discrete counts exhibit particular difficulties and the challenges further increase when a large proportion (typically 50% or more) of the counts are zero-valued. Such scenarios arise in many applications in numerous fields of research and it is often desirable to infer on subtleties of the process, despite the lack of substantive information obscuring the underlying stochastic mechanism generating the data. An ecological example provides the impetus for the research in this thesis: when observations for a species are recorded over a spatial region, and many of the counts are zero-valued, are the abundant zeros due to bad luck, or are aspects of the region making it unsuitable for the survival of the species? In the framework of generalized linear models, we first develop a zero-inflated Poisson generalized linear regression model, which explains the variability of the responses given a set of measured covariates, and additionally allows for the distinction of two kinds of zeros: sampling ("bad luck" zeros), and structural (zeros that provide insight into the data-generating process). We then adapt this model to the spatial setting by incorporating dependence within the model via a general, leniently-defined quasi-likelihood strategy, which provides consistent, efficient and asymptotically normal estimators, even under erroneous assumptions of the covariance structure. In addition to this advantage of robustness to dependence misspecification, our quasi-likelihood model overcomes the need for the complete specification of a probability model, thus rendering it very general and relevant to many settings. To complement the developed regression model, we further propose methods for the simulation of zero-inflated spatial stochastic processes. This is done by deconstructing the entire process into a mixed, marked spatial point process: we augment existing algorithms for the simulation of spatial marked point processes to comprise a stochastic mechanism to generate zero-abundant marks (counts) at each location. We propose several such mechanisms, and consider interaction and dependence processes for random locations as well as over a lattice.

  • Files
  • Details
  • Metrics
Type
doctoral thesis
DOI
10.5075/epfl-thesis-5442
Author(s)
Monod, Anthea  
Advisors
Morgenthaler, Stephan  
Date Issued

2012

Publisher

EPFL

Publisher place

Lausanne

Thesis number

5442

Total of pages

198

Subjects

Generalized linearmodels (GLM)

•

Generalized estimating equations (GEE)

•

Zero-inflated Poisson (ZIP) models

•

Spatial analysis

•

Marked point processes

•

Modèles linéaires généralisés (GLM)

•

équations d'estimation généralisées (GEE)

•

modèles de Poisson modifiés en zéro (ZIP)

•

analyse spatiale

•

processus ponctuels marqués

Note

AMS Subject Classification: 60G55, 62J12, 62M3, 82B20, 92D40

EPFL units
STAP  
Faculty
SB  
School
MATHAA  
Doctoral School
EDMA  
Available on Infoscience
July 23, 2012
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/84141
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés