Modelling count data with excess zeros: An application to health care utilisation data

Peter Moffatt, Shamzaeffa Samsudin

Research output: Contribution to journalArticlepeer-review

2 Citations (Scopus)
10 Downloads (Pure)


This study is concerned with the estimation of microeconometric models of health care utilisation. The data set consists of 14,706 individuals from the General Household Survey for Great Britain, and the dependent variable is the number of General Practitioner (GP) consultations over a 2-week period. A clear feature of this count variable is excess zeros, and it is essential to incorporate this feature in the modelling strategy. Accordingly, in addition to standard Poisson and negative binomial models, zero-inflated, two-part, and latent class models are estimated. The zero-inflated negative binomial model (ZINB) proves, on the basis of several selection criteria, to be superior to other models for this dataset. As anticipated, health related variables show significant effects in determining health care utilisation while socioeconomic variables appear to be less important, according to the results from the preferred model. Some effects differ quite markedly between the different models, underlining the importance of the type of process we have used to identify the best-fitting one.
Original languageEnglish
Pages (from-to)201-215
Number of pages15
JournalMalaysian Journal of Economic Studies
Issue number2
Publication statusPublished - 2014


  • count data
  • excess zeros
  • health care utilization
  • zero-inflated poisson
  • negative binomial

Cite this