On Zero Truncated Under-dispersed Count Data

Main Article Content

H. B. Lawal

Abstract

In this paper, we employ two and three parameter distributions to model the under-dispersed zero truncated data presented in a recent article. We illustrate with a large frequency data having many categories that, the zero-truncated negative binomial and its mixture distributions always fail to converge and consequently producing estimated probabilities that sum more than 1 within the range of the data. The two-parameter zero-truncated versions of the following distributions are considered here: the generalized Poisson (ZTGP), the new logarithmic distribution (ZTNLD), the new geometric discrete Pareto distribution (ZTNGDP), the generalized Poisson-Lindley (ZTGPLD), the zero-truncated Poisson-Exponential-Gamma (ZTPEGD) and the Quasi-Negative Binomial (ZTQNBD2). The three-parameter distributions similarly considered include the zero-truncated Quasi-negative binomial (ZTQNBD), the zero-truncated Inverse Trinomial (ZTIT), the zero-truncated Delaporte (ZTDLPD), the zero-truncated Negative binomialErlang distribution (ZTNB-ELD), the New three parameter Poisson-Lindley distribution (ZTNTPLD) and the New three-parameter size-biased Poisson- Lindley distribution (NTPSBPLD) zero-truncated versions. Three frequency data sets were employed and we further extend our analyses to zero-truncated count data having co-variates (GLM) by utilizing the Nigerian Health Insurance Survey (NHIS) data. Among the models, the ZTPEGD and the ZTDLPD perform better than all the other models for the example datasets employed in this study. The Delaporte is particularly attractive and easier to program in R since there is a Delaporte package in R that can be appropriately used. All models in
this study were implemented with SAS NLMIXED and corresponding written R codes. Two of the R codes are presented in Appendices I and II.

Downloads

Download data is not yet available.

Article Details

Section

Articles

Author Biography

H. B. Lawal, College of Postgraduate Studies Summit University, Offa, Kwara State, Nigeria

   

How to Cite

Lawal, H. (2025). On Zero Truncated Under-dispersed Count Data. Benin Journal of Statistics, 8(1), 13-35. https://bjs-uniben.org/index.php/home/article/view/40

References

Abouammoh, A.M, Alshangiti, A.M. and Ragab IE (2015). A new Generalized Lindley Distribution. Journal of Statistical Computation and Simulation, 85(10),3662-3678.

Adesina, O.S., Agunbiade, D.A. and Oguntunde, P. E. (2021). Flexible Bayesian Dirichlet mixtures of generalized linear mixed models for count data. Scientific African, 13,1–11.

Atikankul, Y. (2023). A New Generalized Lindley Regression Model. Austrian Journal of Statistics, 52, 39-50.

Awad, Y.S., Bar-Lev,S.K. and Makov, U. (2016). A new class counting distributions embedded in the Lee-Carter model for mortality projections. A Bayesian approach. Technical report, No 146. Actuarial Research Center, University of Haifa. Israel.

Bhati,D. and H.S. Bakouch (2019). A new infinitelydivisible discrete distribution with applications to count data modelling. Communications in Statistics-Theory and Methods, 48 (6), 140–16.

Bodhisuwan,W., Pudprommarat, C., Bodhisuwan, R. and Saothayanun, L. (2017). Zerotruncated Negative Binomial-Erlang Distribution. AIP conference Proceedings, 1905, 050011, American Institute of Physics.

Consul, P.C, and Famoye, F. (1992). Generalized Poisson Regression Model. Comm. Statist. Theory Methods, 21(1), 89–109.

Das, K.K., Ahmad, J, and Bhattachajee, S. (2018). A new three-parameter Poisson-Lindley distribution for modeling over dispersed count data. Internal Journal of Applied Engineering Research, 13(25), 16468–16477.

Delaporte, Pierre J. (1960). ”Quelques problmes de statistiques mathmatiques poses parl’Assurance Automobile et le Bonus pour non sinistre” [Some problems of mathematical statistics as related to automobile insurance and no-claims bonus]. Bulletin Trimestriel de l’Institut des Actuaires Franais (in French). 227: 87102.

El-Morshedy,M, Eliwa,M.S., and Nagy, H. (2020). A new two-parameter exponentiated discrete Lindley distribution: Properties, estimation and applications. Journal of Applied Statistics, 47 (2): 354–75.

G➫omez-D➫eniz, Sarabia, J.M, and Calderin-Ojeda,E. (2011). A new discrete distribution with actuarial applications. Insurance: Mathematics and Economics, 48 (3), 406–12.

G➫omez-D➫eniz, E., Hern➫andez-Bastida, A., and Fern➫andez-S➫anchez, M.P.A. (2016). A Suitable Discrete Distribution for Modelling Automobile Claim Frequencies. Bulletin of the Malaysian Mathe- matical Sciences Society, 39, 633–647.

Hassan, A. and Bilal, S. (2008). On some properties of Quasi-Binomial Distribution and Its Applications. Journal of Modern Applied Statistical Methods, 7(2), 616–63

Janardan, K. G. (1975). Markov-Polya urn-model with predetermined strategies. Gujarat Statist. Rev., 2(1),17–32.

Konggrod,S., Bodhisuwan,W. and Payakkapong, P. (2014). The Negative binomial-Erlang distribution with Applications. International Journal of Pure and Applied Mathematics, 92(3), 380–401.

Lawal, H.B. (1980). Tables of percentage points of Pearson’s goodness-of-fit statistic for use with small expectations. Applied Statistics, 29, 292–298.

Lawal, H.B. (2018). Correcting for Non-Sum to 1 Estimated Probabilities in Applications of Discrete Probability Models to Count Data. International Journal of Statistics and Probability, 6, 119–131.

Lawal, H. B. (2019). Quasi-Negative Binomial, Inverse Trinomial and Negative Binomial Generalized Exponential Distributions and their Applications. Journal of Probability and Statistical Science- JPSS, 17(2), 205–222.

Li, S., Yang,F., Famoye,F., Lee, C., Black, D. (2011). Quasi-negative binomial distribution: Properties and applications. Computational Statistics and Data Analysis, 55(20), 2363-2371.

Mahmoudi, E. and Zakerzadeh,H. (2010). Generalized Poisson-Lindley Distribution. Communication in Statistics: Theory and Methods, 39(10), 1785–1798.

Phang, Y.N. and Loh,E.F. (2014). Statistical Analysis for overdispersed medical Count Data.International Scholarly and Scientific Research, 8(2), 292–294.

Pinheiro, J.C. and Bates,D.M.(1995). Approximations to the Log-Likelihood Function in the Non-linear Mixed-Effects Model Journal of Computational and Graomegacal Statistics, 4, 12–35.

Sankaran, M. (1970). The discrete Poisson-Lindley distribution, Biometrics, 26, 145–149.

Shanker, R., and Shukla, K. K.(2013).On a size-based quasi poisson-lindley distribution. International Journal of probability and Statistics, 2(21), 28–34.

Shanker, R., Shukla, K. K.(2020). A new three-parameter sized-biased Poisson-Lindley distribution with properties and applications. Biometrics and Biostatistics International Journal, 9(1), 1–14.

Shanker,R., and Mishra,A. (2017). On size-based two-parameter poisson-lindley distribution and its applications. American Journal of Mathematics and Statistics, 7(3), 99–107.

Shimizu, K. and Yanagimoto, T. (1991). The inverse trinomial Distribution. Japan Journal of Applied Statistics, 20, 89–96.

Shmueli, G., Minka, T., Borle, J., and Boatwright, P. (2005). A useful distribution for fitting discrete data: revival of the Conway-Maxwell-Poisson distribution. J. R. Stat. Soc., Series C, 54:127–142.

Shoukri, M. M. and Aleid, M.M. (2022). Quasi-Negative Binomial: Properties, Parametric Estimation, Regression Model and Application to RNA-SEQ Data. Open Journal of Statistics, 12. 216–237. DOI:10.4236/ojs.2022.122016

Singh, S.N. and Yadav, R.c. (1971). trends in rural out-migration at household level. Rural Demography, 8(1), 53–61.

Umar, M. A. (2019). A zero-truncated Poisson-Exponential-Gamma Distribution and its Applications. M.Sc. Dissertation, University of Ilorin, Ilorin. Nigeria.

Umar, M.A., Jimoh,K., and Yahya, W.B. (2019). A note on the applications of some zero truncated distributions. Edited Proceedings of the 3rd International Conference of the PSSN, 3, 499–505.

Zakerzadeh, H. and Dolati, A. (2009). Generalized Lindley Distribution. Journal of Mathematical Extension, 3, 13–25.

Zhang, H., Tan, K. and Li, B. (2018). COM-negative binomial distribution:modeling overdispersion and utrahigh zero-inflated count data. Front. Math. China, 13(4), 967–998