Influence diagnostics for the Gamma-Pareto regression: a DFFITS-based comparison of residuals

Document Type : Original Article

Authors
1 Department of Statistics, Bahauddin Zakariya university, Multan Pakistan.
2 Department of Statistics, Bahauddin Zakariya University (BZU), Multan, Pakistan,60800
3 Department of Statistics Bahauddin Zakariya University (BZU), Multan, Pakistan,60800
Abstract
The generalized linear models (GLMs) use Gamma-Pareto regression Model (G-PRM) to address the sensitivity of influential observations. Difference of Fits (DFFITS) is a popular technique for identifying influential observations. We apply DFFITS to the G-PRM with various residuals. We present illustrative real and simulated data. One class of adjusted Pearson residuals is more effective in detecting influential observations, considering small or large dispersion parameters. We calculate detection percentages to evaluate the proposed procedure's performance, replicating the process 10,000 times.

The generalized linear models (GLMs) use Gamma-Pareto regression Model (G-PRM) to address the sensitivity of influential observations. Difference of Fits (DFFITS) is a popular technique for identifying influential observations. We apply DFFITS to the G-PRM with various residuals. We present illustrative real and simulated data. One class of adjusted Pearson residuals is more effective in detecting influential observations, considering small or large dispersion parameters. We calculate detection percentages to evaluate the proposed procedure's performance, replicating the process 10,000 times.
Keywords
Subjects

Alzaatreh A, Famoye F, Lee C. Gamma-Pareto distribution and its applications. Journal of Modern Applied Statistical Methods. 2012;11(1):7.
Alzaatreh A, Ghosh I. A study of the Gamma-Pareto (IV) distribution and its applications. Communications in Statistics-Theory and Methods. 2016;45(3):636–654.
Alzaghal A. The exponentiated gamma-Pareto distribution with application to bladder cancer susceptibility. J Stat Appl Probab Lett. 2020;7:39–56.
Amin M, Afzal S, Akram MN, Muse AH, Tolba AH, Abushal TA. Outlier detection in gamma regression using Pearson residuals: Simulation and an application. AIMS
MATHEMATICS. 2022;7(8):15331–15347.
Amin M, Amanullah M, Aslam M. Empirical evaluation of the inverse Gaussian regression residuals for the assessment of influential points. Journal of Chemometrics.
2016;30(7):394–404
Amin M, Amanullah M, Aslam M, Qasim M. Influence diagnostics in gamma ridge regression model. Journal of Statistical Computation and Simulation. 2019;89(3):536–
556.
Amin M, Amanullah M, Cordeiro GM. Influence diagnostics in the gamma regression model with adjusted deviance residuals. Communications in Statistics-Simulation
and Computation. 2017;46(9):6959–6973.
Amin M, Faisal M, Akram MN. Influence diagnostics in the inverse Gaussian ridge regression model: Applications in chemometrics. Journal of Chemometrics.
2021;35(6):e3342.
Amin M, Qasim M, Amanullah M, Afzal S. Performance of some ridge estimators for the gamma regression model. Statistical papers. 2020;61:997–1026.
Anholeto T, Sandoval MC, Botter DA. Adjusted Pearson residuals in beta regression models. Journal of Statistical Computation and Simulation. 2014;84(5):999–1014.
Ashour SK, Said DR, Fahim MAA. The Log-Gamma-Pareto Distribution. International Journal of Sciences: Basic and Applied Research. 2014; 357-374.
Atkinson AC. Two graphical displays for outlying and influential observations in regression. Biometrika. 1981;68(1):13–20.
Atkinson AC. Plots, transformations, and regression: an introduction to graphical methods of diagnostic regression analysis; 1985.
Barnard TE, Booksh KS, Brereton RG, Coomans DH, Deming SN, Hayashi Y, et al. Chemometrics in environmental chemistry-statistical methods, vol. 2. Springer; 1995.
Belsley DA, Kuh E, Welsch R. Regression Diagnostics: Identifying Influential Data and Sources of Collinearity. New York: Wiley; 1980.
Chatterjee S, Hadi AS. Sensitivity Analysis in Linear Regression. Wiley: New York;1988.
Cook RD. Detection of influential observation in linear regression. Technometrics. 1977;19(1):15–18.
Cook RD. Assessment of local influence. Journal of the Royal Statistical Society Series B: Statistical Methodology. 1986;48(2):133–155.
Cook RD, Weisberg S. Residuals and influence in regression; 1982.
Cordeiro GM. On Pearson’s residuals in generalized linear models. Statistics & Probability Letters. 2004;66(3):213–219.
Cox DR, Snell EJ. A general definition of residuals. Journal of the Royal Statistical Society: Series B (Methodological). 1968;30(2):248–265.
Dar AA, Ahmed A, Reshi JA. weighted gamma-pareto distribution and its application. Pakistan Journal of Statistics. 2020;36(4).
De Andrade TA, Fernandez LMZ, Silva FG, Cordeiro GM. The gamma generalized pareto distribution with applications in survival analysis. 2017;.
Dobson AJ, Barnett AG. An introduction to generalized linear models. CRC press; 2002.
Evans DL, Drew JH, Leemis LM. The distribution of the Kolmogorov–Smirnov, Cramer–von Mises, and Anderson–Darling test statistics for exponential populations with estimated parameters. In: Computational probability applications.; 2008.
p. 165–190.
Hadi AS. A new measure of overall potential influence in linear regression. Computational Statistics & Data Analysis. 1992;14(1):1–27.
Hanum H. Modeling Extreme Rainfall with Gamma. Applied Mathematical Sciences. 2015;9(121):6029–6039.
Hanum H, Wigena AH, Djuraidah A, Mangku IW. Developing generalized linear model of Gamma-Pareto Distribution. Far east Mathematical Sciences Journal. 2016;.
Hardin JW, Hilbe JM. Generalized Linear Models and Extensions. 3rd ed. USA: Stata Press Publication; 2012.
Khan JA, Akbar A. Empirical performance of nonparametric regression over LRM and IGRM addressing influential observations. Journal of Chemometrics.
2019;33(7):e3143.
Lawless JF. Statistical Models and Methods for Lifetime Data. New York: Wiley; 2003.
Lee AH. Assessing partial influence in generalized linear models. Biometrics. 1986;44:71–77.
McCullagh P, Nelder JA. Generalized Linear Models. London: Chapman & Hall; 1989.
Nelder J, Wedderburn R. Generalized Linear Models. Journal of the Royal Statistical Society, Series A (General). 1972;135(3):370–384.
Pierce DA, Schafer DW. Residuals in generalized linear models. Journal of the American Statistical Association. 1986;81(396):977–986.
Pregibon D. Logistic regression diagnostics. The Annals of Statistics. 1981;9(4):705–724.
Preisser JS, Qaqish BF. Deletion diagnostics for generalised estimating equations. Biometrika. 1996;83(3):551–562.
Saleem N, Akbar A. Empirical evaluation of the gamma-Pareto regression residuals for influence diagnostics. Journal of Statistical Computation and Simulation.
2024;94(16):3573–3598.
Simas AB, Cordeiro GM. Adjusted Pearson residuals in exponential family nonlinear models. Journal of Statistical Computation and Simulation. 2009;79(4):411–425.
Thomas W, Cook RD. Assessing influence on regression coefficients in generalized linear models. Biometrika. 1989;76(4):741–749.
Vanegas LH, Rondón LM, Cordeiro GM. Diagnostic tools in generalized Weibull linear regression models. Journal of Statistical Computation and Simulation.
2013;83(12):2315–2338.
Williams D. Generalized linear model diagnostics using the deviance and single case deletions. Journal of the Royal Statistical Society: Series C (Applied Statistics).
1987;36(2):181–191.
Zhang J. Powerful goodness-of-fit and multi-sample tests. PhD thesis, Toronto: York University; 2001.
Zheng L, Ismail K, Meng X. Shifted Gamma-Generalized Pareto Distribution model to map the safety continuum and estimate crashes. Safety Science. 2014;64:155–162.
Volume 24, Issue 2
December 2025
Pages 69-90

  • Receive Date 30 October 2025
  • Revise Date 05 January 2026
  • Accept Date 31 January 2026