Random forests in the zero to k inflated Power series populations

  • Hadi Saboori Ferdowsi University of Mashhad
  • Mahdi Doostparast
Keywords: Random forest; Regression tree; zero to k inflated Power series model; Regression

Abstract

Tree-based algorithms are a class of useful, versatile, and popular tools in data mining and machine learning.Indeed, tree aggregation methods, such as random forests, are among the most powerful approaches to boostthe performance of predictions. In this article, we apply tree-based methods to model and predict discretedata, using a highly flexible model. Inflation may occur in discrete data at some points. Inflation can beat points as zero, one or the other. We may even have inflation at two points or more. We use models forinflated data sets based on a common discrete family (the Power series models). The Power series modelsare one of the most famous families used in such models. This family includes common discrete models suchas the Poisson, Negative Binomial, Multinomial, and Logarithmic series models.The main idea of this article is to use zero to k (k = 0, 1, . . .) inflated regression models based on the familyof power series to fit decision regression trees and random forests. An important point of these models isthat they can be used not only for inflated discrete data but also for non-inflated discrete data. Indeed thismodel can be used for a wide range of discrete data sets.
Published
2023-08-03
How to Cite
Saboori, H., & Doostparast, M. (2023). Random forests in the zero to k inflated Power series populations. Statistics, Optimization & Information Computing, 11(4), 865-875. https://doi.org/10.19139/soic-2310-5070-1773
Section
Research Articles