Model Selection Criteria CV, UBR, and GCV for a Mixed Truncated Spline–Gaussian Kernel Estimator in Health Modeling
DOI:
https://doi.org/10.19139/soic-2310-5070-3472Keywords:
Health Modeling, Cross-Validation, Unbiased Risk, Generalized Cross-Validation, Mixed EstimatorsAbstract
Nonparametric regression modeling has commonly applied a single estimator to all predictor variables. Although this approach is straightforward, it can be overly restrictive because predictors often exhibit heterogeneous relationships with the response variable, including linear trends, smooth nonlinear patterns, abrupt changes, or localized variations. Using a uniform estimator may therefore limit model flexibility and reduce predictive accuracy. To overcome this limitation, this study investigates a Mixed Truncated Spline–Gaussian Kernel Estimator, which allows each predictor to be modeled using the estimation technique most appropriate to its underlying data structure. The main objective of this research is to compare the performance of three smoothing parameter selection criteria, namely Cross-Validation (CV), Unbiased Risk (UBR), and Generalized Cross-Validation (GCV). These criteria are employed to determine optimal smoothing parameters, including the number and locations of knots in the truncated spline component and the bandwidth in the Gaussian kernel component. The empirical analysis is conducted using health-related data on heart disease risk factors, a domain characterized by complex and potentially nonlinear relationships. The results indicate that models incorporating three knot points consistently outperform alternative specifications. This superior performance is reflected in lower values of Mean Squared Error (MSE) and Mean Absolute Percentage Error (MAPE), as well as higher coefficients of determination (R²). Among the selection criteria examined, GCV yields the most accurate and stable model, outperforming both CV and UBR. From a methodological perspective, this study contributes to nonparametric regression by providing a systematic evaluation of smoothing parameter selection within a mixed estimator framework. From an applied standpoint, the proposed approach enhances the modeling of heart disease risk factors by offering greater flexibility and precision. Furthermore, the findings support Sustainable Development Goal (SDG) 3: Good Health and Well-Being by promoting robust, data-driven methods for evidence-based health policy formulation.Downloads
Published
2026-04-13
Issue
Section
Research Articles
License
Authors who publish with this journal agree to the following terms:- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).
How to Cite
Model Selection Criteria CV, UBR, and GCV for a Mixed Truncated Spline–Gaussian Kernel Estimator in Health Modeling. (2026). Statistics, Optimization & Information Computing. https://doi.org/10.19139/soic-2310-5070-3472