Preview

Vestnik of Samara State University of Economics

Advanced search

Forecasting and interpreting factors of labor productivity using ensemble machine learning methods

https://doi.org/10.46554/1993-0453-2026-6-260-137-148

Abstract

This study explores the application of nonlinear machine learning models to forecasting regional labor productivity and identifying its key determinants. Several ensemble methods were employed, with the random forest model achieving the best predictive performance. To account for interregional heterogeneity, clustering of labor productivity time series was performed and a categorical label indicating each region’s cluster membership was introduced, which captured structural differences and enhanced the sustainability and interpretability of the forecasts. The forecasting accuracy was compared with that of traditional linear models commonly used in domestic research, revealing a consistent superiority of nonlinear approaches. Interpretation of the final random forest model using the Mean Decrease in Gini criterion indicated that the most influential factors affecting labor productivity are the growth rate of real wages and the dynamics of investment in fixed capital, which aligns with previous empirical findings. Furthermore, the use of SHAP values enabled a more detailed assessment of the direction and magnitude of each feature’s contribution, enhancing the interpretability and transparency of the model. The findings provide evidence supporting the transition from conventional econometric approaches to machine learning methods for forecasting macroeconomic indicators.

About the Author

I. A. Labutkin
ITMO University
Russian Federation

Ivan A. Labutkin – engineer

Saint-Petersburg



References

1. Shumilina V.E., Tsvil M.M. Statistical modeling and forecasting of the labor productivity index in the Russian Federation // Bulletin of Eurasian Science. 2019. Vol. 11, No. 1. P. 46.

2. Shumilina V.E., Tsvil M.M. Building a regression model for time series to forecast the labor productivity index in the Russian Federation // Bulletin of Eurasian Science. 2020. Vol. 12, No. 1. P. 73.

3. Tsvil M.M., Nesterova A.V. Forecasting the labor productivity index for the Central Federal District // Engineering Bulletin of the Don. 2021. No. 3. Pp. 120–129.

4. Dyachkova A.V., Karass V.O. Assessing the impact of wages on labor productivity in Russia: econometric analysis // Managerial Accounting. 2022. No. 10-3. Pp. 710–715. doi:10.25806/uu10-32022710-715.

5. Bakanach O.V., Lopoukhova Ya.S. Statistical modeling of the labor productivity index in the Russian Federation // Science of the 21st Century: Current Directions of Development. 2020. No. 1-1. Pp. 234-238.

6. Bakhtizin A.R., Sulakshin S.S., Kolesnik I.Yu. Wages as a factor in increasing labor productivity // Contours of Global Transformations: Politics, Economics, Law. 2009. Vol. 2, No. 1. Pp. 79–87.

7. Pechura O.V., Polygalova N.Yu. Wages as a factor of labor productivity growth // Alley of Science. 2020. Vol. 1, No. 5. Pp. 168–172.

8. Burtseva T.A. Forecasting regional labor productivity growth // Russian Journal of Labor Economics. 2023. Vol. 10, No. 3. doi:10.18334/et.10.3.117464.

9. Towards more timely measures of labour productivity growth / Y. Dorville [et al.]. OECD Publishing, 2025. doi:10.1787/436ecbb5-en.

10. Bukina T.V., Kashin D.V. Forecasting regional inflation: econometric models or machine learning methods? // HSE Economic Journal. 2024. Vol. 28, No. 1. Pp. 81–107. doi:10.17323/1813-8691-2024-28-1-81-107.

11. Pavlov E. Forecasting inflation in Russia using neural networks // Russian Journal of Money and Finance. 2020. Vol. 79, No. 1. Pp. 57–73. doi:10.31477/rjmf.202001.57.

12. Chakraborty C., Joseph A. Machine learning at central banks. 2017. doi:10.2139/ssrn.3031796.

13. How is machine learning useful for macroeconomic forecasting? / Goulet Coulombe P. [et al.] // Journal of Applied Econometrics. 2022. Vol. 37, No. 5. Pp. 920-964. DOI: 10.1002/jae.2910.

14. Yurtsever M. Unemployment rate forecasting: LSTM-GRU hybrid approach // Journal for Labour Market Research. 2023. Vol. 57, No. 1. P. 18. doi:10.1186/s12651-023-00345-8.

15. Unemployment rate prediction using a hybrid model of recurrent neural networks and genetic algorithms / K. Mero [et al]. // Applied Sciences. 2024. Vol. 14, No. 8. P. 3174. doi:10.3390/app14083174.

16. Breiman L. Random forests // Machine learning. 2001. Vol. 45, No. 1. Pp. 5-32. doi:10.1023/A:1010933404324.

17. Friedman J.H. Greedy function approximation: a gradient boosting machine // Annals of statistics. 2001. Pp. 1189–1232. doi:10.1214/aos/1013203451.

18. Ribeiro M.T., Singh S., Guestrin C. "Why should i trust you?" Explaining the predictions of any classifier // Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining. 2016. Pp. 1135–1144. doi:10.1145/2939672.2939778.

19. Lundberg S.M., Lee S.I. A unified approach to interpreting model predictions // Advances in neural information processing systems. 2017. Vol. 30. doi:10.5555/3295222.3295230.

20. Federal State Statistics Service : official website. URL: https://rosstat.gov.ru/ (date of access: 05.10.2025).

21. Scikit-learn: machine learning in Python / F. Pedregosa [et al.] // The Journal of machine Learning research. 2011. Vol. 12. Pp. 2825–2830.

22. CatBoost: unbiased boosting with categorical features / L. Prokhorenkova [et al.] // Advances in neural information processing systems. 2018. Vol. 31.


Review

For citations:


Labutkin I.A. Forecasting and interpreting factors of labor productivity using ensemble machine learning methods. Vestnik of Samara State University of Economics. 2026;(6):137-148. (In Russ.) https://doi.org/10.46554/1993-0453-2026-6-260-137-148

Views: 35

JATS XML


Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 License.


ISSN 1993-0453 (Print)