МОДЕЛЮВАННЯ ТА ОПТИМІЗАЦІЯ ДИНАМІЧНОГО КОНТРОЛЮ В СТОХАСТИЧНИХ СИСТЕМАХ НА ОСНОВІ МАШИННОГО НАВЧАННЯ

S. I. Zhyr; G. A. Shyshkanova; T. A. Zaytseva; T. I. Sliusarova

doi:10.32782/2521-6643-2026-2-72.8

S. I. Zhyr University of Customs and Finance https://orcid.org/0009-0006-2410-6792
G. A. Shyshkanova Zaporizhia Polytechnic National University https://orcid.org/0000-0002-0336-2803
T. A. Zaytseva Oles Honchar Dnipro National University https://orcid.org/0000-0002-6346-3390
T. I. Sliusarova Zaporizhia Polytechnic National University https://orcid.org/0000-0001-6655-0492

DOI: https://doi.org/10.32782/2521-6643-2026-2-72.8

Keywords: modeling, optimization methods, reinforcement learning, software, Poisson distribution, probability, intelligent logistics systems, mass service

Abstract

The problem of optimizing the functioning of the entry group of a container terminal operating under conditions of stochastic uncertainty of transport flows caused by global logistics trends and random external factors is considered. The relevance of the chosen direction is due to the need to reduce truck waiting time, minimize operating costs and level the negative environmental impact from excess emissions during transport downtime in queues. Obviously, a simple expansion of the physical infrastructure is often economically impractical. A transition to an intelligent hybrid dynamic control model based on the reinforcement learning (RL) paradigm is proposed, which allows the system to adaptively regulate the number of active service channels. The developed model is based on a Markov decision-making process. To adequately reproduce the real dynamics of truck arrivals, a Poisson distribution is used. The entry group is represented through a discrete approximation of the classical mass service model. The use of Q-learning ensures finding the optimal control policy even in the absence of exhaustive a priori information, allowing the agent to «learn» directly during interaction with the environment. The study illustrates the evolution of agent learning and confirms its convergence to a theoretically justified optimal strategy. The modeling results show that the implementation of RL methods contributes to effective smoothing of peak loads, a significant reduction in queue length and an overall increase in terminal throughput. The possibilities of scaling the model through the integration of deep neural networks are considered, which allows operating with large data sets and complex state spaces. The Hamilton–Jacobi–Bellman equation, which determines the optimality limits in continuous control problems, is a theoretical verification of the obtained strategies. The proposed approach has practical significance for the development of logistics systems, as it allows integrating hybrid intelligent algorithms into infrastructure management and ensures optimization of economic indicators.

References

1. Chargui, K., Zouadi, T., Sreedharan, V. R., Fallahi, A. & Reghioui, M. (2023). A novel robust exact decomposition algorithm for berth and quay crane allocation and scheduling problem considering uncertainty and energy efficiency. Omega, 118, 102868, https://doi.org/10.1016/j.omega.2023.102868
2. Cheng, T. T. (2014). Queuing Model of Container Terminal Logistics System in Event Scheduling. Advanced Materials Research, 971–973, 2358–2360. https://doi.org/10.4028/www.scientific.net/amr.971-973.2358
3. Hall, R. W. (2003). Transportation Queueing. In: Hall, R. W. (eds) Handbook of Transportation Science. International Series in Operations Research & Management Science, vol 56. Springer, Boston, MA. https://doi.org/10.1007/0-306-48058-1_5
4. Wang, T., Tian, X. & Wang, Y. (2020). Container slot allocation and dynamic pricing of time-sensitive cargoes considering port congestion and uncertain demand. Transportation Research Part E: Logistics and Transportation Review, 144, 102149, https://doi.org/10.1016/j.tre.2020.102149
5. Dvigun, A., Datsii, O., Levchenko, N., Shyshkanova, G., Platonov, O., & Zalizniuk, V. (2022). Increasing Ambition to Reduce the Carbon Trace of Multimodal Transportation in the Conditions of Ukraine’s Economy Transformation Towards Climate Neutrality. Science and Innovation, 18(1), 96–111. https://doi.org/10.15407/scine18.01.096
6. Bouyahia, F., Belaqziz, S., Meliani, Y., Lissane Elhaq, S. & Boukachour, J. (2025). A Novel Truck Appointment System for Container Terminals. Sustainability, 17(13), 5740. https://doi.org/10.3390/su17135740
7. Silva, M. R. F., Agostino, I. R. S. & Frazzon, E. M. (2023). Integration of machine learning and simulation for dynamic rescheduling in truck appointment systems. Simulation Modelling Practice and Theory, 125, 102747. https://doi.org/10.1016/j.simpat.2023.102747
8. Abeysooriya, H., Weerasinghe, B. A. & Perera, H. N. (2024). Optimizing Gate Queuing at Container Terminals to Facilitate Green Operations. IFAC-PapersOnLine, 58(19), 307-312. https://doi.org/10.1016/j.ifacol.2024.09.201
9. Datsii, O., Levchenko, N., Shyshkanova, G., Platonov, O. & Abuselidze, G. (2021). Creating a Regulatory Framework for the ESG-investment in the Multimodal Transportation Development. Rural Sustainability Research, 46(341), 39-52. https://doi.org/10.2478/plua-2021-0016
10. Zhang, L., Zeng, Q. & Wang, L. (2024). How to Achieve Comprehensive Carbon Emission Reduction in Ports? A Systematic Review. Journal of Marine Science and Engineering, 12(5), 715. https://doi.org/10.3390/jmse12050715
11. Çolak, M., Heilig, L. & Voß, S. (2025). Reinforcement learning in the context of container terminals. Flexible Services and Manufacturing Journal. https://doi.org/10.1007/s10696-025-09643-4
12. Kiseleva, E.M., Prytomanova, O.M., Hart, L.L., Zaytseva, T.A. & Kuzenkov O.O. (2024). Аpplication of mathematical methods of artificial intelligence to solve problems of optimal set partitioning. Питання прикладної математики та математичного моделювання, 27, 89-98. https://doi.org/10.15421/32242401
13. Cheng, S., Liu, Q., Jin, H., Zhang, R., Ma, L. & Kwong, C. F. (2025). Collaborative optimization of truck scheduling in container terminals using graph theory and DDQN. Scientific reports, 15(1), 6950. https://doi.org/10.1038/s41598-025-91140-7
14. Yan, Y., Chow, A.H.F., Ho, C. P., Kuo, Y.-H., Wu, Q. & Ying Ch. (2022). Reinforcement learning for logistics and supply chain management: Methodologies, state of the art, and future opportunities, Transportation Research Part E: Logistics and Transportation Review, 162, 102712, https://doi.org/10.1016/j.tre.2022.102712

MODELING AND OPTIMIZATION OF DYNAMIC CONTROL IN STOCHASTIC SYSTEMS BASED ON MACHINE LEARNING

Abstract

References

Most read articles by the same author(s)