Optimization of the frequency of memory testing of computing systems
Abstract
One of the effective measures to increase readiness for error—free fault-tolerant operation of computer systems is their periodic test control combined with operational control. At the same time, the problem arises of resolving a technical contradiction caused by the fact that an increase in the frequency of testing activation leads to an increase in downtime, and a decrease leads to an increase in the likelihood of dangerous operating conditions in conditions of undetected failures. The paper proposes an approach for finding the optimal frequency of testing based on Markov models for systems with dedicated computing and memory nodes. The practical significance of the study lies in the possibility of using the model to increase the system's error-free availability while reducing downtime during testing.
Full Text:
PDF (Russian)References
Decree of the Government of the Russian Federation dated December 28, 2022 No. 4261-r "On approval of the Strategy for the Development of the automotive industry of the Russian Federation until 2035": [Electronic resource]. –– URL: https://www.garant.ru/products/ipo/prime/doc/405963861 /
Polovko A.M., Gurov S.V. Fundamentals of reliability theory. St. Petersburg: BHV-Petersburg, 2006. 702 p.
Kazarin O.V., Shubinsky I.B. Reliability and security of software. Moscow: Yurayt, 2018. 342 p.
Astakhova T.N., Verzun N.A., Kasatkin V.V., Kolbanev M.O., Shamin A.A. Sensor network connectivity models. Informatsionno-upravliaiushchie sistemy [Information and Control Systems], 2019. N 5. P. 38–50. doi:10.31799/16848853-2019-5-38-50. https://doi.org/10.31799/1684-8853-2019-5-38-50
Burkov A. A., Rachugin R. O., Turlikov A. M. The impact of the number of unique preambles on the stability region of the ALOHA algorithm with early feedback. Information and Control Systems, 2024, No. 6, pp. 58-65. doi:10.31799/1684-8853-2024-6-58-65
Tatarnikova T.M., Arkhiptsev E.D., Karmanovsky N.S. Determining the cluster size and the number of replicas of highly loaded information systems Izvestia of Higher Educational Institutions. Instrument engineering. 2023. Vol. 66. No. 8. pp. 646-651
Krylov D. R., Poymanova E. D., Tyurlikov A.M. A model of a replicated data storage system using the average age of information as an indicator of data relevance. Information and Control Systems, 2024, No. 3, pp. 11-23. doi:10.31799/1684-8853-2024-3-11-23
Sorin, Daniel. (2009). Fault Tolerant Computer Architecture. 10.2200/S00192ED1V01Y200904CAC005. URL: https://www.researchgate.net/publication/220696325_Fault_Tolerant_Computer_Architecture
Bogatyrev V.A., Bogatyrev S.V. Reliability of multicluster systems with the redistribution of query flows // Izv. vuzov: Instrument Engineering. 2017. Vol. 60. No. 2. pp. 171-177.
Shubinsky I.B. Reliable fault-tolerant information systems. Synthesis methods. Ulyanovsk: Printing Yard, 2016. 544 p.
Klimenko A.B. "Efficient allocation of computing resources in geo-distributed heterogeneous dynamic computing environments." MODELING, OPTIMIZATION AND INFORMATION TECHNOLOGY (2024)
Ananyev, A.V. et al. "Reliability models of complex state control in spatially distributed information security systems." Bulletin of the South Ural State University. Ser. Computer Technologies, Automatic Control & Radioelectronics (2022)
Yarmolik, V. N. Control and diagnostics of computing systems / B. N. Yarmolik. Minsk : Bestprint, 2019. 387 p.: ill. - ISBN 978-985 -90509-5-4 .
Platonov A., V. I. Timofeev. "Integrity control of dynamic objects of computing systems using metric standards." (2015).
Ruban I. V., Martovitsky V. A., Lukova-Chuiko N. V.
Development of a monitoring model for cluster supercomputers. URL: https://media.neliti.com/media/publications/306922-designing-a-monitoring-model-for-cluster-972bdd39.pdf
Goncharenko V.A., Khomonenko A.D., Khalil M.M. Study of load balancing in three-channel cluster systems with adaptive dispatching //High-tech technologies in space exploration of the Earth. – 2025. – Vol. 17. No. 2. – pp.19-31.
Khomonenko A.D., Blagoveshchenskaya E.A., Prourzin O.V., Andruk A.A. Forecasting the reliability of a cluster computing system using a semi-Markov model of alternating processes and monitoring // High-tech technologies in Earth space research, 2018, vol. 10, No. 4, pp. 72-82.
Bogatyrev, V. A. Optimization of intervals for checking information security of systems / V. A. Bogatyrev, A.V. Bogatyrev, S. V. Bogatyrev // Scientific and Technical Bulletin of Information Technologies, Mechanics and Optics. – 2014. – № 5(93). – Pp. 119-125. – EDN QWOGHR.
Bogatyrev V.A., Vinokurova M.S., Petrov P.A., Nazarova M.L., Shabakov R.V. Control and safety of functioning of duplicated computer systems // Scientific and Technical Bulletin of Information Technologies, Mechanics and Optics. 2017. Vol. 17. No. 2. pp. 368-372. doi: 10.17586/2226-1494-2017-17-2-368-372
In A. Bogatyrev, D E. Lisichkin. "Optimization of the frequency of control initialization based on duplicated calculations" // Software products and systems. 2019. №2. URL: https://cyberleninka.ru/article/n/optimizatsiya-periodichnosti-initsializatsii-kontrolya-na-osnove-dublirovannyh-vychisleniy.
Bogatyrev V.A., Bogatyrev A.V. Optimization of the frequency of monitoring the security of computer systems // Scientific and Technical Bulletin of Information Technologies, Mechanics and Optics. 2015. Volume 15. No. 2. pp. 300-304.
Refbacks
- There are currently no refbacks.
Abava Кибербезопасность Monetec 2026 СНЭ
ISSN: 2307-8162