Is Machine Learning Necessary for Cloud Resource Usage Forecasting?

Esta página aún no ha sido traducida. A continuación se muestra la página en inglés.

martes 14 de noviembre de 2023

11:00am 302-Mountain View and Zoom4 (https://zoom.us/j/4911012202, password:@s3)

Georgia Christofidi, PhD Student, IMDEA Software

Is Machine Learning Necessary for Cloud Resource Usage Forecasting?

Abstract:

Robust forecasts of future resource usage in cloud computing environments enable high efficiency in resource management solutions, such as autoscaling and overcommitment policies. Production-level systems use lightweight combinations of historical information to enable practical deployments. Recently, Machine Learning (ML) models, in particular Long Short Term Memory (LSTM) neural networks, have been proposed by various works, for their improved predictive capabilities. Following this trend, we train LSTM models and observe high levels of prediction accuracy, even on unseen data. Upon meticulous visual inspection of the results, we notice that although the predicted values seem highly accurate, they are nothing but versions of the original data shifted by one time step into the future. Yet, this clear shift seems to be enough to produce a robust forecast, because the values are highly correlated across time. We investigate time series data of various resource usage metrics (CPU, memory, network, disk I/O) across different cloud providers and levels, such as at the physical or virtual machine-level and at the application job-level. We observe that resource utilisation displays very small variations in consecutive time steps. This insight can enable very simple solutions, such as data shifts, to be used for cloud resource forecasting and deliver highly accurate predictions. This is the reason why we ask whether complex machine learning models are even necessary to use. We envision that practical resource management systems need to first identify the extent to which simple solutions can be effective, and resort to using machine learning to the extent that enables its practical use. This talk will be based on work that has been presented in the 14th edition of the annual ACM Symposium on Cloud Computing (SoCC ‘23).