The Forecast of Streamflow through Göksu Stream Using Machine Learning and Statistical Methods


Ciner M. N., Güler M., Namlı E., Samastı M., Ulu M., Peker İ. B., ...Daha Fazla

WATER (SWITZERLAND), cilt.16, sa.8, ss.1-16, 2024 (SCI-Expanded, Scopus) identifier identifier

  • Yayın Türü: Makale / Tam Makale
  • Cilt numarası: 16 Sayı: 8
  • Basım Tarihi: 2024
  • Doi Numarası: 10.3390/w16081125
  • Dergi Adı: WATER (SWITZERLAND)
  • Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus, Agricultural & Environmental Science Database, Aqualine, Aquatic Science & Fisheries Abstracts (ASFA), CAB Abstracts, Compendex, Environment Index, Food Science & Technology Abstracts, Geobase, INSPEC, Pollution Abstracts, Veterinary Science Database, Directory of Open Access Journals
  • Sayfa Sayıları: ss.1-16
  • İstanbul Üniversitesi-Cerrahpaşa Adresli: Evet

Özet

Forecasting streamflow in stream basin systems plays a crucial role in facilitating effective urban planning to mitigate floods. In addition to employing intricate hydrological modeling systems, machine learning and statistical techniques offer an alternative means for streamflow forecasts. Nonetheless, the precision and dependability of these methods are subjects of paramount importance. This study rigorously investigates the effectiveness of three distinct machine learning techniques and two statistical approaches when applied to streamflow data from the G & ouml;ksu Stream in the Marmara Region of Turkey, spanning from 1984 to 2022. Through a comparative analysis of these methodologies, this examination aims to contribute innovative advancements to the existing methodologies used in the prediction of streamflow data. The methodologies employed include machine learning methods such as Extreme Gradient Boosting (XGBoost), Random Forest (RF), and Support Vector Machine (SVM) and statistical methods such as Simple Exponential Smoothing (SES) and Autoregressive Integrated Moving Average (ARIMA) model. In the study, 444 data points between 1984 and 2020 were used as training data, and the remaining data points for the period 2021-2022 were used for streamflow forecasting in the test validation period. The results were evaluated using various metrics, such as the correlation coefficient (r), mean absolute error (MAE), root mean square error (RMSE), mean absolute percentage error (MAPE), coefficient of determination (R2), and Nash-Sutcliffe efficiency (NSE). Upon analyzing the results, it was found that the model generated using the XGBoost algorithm outperformed other machine learning and statistical techniques. Consequently, the models implemented in this study demonstrate a high level of accuracy in predicting potential streamflow in the river basin system.