Accurate prediction of groundwater levels is essential for effective water resource management, especially in regions facing water scarcity or overexploitation of aquifers. The Upper Bhavani River Basin in Coimbatore, Tamil Nadu, is one area where sustainable groundwater management is vital for agriculture and urban water supply. Traditional methods often struggle to capture the complex interactions between various environmental factors and groundwater dynamics. Machine learning approaches have shown promise in addressing these challenges in recent years. This study explores the application of Random Forest Regression (RFR) in predicting groundwater levels with high precision in this critical watershed.
2. Study Area:
The Upper Bhavani River Basin is located in the western part of Coimbatore district, Tamil Nadu, India. The Upper Bhavani River Basin, an integral part of the larger Bhavani River system and a significant tributary of the Cauvery River, is situated in the Western Ghats. The basin is predominantly agricultural, with Horticulture crops being the primary cultivation. Hydro geologically, the area features a description of the aquifer system, rock types, and groundwater availability. The Upper Bhavani River Basin is crucial for irrigation, domestic water supply, and industrial use in the region, making accurate groundwater level predictions essential for sustainable water resource management. The Upper Bhavani Basin faces challenges such as specific water-related issues in the area, e.g., seasonal water scarcity, overexploitation of groundwater, making accurate groundwater level prediction essential for sustainable water management.
Groundwater level forecasting has become increasingly important for sustainable water resource management, especially in regions facing water scarcity. Over the past decade, machine learning techniques have gained prominence due to their ability to handle complex, non-linear relationships in hydrological systems. Random Forest Regression (RFR) has emerged as a powerful tool for groundwater level prediction. Rajaee et al. (2019) compared various machine learning techniques and found that RFR often outperforms other methods in terms of accuracy and robustness. They attributed this to RFR's ability to handle high-dimensional data and its resistance to overfitting. In a study focused on semi-arid regions, Sahoo et al. (2017) demonstrated the effectiveness of RFR in predicting groundwater levels under varying climatic conditions. They highlighted the importance of feature selection in improving model performance. To address data scarcity, Naghibi et al. (2020) proposed a hybrid approach combining RFR with other data-driven methods. Their results showed improved prediction accuracy, especially in areas with limited historical data. Raghavendra et al. (2014) utilized multiple linear regression (MLR) for predicting pest incidence in cotton crops. "Raghavendra et al. (2014) also used MLR for predicting pest incidence of cotton. "For the specific context of river basins, Chen et al. (2020) applied RFR to forecast groundwater levels in a complex river-aquifer system. They found that incorporating river stage data significantly enhanced the model's predictive power. In agricultural watersheds, similar to the Upper Bhavani River Basin, Nair and Kumar (2018) demonstrated the superiority of RFR over traditional time series models. They emphasized the importance of including land use and irrigation data as input features. Recent work by Prasad et al., (2022) has focused on integrating remote sensing data with RFR models. Their approach showed promise in improving long-term groundwater level forecasts, particularly in data-scarce regions.