REMOTE SENSING, cilt.17, sa.14, ss.2500-2520, 2025 (SCI-Expanded)
Accurate crop yield prediction is essential for stabilizing food supply chains and reducing the uncertainties in financial risks related to agricultural production. Yet, it is even more essential to understand how crop yield models make predictions depending on their relationship to Earth Observation (EO) indicators. This study presents a state-of-the-art explainable artificial intelligence (XAI) method to estimate corn yield prediction over the Corn Belt in the continental United States (CONUS). We utilize the recently introduced Kolmogorov–Arnold Network (KAN) architecture, which offers an interpretable alternative to the traditional Multi-Layer Perceptron (MLP) approach by utilizing learnable spline-based activation functions instead of fixed ones. By including a KAN in our crop yield prediction framework, we are able to achieve high prediction accuracy and identify the temporal drivers behind crop yield variability. We create a multi-source dataset that includes biophysical parameters along the crop phenology, as well as meteorological, topographic, and soil parameters to perform end-of-season and in-season predictions of county-level corn yields between 2016–2023. The performance of the KAN model is compared with the commonly used traditional machine learning (ML) models and its architecture-wise equivalent MLP. The KAN-based crop yield model outperforms the other models, achieving an R2 of 0.85, an RMSE of 0.84 t/ha, and an MAE of 0.62 t/ha (compared to MLP: R2 = 0.81, RMSE = 0.95 t/ha, and MAE = 0.71 t/ha). In addition to end-of-season predictions, the KAN model also proves effective for in-season yield forecasting. Notably, even three months prior to harvest, the KAN model demonstrates strong performance in in-season yield forecasting, achieving an R2 of 0.82, an MAE of 0.74 t/ha, and an RMSE of 0.98 t/ha. These results indicate that the model maintains a high level of explanatory power relative to its final performance. Overall, these findings highlight the potential of the KAN model as a reliable tool for early yield estimation, offering valuable insights for agricultural planning and decision-making.