2025-05-21 イリノイ大学アーバナ・シャンペーン校
<関連情報>
- https://aces.illinois.edu/news/illinois-study-novel-ai-methodology-improves-gully-erosion-prediction-and-interpretation
- https://www.sciencedirect.com/science/article/pii/S0301479725014549?via%3Dihub#ack0010
スタッキングアンサンブルモデルを用いたSHApley Additive exPlanations(SHAP)法によるガリー侵食感受性の予測 Prediction of gully erosion susceptibility through the lens of the SHapley Additive exPlanations (SHAP) method using a stacking ensemble model
Jeongho Han, Jorge A. Guzman, Maria L. Chu
Journal of Environmental Management Available online: 25 April 2025
DOI:https://doi.org/10.1016/j.jenvman.2025.125478
Highlights
- Stacking models enhance the accuracy of gully erosion susceptibility prediction.
- Ensemble models mitigate biases in individual machine learning models in erosion risk assessment.
- SHAP analysis reveals the importance of features and their interactions at the global and local levels.
- Reliable, explainable erosion maps enhance conservation planning and evidence-based policy decisions.
Abstract
This study develops a novel explainable stacking ensemble model that combines the stacked generalization ensemble method with SHapley Additive exPlanations (SHAP) to enhance the prediction and interpretation of gully erosion susceptibility. Applied to Jefferson County, Illinois, our approach leverages Random Forest (RF), Gradient Boosting Machine (GBM), Logistic Regression (LR), and Deep Neural Networks (DNN) as both base and meta-learners in various configurations, resulting in 44 distinct stacking models. The comparative analysis demonstrated the superior predictive performance of the stacked models when evaluated at 200 randomly gully sites selected points based on LiDAR difference observations; all but three exceeded the highest area under the curve (AUC) value of 0.86 achieved by the best-performing base model (GBM). The LR stacking model, combining RF and GBM as base models with LR as the meta-learner, emerged as the most effective, achieving an AUC of 0.916. The resulting gully erosion susceptibility map by the LR stacking model classified 33 % of the agricultural land (89,208 ha) as the “very high” class, compared to 27 %, 87 %, 27 %, and 55 % predicted by individual RF, LR, GBM, and DNN models, respectively. Crucially, SHAP analysis elucidated how changes in feature values influence model behavior, considering feature interactions within both the base models and the meta-learner. The SHAP identified the annual leaf area index (LAI) as the most influential feature in both RF and GBM base models. Additionally, it highlights the significance of the GBM model in comparison to the RF base model in the final decision-making process of the stacking model. By offering a transparent mechanism to evaluate how different features and models contribute to final decisions, this approach can be extended to broader environmental management and policy-making contexts, facilitating more informed and responsible resource allocation.