欢迎访问《应用生态学报》官方网站,今天是 分享到:

应用生态学报 ›› 2025, Vol. 36 ›› Issue (8): 2407-2419.doi: 10.13287/j.1001-9332.202508.012

• 研究论文 • 上一篇    下一篇

基于Sentinel-2和可解释机器学习的河套平原农田土壤盐分和pH反演

黄华雨1, 丁启东1, 张俊华1*, 潘鑫1, 周跃辉1, 贾科利2   

  1. 1宁夏大学生态环境学院, 银川 750021;
    2宁夏大学地理科学与规划学院, 银川 750021
  • 收稿日期:2025-01-27 接受日期:2025-05-13 出版日期:2025-08-18 发布日期:2026-02-18
  • 通讯作者: *E-mail: zhangjunhua728@163.com
  • 作者简介:黄华雨, 女, 1999年生, 硕士研究生。主要从事遥感监测与土壤质量提升研究。E-mail: huayuhuang1010@163.com
  • 基金资助:
    国家重点研发计划项目(2021YFD1900602)、国家自然科学基金项目(42467036)和宁夏科技创新领军人才项目(2022GKLRLX02)

Inversion of soil salinity and pH in farmland of the Hetao Plain based on Sentinel-2 and explainable machine learning

HUANG Huayu1, DING Qidong1, ZHANG Junhua1*, PAN Xin1, ZHOU Yuehui1, JIA Keli2   

  1. 1College of Ecology and Environmental Science, Ningxia University, Yinchuan 750021, China;
    2College of Geographical Sciences and Planning, Ningxia University, Yinchuan 750021, China
  • Received:2025-01-27 Accepted:2025-05-13 Online:2025-08-18 Published:2026-02-18

摘要: 耕地土壤盐碱化的加剧对农业可持续发展和生态环境构成了重大威胁,土壤含盐量(SSC)和pH值是评估盐碱化程度的关键指标。遥感技术为大范围、高效的土壤盐碱状况监测提供了有力支撑。本研究以河套平原盐碱农田土壤为研究对象,结合实测SSC、pH值和Sentinel-2影像(包括6个波段及24个盐分指数),并引入环境变量、土壤理化属性及合成孔径雷达数据作为建模变量,采用梯度提升机(GBM)进行特征筛选后,基于极端梯度提升(XGBoost)、轻量级梯度提升机(LightGBM)、自适应提升(AdaBoost)、类别提升(CatBoost)、随机森林(RF)和极端随机树(ERT)6种机器学习算法建立SSC和pH值的反演模型,通过夏普利加性解释(SHAP)可视化变量贡献度,并对盐碱化信息空间分布反演制图。结果表明: 研究区土壤盐碱化整体呈轻度至中度,且盐化与碱化存在显著空间异质性。GBM算法通过筛选累积贡献达90%的特征变量,有效降低了模型复杂度,且不同类型变量对盐碱化信息的贡献差异较大。XGBoost和ERT模型分别在SSC和pH值反演中表现最佳,模型验证R2分别为0.925和0.818。SHAP分析显示,盐分指数对SSC和pH值的累计贡献度分别为34.9%和34.2%,位于所有变量之首,其次为土壤理化属性和地形因子,占比为15.7%~23.0%,气候因子和雷达数据的贡献有限,单波段贡献最小。本研究可为类似区域土壤盐碱化信息监测、变量优选及农业改良决策提供参考。

关键词: 土壤盐碱, Sentinel-2影像, 环境变量, 夏普利加性解释, 数字土壤制图

Abstract: The escalating salinization and alkalization of arable soils represents a significant threat to the sustainable development of agriculture and environment. The assessment of salinization and alkalization can be facilitated by measuring crucial indicators including soil salinity content (SSC) and pH. The utilization of remote sensing technology could facilitate the effective and large-scale monitoring of soil salinity and alkalinity conditions. In this study, we selected the saline and alkaline farmland soil in the Hetao Plain as the research object, integrated measured soil salinity content (SSC), pH, and Sentinel-2 images (comprising six bands and 24 salinity indices), and incorporated environmental variables, soil physicochemical attributes, and synthetic aperture radar (SAR) data as mode-ling variables. Following the implementation of feature screening through the utilization of gradient boosting machine (GBM), we established the inverse models of SSC and pH based on six machine learning algorithms, including extreme gradient boosting (XGBoost), light gradient boosting machine (LightGBM), adaptive boosting (AdaBoost), category boosting (CatBoost), random forest (RF), and extreme random tree (ERT). We further visuali-zed the variable contributions by Shapley additive explanation (SHAP) interpretation, and realized the inverse mapping for the spatial distribution of salinity and alkalinity information. The results showed that the overall soil salinization and alkalization were at mild to moderate levles, with significant spatial heterogeneity between salinization and alkalization. The GBM algorithm could effectively reduce the model’s complexity by filtering the feature variables with a cumulative contribution of up to 90%. The contribution of different types of variables to the salinization and alkalization information varied significantly. The XGBoost and ERT models demonstrated optimal perfor-mance in the SSC and pH inversions, respectively, with model validation R2 values of 0.925 and 0.818, respectively. The SHAP analysis revealed that the salinity index was the most significant variable that contributed 34.9% to the SSC and 34.2% to the pH, respectively. Soil physicochemical properties and topographic factors exhibited a range of 15.7% to 23.0% contributions. There were minimal contributions from climatic factors and radar data, and the least contribution from single band. The study could offer a scientific reference for the monitoring of soil salinization and alkalization, the selection of variables, and the decision-making process concerning agricultural enhancement in analogous regions.

Key words: soil salinization and alkalization, Sentinel-2 imagery, environmental variable, Shapley additive explanation, digital soil mapping