欢迎访问《应用生态学报》官方网站,今天是 分享到:

应用生态学报 ›› 2012, Vol. 23 ›› Issue (02): 452-458.

• 研究论文 • 上一篇    下一篇

互信息理论结合决策树算法的土壤质量预测

林芬芳1,王珂2**,杨宁2,严世光3,郑辛煜2   

  1. 1南京信息工程大学遥感学院, 南京 210044; 2浙江大学农业遥感与信息技术应用研究所, 杭州 310058; 3西南大学资源与环境学院, 重庆 400715
  • 出版日期:2012-02-18 发布日期:2012-02-18

Prediction of regional soil quality based on mutual information theory integrated with decision tree algorithm.

LIN Fen-fang1, WANG Ke2, YANG Ning2, YAN Shi-guang3, ZHENG Xin-yu2   

  1. 1School of Remote Sensing, Nanjing University of Information Science & Technology, Nanjing 210044, China; 2Institute of Agricultural Remote Sensing and Information Technology, Zhejiang University, Hangzhou 310058, China; 3College of Resources and Environments, Southwest University, Chongqing 400715, China
  • Online:2012-02-18 Published:2012-02-18

摘要: 在充分利用土壤类型、土地利用方式、岩性类型、地形、道路、工业类型等影响土壤质量主要因素,准确获取区域土壤质量的空间分布特征的基础上,采用互信息理论对13个辅助变量(岩性类型、土地利用方式、土壤类型、到城镇的距离、到道路的距离、到工业用地的距离、到河流的距离、相对高程、坡度、坡向、平向曲率、纵向曲率和切线曲率)进行筛选,然后通过决策树See5.0预测研究区土壤质量.结果表明: 影响研究区土壤质量的主要因素包括土壤类型、土地利用方式、岩性类型、到城镇的距离、到水域的距离、相对高程、到道路的距离和到工业用地的距离;以互信息理论选取的因子为预测变量的决策树模型精度明显优于以全部因子为预测变量的决策树模型,在前者的决策树模型中,无论是决策树还是决策规则,分类预测精度均达到80%以上.互信息理论结合决策树的方法在充分利用连续型和字符型数据的基础上,不仅精简了一般决策树算法的输入参数,而且能有效地预测和评价区域土壤质量等级.

关键词: 土壤质量, 预测, 决策树, 互信息

Abstract: In this paper, some main factors such as soil type, land use pattern, lithology type, topography, road, and industry type that affect soil quality were used to precisely obtain the spatial distribution characteristics of regional soil quality, mutual information theory was adopted to select the main environmental factors, and decision tree algorithm See5.0 was applied to predict the grade of regional soil quality. The main factors affecting regional soil quality were soil type, land use, lithology type, distance to town, distance to water area, altitude, distance to road, and distance to industrial land. The prediction accuracy of the decision tree model with the variables selected by mutual information was obviously higher than that of the model with all variables, and, for the former model, whether of decision tree or of decision rule, its prediction accuracy was all higher than 80%. Based on the continuous and categorical data, the method of mutual information theory integrated with decision tree could not only reduce the number of input parameters for decision tree algorithm, but also predict and assess regional soil quality effectively.

Key words: soil quality, prediction, decision tree, mutual information