Abstract

The Goal of the research is to introducing the principle of Decision Tree that is being used to forecast river pollution, it provides a new method to evaluate the river pollution based on its water quality. We collected monthly monitoring data of water quality from Dezikou River basin of Yilan County, and the data of fish ecology obtained from ecological survey and report, in which to build a water quality and ecology resources database through an actual field investigation. By using data mining software, IBM SPSS Modeler 14.1’s decision tree, conducting the River Pollution Index. Shannon-Weaver diversity, Pielou’s Evenness Index, Margalef’s Species Richness Index, Fish Tolerant Index and Simpson’s Index of Diversity’s classification and prediction, to build a model for river pollution prediction, and to compare this with the Multiple Logistic Regression Analysis. The results showed that the model for river pollution prediction built under the Decision Tree can obtain a better forecast result. The following are the accurate rates of Decision Tree: 88% for CART, 90% for CHAID, 91.67% for C5.0, and 86.11% for Multiple Logistic Regression Analysis. Therefore, the Decision Tree’s algorithm shows a better result in forecasting than the Multiple Logistic Regression Analysis.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call