Abstract

Cadmium (Cd) accumulation in edible plant tissues poses a serious threat to human health through the food chain. Assessing the availability of soil Cd is crucial for evaluating associated environmental risks. However, existing experimental methods and traditional models are time-consuming and inefficient. In this study, we developed machine learning models to predict soil available Cd based on soil properties, using a dataset comprising 585 data points covering 585 soils. Traditional machine learning models exhibited prediction values beyond the theoretical range, urging the need for alternative approaches. To address this, different models were tested, and the post-constraint eXtreme Gradient Boosting (XGBoost) model was found to possess the best predictive performance (R2 =0.81) outperform traditional linear regression model in terms of accuracy. Furthermore, we explored the relationship between soil available Cd and wheat grain Cd and rice grain Cd. Linear regression models were developed using 302 data points for wheat and 563 data points for rice. Results demonstrated a significant correlation between soil available Cd and wheat grain Cd (R2 =0.487) as well as rice grain Cd (R2 =0.43).

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.