Abstract

The economic and environmental performances of the swine farming industry have always resulted in heated discussions in developing countries. Exploring the relationship between these features and the producers’ overall performance is the focus of this paper. For constructing multi-objective features that include the above features, a compromise approach for optimization is taken into consideration. For classifying the overall performance into different levels and detecting the effect of economic and environmental features on such features, an iteration scheme is developed in which the overall performance is treated as a target label. By neglecting this target label, a k-means clustering method is then used to help predict the producer’s overall performance given their economic and environmental features. In data pre-processing, correlation analysis for feature selection shows that the producer’s pollution emission and received regulation intensity largely affect its overall performance, while profit is found to be negatively correlated with pollution emission as regulation intensity is neglected. The classification result derived from the Silhouette Coefficient shows that the data set can be efficiently split into different groups in terms of the producer’s overall performance. The average distance between the objects in the low-performance group is larger than that of the high-performance group. The threshold position between the two groups is found to be largely dependent on the features of pollution emission and regulation intensity. The clustering result obtained by the k-means method shows good effectiveness and efficiency in separating the objects into different groups based on various features other than the overall performance. In 2- and 3-cluster cases, the result also shows evidence of the impact of economic and environmental features on the clustering result. The cross-validation analysis under a set of randomly chosen splitting points shows an increasing out-of-sample prediction quality with increases in training sample size. As one of the by-products of this paper, the geographical distribution in the clustering result is found partially consistent with the official report from Chinas central government regarding advantageous regions within the industry. In addition to current research, the ease of using the knowledge obtained in this paper for transfer learning is discussed.

Highlights

  • The number of live swine in China has been ranked the highest in the world

  • Recalling the distribution of Total MO (TMO) feature, we state that the average distance between objects with low TMO features is relatively small compared to the average distance between objects with higher-valued TMO features

  • For the object similarity derived by averaging the features in the class center, we find it is larger in the worst‐performing class than that in better‐performing one

Read more

Summary

Introduction

The number of live swine in China has been ranked the highest in the world. By the end of 2014, the population had reached 466 million, accounting for 58.8% of the worlds swine population (China Statistical Yearbook, 2015). DA bsyu[r8v]eoynco11n2ducocmtedmberyci[a8l]sownin1e12facrommsminerthciealUsKwsinheowfasrmthsatinpothlleutUioKn snheogwatsivtehlayt apfofelclutstitohne neecognatoimveilcy affects the economic outcome of the industry since pollution causes disease transmission, resulting in a 12% increase in piglet mortality and increasing farming costs to a certain degree From another angle, Jaffe et al [20] states that environmental regulation is bound to increase production cost and bring a negative impact on farmer’s economic benefits. Outcome of the industry since pollution causes disease transmission, resulting in a 12% increase in piglet mortality and increasing farming costs to a certain degree The authors found that a farmer’s income has positive correlation with their ecological farming behavior

Data Mining in Agricultural Issues
Multivariate Statistics Analysis and Knowledge Formalization
Comments
Data Sources and Variable Selection
Variable Selection of Producer’s Environmental Performance
Compromise Approach and Its Implementation Method
Construction of Compromise Multi-Objective Features
Feature Selection
Correlation Analysis of the Compromise Features
Correlation Between Continuous Features
3.10. Comparison with Expert Knowledge Formalization
Classification and Clustering on China’s Swine Farming Industry
Classification
Results on Ascending and Descending TMO
Computation Time on Varied Scales
Summary
Classified Difference in Economic and Environmental Features
Validation Test on Clustering of Two Groups
Conclusions
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.