In the real world, phenomena are often observed and recorded by multiple organizations which results in multiple sources of data. When dealing with such data, the centralized modeling approach aims to achieve collaborative modeling by fusing multiple sources of data into a single data set, which may pose challenges to data privacy. Unlike centralized modeling, the distributed modeling approach can effectively solve the privacy issue. However, modeling approaches based on this idea still suffer from either low prediction accuracy or high communication costs. In this study, we propose a collaborative modeling strategy for multi-source data based on fuzzy rule-based models (FRBMs) to balance the needs of both model prediction accuracy and efficiency. First, we adopt the concept and algorithm of collaborative fuzzy clustering (CFC) to improve prediction accuracy and reduce communication costs by improving the CFC algorithm. Then, we construct a granular FRBM for multi-source data based on the principle of justifiable granularity (PJG) by integrating local models into a more robust and perfect global model. Finally, we improve the performance evaluation index of the existing granular FRBM and propose two model optimization schemes to further improve the performance of the model. We conduct experiments on both synthetic and publicly available data sets to demonstrate the effectiveness of the proposed method.
Read full abstract