Transfer learning via random forests: A one-shot federated approach

Pengcheng Xiang,Ling Zhou,Lu Tang

doi:10.1016/j.csda.2024.107975

Abstract

A one-shot federated transfer learning method using random forests (FTRF) is developed to improve the prediction accuracy at a target data site by leveraging information from auxiliary sites. Both theoretical and numerical results show that the proposed federated transfer learning approach is at least as accurate as the model trained on the target data alone regardless of possible data heterogeneity, which includes imbalanced and non-IID data distributions across sites and model mis-specification. FTRF has the ability to evaluate the similarity between the target and auxiliary sites, enabling the target site to autonomously select more similar site information to enhance its predictive performance. To ensure communication efficiency, FTRF adopts the model averaging idea that requires a single round of communication between the target and the auxiliary sites. Only fitted models from auxiliary sites are sent to the target site. Unlike traditional model averaging, FTRF incorporates predicted outcomes from other sites and the original variables when estimating model averaging weights, resulting in a variable-dependent weighting to better utilize models from auxiliary sites to improve prediction. Five real-world data examples show that FTRF reduces the prediction error by 2-40% compared to methods not utilizing auxiliary information.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Transfer learning via random forests: A one-shot federated approach

Abstract

Talk to us

Similar Papers

More From: Computational Statistics and Data Analysis

Lead the way for us

Similar Papers

Gyroscope in-assembly drift anomaly detection based on decision re-optimized deep auto-encoder
Wuyang Fan ... Shisheng Zhong
Measurement Science and Technology | VOL. -
Wuyang Fan, et. al.Wuyang Fan ... Shisheng Zhong
15 Oct 2024
Measurement Science and Technology | VOL. -

Multi-Branching Neural Network for Myocardial Infarction Prediction
Zekai Wang ... Bing Yao
-
Zekai Wang, et. al.Zekai Wang ... Bing Yao
20 Aug 2022
20 Aug 2022

CABNet: Category Attention Block for Imbalanced Diabetic Retinopathy Grading.
Along He ... Huazhu Fu
IEEE Transactions on Medical Imaging | VOL. 40
Along He, et. al.Along He ... Huazhu Fu
29 Dec 2020
IEEE Transactions on Medical Imaging | VOL. 40

불균형 데이터 환경에서 변수가중치를 적용한 사례기반추론 기반의 고객반응 예측
...
Journal of Intelligence and Information Systems | VOL. 21
, et. al. ...
31 Mar 2015
Journal of Intelligence and Information Systems | VOL. 21

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Transfer learning via random forests: A one-shot federated approach

Abstract

Talk to us

Similar Papers

More From: Computational Statistics and Data Analysis