Abstract

As a tool for producing meaningful and interpretable results, subset or variable selection has been well studied in modern statistics. However, most of the existing methods focus on the independent data and cannot directly extend to the network-linked data where samples are connected with each other. To this end, we propose a subset selection method in the linear regression model by incorporating the network information into the intercept term, which can achieve automatic subset selection and have good network structural interpretability simultaneously. Based on this, we develop an efficient algorithm to recover the true subset, as well as determine subgroups. Simulation studies demonstrate that the proposal outperforms the state-of-art methods in estimation and selection accuracy. We also apply the proposed method on data from the national longitudinal study of adolescent health and show the superiority of selecting variables alone a network by a smaller model size and more accurate prediction.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.