Abstract

We propose a Bootstrap resampling approach for Feature Selection (FS) using the weights obtained by a linear Support Vector Machine (SVM) when it is applied to high-dimensional input spaces. We build our approach on a practical application with an extremely high-dimensional input space. The application is the detection of Anastomosis Leakage (AL) after colorectal cancer surgery using free text Bag-of-Words in Electronic Health Records (EHRs). Colorectal cancer is the third most common cancer type, and surgery is the only curative treatment, making the detection of AL of prime importance. The reduced input space obtained by the proposed FS strategy in combination with the linear SVM provided a much improved performance for early detection AL after colorectal cancer (earlier/final sensitivity 97%/100% and specificity 47%/89%). Further extensions of the method can be the basis for a principled FS strategy in high-dimensional input spaces.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.