Abstract
AbstractThe multi-label automatic classification of scientific publications based on a pre-defined taxonomy, also called automatic subject indexing is a continuing research endeavor with significant cross-domain applicability. In this paper, we assess the performance of X-transformer and its variants with other extreme multi-label classification models for the above task. Our model Weak X-transformer achieves a micro F1-score of 0.65 and 64% accuracy on the task outperforming all other methods. We also investigate the impact of incorporating additional unlabelled data and hierarchical structure into the models. Our findings demonstrate that the transformer-based model with weak supervision outperforms other approaches, providing insights into effective strategies for extreme multi-label classification in scholarly publications.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.