A Natural Language Processing Model for Automated Organization and Analysis of Intangible Cultural Heritage

Yan Zheng,Fuqing Li,Rui Cao,Zheyuan Zhang,Noman Sohail,Cui Li

doi:10.4018/joeuc.349736

A Natural Language Processing Model for Automated Organization and Analysis of Intangible Cultural Heritage

Yan Zheng, Fuqing Li + Show 4 more

Open Access

PDF Available

https://doi.org/10.4018/joeuc.349736

Copy DOI

Export

Save

Cite

Journal: Journal of Organizational and End User Computing	Publication Date: Jul 23, 2024
License type: CC BY 3.0

#Effect Of Different Distances #Field Of NLP #F1 Values #Text Dataset #Clustering Results #Evaluation Criteria #Accuracy Of Clustering #Effect Of Distances #Text Similarity Methods #Natural Language Processing Model

Abstract
Full-Text PDF
Similar Papers

Abstract

Listen

This paper investigates text similarity methods in the field of NLP, improves upon the WMD, and develops the SWC-WMD distance, forming the basis for a clustering method for long ICH texts. Clustering experiments on the constructed ICH long text dataset using WMD, SWC-WMD, and TF-IDF-WMD distances were conducted. The impact of the number of feature words on clustering results and the effect of different distances on clustering outcomes were assessed based on accuracy and F1 values from the evaluation criteria. The final results show that the SWC-WMD distance improves the accuracy and F1 values of the ICH long text clustering results compared to the other two distances, thereby proving the effectiveness of the methods proposed in this paper.

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

More From: Journal of Organizational and End User Computing

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.

R Discovery Prime

A Natural Language Processing Model for Automated Organization and Analysis of Intangible Cultural Heritage