A semantic similarity approach to predicting Library of Congress subject headings for social tags

Kwan Yi

doi:10.1002/asi.21351

Abstract

AbstractSocial tagging or collaborative tagging has become a new trend in the organization, management, and discovery of digital information. The rapid growth of shared information mostly controlled by social tags poses a new challenge for social tag‐based information organization and retrieval. A plausible approach for this challenge is linking social tags to a controlled vocabulary. As an introductory step for this approach, this study investigates ways of predicting relevant subject headings for resources from social tags assigned to the resources. The prediction of subject headings was measured by five different similarity measures: tf–idf, cosine‐based similarity (CoS), Jaccard similarity (or Jaccard coefficient; JS), Mutual information (MI), and information radius (IRad). Their results were compared to those by professionals. The results show that a CoS measure based on top five social tags was most effective. Inclusions of more social tags only aggravate the performance. The performance of JS is comparable to the performance of CoS while tf–idf is comparable with up to 70% less than the best performance. MI and IRad have inferior performance compared to the other methods. This study demonstrates the application of the similarity measuring techniques to the prediction of correct Library of Congress subject headings.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A semantic similarity approach to predicting Library of Congress subject headings for social tags

Abstract

Talk to us

Similar Papers

More From: Journal of the American Society for Information Science and Technology

Lead the way for us

Journal: Journal of the American Society for Information Science and Technology	Publication Date: Apr 26, 2010
Citations: 6

Similar Papers

A Semantic Similarity Approach for Linking Tweet Messages to Library of Congress Subject Headings using Linked Resources: A Pilot Study
Kwan Yi
Advances in Classification Research Online | VOL. 24
Kwan YiKwan Yi
09 Jan 2014
Advances in Classification Research Online | VOL. 24

A semantic similarity approach to predicting Library of Congress subject headings for social tags

Journal of the American Society for Information Science and Technology | VOL. -

01 Aug 2010
Journal of the American Society for Information Science and Technology | VOL. -

برچسب های اجتماعی لایبرری ثینگ در مقابل سرعنوان های موضوعی کتابخانه کنگره: مرور نوشتارها
...
-
, et. al. ...
23 Aug 2018
23 Aug 2018

A conceptual framework for improving information retrieval in folksonomy using Library of Congress subject headings
Kwan Yi
Proceedings of the American Society for Information Science and Technology | VOL. 45
Kwan YiKwan Yi
01 Jan 2008
Proceedings of the American Society for Information Science and Technology | VOL. 45

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A semantic similarity approach to predicting Library of Congress subject headings for social tags

Abstract

Talk to us

Similar Papers

More From: Journal of the American Society for Information Science and Technology