A Case for Automated Large Scale Semantic Annotation

Stephen Dill,Anant Jhingran,Jason Y Zien,Sridhar Rajagopalan,Nadav Eiron,Andrew Tomkins,Tapas Kanungo,R Guha,Daniel Gruhl,Kevin S Mccurley,John A Tomlin,David Gibson

doi:10.2139/ssrn.3199010

Abstract

This paper describes Seeker, a platform for large-scale text analytics, and SemTag, an application written on the platform to perform automated semantic tagging of large corpora. We apply SemTag to a collection of approximately 264 million web pages, and generate approximately 434 million automatically disambiguated semantic tags, published to the web as a label bureau providing metadata regarding the 434 million annotations. To our knowledge, this is the largest scale semantic tagging effort to date. We describe the Seeker platform, discuss the architecture of the SemTag application, describe a new disambiguation algorithm specialized to support ontological disambiguation of large-scale data, evaluate the algorithm, and present our final results with information about acquiring and making use of the semantic tags. We argue that automated large scale semantic tagging of ambiguous content can bootstrap and accelerate the creation of the semantic web.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Case for Automated Large Scale Semantic Annotation

Abstract

Talk to us

Similar Papers

More From: SSRN Electronic Journal

Lead the way for us

Journal: SSRN Electronic Journal	Publication Date: Jan 1, 2003
Citations: 5

Similar Papers

SemTag and seeker
Stephen Dill ... David Gibson
-
Stephen Dill, et. al.Stephen Dill ... David Gibson
01 Jan 2003
01 Jan 2003

A case for automated large-scale semantic annotation
Stephen Dill ... Jason Y Zien
Web Semantics: Science, Services and Agents on the World Wide Web | VOL. 1
Stephen Dill, et. al.Stephen Dill ... Jason Y Zien
01 Dec 2003
Web Semantics: Science, Services and Agents on the World Wide Web | VOL. 1

Automated Semantic Tagging of Textual Content
Jelena Jovanovic ... Zoran Jeremic
IT Professional | VOL. 16
Jelena Jovanovic, et. al.Jelena Jovanovic ... Zoran Jeremic
01 Nov 2014
IT Professional | VOL. 16

UNIpedia: A Unified Ontological Knowledge Platform for Semantic Content Tagging and Search
Murat Kalender ... Suzan Uskudarli
-
Murat Kalender, et. al.Murat Kalender ... Suzan Uskudarli
01 Sep 2010
01 Sep 2010

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Case for Automated Large Scale Semantic Annotation

Abstract

Talk to us

Similar Papers

More From: SSRN Electronic Journal