Extracting hot spots of basic and complex topics from time stamped documents

Wei Chen Wei Chen,Parvathi Chundi

doi:10.1109/cidm.2009.4938639

Abstract

Identifying time periods with a burst of activity related to a topic has been an important problem in analyzing time stamped documents. In this paper, we discuss methods to compute a hot spot of a given topic from a time stamped document set. We consider basic topics that contain one or more keywords as well as complex topics that contain topics connected by logical operators and, or, not. We use the temporal scan statistic to assign a discrepancy score to each of the intervals of the time period spanning the given document set. The hot spot of the given topic is the time interval with the highest discrepancy score. We describe efficient algorithms to compute the hot spots of both basic and complex topics. Our preliminary experiments using the SIGMOD/VLDB paper titles data set and the CNN/Reuters news article titles data set collected from the TDT-Pilot Corpus show that our methods to compute the measure and the hot spot of a topic work very well in practice.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Extracting hot spots of basic and complex topics from time stamped documents

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Extracting hot spots of topics from time-stamped documents
Wei Chen ... Parvathi Chundi
Data & Knowledge Engineering | VOL. 70
Wei Chen, et. al.Wei Chen ... Parvathi Chundi
23 Mar 2011
Data & Knowledge Engineering | VOL. 70

Dynamics of Rotating Systems
Giancarlo Genta
-
Giancarlo GentaGiancarlo Genta
01 Jan 2004
01 Jan 2004

Quality and Patient Safety in a Pathology Residency Training Program
R Demkowicz ... S Sapatnekar
American Journal of Clinical Pathology | VOL. 154
R Demkowicz, et. al.R Demkowicz ... S Sapatnekar
28 Oct 2020
American Journal of Clinical Pathology | VOL. 154

Detection of Clostridium difficile infection clusters, using the temporal scan statistic, in a community hospital in southern Ontario, Canada, 2006-2011.
Meredith C Faires ... David L Pearl
BMC Infectious Diseases | VOL. 14
Meredith C Faires, et. al.Meredith C Faires ... David L Pearl
12 May 2014
BMC Infectious Diseases | VOL. 14

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Extracting hot spots of basic and complex topics from time stamped documents

Abstract

Talk to us

Similar Papers