Abstract

In this paper, we propose a new dynamic template based event detection algorithm (DTED). Candidate template of an event is firstly constructed from a set of texts or their surrogates. Each candidate template contains several terms automatically extracted by the term weighting algorithm proposed in this paper. Then, we classify each text into a candidate event through a new similarity function. Some insignificant candidate templates are deleted. Whether an event template represents a new happened event or not is determined by comparing it with the event templates constructed in previous time window. Some events are merged into existing events and their templates are updated again. To evaluate the proposed DTED algorithm, we construct two datasets for experiment and F-measure is used as performance metric. The experiment result shows that DTED outperforms single-pass algorithm and clustering algorithms implemented in Cluto toolkit; meanwhile, Experimental results on Linguistic Data Consortium (LDC) dataset TDT4 show that DTED gets promising result.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call