Time Expression Analysis and Recognition Using Syntactic Token Types and General Heuristic Rules

Xiaoshi Zhong,Erik Cambria,Aixin Sun

doi:10.18653/v1/p17-1039

Abstract

Extracting time expressions from free text is a fundamental task for many applications. We analyze the time expressions from four datasets and find that only a small group of words are used to express time information, and the words in time expressions demonstrate similar syntactic behaviour. Based on the findings, we propose a type-based approach, named SynTime, to recognize time expressions. Specifically, we define three main syntactic token types, namely time token, modifier, and numeral, to group time-related regular expressions over tokens. On the types we design general heuristic rules to recognize time expressions. In recognition, SynTime first identifies the time tokens from raw text, then searches their surroundings for modifiers and numerals to form time segments, and finally merges the time segments to time expressions. As a light-weight rule-based tagger, SynTime runs in real time, and can be easily expanded by simply adding keywords for the text of different types and of different domains. Experiment on benchmark datasets and tweets data shows that SynTime outperforms state-of-the-art methods.

Highlights

Time expression plays an important role in information retrieval and many applications in natural language processing (Alonso et al, 2011; Campos et al, 2014)
Occurrence, small vocabulary, and similar syntactic behaviour all reduce the cost of energy required to communicate
We propose a time tagger named SynTime to recognize time expressions using syntactic token types and general heuristic rules

Summary

Introduction

Time expression plays an important role in information retrieval and many applications in natural language processing (Alonso et al, 2011; Campos et al, 2014). The key difference between SynTime and other rulebased taggers lies in the way of defining token types and the way of designing rules. (The test for other languages needs only to construct a collection of token regular expressions in the target language under our defined token types.) we evaluate SynTime against three state-of-the-art methods (i.e., HeidelTime, SUTime, and UWTime) on three datasets: TimeBank, WikiWars, and Tweets.. We propose a time tagger named SynTime to recognize time expressions using syntactic token types and general heuristic rules. We conduct experiments on three datasets, and the results demonstrate the effectiveness of SynTime against state-of-the-art baselines

Related Work

Dataset

Finding

SynTime

SynTime Construction

Time Token Identification

Time Segment Identification

Time Expression Extraction

SynTime Expansion

Experiments

Experiment Setting

Method

Experiment Result

Limitations

Conclusion and future work

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Time Expression Analysis and Recognition Using Syntactic Token Types and General Heuristic Rules

Abstract

Highlights

Summary

Talk to us

Similar Papers

Lead the way for us

Publication Date: Jan 1, 2017
Citations: 74	License type: cc-by

Similar Papers

SynTime: Token Types and Heuristic Rules
Xiaoshi Zhong ... Erik Cambria
-
Xiaoshi Zhong, et. al.Xiaoshi Zhong ... Erik Cambria
01 Jan 2020
01 Jan 2020

A Timed-Release Key Management Scheme for Backward Recovery
Maki Yoshida ... Shigeo Mitsunari
-
Maki Yoshida, et. al.Maki Yoshida ... Shigeo Mitsunari
01 Jan 2006
01 Jan 2006

Time Controlled Expressive Predicate Query With Accountable Anonymity
Yang Yang ... Chunming Rong
IEEE Transactions on Services Computing | VOL. 16
Yang Yang, et. al.Yang Yang ... Chunming Rong
01 Mar 2023
IEEE Transactions on Services Computing | VOL. 16

Application of Time Token Learning to Improve Elementary Students' Communication Skills
Bannaga Taha Al-Zubair Hussen ... Siti Amsarina Pangaribuan
Journal of Contemporary Islamic Primary Education | VOL. 1
Bannaga Taha Al-Zubair Hussen, et. al. Bannaga Taha Al-Zubair Hussen ... Siti Amsarina Pangaribuan
20 Jul 2023
Journal of Contemporary Islamic Primary Education | VOL. 1

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Time Expression Analysis and Recognition Using Syntactic Token Types and General Heuristic Rules

Abstract

Highlights

Summary

Talk to us

Similar Papers