On the need for structure modelling in sequence prediction

Niall Twomey,Tom Diethe,Peter Flach

doi:10.1007/s10994-016-5571-y

Abstract

There is no uniform approach in the literature for modelling sequential correlations in sequence classification problems. It is easy to find examples of unstructured models (e.g. logistic regression) where correlations are not taken into account at all, but there are also many examples where the correlations are explicitly incorporated into a--potentially computationally expensive--structured classification model (e.g. conditional random fields). In this paper we lay theoretical and empirical foundations for clarifying the types of problem which necessitate direct modelling of correlations in sequences, and the types of problem where unstructured models that capture sequential aspects solely through features are sufficient. The theoretical work in this paper shows that the rate of decay of auto-correlations within a sequence is related to the excess classification risk that is incurred by ignoring the structural aspect of the data. This is an intuitively appealing result, demonstrating the intimate link between the auto-correlations and excess classification risk. Drawing directly on this theory, we develop well-founded visual analytics tools that can be applied a priori on data sequences and we demonstrate how these tools can guide practitioners in specifying feature representations based on auto-correlation profiles. Empirical analysis is performed on three sequential datasets. With baseline feature templates, structured and unstructured models achieve similar performance, indicating no initial preference for either model. We then apply the visual analytics tools to the datasets, and show that classification performance in all cases is improved over baseline results when our tools are involved in defining feature representations.

Highlights

Structure modelling permits target variables to collaborate so that ‘informed’ decisions about a set of random variables are based on a collection of beliefs linked together in a graphical structure (Lafferty et al 2001; Sutton and McCallum 2011)
To help us understand the use of Logistic Regression (LR) for sequence prediction, we show in Theorem 1 that given certain conditions on transition potentials of Conditional Random Field (CRF), unconditional independence can be proved between adjacent nodes
Our first experiments assess the difference in classification performance between LR and CRF models over the Word Hyphenation (WH), Activity Recognition (AR) and Occasionally Dishonest Casino (ODC) datasets

Summary

Introduction

Structure modelling permits target variables to collaborate so that ‘informed’ decisions about a set of random variables are based on a collection of beliefs linked together in a graphical structure (Lafferty et al 2001; Sutton and McCallum 2011). In such frameworks, instances can be a list of vectors each relating to a single target variable in the graph. Marginal distributions in a structured model, are explicitly influenced by all possible target permutations over the graph This can be expensive to compute, but, in some applications, superior classification performance admonishes time complexity. The abandonment of structure might be considered sub-optimal for many of these applications, yet some are considered ‘solved’ with the unstructured model choice

Objectives

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Machine Learning	Publication Date: Jul 21, 2016
Citations: 5	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

On the need for structure modelling in sequence prediction

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Machine Learning

Lead the way for us

Similar Papers

Visual analytics of dynamic higher order information

-

01 Jan 2017
01 Jan 2017

Experts\u2019 perceptions on the use of visual analytics for complex mental healthcare planning: an exploratory study
Erin I Walsh ... Luis Salvador-Carulla
BMC Medical Research Methodology | VOL. 20
Erin I Walsh, et. al.Erin I Walsh ... Luis Salvador-Carulla
07 May 2020
BMC Medical Research Methodology | VOL. 20

Towards a product design assessment of visual analytics in decision support applications: a systematic review
Ovo Adagha ... Sheelagh Carpendale
Journal of Intelligent Manufacturing | VOL. 28
Ovo Adagha, et. al.Ovo Adagha ... Sheelagh Carpendale
30 Jun 2015
Journal of Intelligent Manufacturing | VOL. 28

A Data-Driven Platform for the Coordination of Independent Visual Analytics Tools
Lars Nonnemann ... Hans-Jörg Schulz
Computers & Graphics | VOL. 106
Lars Nonnemann, et. al.Lars Nonnemann ... Hans-Jörg Schulz
06 Jun 2022
Computers & Graphics | VOL. 106

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

On the need for structure modelling in sequence prediction

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Machine Learning