Probability Model Based on Cluster Analysis to Classify Sequences of Observations for Small Training Sets

Sergey S Yulin,Irina N Palamar

doi:10.19139/soic-2310-5070-690

Sergey S Yulin, Irina N Palamar

Open Access

PDF Available

https://doi.org/10.19139/soic-2310-5070-690

Copy DOI

Export

Save

Cite

Abstract
Highlights/Summary
Full-Text PDF
Similar Papers

Abstract

Listen

The problem of recognizing patterns, when there are few training data available, is particularly relevant and arises in cases when collection of training data is expensive or essentially impossible. The work proposes a new probability model MC&CL (Markov Chain and Clusters) based on a combination of markov chain and algorithm of clustering (self-organizing map of Kohonen, k-means method), to solve a problem of classifying sequences of observations, when the amount of training dataset is low. An original experimental comparison is made between the developed model (MC&CL) and a number of the other popular models to classify sequences: HMM (Hidden Markov Model), HCRF (Hidden Conditional Random Fields),LSTM (Long Short-Term Memory), kNN+DTW (k-Nearest Neighbors algorithm + Dynamic Time Warping algorithm). A comparison is made using synthetic random sequences, generated from the hidden markov model, with noise added to training specimens. The best accuracy of classifying the suggested model is shown, as compared to those under review, when the amount of training data is low.

Highlights

Sequences of observations are classified in the process of solving the problems of recognizing: speech [1, 2], hand-written text [3], gestures of hands/head [4, 5], states of technical objects [6, 7, 8] Due to intense introduction of computer-aided learning into various areas of human activities, machine learning engineers often have to deal with small-scale training sets, which structure and characteristics are almost unknown
MC&CL (Markov Chain and CLusters) method implies development and modification of probability model, we have previously developed, which is based on markov chain and self-organizing map of Kohonen/Growing neural gas [21, 22]
Sequences of observations for training and test datasets shall be generated from hidden markov model with random parameters of distribution

Summary

Introduction

Sequences of observations are classified in the process of solving the problems of recognizing: speech [1, 2], hand-written text [3], gestures of hands/head [4, 5], states of technical objects [6, 7, 8] Due to intense introduction of computer-aided learning into various areas of human activities, machine learning engineers often have to deal with small-scale training sets, which structure and characteristics are almost unknown. To classify the sequences of observations, the following machine learning methods have widely been used: Hidden Markov Model (HMM), Hidden Conditional Random Fields (HCRF), Long Short-Term Memory (LSTM), k-Nearest Neighbors algorithm (kNN) with Dynamic Time Warping algorithm (DTW). KNN method is a popular metric non-parametric algorithm of classification It is based on computing a distance between test specimen and specimens from the training set. Several studies on applying kNN-method together with DTW algorithm were undertaken by Professor Eamonn Keogh and his colleagues.

Objectives

Methods

Results

Discussion

Conclusion

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Statistics, Optimization & Information Computing	Publication Date: Feb 18, 2020
Citations: 1	License type: cc-by

R Discovery Prime

Probability Model Based on Cluster Analysis to Classify Sequences of Observations for Small Training Sets

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: Statistics, Optimization & Information Computing

Lead the way for us

Similar Papers

An ICA Mixture Hidden Conditional Random Field Model for Video Event Classification
Xiaofeng Wang ... Xiao-Ping Zhang
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 23
Xiaofeng Wang, et. al.Xiaofeng Wang ... Xiao-Ping Zhang
01 Jan 2013
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 23

Trajectory-based human activity recognition using Hidden Conditional Random Fields
Qing-Bin Gao ... Shi-Liang Sun
-
Qing-Bin Gao, et. al. Qing-Bin Gao ... Shi-Liang Sun
01 Jul 2012
01 Jul 2012

Joint semi-supervised learning of Hidden Conditional Random Fields and Hidden Markov Models
Yann Soullard ... Thierry Artières
Pattern Recognition Letters | VOL. 37
Yann Soullard, et. al.Yann Soullard ... Thierry Artières
06 Apr 2013
Pattern Recognition Letters | VOL. 37

Detecting cell division of Pseudomonas aeruginosa bacteria from bright-field microscopy images with hidden conditional random fields.
Lee-Ling S Ong ... H Harry Asada
Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference | VOL. 2016
Lee-Ling S Ong, et. al.Lee-Ling S Ong ... H Harry Asada
01 Aug 2016
01 Aug 2016

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Probability Model Based on Cluster Analysis to Classify Sequences of Observations for Small Training Sets

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: Statistics, Optimization &amp; Information Computing

More From: Statistics, Optimization & Information Computing