Adaptive context trees and text clustering

J.-P Vert

doi:10.1109/18.930925

Adaptive context trees and text clustering

J.-P Vert

Open Access

https://doi.org/10.1109/18.930925

Copy DOI

Journal: IEEE Transactions on Information Theory	Publication Date: Jul 1, 2001
Citations: 33

#Context Trees #Text Clustering + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

In the finite-alphabet context we propose four alternatives to fixed-order Markov models to estimate a conditional distribution. They consist in working with a large class of variable-length Markov models represented by context trees, and building an estimator of the conditional distribution with a risk of the same order as the risk of the best estimator for every model simultaneously, in a conditional Kullback-Leibler sense. Such estimators can be used to model complex objects like texts written in natural language and define a notion of similarity between them. This idea is illustrated by experimental results of unsupervised text clustering.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

More From: IEEE Transactions on Information Theory

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.