Normalized Maximum Likelihood Research Articles

In this paper, we develop a code length principle which is invariant to the choice of parameterization on the model distributions, that is the code length remains the same under smooth transformations on the likelihood parameters. An invariant approximation formula for easy computation of the marginal distribution is provided for Gaussian likelihood models. We provide invariant estimators of the model parameters and formulate conditions under which these estimators are essentially posteriori unbiased for Gaussian models. An upper bound on the coarseness of discretization on the model parameters is deduced. We introduce a discrimination measure between probability distributions and use it to construct probability distributions on model classes and show how this may induce an additional code length term <tex xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">$ kover 4log _2k$</tex> for a <tex xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">$k$</tex> -parameter model. The total code length is shown to be closely related to the normalized maximum likelihood (NML) code length of Rissanen when choosing Jeffreys prior distribution on the model parameters together with a uniform prior distribution on the model classes. Our model selection principle is applied to a Gaussian estimation problem for data in a wavelet representation and its performance is tested and compared to alternative wavelet-based estimation methods in numerical experiments.

This paper studies the problem of class discrimination based on the normalized maximum likelihood (NML) model for a nonlinear regression, where the nonlinearly transformed class labels, each taking M possible values, are assumed to be drawn from a multinomial trial process. The strength of the MDL methods in statistical inference is to find the model structure which, in this particular classification problem, amounts to finding the best set of feature genes. We first show that the minimization of the codelength of the NML model for different sets of feature genes is a tractable problem. We then extend the model for selecting the feature genes to a completely defined classifier and check its classification error in a cross-validation experiment. Also the quantization process itself involved in getting the required entries in the model, can be evaluated with the NML description length. The new classification method is applied to leukemia class discrimination based on gene expression microarray data. We find classification errors as low as 0.03% with a quadruplet of binary quantized genes, which was top ranked by the NML description length. Such a length of the class labels, obtained with various sets of feature genes in the nonlinear regression model, allows intuitive comparisons of nested feature sets.

Normalized Maximum Likelihood Research Articles

Articles published on Normalized Maximum Likelihood

DNA sequence compression - Based on the normalized maximum likelihood model

On the normalized maximum likelihood and Bayesian decision theory

An Invariant Bayesian Model Selection Principle for Gaussian Data in a Sparse Representation

Model selection by normalized maximum likelihood

Clustering Time Series Gene Expression Data Based on Sum-of-Exponentials Fitting

An efficient normalized maximum likelihood algorithm for DNA sequence compression

On some properties of the NML estimator for Bernoulli strings

Convergence Rate of the Distributions of Normalized Maximum Likelihood Estimators for Irregular Parametric Families

Classification and feature gene selection using the normalized maximum likelihood model for discrete regression

Zipf's Law in Importance of Genes for Cancer Classification Using Microarray Data

Strong optimality of the normalized ML models as universal codes and information in data

Accuracy of normal approximation for the maximum likelihood estimator and Bayes estimators in the Ornstein–Uhlenbeck process using random normings

A DISCRETE HMM FOR ONLINE HANDWRITING RECOGNITION

Unit Root Tests Based on Unconditional Maximum Likelihood Estimation for the Autoregressive Moving Average

Estimation of the convergence rate for the distributions of normalized maximum likelihood estimators in the case of a discontinuous density

Asymptotic Properties of Maximum Likelihood Estimators and Likelihood Ratio Tests under Nonstandard Conditions

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Normalized Maximum Likelihood Research Articles

Articles published on Normalized Maximum Likelihood

DNA sequence compression - Based on the normalized maximum likelihood model

On the normalized maximum likelihood and Bayesian decision theory

An Invariant Bayesian Model Selection Principle for Gaussian Data in a Sparse Representation

Model selection by normalized maximum likelihood

Clustering Time Series Gene Expression Data Based on Sum-of-Exponentials Fitting

An efficient normalized maximum likelihood algorithm for DNA sequence compression

On some properties of the NML estimator for Bernoulli strings

Convergence Rate of the Distributions of Normalized Maximum Likelihood Estimators for Irregular Parametric Families

Classification and feature gene selection using the normalized maximum likelihood model for discrete regression

Zipf's Law in Importance of Genes for Cancer Classification Using Microarray Data

Strong optimality of the normalized ML models as universal codes and information in data

Accuracy of normal approximation for the maximum likelihood estimator and Bayes estimators in the Ornstein–Uhlenbeck process using random normings

A DISCRETE HMM FOR ONLINE HANDWRITING RECOGNITION

Unit Root Tests Based on Unconditional Maximum Likelihood Estimation for the Autoregressive Moving Average

Estimation of the convergence rate for the distributions of normalized maximum likelihood estimators in the case of a discontinuous density

Asymptotic Properties of Maximum Likelihood Estimators and Likelihood Ratio Tests under Nonstandard Conditions