Graphical Models: Modeling, Optimization, and Hilbert Space Embedding

Xinhua Zhang

doi:10.25911/5d7a2ce02b007

Abstract

Over the past two decades graphical models have been widely used as powerful tools for compactly representing distributions. On the other hand, kernel methods have been used extensively to come up with rich representations. This thesis aims to combine graphical models with kernels to produce compact models with rich representational abilities. Graphical models are a powerful underlying formalism in machine learning. Their graph theoretic properties provide both an intuitive modular interface to model the interacting factors, and a data structure facilitating efficient learning and inference. The probabilistic nature ensures the global consistency of the whole framework, and allows convenient interface of models to data. Kernel methods, on the other hand, provide an effective means of representing rich classes of features for general objects, and at the same time allow efficient search for the optimal model. Recently, kernels have been used to characterize distributions by embedding them into high dimensional feature space. Interestingly, graphical models again decompose this characterization and lead to novel and direct ways of comparing distributions based on samples. Among the many uses of graphical models and kernels, this thesis is devoted to the following four areas: Conditional random fields for multi-agent reinforcement learning Conditional random fields (CRFs) are graphical models for modeling the probability of labels given the observations. They have traditionally been trained with using a set of observation and label pairs. Underlying all CRFs is the assumption that, conditioned on the training data, the label sequences of different training examples are independent and identically distributed (iid). We extended the use of CRFs to a class of temporal learning algorithms, namely policy gradient reinforcement learning (RL). Now the labels are no longer iid. They are actions that update the environment and affect the next observation. From an RL point of view, CRFs provide a natural way to model joint actions in a decentralized Markov decision process. They define how agents can communicate with each other to choose the optimal joint action. We tested our framework on a synthetic network alignment problem, a distributed sensor network, and a road traffic control system. Using tree sampling by Hamze & de Freitas (2004) for inference, the RL methods employing CRFs clearly outperform those which do not vii

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Graphical Models: Modeling, Optimization, and Hilbert Space Embedding

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Conditional random fields for multi-agent reinforcement learning
Xinhua Zhang ... Douglas Aberdeen
-
Xinhua Zhang, et. al.Xinhua Zhang ... Douglas Aberdeen
20 Jun 2007
20 Jun 2007

Finding Out Biological Terms from Texts with CRFs for Reinforcement Learning
Zhao Hui Wang ... Wei Huang
Applied Mechanics and Materials | VOL. 198-199
Zhao Hui Wang, et. al.Zhao Hui Wang ... Wei Huang
01 Sep 2012
Applied Mechanics and Materials | VOL. 198-199

Extracting Terms from Texts with Conditional Random Fields
Xun Lu ... Yixuan Li
-
Xun Lu, et. al.Xun Lu ... Yixuan Li
01 Jan 2015
01 Jan 2015

EDA-RL: EDA with Conditional Random Fields for Solving Reinforcement Learning Problems
Hisashi Handa
-
Hisashi HandaHisashi Handa
01 Jan 2012
01 Jan 2012

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Graphical Models: Modeling, Optimization, and Hilbert Space Embedding

Abstract

Talk to us

Similar Papers