Kernel and Moment Based Prediction and Planning : Applications to Robotics and Natural Language Processing

Zita Marinho

doi:10.1184/r1/6720311.v1

Abstract

This thesis focuses on moment and kernel-based methods for applications in Robotics and Natural Language Processing. Kernel and moment-based learning leverage information about correlated data that allow the design of compact representations and efficient learning algorithms. We explore kernel algorithms for planning by leveraging inherently continuous properties of reproducing kernel Hilbert spaces. We introduce a kernel based robot motion planner based on gradient optimization, in a space of smooth trajectories, a reproducing kernel Hilbert space. We further study a kernel-based approach in the context of prediction, for learning a generative model, and in the context of planning for learning to interact with a controlled process. Our work on moment-based learning can be decomposed into two main branches: spectral techniques and anchor-based methods. Spectral learning describes a more expressive model, which implicitly uses hidden state variables. We use it as a means to obtain a more expressive predictive model that we can use to learn to control an interactive agent, in the context of reinforcement learning. We propose a combination of predictive representations with deep reinforcement learning to produce a recurrent network that is able to learn continuous policies under partial observability. We introduce an efficient end-to-end learning algorithm that is able to maximize cumulative reward while minimizing prediction error. We apply this approach to several continuous observation and action environments. Anchor learning, on the other hand, provides an explicit form of representing state variables, by relating states to unambiguous observations. We rely on anchor-based techniques to provide a form of explicitly recovering the model parameters, in particular when states have a discrete representation such as in many Natural Language Processing tasks. This family of methods provides an easier form of integrating supervised information during the learning process. We apply anchor-based algorithms on word labelling tasks in Natural Language Processing, namely semi-supervised part-of-speech tagging where annotations are learned from a large amount of raw text and a small annotated corpus.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Kernel and Moment Based Prediction and Planning : Applications to Robotics and Natural Language Processing

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Hidden Markov Model based Part of Speech Tagging for Nepali language
Abhijit Paul ... Bipul Syam Purkayastha
-
Abhijit Paul, et. al.Abhijit Paul ... Bipul Syam Purkayastha
01 Sep 2015
01 Sep 2015

A Ruled-Based Part of Speech (RPOS) Tagger for Malay Text Articles
Rayner Alfred ... Joe Henry Obit
-
Rayner Alfred, et. al.Rayner Alfred ... Joe Henry Obit
01 Jan 2013
01 Jan 2013

Combination of Genetic Algorithm and Brill Tagger Algorithm for Part of Speech Tagging Bahasa Madura
Nindian Puspa Dewi ... Ubaidi Ubaidi
Proceeding of the Electrical Engineering Computer Science and Informatics | VOL. 7
Nindian Puspa Dewi, et. al.Nindian Puspa Dewi ... Ubaidi Ubaidi
01 Oct 2020
Proceeding of the Electrical Engineering Computer Science and Informatics | VOL. 7

Part-of-speech tagging for Arabic tweets using CRF and Bi-LSTM
Wasan Alkhwiter ... Nora Al-Twairesh
Computer Speech & Language | VOL. 65
Wasan Alkhwiter, et. al.Wasan Alkhwiter ... Nora Al-Twairesh
31 Jul 2020
Computer Speech & Language | VOL. 65

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Kernel and Moment Based Prediction and Planning : Applications to Robotics and Natural Language Processing

Abstract

Talk to us

Similar Papers