Automatic Parameter Tying: A New Approach for Regularized Parameter Learning in Markov Networks

Li Chou,Somdeb Sarkhel,Pracheta Sahoo,Nicholas Ruozzi,Vibhav Gogate

doi:10.1609/aaai.v32i1.11785

Abstract

Parameter tying is a regularization method in which parameters (weights) of a machine learning model are partitioned into groups by leveraging prior knowledge and all parameters in each group are constrained to take the same value. In this paper, we consider the problem of parameter learning in Markov networks and propose a novel approach called automatic parameter tying (APT) that uses automatic instead of a priori and soft instead of hard parameter tying as a regularization method to alleviate overfitting. The key idea behind APT is to set up the learning problem as the task of finding parameters and groupings of parameters such that the likelihood plus a regularization term is maximized. The regularization term penalizes models where parameter values deviate from their group mean parameter value. We propose and use a block coordinate ascent algorithm to solve the optimization task. We analyze the sample complexity of our new learning algorithm and show that it yields optimal parameters with high probability when the groups are well separated. Experimentally, we show that our method improves upon L2 regularization and suggest several pragmatic techniques for good practical performance.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Automatic Parameter Tying: A New Approach for Regularized Parameter Learning in Markov Networks

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence	Publication Date: Apr 29, 2018
Citations: 2

Similar Papers

Sparse Logistic Regression With L 1/2 Penalty for Emotion Recognition in Electroencephalography Classification.
Dong-Wei Chen ... Yue-Yue Lu
Frontiers in Neuroinformatics | VOL. 14
Dong-Wei Chen, et. al.Dong-Wei Chen ... Yue-Yue Lu
07 Aug 2020
Frontiers in Neuroinformatics | VOL. 14

A generalized regularization method for nonlinear ill-posed problems enhanced for nonlinear regularization terms
T Roths ... J Honerkamp
Computer Physics Communications | VOL. 139
T Roths, et. al.T Roths ... J Honerkamp
05 Sep 2001
Computer Physics Communications | VOL. 139

Reduction of Staircase Effect With Total Generalized Variation Regularization for Electrical Impedance Tomography
Yanyan Shi ... Meng Wang
IEEE Sensors Journal | VOL. 19
Yanyan Shi, et. al.Yanyan Shi ... Meng Wang
23 Jul 2019
IEEE Sensors Journal | VOL. 19

A hybrid regularization method combining Tikhonov with total variation for electrical resistance tomography
Xizi Song ... Feng Dong
Flow Measurement and Instrumentation | VOL. 46
Xizi Song, et. al.Xizi Song ... Feng Dong
02 Jul 2015
Flow Measurement and Instrumentation | VOL. 46

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Automatic Parameter Tying: A New Approach for Regularized Parameter Learning in Markov Networks

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence