SC2Net: Sparse LSTMs for Sparse Coding

Joey Tianyi Zhou,Sinno Jialin Pan,Jiawei Du,Yong Liu,Xi Peng,Hao Yang,Rick Siow Mong Goh,Ivor Tsang,Zheng Qin,Kai Di

doi:10.1609/aaai.v32i1.11721

Abstract

The iterative hard-thresholding algorithm (ISTA) is one of the most popular optimization solvers to achieve sparse codes. However, ISTA suffers from following problems: 1) ISTA employs non-adaptive updating strategy to learn the parameters on each dimension with a fixed learning rate. Such a strategy may lead to inferior performance due to the scarcity of diversity; 2) ISTA does not incorporate the historical information into the updating rules, and the historical information has been proven helpful to speed up the convergence. To address these challenging issues, we propose a novel formulation of ISTA (named as adaptive ISTA) by introducing a novel \textit{adaptive momentum vector}. To efficiently solve the proposed adaptive ISTA, we recast it as a recurrent neural network unit and show its connection with the well-known long short term memory (LSTM) model. With a new proposed unit, we present a neural network (termed SC2Net) to achieve sparse codes in an end-to-end manner. To the best of our knowledge, this is one of the first works to bridge the $\ell_1$-solver and LSTM, and may provide novel insights in understanding model-based optimization and LSTM. Extensive experiments show the effectiveness of our method on both unsupervised and supervised tasks.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

SC2Net: Sparse LSTMs for Sparse Coding

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the AAAI Conference on Artificial Intelligence	Publication Date: Apr 29, 2018
Citations: 29

Similar Papers

Machine-learning-based model and simulation analysis of PM2.5 concentration prediction in Beijing
...
工程科学学报 | VOL. 41
, et. al. ...
20 Mar 2019
工程科学学报 | VOL. 41

Long short-term memory - Fully connected (LSTM-FC) neural network for PM2.5 concentration prediction
Jiachen Zhao ... Jie Chen
Chemosphere | VOL. 220
Jiachen Zhao, et. al.Jiachen Zhao ... Jie Chen
21 Dec 2018
Chemosphere | VOL. 220

Vector Decomposed Long Short-Term Memory Model for Behavioral Modeling and Digital Predistortion for Wideband RF Power Amplifiers
Hongmin Li ... Yikang Zhang
IEEE Access | VOL. 8
Hongmin Li, et. al.Hongmin Li ... Yikang Zhang
01 Jan 2020
IEEE Access | VOL. 8

Forecasting daily PM2.5 concentrations in Wuhan with a spatial-autocorrelation-based long short-term memory model
Zhifei Liu ... Yixuan Zhang
Atmospheric Environment | VOL. 331
Zhifei Liu, et. al.Zhifei Liu ... Yixuan Zhang
23 May 2024
Atmospheric Environment | VOL. 331

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

SC2Net: Sparse LSTMs for Sparse Coding

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence