Effect of Architectures and Training Methods on the Performance of Learned Video Frame Prediction

Yilmaz, M. Akin; Tekalp, A. Murat

doi:10.48623/aperta.71679

Yayınlanmış 1 Ocak 2019 | Sürüm v1

Konferans bildirisi Açık

Effect of Architectures and Training Methods on the Performance of Learned Video Frame Prediction

1. Koc Univ, Dept Elect & Elect Engn, Istanbul, Turkey

We analyze the performance of feedforward vs. recurrent neural network (RNN) architectures and associated training methods for learned frame prediction. To this effect, we trained a residual fully convolutional neural network (FCNN), a convolutional RNN (CRNN), and a convolutional long short-term memory (CLSTM) network for next frame prediction using the mean square loss. We performed both stateless and stateful training for recurrent networks. Experimental results show that the residual FCNN architecture performs the best in terms of peak signal to noise ratio (PSNR) at the expense of higher training and test (inference) computational complexity. The CRNN can be trained stably and very efficiently using the stateful truncated backpropagation through time procedure, and it requires an order of magnitude less inference runtime to achieve near real-time frame prediction with an acceptable performance.

Dosyalar

bib-8b777699-010e-4cb1-817b-58e937b83f40.txt

Dosyalar (192 Bytes)

Ad	Boyut	Hepisini indir
bib-8b777699-010e-4cb1-817b-58e937b83f40.txt md5:1113b12d5fbd8357fe5f603a66088ce2	192 Bytes	Ön İzleme İndir

	Tüm sürümler	Bu sürüm
Görüntüleme	65	65
İndirilenler	15	15
Veri miktarı	2.9 kB	2.9 kB

Effect of Architectures and Training Methods on the Performance of Learned Video Frame Prediction

Dosyalar

bib-8b777699-010e-4cb1-817b-58e937b83f40.txt

Dosyalar (192 Bytes)

TÜBİTAK ULAKBİM

İLETİŞİM

Effect of Architectures and Training Methods on the Performance of Learned Video Frame Prediction

Oluşturanlar

Açıklama

Dosyalar

bib-8b777699-010e-4cb1-817b-58e937b83f40.txt

Dosyalar (192 Bytes)