Caffe con Troll: Shallow Ideas to Speed Up Deep Learning.

Stefan Hadjis,Firas Abuzaid,Christopher Ré,Ce Zhang

doi:10.1145/2799562.2799641

Caffe con Troll: Shallow Ideas to Speed Up Deep Learning.

Stefan Hadjis, Firas Abuzaid + Show 2 more

Open Access

https://doi.org/10.1145/2799562.2799641

Copy DOI

Journal: Proceedings of the Fourth Workshop on Data analytics at sCale (DanaC 2015) : May 31st, 2015, Melbourne, Australia. Workshop on Data Analytics in the Cloud (4th : 2015 : Melbourne, Vic.)	Publication Date: May 31, 2015
Citations: 49

Affiliation: Stanford University

#Throughput Improvement #Hardware Architectures + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

We present Caffe con Troll (CcT), a fully compatible end-to-end version of the popular framework Caffe with rebuilt internals. We built CcT to examine the performance characteristics of training and deploying general-purpose convolutional neural networks across different hardware architectures. We find that, by employing standard batching optimizations for CPU training, we achieve a 4.5× throughput improvement over Caffe on popular networks like CaffeNet. Moreover, with these improvements, the end-to-end training time for CNNs is directly proportional to the FLOPS delivered by the CPU, which enables us to efficiently train hybrid CPU-GPU systems for CNNs.

Full Text