Growing Pains for Deep Learning

1 minute read

Edwards, C. (2015). Growing Pains for Deep Learning. Commun. ACM, 58(7), 14—16. http://doi.org/10.1145/2771283

Summary

This article provides an overview of the evolution of neural networks from the 1990, where only one hidden layer has been used for performance reasons, to 2006 where work on deep architectures (i.e. neural networks with more than one hidden layer) received a breakthrough due to the development of effective training techniques (such as pre-training) for networks with multiple hidden layers. This architecture enabled the first layers to extract high-level features which could then be used for more efficient classifications in lower layers yielding considerably lower error rates. Nevertheless, there are still a number of obstacles that make deploying neural networks challenging:

massive network sizes: first layers for image processing might require millions of neurons (one neuron per pixel), which will be amplified by the number of layers in the neural network. Therefore, recent systems draw upon GPUs or field-programmable gate arrays (FPGAs) to massive networks. Another successful approach has been replacing deep neural networks with a) data-pre processing and b) multiple smaller networks (i.e. committees).
training setting: the systems have considerable problems with training tasks where the reward depends on a number of successfully completed stages (e.g. traversing a maze), although they still do well in situations where success is delayed, but can be learned from random responses.
massively missing data (e.g. medical data where patients usually have only done a fraction of all available tests and exams), is another challenge area. Lawrence and others have suggested using layers of Gaussian processes in such settings.

Share on

Twitter Facebook LinkedIn

Albert Weichselbraun

Growing Pains for Deep Learning

Summary

Share on

You may also enjoy

Big, Linked Geospatial Data and Its Application in Earth Observation

Employment relations: a data driven analysis of job markets using online job boards and online professional networks

Suffix array

Dynamic feature scaling for online learning of binary classifiers