Summary of AlexNet

Title: ImageNet Classification with Deep Convolutional Neural Networks (2012)

Authors: Alex Krizhevsky, Ilya Sutskever, Geoffrey E. Hinton

Point of the Paper: “GPU memory enables training large networks on large image datasets.”

At a time when human performance was achieved on relatively small image datasets like MNIST and CIFAR-10, computer vision researchers were eager to achieve the same with large, high-resolution image datasets. Although datasets such as ImageNet provided large and real-world data, simple convolution neural networks (CNN) did not perform well on such high-resolution datasets. Innovative trials were limited by computational resources. AlexNet solved these problems by introducing deep CNN that achieved the state-of-the-art in the ImageNet LSVRC-2010 contest, and GPU parallelization.

Read More