Densely Connected Convolutional Networks

Gao Huang, Zhuang Liu, Laurens van der Maaten, Kilian Q. Weinberger

2017 · CVPR

Densely Connected Convolutional Networks

Problem

Framing

Residual nets ease optimization but still fuse features by addition, so earlier representations are not preserved explicitly and parameter reuse stays weak. DenseNet closes this gap by concatenating each layer into all later layers, cutting redundancy and reaching 3.46% error on CIFAR-10+.

Currently Used Methods

Foundational

Proposed Method

Architecture

A dense block concatenates all previous feature maps into each new layer. CIFAR models use three dense blocks with transition layers of batch normalization, 1times11 \\times 1 convolution, and 2times22 \\times 2 average pooling; DenseNet-BC adds bottlenecks and compression θ\theta.

Architecture: an input image passes through three dense blocks with internal all-to-all concatenative links, separated by convolution and pooling transition layers, then global pooling and a linear classifier.

Loss / Objective

The core design is the connectivity rule inside each block.

mathbfxell=Hell([mathbfx0,mathbfx1,ldots,mathbfxell1])\\mathbf{x}_{\\ell} = H_{\\ell}([\\mathbf{x}_0, \\mathbf{x}_1, \\ldots, \\mathbf{x}_{\\ell-1}])

Algorithm

Channel width grows linearly with growth rate kk, then transition layers compress it.

mathrmchannels(mathbfxell)=k0+k(ell1)\\mathrm{channels}(\\mathbf{x}_{\\ell}) = k_0 + k(\\ell - 1)

Training Procedure

Evaluation

Datasets

Metrics

Headline results

Results plot: ImageNet validation error versus parameter count, comparing ResNets and DenseNet-BC variants; DenseNet models lie below similarly accurate ResNets.

Ablations

Method Strengths and Weaknesses

Strengths

Weaknesses

Suggestions from the authors

Links

Prior Papers

Further Papers