Skip to content

Read -- Deep Image Prior

Deep convolutional networks <—> DIP
DIP:

the structure of a generator network is sufficient to capture a great deal of low-level image statistics $prior$ to any learning.
a randomly initialized neural network can be used as a handcrafted prior
can be used to invert deep neural representations to diagnose them

Inpainting

a binary mask : $m\in{0,1}^{H\times W}$

$m_{i, j}=0$ means this is a missing pixel.
$m_{i, j}=1$ means on position ${i, j}$ pixel is observed.

$\odot$: Hadamard product
$E(x; x_0)$ : the pixel difference between visible pixels

Convolutional Sparse coding (CSC):
represent a signal (such as image or an audio waveform) as
a sum of convolutions between

a small set of learned filters (the “dictionary”) and
corresponding sparse activation maps

Filters ${d_k}$—the small weight matrices that define what patterns to look for, and
Activation maps ${z_k}$—where and how strongly each filter occurs in the image.
The reconstruction $\sum_k d_k * z_k$ adds up each filter’s contribution across the image.