Meta-learning Sparse Implicit Neural Representations
Jaeho Lee / Meta-learning Sparse Implicit Neural Representations / NeurIPS 2021
Last updated
Jaeho Lee / Meta-learning Sparse Implicit Neural Representations / NeurIPS 2021
Last updated
Implicit Neural Representations (INRs) also-known as coordinate-based representations could provide a generic way of representing all kinds of data modalities. INR are obtained by training a neural network with continuous activation function to predict the coordinate-to-value mapping. Take an image as an example : the neural network map the pixel locations to RGB values and directly store the parameters of the network.
Scalability is a huge challenge nowadays since discrete representation requires a huge amount of storage and computation capacity. INR can provide a generic way to represent data while compressing it. Furthermore, using continuous data representation instead of the typical discrete representation render the data independant from spatial resolution, allowing high dimension processing. However, training INR require a lot of parameters, memory and computations steps. In this papaer, the authors are proposing an efficient algorithm to train sparse neural network for a a large number of signals to get INRs.
Existing work in implicit neural representation are parameters heavy. As a result, they require lot of memory and computations steps.
There is three main approaches to train the INRs on multiple signals. The first one use a latent code vector to share information accross the neural network. The second one use meta-learning to pre-train the model. The last approach add an uniform quantization of the weights for each INR. None of those approach is able to combine the efficency in both memory and computation with the ability to generalize to new signals.
The authors solution to this problem is called Meta-SparseINR. This new framework consist of adding neural network pruning to the meta-learning approach in order to obtain sparse INR. Network pruning removes parameters form the neural representation by fixing them to zero, thus creating sparse networks. The "winning ticket" method demonstrated that a sparse subnetwork can match the results of the original dense model when trained from scratch. Furthermore, it's possible to transfer a winning ticket to a new highly relevant dataset. Existing works are focusing on pruning randomly initialized model but the authors are proposing to prune a meta-trained model instead.
To find a well-initialized sparse subnetwork structure, the authors developped an algorithm called Meta-SparseINR based on the alternation of meta-learning with pruning. For starters, the weights are initialized according to the standard scheme for INR. The, the algorithm can be described in three steps :
Meta-Learning of the signals using Model-Agnostic Meta-Learning (MAML) : MAML is a kind of few-shot meta-learning which provides a mehod for fast adaptation of neural network. It can be used with any model performing gradient descent. MAML relies on the idea that some internal features are more transferrable than others. As such, the model can generalize well with only a few gradient descent steps and few training data. In order to obtain a sparse subnetwork, the authors are adding to the MAML a binary mask performing an element-wise product with the parameters.
Magnitude-based pruning : the algorithm first compute the magnitude score of the learned INR and proceed to remove a percentage of the connections. Connections with a magnitude below the common treshold for every layer will be cut off instead of pruning every individual signal.
Retrain and repeat : the algorithm keep going until the sparsity degree is attained.
Dataset for image regression : CelebA for face data, Imagenette for natural images and 2D SDF for geomtric pattern.
Model : Meta-SparseINR algorithm with multi-layer perceptrons and sinusoidal activation functions.
Baseline : Random Pruning, Dense-Narrow, MAML+OneShot, MAML+IMP, Scratch
Evaluation Metric : Peak signal-to-noise ratio (PSNR) to evaluate the reconstruction quality of images and videos
The goal of the authors is to propose an efficient method to train compressed INRs in regards of memory and computation steps.
Memory performance
In the figure 1, the authors propose to compare the PSNR of Meta-SparseINR against the baseline for different number of model parameters. Meta-SparseINR outperforms all baseline for all three datasets. The results are particulary significant for very sparse models since the baseline score are dropping before Meta-SparseINR.
As we can see in the table, Meta-SparseINR needs half the number of parameters of the strongest baseline Dense-Narrow and Random Pruning to obtain a similar PSNR. So Meta-SparseINR is a memory efficient method to train INRs.
Computation performance
In another experiment, the authors compared the PSNR of Meta-SparseINR against the baselines after training on a small number of optimization steps (<100). Meta-SparseINR steadily offers better results than the baselines.
Since the authors' model require less computational step and memory to achieve the same PSNR as other model, they succeeded in providing an efficient method to train INRs.
In this paper, the authors proposed an efficient way to train implicit neural representations on a large group of signals. They achieved such a result by combining meta-learning and network pruning. Using MAML allowed them to learn the initial weights with a small number of gradient steps in a few-shot learning fashion. While using network pruning introduce sparsity by reducing the number of parameters. By pruning the entire network instead of each signal individually, they are able to keep their model efficiency in regards of computation. In the end, they outperformed the usual dense models with half the number of steps. Since the main contribution of this paper lies in their new pipeline, they used relaitvely simple methods for implementation. Performing different task like neural compression or attaining better representation could be explored by adding layers of complexity to the pipeline.
Alterning between meta-learning and network pruning to find a well-initialized sparsed INR is computer efficent in regards of memory and computation while maintaining the generalization ability of the model.
Emeline BAGORIS
KAIST, Graduate School of AI
Github Implementation : https://github.com/jaeho-lee/MetaSparseINR
Meta-Learning Sparse Implicit Neural Representations : https://arxiv.org/abs/2110.14678