Finding non-uniform quantization schemes using multi-task Gaussian processes
We propose a novel method for neural network quantization that casts the neural architecture search problem as one of hyperparameter search to find non-uniform bit distributions throughout the layers of a CNN. We perform the search assuming a Multi-Task Gaussian Processes prior, which splits the pro...
Những tác giả chính: | , , |
---|---|
Định dạng: | Internet publication |
Ngôn ngữ: | English |
Được phát hành: |
2020
|