Finding non-uniform quantization schemes using multi-task Gaussian processes

We propose a novel method for neural network quantization that casts the neural architecture search problem as one of hyperparameter search to find non-uniform bit distributions throughout the layers of a CNN. We perform the search assuming a Multi-Task Gaussian Processes prior, which splits the pro...

Mô tả đầy đủ

Chi tiết về thư mục
Những tác giả chính: Gennari do Nascimento, M, Costain, TW, Prisacariu, VA
Định dạng: Internet publication
Ngôn ngữ:English
Được phát hành: 2020