Mirror Descent view for Neural Network quantization

Quantizing large Neural Networks (NN) while maintaining the performance is highly desirable for resource-limited devices due to reduced memory and time complexity. It is usually formulated as a constrained optimization problem and optimized via a modified version of gradient descent. In this work, b...

Cijeli opis

Bibliografski detalji
Glavni autori: Ajanthan, T, Gupta, K, Torr, PHS, Hartley, R, Dokania, PK
Format: Conference item
Jezik:English
Izdano: Journal of Machine Learning Research 2021