Mirror Descent view for Neural Network quantization

Quantizing large Neural Networks (NN) while maintaining the performance is highly desirable for resource-limited devices due to reduced memory and time complexity. It is usually formulated as a constrained optimization problem and optimized via a modified version of gradient descent. In this work, b...

Cijeli opis

Bibliografski detalji
Glavni autori:	Ajanthan, T, Gupta, K, Torr, PHS, Hartley, R, Dokania, PK
Format:	Conference item
Jezik:	English
Izdano:	Journal of Machine Learning Research 2021

Mirror Descent view for Neural Network quantization

Slični predmeti