Text this: Deep reinforcement learning-based beam training with energy and spectral efficiency maximisation for millimetre-wave channels