Text this: Reinforcement learning-based dynamic band and channel selection in cognitive radio ad-hoc networks