Text this: Asynchronous hybrid reinforcement learning for latency and reliability optimization in the metaverse over wireless communications