Text this: OFFER: Off-environment reinforcement learning