Text this: Trading performance for stability in Markov decision processes