Văn bản này: Trading performance for stability in Markov decision processes