发送短信: An MRP formulation for supervised learning: generalized temporal difference learning models