Zaslat SMS: VariBAD: variational bayes-adaptive deep RL via meta-learning