Text this: A conditional deep generative model of people in natural images