SMS dit: Learning affordances in object-centric generative models