Text this: Document similarity in repeatedly translated corpora