Text this: A computational framework for complex disease stratification from multiple large-scale datasets.