Text this: A Submodular Optimization Framework for Imbalanced Text Classification With Data Augmentation