Text this: Pearson Correlation-Based Feature Selection for Document Classification Using Balanced Training