Text this: Feature Encodings and Poolings for Action and Event Recognition: A Comprehensive Survey