Describir: An NLP-based technique to extract meaningful features from drug SMILES