Text this: Automatic Image Annotation by Sequentially Learning From Multi-Level Semantic Neighborhoods