Deep Fragment Embeddings for Bidirectional Image Sentence Mapping